Oiva007141.1
Basic Information
- Insect
- Oeneis ivallda
- Gene Symbol
- -
- Assembly
- GCA_029955525.1
- Location
- JARPMR010000005.1:376069-398577[-]
Transcription Factor Domain
- TF Family
- zf-GATA
- Domain
- zf-GATA domain
- PFAM
- PF00320
- TF Group
- Zinc-Coordinating Group
- Description
- This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 20 0.45 2.6e+03 -0.4 0.1 15 26 78 89 76 90 0.89 2 20 0.0016 9.3 7.4 1.3 1 34 300 340 300 342 0.85 3 20 0.0061 35 5.6 0.3 14 34 392 412 389 414 0.85 4 20 0.0061 35 5.6 0.3 14 34 464 484 461 486 0.85 5 20 0.0061 35 5.6 0.3 14 34 536 556 533 558 0.85 6 20 0.0061 35 5.6 0.3 14 34 608 628 605 630 0.85 7 20 0.0061 35 5.6 0.3 14 34 680 700 677 702 0.85 8 20 0.0014 7.9 7.6 0.2 14 34 752 772 749 774 0.85 9 20 0.0061 35 5.6 0.3 14 34 824 844 821 846 0.85 10 20 0.0061 35 5.6 0.3 14 34 896 916 893 918 0.85 11 20 0.0061 35 5.6 0.3 14 34 968 988 965 990 0.85 12 20 0.0061 35 5.6 0.3 14 34 1040 1060 1037 1062 0.85 13 20 0.0014 7.9 7.6 0.2 14 34 1112 1132 1109 1134 0.85 14 20 0.0061 35 5.6 0.3 14 34 1184 1204 1181 1206 0.85 15 20 0.0061 35 5.6 0.3 14 34 1256 1276 1253 1278 0.85 16 20 0.0061 35 5.6 0.3 14 34 1328 1348 1325 1350 0.85 17 20 0.0061 35 5.6 0.3 14 34 1400 1420 1397 1422 0.85 18 20 0.0061 35 5.6 0.3 14 34 1472 1492 1469 1494 0.85 19 20 0.0061 35 5.6 0.3 14 34 1544 1564 1541 1566 0.85 20 20 2.3 1.3e+04 -2.7 0.0 19 29 1693 1703 1689 1705 0.75
Sequence Information
- Coding Sequence
- ATGACTGCCACTGCAGTAAATAGCAGTATAGAGACCAAGTGTTCACTGAAGGATGATGAAGCGCTCGGGGTTAAAGAAGAAACCGACGTCATTGTGAAGCAGGAGTTAGTGGAATGTTTGGTATGCTTGGGACGCAGCGGGAGCTACTGCGACCTCAACACGGCGACAACGGCCGCGGGCACTTCGCTAAGCGCTTTCCTGACTAAATTCGCTCACGCTGGCCTCGGCAAGTCACCGAGTGGCTCCAAACATGTTTGCAGGACGTGCCTCAACCTAGTTAACATATTAGAACAAGCTGAATTGGAGTATTATAAAGTAAGAGAAGAATTTCACGCTGTAATAAGTAAGAACCCACTCTTCGAACCATCGGCGACGAACGCACATATAACTCTAGAATTAGTTAAGAATGAGGATTTAGAAACATTTGTTGATAATGATTGTGACAACGACTCCGAAGACGAGCCCCTCGCTCTCACAAAGAAGAAACGTCACCGAAAAGTTGAAAAGCGGAAAAAGAAGCCCACCGCAGAGAATAAACGAAAGTTGCGCACTAAAAACAATGCCGACAGTTGGGAGTGCCCGGAGTGTGAAGCGAGCGGGGAGTGCGGGGGGGAGGCTGCTCTCACCGCCCACATGCTGGTGACACATCTCAACATACAGAAAGTTAAGAAGGAGGAGTCACCCGACAGATCCCAAAGTCCCCACTTGAACCATCTGGATGAAATGCTCAAGTTGGAAGATGACACATTCGACGATGACGACGACAGCTCCAACTACGTGCCGCACCAGATCAGCAACGGCAAGCAGAGCAAGGCCAAGCACCTCAGCAGAACCAAGAAGAAAAGCTTCAAAAACATCAGCGTCAAGAAGAAAAAGGACCCCAAAGTGATGCATCAGTGTGACCAGTGCCCGGCCAGATACACCTCACTGGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGTGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGTGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTGTGCGGCGCGCACTACAAGCACAAGCGGGCCTGCGACATACACGTCGCTCTGCACAAGGGTATTGCGGCCTTTGCTATTTACTTTGTGCATTGCAGTTCGTCTGGAGCAGCACAAGCTGAAGCACGACGACTCCAAGCCGCCCTACATCTGCGAGGTATCAGCGACTGGAAGTGTGAAGAGTGCAATAAGTTGTTCCCCTCTAAGGGCGCCCTGCAGAGGCACAACAACATACACACCGGCAAACTGAATTATCAGTGCGACCTGTGCGGCAAGTCGTTCATCCACACGTCGTCGTTCAAGATGCACAAGCTGTCGCACTCGGGCGTGAAGCCGCACGCGTGCGACGTGTGCGGGCTGGCGCTGATGACGCGCTCGCACCTCAAGCGGCACAAGCGCGTGCACTCGGGCGAGAAGCGGCACGAGTGCGCCGTGTGCGGCAAGCGCTTCAGCGAGCGCTACAACCTGATCGCGCACGCGCGCTCGCACGCGCCGCACGACGCGCCCGACGCGCCGCCCGCATCCGCACCCACCGCTCCGCGCCGCCGCCTCTTCCGCTGCGCCTTCTGCCCCGAGCGCTTCGAGCGCCGCTACATGCTGGAGCGGCACGCGGGCGCCGCGCACGGCCGCGCGCTCGAGCGCCCGCCGCCCACGCCGCGCAACACCATGAGCAAGCTGCTCAAGGCGCAGGCGCAGCGCCGCGACGCCGCACCACCACCCGCCGCACCaacacccgcacccgcaccagCACCCACACCCACAGACGACAAGCTCAACCGCAGCCCCGAGTCGCGGGCCGCCGCCGGCTCCAAGCTGCACGCGGCGCTGGTTCTCAATGTGGCGGCTTCAAGTGACGTGGTTGTTGTTGCAGCAGCAgagggcgcgggcggcgacgcgggcggcgcggcgggcgccgtgTCCGCCATCACGTGGGCCGGCGCCTACGCGGCCGAGTTCGGCCTCAGGCCGGACTACTCGCACTGA
- Protein Sequence
- MTATAVNSSIETKCSLKDDEALGVKEETDVIVKQELVECLVCLGRSGSYCDLNTATTAAGTSLSAFLTKFAHAGLGKSPSGSKHVCRTCLNLVNILEQAELEYYKVREEFHAVISKNPLFEPSATNAHITLELVKNEDLETFVDNDCDNDSEDEPLALTKKKRHRKVEKRKKKPTAENKRKLRTKNNADSWECPECEASGECGGEAALTAHMLVTHLNIQKVKKEESPDRSQSPHLNHLDEMLKLEDDTFDDDDDSSNYVPHQISNGKQSKAKHLSRTKKKSFKNISVKKKKDPKVMHQCDQCPARYTSLVRLEQHKLKHDDSKPPYICEVCGAHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGAHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGAHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGAHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGAHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGAHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGVHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGAHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGAHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGAHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGAHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGVHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGAHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGAHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGAHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGAHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGAHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRVRLEQHKLKHDDSKPPYICEVCGAHYKHKRACDIHVALHKGIAAFAIYFVHCSSSGAAQAEARRLQAALHLRGISDWKCEECNKLFPSKGALQRHNNIHTGKLNYQCDLCGKSFIHTSSFKMHKLSHSGVKPHACDVCGLALMTRSHLKRHKRVHSGEKRHECAVCGKRFSERYNLIAHARSHAPHDAPDAPPASAPTAPRRRLFRCAFCPERFERRYMLERHAGAAHGRALERPPPTPRNTMSKLLKAQAQRRDAAPPPAAPTPAPAPAPTPTDDKLNRSPESRAAAGSKLHAALVLNVAASSDVVVVAAAEGAGGDAGGAAGAVSAITWAGAYAAEFGLRPDYSH
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -