Eabb006289.1
Basic Information
- Insect
- Eupithecia abbreviata
- Gene Symbol
- -
- Assembly
- GCA_943735975.1
- Location
- CALSER010000038.1:1-9763[+]
Transcription Factor Domain
- TF Family
- zf-GATA
- Domain
- zf-GATA domain
- PFAM
- PF00320
- TF Group
- Zinc-Coordinating Group
- Description
- This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 15 0.015 22 5.0 0.1 15 29 18 32 15 36 0.84 2 15 0.015 22 5.0 0.1 15 29 98 112 95 116 0.84 3 15 0.025 38 4.2 0.0 16 29 180 193 176 197 0.87 4 15 0.025 38 4.2 0.0 16 29 261 274 257 278 0.87 5 15 0.015 22 5.0 0.1 15 29 340 354 337 358 0.84 6 15 0.025 38 4.2 0.0 16 29 422 435 418 439 0.87 7 15 0.025 38 4.2 0.0 16 29 503 516 499 520 0.87 8 15 0.025 38 4.2 0.0 16 29 584 597 580 601 0.87 9 15 0.025 38 4.2 0.0 16 29 665 678 661 682 0.87 10 15 0.025 38 4.2 0.0 16 29 746 759 742 763 0.87 11 15 0.025 38 4.2 0.0 16 29 827 840 823 844 0.87 12 15 0.015 22 5.0 0.1 15 29 906 920 903 924 0.84 13 15 0.015 22 5.0 0.1 15 29 986 1000 983 1004 0.84 14 15 0.013 19 5.2 0.0 15 30 1066 1081 1063 1084 0.84 15 15 1.8 2.7e+03 -1.7 0.0 23 29 1102 1108 1095 1114 0.79
Sequence Information
- Coding Sequence
- AAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCATACTCGCGCGTCCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCATACTCGCGCGTCCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTCCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCATGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTCCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTCCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGATGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTTGCGCGTCCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGAGTTCGTGGTCGCTGCTAAACCACCTCAACACGCACTCCCGCTCCCACCGCTTCACCTGCAAGACGTGTGGCCTCCAGCTCAGCTCCAGAAGTGTTCTACAGAGGCATCAGCTGACCCACGGTCACGAAAAGAGTTTTGTCTGTGACCGTTGCCACAAACGGTTTAATCATCGCAACGGGCTCAGAGTTCATCTACGCACACATGAGAAGGAACGAGACAGTCCTGGCAGAAAGAAAGAGACGCAACATGTCATGGATTACTTCAAACATTATGGCCCTTTAGGAAATAACTATGGTTATGAGGCTAAAAAACTCTAA
- Protein Sequence
- KHHAHAHPDSPLLPTRDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHPDSPLLPTRDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRASSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHPDSPLLPTRDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRASSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRASSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRASSKHHAHAHPDSPLLPTRDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRASSKHHAHAHPDSPLMPTRDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTCASSKHHAHAHPDSPLLPTRDKHNGQRLVCEVCGITLASSWSLLNHLNTHSRSHRFTCKTCGLQLSSRSVLQRHQLTHGHEKSFVCDRCHKRFNHRNGLRVHLRTHEKERDSPGRKKETQHVMDYFKHYGPLGNNYGYEAKKL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00696912;
- 90% Identity
- iTF_00696912;
- 80% Identity
- iTF_00696912;