Basic Information

Gene Symbol
-
Assembly
GCA_943735975.1
Location
CALSER010000038.1:1-9763[+]

Transcription Factor Domain

TF Family
zf-GATA
Domain
zf-GATA domain
PFAM
PF00320
TF Group
Zinc-Coordinating Group
Description
This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 15 0.015 22 5.0 0.1 15 29 18 32 15 36 0.84
2 15 0.015 22 5.0 0.1 15 29 98 112 95 116 0.84
3 15 0.025 38 4.2 0.0 16 29 180 193 176 197 0.87
4 15 0.025 38 4.2 0.0 16 29 261 274 257 278 0.87
5 15 0.015 22 5.0 0.1 15 29 340 354 337 358 0.84
6 15 0.025 38 4.2 0.0 16 29 422 435 418 439 0.87
7 15 0.025 38 4.2 0.0 16 29 503 516 499 520 0.87
8 15 0.025 38 4.2 0.0 16 29 584 597 580 601 0.87
9 15 0.025 38 4.2 0.0 16 29 665 678 661 682 0.87
10 15 0.025 38 4.2 0.0 16 29 746 759 742 763 0.87
11 15 0.025 38 4.2 0.0 16 29 827 840 823 844 0.87
12 15 0.015 22 5.0 0.1 15 29 906 920 903 924 0.84
13 15 0.015 22 5.0 0.1 15 29 986 1000 983 1004 0.84
14 15 0.013 19 5.2 0.0 15 30 1066 1081 1063 1084 0.84
15 15 1.8 2.7e+03 -1.7 0.0 23 29 1102 1108 1095 1114 0.79

Sequence Information

Coding Sequence
AAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCATACTCGCGCGTCCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCATACTCGCGCGTCCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTCCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCATGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTCCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTCCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGATGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTTGCGCGTCCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGAGTTCGTGGTCGCTGCTAAACCACCTCAACACGCACTCCCGCTCCCACCGCTTCACCTGCAAGACGTGTGGCCTCCAGCTCAGCTCCAGAAGTGTTCTACAGAGGCATCAGCTGACCCACGGTCACGAAAAGAGTTTTGTCTGTGACCGTTGCCACAAACGGTTTAATCATCGCAACGGGCTCAGAGTTCATCTACGCACACATGAGAAGGAACGAGACAGTCCTGGCAGAAAGAAAGAGACGCAACATGTCATGGATTACTTCAAACATTATGGCCCTTTAGGAAATAACTATGGTTATGAGGCTAAAAAACTCTAA
Protein Sequence
KHHAHAHPDSPLLPTRDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHPDSPLLPTRDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRASSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHPDSPLLPTRDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRASSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRASSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRASSKHHAHAHPDSPLLPTRDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRASSKHHAHAHPDSPLMPTRDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTCASSKHHAHAHPDSPLLPTRDKHNGQRLVCEVCGITLASSWSLLNHLNTHSRSHRFTCKTCGLQLSSRSVLQRHQLTHGHEKSFVCDRCHKRFNHRNGLRVHLRTHEKERDSPGRKKETQHVMDYFKHYGPLGNNYGYEAKKL

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00696912;
90% Identity
iTF_00696912;
80% Identity
iTF_00696912;