Basic Information

Gene Symbol
-
Assembly
GCA_943735975.1
Location
CALSER010000221.1:1-8629[-]

Transcription Factor Domain

TF Family
zf-GATA
Domain
zf-GATA domain
PFAM
PF00320
TF Group
Zinc-Coordinating Group
Description
This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 13 1 1.6e+03 -0.9 3.2 19 33 329 343 303 346 0.75
2 13 0.074 1.1e+02 2.7 0.5 1 30 386 419 386 422 0.68
3 13 4.2 6.4e+03 -2.9 0.1 20 29 445 454 440 455 0.66
4 13 1.4 2.1e+03 -1.3 0.0 17 30 495 507 491 511 0.70
5 13 0.025 38 4.2 0.0 16 29 534 547 530 551 0.87
6 13 0.015 22 5.0 0.1 15 29 614 628 611 632 0.84
7 13 0.015 22 5.0 0.1 15 29 694 708 691 712 0.84
8 13 0.025 38 4.2 0.0 16 29 777 790 773 794 0.87
9 13 0.025 38 4.2 0.0 16 29 858 871 854 875 0.87
10 13 0.025 38 4.2 0.0 16 29 940 953 936 957 0.87
11 13 0.015 22 5.0 0.1 15 29 1019 1033 1016 1037 0.84
12 13 0.028 42 4.1 0.0 15 28 1099 1112 1096 1116 0.86
13 13 0.031 46 3.9 0.1 16 29 1182 1195 1178 1197 0.87

Sequence Information

Coding Sequence
ATGTCTGAGGACAAATTCTGTCGGATATGCGTCCGCAAAGATGTGAAAATGTATGTATATAATCGATTCCACTTGAAGCAATTTTATAAAGAAATTACCGGATCGAAGGTTTCGAAGCGGGATTGCCTTCCGAAATGGTTCTGTTTTGAGTGTGCAGCGCTATTGCATAAATTCCATGAATTCAAGAAGAAATGTTACAATGGACAGAATATCTTCAAAAAACTTCTTGAGTCGAAAAGTACAACCAGGAACAATTTTAACAGATACAGCCAATTACCATATCTACAAATAATATCAGTCCATACCAACAATAGTATTAAAACATATACAATAAAACACGGGGAAATAAAACAAAGTATATCACAAAATAATTTTACTGAGGTTTATGTTGATGACGTAGATAATGATGAAGCAAGAAATGATAAGCCTATAGAACACAATGATGATGACAATGACAGTGATATTGATAATGATATACACAATGATTCTCTGTTTTCTAATAGAAATGCAGAAAATGATGAGGAATTTAAAATGGACAAAGCAGATAAGCAAACCAAACATAGAATTACAAATTATACAGAAGTAATAACAAATGAACTTGATAGTGACGAAAAACCAAATATATTTGACGAAAAAACAGGCAATTTCGTAATAGATGATGAAATAGAGAAAGAACAAACATATAAAATAGACAACTCAAAAATCAAAGAAAGAATCAAATCCGAAACAGATATAATAAATAAATTGAATCACGAATCAAACCTTGTAAATAAATTCCAAAGATCAGTCAAAAAGAAATTTCTTGATGAGAATCATTGGAATAAAATAACTTTAAGTGATGAGGAAGCTTCTAGAAGGTTTCAAGCGAAGGCCTTAGAACAGAAGTACATAAGGGCTGATTTTAAATGTACTGATTGTTATAGGACATTTTCTCAGGAAGATATGATGAAAAGGCATATTAAGCTGAGGCATTGTGAATCCCTAGGCCCCCACGAATGCCGGCACTGCCGCATGCGCTTCAAGTGGAAGTCCAGGCTACAGAAACATTTGAAGGAACACTACACCATCTACAAGTGCCTTAGATGTGAACTTTCCTTTCCTGTTGAAATTTCTGCATTCCAACACGATTATTCCCACAATGGCGTTACTTGGACATGTGCGCATTGCGGACAAAGATTTCGACACAGTTCCACCTACTACACACATCTTCGGAAGCACAAGAGTAAATACGTGTGTACCCTTTGCGGGGTCTCGTTCGTCAGCGAATTCGGACTGTTCATGCACAAGCGGGTCAAGCATGTTATTACTGAGGATTTTAAAGAAGAAAGTTCCAATACGTACTGCAATGTGTGTGATATAACGTTTGATACGCTAAAGGGCTACGAAGATCACTTCGCGGAGTCTGCTTTACACGTTACTGAAGTGTCGGAAGGCGCGACCAGTCTACCTCTGAAGAGCAGATGCACTAAGAAAAGAGTTCCGAGGATAGCTACTACTTGTAATATTTGTGGCCGTGCATTCTCCACCTACGCTGCGTTCAGCAAGCACCATACGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTCCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTCCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTCCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGGTACGTATCTACATATACAGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCGCGTTCAGCAAGCACCATGCGCACGCGCACCTCGACTCGCCGCTGCTGCCGACCAGAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCATACTCGCGGTACGTATCTACATATACAGCGTCCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCATACTCGCGCGTCCAGCAAGCACCATGCGCACGCGCACCCCGACTCGCCGCTGCTGCCCAAGACCAAAGACAAGCACAACGGACAGAGACTAGTGTGCGAAGTATGCGGCATCACACTCGCG
Protein Sequence
MSEDKFCRICVRKDVKMYVYNRFHLKQFYKEITGSKVSKRDCLPKWFCFECAALLHKFHEFKKKCYNGQNIFKKLLESKSTTRNNFNRYSQLPYLQIISVHTNNSIKTYTIKHGEIKQSISQNNFTEVYVDDVDNDEARNDKPIEHNDDDNDSDIDNDIHNDSLFSNRNAENDEEFKMDKADKQTKHRITNYTEVITNELDSDEKPNIFDEKTGNFVIDDEIEKEQTYKIDNSKIKERIKSETDIINKLNHESNLVNKFQRSVKKKFLDENHWNKITLSDEEASRRFQAKALEQKYIRADFKCTDCYRTFSQEDMMKRHIKLRHCESLGPHECRHCRMRFKWKSRLQKHLKEHYTIYKCLRCELSFPVEISAFQHDYSHNGVTWTCAHCGQRFRHSSTYYTHLRKHKSKYVCTLCGVSFVSEFGLFMHKRVKHVITEDFKEESSNTYCNVCDITFDTLKGYEDHFAESALHVTEVSEGATSLPLKSRCTKKRVPRIATTCNICGRAFSTYAAFSKHHTHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAAQDQRQAQRTETSVRSMRHHTRAFSKHHAHAHPDSPLLPTRDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRASSKHHAHAHLDSPLLPTRDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAAQDQRQAQRTETSVRSMRHHTRASSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAAQDQRQAQRTETSVRSMRHHTRASSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHLDSPLLPTRDKHNGQRLVCEVCGITLAVRIYIYSVQQAPCARAPRLAAAADQRQAQRTETSVRSMRHHTRAFSKHHAHAHLDSPLLPTRDKHNGQRLVCEVCGIILAVRIYIYSVQQAPCARAPRLAAAAQDQRQAQRTETSVRSMRHHTRASSKHHAHAHPDSPLLPKTKDKHNGQRLVCEVCGITLA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00696904;
90% Identity
iTF_00696904;
80% Identity
iTF_00696904;