Basic Information

Gene Symbol
-
Assembly
GCA_932294385.1
Location
CAKOAM010000059.1:2513727-2520622[+]

Transcription Factor Domain

TF Family
zf-GATA
Domain
zf-GATA domain
PFAM
PF00320
TF Group
Zinc-Coordinating Group
Description
This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 9 0.034 85 3.3 0.2 18 32 52 66 48 69 0.82
2 9 0.1 2.5e+02 1.8 0.1 18 33 139 153 137 156 0.79
3 9 0.0012 3.1 7.9 0.0 19 33 165 179 159 182 0.81
4 9 0.0012 2.9 8.0 0.0 19 34 213 228 207 230 0.81
5 9 0.00062 1.5 8.9 0.0 19 33 261 275 255 278 0.81
6 9 0.0012 3.1 7.9 0.0 19 33 346 360 340 363 0.81
7 9 0.013 32 4.7 0.0 20 30 395 405 390 411 0.81
8 9 0.0064 16 5.7 0.4 6 33 435 462 433 465 0.81
9 9 0.68 1.7e+03 -0.8 0.3 23 27 565 569 561 577 0.62

Sequence Information

Coding Sequence
ATGACTCCCGATTTCGAAGATAAGTGTTTGGTTTGCTTAAAAACCGGCGGCAATTTGAATATCTTTACCACGTACAACATCAACAATGACAAAGAAGAATTATACGTTGAAGTCATTAAAAACTGCTTTAATGTGGAGATTACACGGGACTCTGGCCTCAACGCAATGTGCAATGAATGTAGCACTAAATTGAGAAAGGCATATACCTTCAAAAATGATGTTTTGATAGCTGTTAACAGACTAGTGGTTTTACAAGTGGAAAGGAAAGCTAAATCAAAGAAACCAGCCAAGAGTCAAAGTAAGAAGATTATAACAATAGAGGGTGAGCTGGTCAGTCCTACAGATCAAGAGGAGACTAATAAGAGGCCAAAGAAAGTGAAAGAAGCCAAGAAGCCCCCCTTCACTCTCCAGGCTGGCCAAACGTTGTGCGCACTCTGCGGCGCTCAGTTCGAAACGCACAAGGCGCTGAACACACACATGAACGCTCACTTCCCCGACCACGTCTGCAATGAGTGCGGGATGGCGTTCGCGAGTCAGAGCCGGCTGAGGATGCACAGATATCAAGCACACGTCGACGGGCCGATAAACTGCAAGTACTGTGACAAGGCGCTGAATACACACATGAACGCGCACTTCCCCGACCACGTCTGCAATGAGTGCGGGATGGCGTTCGCGAGTCGGAGCCGGCTGAGGATGCACAGATACCAAGCACACGTGGACGGGCCGATAAACTGCAAGTACTGCGACAAGGCGCTGAACACACACATGAACGCTCACTTCCCCGACCACGTCTGCAATGACTGCGGGATGGCGTTCGCGAGTCAGAGCCGGCTGAGGATGCACAGATACCAAGCACACGTGGACGGGCCGATAAACTGCAAGTACTGCGACAAGTCAGAGCCGGCTGAGGATGCACAGATACCAAACACACGTCGATGGGCCGATAAACTGCAAGTACTGCGACAAGGTATGTCACTCTGCGGCGCTCAGTTCGAAACGCACAAGGCGCTGAATACACACATGAACGCGCACTTCCCCGACCACGTCTGCAATGAGTGCGGGATGGCGTTCGCGAGTCAGAGCCGGCTGAGGATGCACAGATACCAAGCACACGTCGACGGGCCGATAAACTGCAAGTACTGCGACAAGGCGCTGAATACACACATGAACGTGCACTTCCCCGACCACGTCTGCAATGAGTGCGGGATGGTGTTCCCGAGTCAGAGCCAGCTGAGGATACACGGATACCAGGCACACGTCGATGGGCCGAAGAACTGCGATAAGAGTTTCACCAACGTGAACACCCGTAACGCGCACATGTGGCGCGTCCACACGAAGACCGACCCCTTCAAGTGTGCCCAGTGCGGCGCGCGCTTCAAGACGTATCTCAAGCGGCTGAGGCATATGCAGAGCGCGCACGGAGAAACAGTTGAATACCAGTGCCAGTACTGTGATAAGAAGTTCGCTTACGCTGCATACAGAACGAAGCACGTCCGCACGATCCACCTCGAATCCCGCCCCCACAAGTGTCCCAGCTGCCCCTACAGCTTCGCGCGCCGCCGCGAGCTGGACGCGCACCGCGCCAACAAACATGGCGACGGCGGGATGAAGTACCAATGCGATGCGTGCGACAAGAAGTTCTTCTTCAAGAGCGCGCTGAAGATGCACGTCAGTTTGCATAACAGATTCACATGCAACCAGTGCGACGCGAAATTCACCCACGCGCGAGAACTGGAAGCGCACCACCGCACACACGAGTTCGCGGACATGCCGGAGCTGCCCATCCTCGACCCCGCGCAGTTCGACGCGTTTGCGGACTACGCGCTGGACTGGGTGCTGGCTCCCTAG
Protein Sequence
MTPDFEDKCLVCLKTGGNLNIFTTYNINNDKEELYVEVIKNCFNVEITRDSGLNAMCNECSTKLRKAYTFKNDVLIAVNRLVVLQVERKAKSKKPAKSQSKKIITIEGELVSPTDQEETNKRPKKVKEAKKPPFTLQAGQTLCALCGAQFETHKALNTHMNAHFPDHVCNECGMAFASQSRLRMHRYQAHVDGPINCKYCDKALNTHMNAHFPDHVCNECGMAFASRSRLRMHRYQAHVDGPINCKYCDKALNTHMNAHFPDHVCNDCGMAFASQSRLRMHRYQAHVDGPINCKYCDKSEPAEDAQIPNTRRWADKLQVLRQGMSLCGAQFETHKALNTHMNAHFPDHVCNECGMAFASQSRLRMHRYQAHVDGPINCKYCDKALNTHMNVHFPDHVCNECGMVFPSQSQLRIHGYQAHVDGPKNCDKSFTNVNTRNAHMWRVHTKTDPFKCAQCGARFKTYLKRLRHMQSAHGETVEYQCQYCDKKFAYAAYRTKHVRTIHLESRPHKCPSCPYSFARRRELDAHRANKHGDGGMKYQCDACDKKFFFKSALKMHVSLHNRFTCNQCDAKFTHARELEAHHRTHEFADMPELPILDPAQFDAFADYALDWVLAP

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00659666;
90% Identity
iTF_00659666;
80% Identity
iTF_00659666;