Basic Information

Gene Symbol
-
Assembly
GCA_905147745.1
Location
LR990523.1:17854172-17866707[-]

Transcription Factor Domain

TF Family
zf-GATA
Domain
zf-GATA domain
PFAM
PF00320
TF Group
Zinc-Coordinating Group
Description
This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 25 0.65 2.8e+03 -1.3 0.3 17 29 292 303 277 308 0.66
2 25 0.039 1.7e+02 2.6 0.1 23 35 325 337 315 338 0.87
3 25 0.039 1.7e+02 2.6 0.1 23 35 372 384 362 385 0.87
4 25 0.039 1.7e+02 2.6 0.1 23 35 419 431 409 432 0.87
5 25 0.039 1.7e+02 2.6 0.1 23 35 466 478 456 479 0.87
6 25 0.039 1.7e+02 2.6 0.1 23 35 513 525 503 526 0.87
7 25 0.039 1.7e+02 2.6 0.1 23 35 560 572 550 573 0.87
8 25 0.039 1.7e+02 2.6 0.1 23 35 607 619 597 620 0.87
9 25 0.039 1.7e+02 2.6 0.1 23 35 654 666 644 667 0.87
10 25 0.039 1.7e+02 2.6 0.1 23 35 701 713 691 714 0.87
11 25 0.039 1.7e+02 2.6 0.1 23 35 748 760 738 761 0.87
12 25 0.039 1.7e+02 2.6 0.1 23 35 795 807 785 808 0.87
13 25 0.039 1.7e+02 2.6 0.1 23 35 842 854 832 855 0.87
14 25 0.039 1.7e+02 2.6 0.1 23 35 889 901 879 902 0.87
15 25 0.039 1.7e+02 2.6 0.1 23 35 936 948 926 949 0.87
16 25 0.039 1.7e+02 2.6 0.1 23 35 1006 1018 996 1019 0.87
17 25 0.039 1.7e+02 2.6 0.1 23 35 1053 1065 1043 1066 0.87
18 25 0.039 1.7e+02 2.6 0.1 23 35 1100 1112 1090 1113 0.87
19 25 0.039 1.7e+02 2.6 0.1 23 35 1147 1159 1137 1160 0.87
20 25 0.039 1.7e+02 2.6 0.1 23 35 1194 1206 1184 1207 0.87
21 25 0.039 1.7e+02 2.6 0.1 23 35 1236 1248 1226 1249 0.87
22 25 0.039 1.7e+02 2.6 0.1 23 35 1283 1295 1273 1296 0.87
23 25 0.039 1.7e+02 2.6 0.1 23 35 1330 1342 1320 1343 0.87
24 25 0.039 1.7e+02 2.6 0.1 23 35 1377 1389 1367 1390 0.87
25 25 0.039 1.7e+02 2.6 0.1 23 35 1424 1436 1414 1437 0.87

Sequence Information

Coding Sequence
ATGAGAACATATACTCGTAAGCGGCCGCACTATGAGTTAATAGAAGGTGTCCATGATCTATGTAGACTGTGTCTGAACAAGGCTGGAGAAGGCTTGCCCATATTCACTGAAGACCCACACAATATTTGTGCTACATTAGCAATGAGGATCATGATATGTGTGGGACTGGAGGTGACAAAAGAAGAATGCTTACCAAATATAATCTGTGCGAAGTGCTTGACTGAACTCGATAAATATTATTCATTTAGAAAGCAATGTGAAGTAACATATCAGAAACTTAAATCTCATGTAATAGCCTTCAAAGAAAATTTATACAAAGAAAAGCAATTGAAAGAAGCAGAACAGAAAAAGAAGTTAGAAGAAGAAACCAAAAATGGAATGAAGTTTGTAGTTACATTTGAAAAGGATCAGTTTCCTGACCTCAATGTTTTGAATCTGAATGGTGTAGCACGCATTAACAATTTTGTAGACAAATTACCTGACCTTACAACAGTAGACAGCACAGAAATAGAAGTTAAAGTGATAGATGAAACTGACCAAAAACCACATGATGGAGACACATTTACCCCAGATGTAACAGCATTCTTGTCCACCATGCTCCTAGAATTGGGAATTATAGCCCAGCAGGAAGATGGGCTGGTGTACTCCAACCAAAACATTTCGTCGTTAGAGGTGGAAACTGGGGATGGCAGTCAGGTCACGTTTGAACTGGCGGAGGAAGATGATATAGAGGAACAGGAACAGGAAGAAGTAATTGAGGACCCTCAAAATATACCCAAAGAAGGAGATGAAACATCACAAGAAAACGGAGGCTGCAACATCAAATATGCTACTTCAAACTGGTCCAACAATTTAAATAAAAAGAGCCAGCCCAGCGAGGGCGCGTGGTGCGACGCGTGCGGCAAGCGCCTCGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGCCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTATACCCCACATAGCCTCGCGCGCCGCGCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCTTTCGCCTGCAACACCTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGAGGCATCAGCTGGTGCATGAAGGTACACCCCACATAGCCTCGCGCGCCACCTGCGCACGCACTCGGGCGAGCGGCCCTTCGCCTGCAACACTTGCGGCCGCCGCTTCGCGCAGAAGGAGGTCATGCTGA
Protein Sequence
MRTYTRKRPHYELIEGVHDLCRLCLNKAGEGLPIFTEDPHNICATLAMRIMICVGLEVTKEECLPNIICAKCLTELDKYYSFRKQCEVTYQKLKSHVIAFKENLYKEKQLKEAEQKKKLEEETKNGMKFVVTFEKDQFPDLNVLNLNGVARINNFVDKLPDLTTVDSTEIEVKVIDETDQKPHDGDTFTPDVTAFLSTMLLELGIIAQQEDGLVYSNQNISSLEVETGDGSQVTFELAEEDDIEEQEQEEVIEDPQNIPKEGDETSQENGGCNIKYATSNWSNNLNKKSQPSEGAWCDACGKRLASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRATCARTRASGPSPATPAAAASRRRSLARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGIPHIASRAALARHLRTHSGERPFACNTCGRRFAQKEVMLRHQLVHEGTPHIASRATCARTRASGPSPATLAAAASRRRRSC

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01192942;
90% Identity
iTF_01192942;
80% Identity
iTF_01192942;