Basic Information

Gene Symbol
-
Assembly
GCA_029228625.1
Location
WJQU01004293.1:1-10431[+]

Transcription Factor Domain

TF Family
zf-GATA
Domain
zf-GATA domain
PFAM
PF00320
TF Group
Zinc-Coordinating Group
Description
This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 13 0.008 32 5.8 0.0 23 32 20 29 14 33 0.86
2 13 0.0045 18 6.6 0.1 23 34 70 81 64 83 0.83
3 13 0.0055 22 6.3 0.0 23 33 120 130 114 133 0.84
4 13 7.3 3e+04 -3.7 0.1 23 31 170 178 170 179 0.64
5 13 0.0036 15 6.9 0.0 23 33 220 230 214 233 0.86
6 13 0.01 41 5.5 0.0 23 32 270 279 264 280 0.87
7 13 0.0065 26 6.1 0.1 23 32 320 329 314 332 0.86
8 13 0.0013 5.5 8.3 0.0 23 33 436 446 430 449 0.84
9 13 0.0045 18 6.6 0.1 23 34 486 497 480 499 0.83
10 13 0.0055 22 6.3 0.0 23 33 536 546 530 549 0.84
11 13 7.3 3e+04 -3.7 0.1 23 31 586 594 586 595 0.64
12 13 0.011 45 5.4 0.0 23 31 636 644 630 646 0.88
13 13 0.0065 26 6.1 0.1 23 32 686 695 680 698 0.86

Sequence Information

Coding Sequence
AGACTTGCCGCCAGTCATCTTGTTTCCACTTTACTTATAGACGAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTTAAACGACCACAACAATTGGCGAGTCATAAGCTCACCCATTCTGGTAAGGAGTTAATCTTACAATTGCAGAGACTTGCCGCCAGTCATCTTGTTTCCTCTTTACTTATAGACAAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTTAAACGATTACACCATTTGGCGAATCATAAGCTCACCCATTCTGGTAAGGAGTTAATTTTACAATTGCAGAGACTTGCCGCCAGTCATCTTGTTTCCACTTTACTTATAGACGAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTTAAACGATCACAACAATTGGCGAGTCATAAGCTCACCCATTCTGGTAAGGAGTTAATCTTACAATTGCAGAGACATGCGGCCAATTATTTTGTTTCCTGTTTACTTATAGACGAAGAGCCTTTCAAATGTGACGACTGCCAAATGACTTTCAAACGATCGGACTATTTGGCGATTCATAAGCGCACCCATTCTGGTAAGAAGTTAATTTTACAATTGCAGAGACTTGCCTCCAGTCATCTTATTTCCTCTTTACTTATAGACGAAAAGCCTTTCAAATGTAATGATTGCGGAATGACATTCAAACTATCACACGCTTTGGCGAATCATAAGCTCACCCATTCTGGTAAGGAGTTAATTTTACAATTGCAAAGACTTGCCGCCAGTCATCTTGTTTCCTCTTTACTTATAGACGAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTCAAACGATCAGTCTATTTGGCGAGTCATAAGCTCACCCATTCTGGTAAGAAGTTAATTTTACAACAGCAGAGATTTGTCGCCTGTCATCTTGTTTCCTCTTTACTTATAGACGAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTCAAACGATCACGCTATTTGGCGGATCATAAGCTCACCCATTCTGTCAAGGACGAGGAAAAAGCAGAACCTTGTGTTCCCGGAGCAGCGGTAAACCTTACAGTATCAATGATAGTCTTCGAGGACGGCCTACTTCAAAACGTAGAAGTCAAAATTAAATCAGAAGAACCTTTAGATATTTCATTCACAGAGGAAAATTTACTCACCGAGGACATTTCACTCAACGAGGACAATTCACTCCGGTGTACAATCTGCCAAAAGTCATTTCATAACAAACGCAATTTGAGAAGGCATAAATTAACCCATTTAGACGAAAAGCCTTTCAAATGTAACGACTGCGGATTGACATTTAAACGACCACATCAATTGCGGAGTCATAAGCTCGCCCATTCTGGTAAGGAGTTAATCTTACAATTGCAGAGACTTGCCGCCAGTCATCTTGTTTCCTCTTTACTTATAGACAAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTTAAACGATTACACCATTTGACGAATCATAAGCTCACCCATTCTGGTAAGGAGTTAATTTTACAATTGCAGAGACTTGCCGCCAGTCATCTTGTTTCCTCTTTACTTATAGACGAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTTAAACGATCACAACAATTGGCTTGTCATAAGCTCACCCATTCTGGTAAGGAGTTAATCTTACAATTGCAGAGACTTGCGGCCAATTATTTTGTTTCCTGTTTACTTATAGATGAAGAGCCTTTCAAATGTGACGACTGCCAAATGACTTTCAAACGATCGGACTATTTGGCGTATCATAAGCTCACCCATTCTGGTAAGGAGTTAATTTTACAATTGCAAGGACTTGCCGCCAGTCATCTTGTTTCCTCTTTACTTATAGACGAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTCAAACGACCAGTCTATTTGGCGAGTCATAAGCTCACCCATTCTGGTAAGGAGTTAATTTTACAATTGCAGAGACTTGTCGCCAGTCATCTTGTTTCCTCTTTACTTATAGACGAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTCAAACGATCACGCTATTTGGCGGATCATAAGCTCACCCATTCTG
Protein Sequence
RLAASHLVSTLLIDEKPFKCNDCGMTFKRPQQLASHKLTHSGKELILQLQRLAASHLVSSLLIDKKPFKCNDCGMTFKRLHHLANHKLTHSGKELILQLQRLAASHLVSTLLIDEKPFKCNDCGMTFKRSQQLASHKLTHSGKELILQLQRHAANYFVSCLLIDEEPFKCDDCQMTFKRSDYLAIHKRTHSGKKLILQLQRLASSHLISSLLIDEKPFKCNDCGMTFKLSHALANHKLTHSGKELILQLQRLAASHLVSSLLIDEKPFKCNDCGMTFKRSVYLASHKLTHSGKKLILQQQRFVACHLVSSLLIDEKPFKCNDCGMTFKRSRYLADHKLTHSVKDEEKAEPCVPGAAVNLTVSMIVFEDGLLQNVEVKIKSEEPLDISFTEENLLTEDISLNEDNSLRCTICQKSFHNKRNLRRHKLTHLDEKPFKCNDCGLTFKRPHQLRSHKLAHSGKELILQLQRLAASHLVSSLLIDKKPFKCNDCGMTFKRLHHLTNHKLTHSGKELILQLQRLAASHLVSSLLIDEKPFKCNDCGMTFKRSQQLACHKLTHSGKELILQLQRLAANYFVSCLLIDEEPFKCDDCQMTFKRSDYLAYHKLTHSGKELILQLQGLAASHLVSSLLIDEKPFKCNDCGMTFKRPVYLASHKLTHSGKELILQLQRLVASHLVSSLLIDEKPFKCNDCGMTFKRSRYLADHKLTHS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01266155;
90% Identity
iTF_01266155;
80% Identity
iTF_01266155;