Basic Information

Gene Symbol
-
Assembly
GCA_963855885.1
Location
OY979654.1:1745122-1756950[-]

Transcription Factor Domain

TF Family
zf-GATA
Domain
zf-GATA domain
PFAM
PF00320
TF Group
Zinc-Coordinating Group
Description
This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 24 0.54 1.7e+03 -0.7 0.0 7 29 268 289 266 293 0.79
2 24 0.54 1.7e+03 -0.7 0.0 7 29 312 333 310 337 0.79
3 24 1.4 4.5e+03 -2.0 0.0 7 28 356 376 354 379 0.68
4 24 0.89 2.8e+03 -1.4 0.0 7 29 400 421 398 424 0.76
5 24 3.6 1.1e+04 -3.3 0.0 9 28 489 507 488 509 0.63
6 24 0.54 1.7e+03 -0.7 0.0 7 29 531 552 529 556 0.79
7 24 0.54 1.7e+03 -0.7 0.0 7 29 575 596 573 600 0.79
8 24 0.0049 15 5.9 0.0 20 32 622 633 618 636 0.85
9 24 0.062 1.9e+02 2.4 0.2 21 33 674 686 670 688 0.82
10 24 0.048 1.5e+02 2.7 0.0 19 31 711 723 706 727 0.82
11 24 0.048 1.5e+02 2.7 0.0 19 31 758 770 753 774 0.82
12 24 0.048 1.5e+02 2.7 0.0 19 31 805 817 800 821 0.82
13 24 0.023 73 3.7 0.0 19 31 851 863 847 867 0.79
14 24 0.048 1.5e+02 2.7 0.0 19 31 898 910 893 914 0.82
15 24 0.048 1.5e+02 2.7 0.0 19 31 945 957 940 961 0.82
16 24 0.048 1.5e+02 2.7 0.0 19 31 992 1004 987 1008 0.82
17 24 0.023 73 3.7 0.0 19 31 1038 1050 1034 1054 0.79
18 24 0.048 1.5e+02 2.7 0.0 19 31 1085 1097 1080 1101 0.82
19 24 0.048 1.5e+02 2.7 0.0 19 31 1132 1144 1127 1148 0.82
20 24 0.048 1.5e+02 2.7 0.0 19 31 1179 1191 1174 1195 0.82
21 24 0.048 1.5e+02 2.7 0.0 19 31 1226 1238 1221 1242 0.82
22 24 0.048 1.5e+02 2.7 0.0 19 31 1273 1285 1268 1289 0.82
23 24 0.048 1.5e+02 2.7 0.0 19 31 1320 1332 1315 1336 0.82
24 24 0.045 1.4e+02 2.8 0.0 19 31 1367 1379 1362 1381 0.83

Sequence Information

Coding Sequence
ATGATTTGCCTAGACACAGAGAGTAAACTGTATCCGCTCAACAAATACAATTTGGACACAAAGTTTGAATATCTGACAGGATTTTCtCTTCACGATGTAGAGAATTTTCTGCCACAGTTTTGCATTGAGTGCGCTCAGAGGCTGACCACTTGTAGTAGCTTCAGAGAAAAGGCCCTCAGAGCGTATCACTTGTTGCTAGAAGTAGCTGAGAACAGTCAAGAGGTGGTTAAAAAAGAGAAGGTTTCTGCTAAAAAGGAGGAGAATGGTACAAAAGGGGGGGATGCATTGGCGCAGTTTAAAGTGACATTGCTGAGTTTTGAAGAACAGTTGGCTGAAATAGAGAAGAGACGGGAGAGCGCTAACTTCAAATATTCTAGATATAAATGCAACAAGTGCTTCAAAGGATTTAGTAGTGTTCCTACATATGAAAGTCATATGGAGAAGCATACTAATAAATTCGGTGAATTCGAGTGCGAAGTGTGCAAGGTCCACGTGAAGAGCGACTACCTGCTGCGGCACCACGTGCGCCACACGCACAGCGTGCGCTACACGTGCGGCAGCTGTCCCTTTGTCACCAACCAGAAATTGTCAGCGATACGTCATGAGGGTTGGCATGCGGGCAAAACATTCAAATGCCCGCACTGCGATGAAGAGTATAGTAAAAGAACGTCCTACTTGTCCCACCTAAGGGTCTCACATCCCACCGATGTCGTGTGCACGttgtgcgggttctccttcatcaaCGAGAGGGGTCTCAACATGCACATGAACCTCAAACACCGCTTCGACGACGCGCAGGTGAGCACTACACTCCTCCTGCGGGTCTCACAGCGCACTGACGCGGTGTGCACGttgtgcgggttctccttcatcaaCGAGAGGGGTCTCAACATGCACGTGAACCTCAAACACCGCTTCGACGACGCGCAGGTGAGCACTACACTCCTCCTGCGGGTCTCACAGCGCACTGACGCGGTGTGCACGttgtgcgggttctccttcatcaaCGAGAGGGGTCTCAACATGCACATGAACCTCAAACACCGCTTCGACGACGCGCAGGTGAGCACTACACTCCTCCTGAGGGTCTCACATCCCACCGATGTCGTGTGCACGttgtgcgggttctccttcatcaaCGAGAGGGGTCTCAACATGCACATGAACCTCAAACACCGCTTCGACGACGCGCAGGTGAGCACTACACTCCTCCTGCGGGTCTCACAGCCCACTGAAGCGGTGTGCACGttgtgcgggttctccttcatcaaCGAGAGGGGTCTCAACATGCACATGCACCTCAAACACCGCTTCGACGACGCGCAGGTGAGCACTACACTCCTCCTGCGGGTCTCACAGCGCACTGACGCGGTGTGGttgtgcgggttctccttcatcaaCGAGAGGGGTCTCAACATGCACATGAACCTCAAACACCGCTTTGACGACGCGCAGGTGTGCACTACACTCCTCCTGCGGGTCTCACAGCGCACGGACGCGGTGTGCGCGttgtgcgggttctccttcatcaaCGAGTGGGGTCTCAACATGCACATGAACCTCAAACACCGCTTCGACGACGCGCAGGTGAGCACTACACTCCTCCTGCGGGTCTCACAGCGCACTGACGCGGTGTGCACGttgtgcgggttctccttcatcaaCGAGAGGGGTCTCAACATGCACATGACCCTCAAACACCGCTTCGACGACGCGCAGGTGAGCACTACACTCCTCCTGCGGGTCTCACAGCGCACTGACGCGGTGTGCACGttgtgcgggttctccttcatcaaCGAGAGGGGTCTCAACATGCACATGACCCTCAAACACCGCTTCGACGACGCGCAGAGCGCGGCGGGTCCGCTGTGCGCGCCGTGCGGCATCCGCTTCGCGTCGCAGACCGCCTACGCGCAGCACCTCGAGGTGTCGCCCAAACACACGTCAGCCGATAAATTGAAAGTGAACGCTCCGAAGAGACCTCGCAAAAACCGCCTCAAACCTTTGGACTGTGAGACGCTGGAGTGTGAACAATGCGGGGTTCAAGTGAGGAGTTACAAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACACACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAAATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAAATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGGTGAGCCGAGCCGCAGGCGGGTCATGTCACGAGATGTACAGCACGCACTTCAACAGATTCCACCCGGACAAGACTCGGACCCAGTACCCGGCGCACTCGCCGCAGCGCTTCCtgtgcgagcagtgtggacgagTCTTTAAGaatcaatgGGGGCTGAGAGACCACGTGCTGGTGAAACATTCCGGCGTAAAGGAATTCGTCTGCGACACTTGCAACAAGTCGTTCTGCCTCAAGGCGAGCCTCACAGCGCACATGAAGACGCACAGCGACTCGCAGCCGACGCACGCGTGCCCGATATGCGGCAAGCACTTCACCAGCAAGGCCAACACCAACAGGCATGTACTGGTGAGTGAACGTAACAGAGTTCGTCTGCGACACGCGCAACAAGTCGTTCTGCCTCAAGGCGAGCCTCACAGCGCACATGAAGACGCACAGCGACTCGCAGCCGACGCACGCGTGCCCGATATGCGGCAAGCACTTCACCAGCAAGGCCAACACCAACAGGCATGTACTGCGCACATGAAGACGCACAGCGACTCGCAGCCGACGCACGCGTGCCCGATATGCGGCAAGCACTTCACCAGCAAGGCCAACACCAACAGGCATGTACTGGTGAGTGAACGTAACAGAGTTCGTCTGCGACACGCGCAACAAGTCGTTCTGCCTCAAGGCGAGCCTCACAGCGCACATGAAGACGCACAGCGACTCGCAGCCGACGCACGCGTGCCCGATATGCGGCAAGCACTTCACCAGCAAGGCCAACACCAACAGGCATGTACTGAGTTCGTCTGCGACACGCGCAACAAGTCGTTCTGCCTCAAGGCGAGCCTCACAGCGCACATGAAGACGCACAGCGACTCGCAGCCGACGCACGCGTGCCCGATATGCGGCAAGCACTTCACCAGCAAGGCCAACACCAACAGGCATCTACTATCAACAACATCTTTCGAAGACAGCTACAGTGTTACGCACAGAGAGTCGCGCCCCTTCAAGTGCCACGCGTGCGAGAAGACATTCGTGAACGGCTCGTCGCGGCGCTACCACGAGCTGCACGCGCACCTCAAGCAGCCGTGGCCCAAGAAGAACCGCGGCCCGCGCCAGAGGGCCAGCCGCGCGCGCCACACCAAGGAGGCCGTGTACACCATGTGGCCCAAGGTGAGAGTCGAGAAGACATTCGTGAACGGCTCGTCGCGGCGCTACCACGAGCTGCACGCGCACCTCAAGCAGCCGTGGCCCAAGAAGAACCGCGGCCCGCGCCAGAGGGCCAGCCGCGCGCGCCACACCAAGGAGGCCGTGTAG
Protein Sequence
MICLDTESKLYPLNKYNLDTKFEYLTGFSLHDVENFLPQFCIECAQRLTTCSSFREKALRAYHLLLEVAENSQEVVKKEKVSAKKEENGTKGGDALAQFKVTLLSFEEQLAEIEKRRESANFKYSRYKCNKCFKGFSSVPTYESHMEKHTNKFGEFECEVCKVHVKSDYLLRHHVRHTHSVRYTCGSCPFVTNQKLSAIRHEGWHAGKTFKCPHCDEEYSKRTSYLSHLRVSHPTDVVCTLCGFSFINERGLNMHMNLKHRFDDAQVSTTLLLRVSQRTDAVCTLCGFSFINERGLNMHVNLKHRFDDAQVSTTLLLRVSQRTDAVCTLCGFSFINERGLNMHMNLKHRFDDAQVSTTLLLRVSHPTDVVCTLCGFSFINERGLNMHMNLKHRFDDAQVSTTLLLRVSQPTEAVCTLCGFSFINERGLNMHMHLKHRFDDAQVSTTLLLRVSQRTDAVWLCGFSFINERGLNMHMNLKHRFDDAQVCTTLLLRVSQRTDAVCALCGFSFINEWGLNMHMNLKHRFDDAQVSTTLLLRVSQRTDAVCTLCGFSFINERGLNMHMTLKHRFDDAQVSTTLLLRVSQRTDAVCTLCGFSFINERGLNMHMTLKHRFDDAQSAAGPLCAPCGIRFASQTAYAQHLEVSPKHTSADKLKVNAPKRPRKNRLKPLDCETLECEQCGVQVRSYKMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKVSRAAGGSCHEMYSTHFNRFHPDKTRTQYPAHSPQRFLCEQCGRVFKNQWGLRDHVLVKHSGVKEFVCDTCNKSFCLKASLTAHMKTHSDSQPTHACPICGKHFTSKANTNRHVLVSERNRVRLRHAQQVVLPQGEPHSAHEDAQRLAADARVPDMRQALHQQGQHQQACTAHMKTHSDSQPTHACPICGKHFTSKANTNRHVLVSERNRVRLRHAQQVVLPQGEPHSAHEDAQRLAADARVPDMRQALHQQGQHQQACTEFVCDTRNKSFCLKASLTAHMKTHSDSQPTHACPICGKHFTSKANTNRHLLSTTSFEDSYSVTHRESRPFKCHACEKTFVNGSSRRYHELHAHLKQPWPKKNRGPRQRASRARHTKEAVYTMWPKVRVEKTFVNGSSRRYHELHAHLKQPWPKKNRGPRQRASRARHTKEAV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00354354;
90% Identity
iTF_00354354;
80% Identity
iTF_00354354;