Basic Information

Gene Symbol
-
Assembly
GCA_905340355.1
Location
HG996554.1:7346397-7348721[-]

Transcription Factor Domain

TF Family
zf-GAGA
Domain
zf-GAGA domain
PFAM
PF09237
TF Group
Zinc-Coordinating Group
Description
Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 19 4.3 2e+03 -0.1 0.0 21 46 70 95 52 101 0.69
2 19 0.57 2.6e+02 2.7 0.0 21 47 98 124 94 130 0.75
3 19 0.64 2.9e+02 2.5 0.1 21 51 126 156 122 159 0.82
4 19 2.7 1.2e+03 0.5 0.0 20 45 153 178 149 185 0.78
5 19 3.5 1.6e+03 0.2 0.1 26 45 213 232 211 235 0.86
6 19 2.6 1.2e+03 0.6 0.0 22 46 237 261 232 269 0.84
7 19 5.7 2.6e+03 -0.5 0.0 22 48 265 291 262 295 0.80
8 19 0.29 1.3e+02 3.6 0.1 21 46 292 317 285 324 0.80
9 19 0.022 10 7.2 0.0 21 46 348 373 338 376 0.90
10 19 0.062 28 5.8 0.1 22 46 377 401 374 407 0.86
11 19 0.056 25 5.9 0.1 21 46 404 429 400 435 0.82
12 19 0.15 68 4.5 0.0 22 47 433 458 428 464 0.82
13 19 2.3 1e+03 0.8 0.0 21 46 460 485 456 487 0.83
14 19 0.063 29 5.7 0.1 21 47 488 514 485 520 0.86
15 19 0.18 82 4.3 0.0 20 52 571 603 568 605 0.85
16 19 1.3 5.7e+02 1.6 0.0 22 46 601 625 598 633 0.73
17 19 1.4 6.4e+02 1.4 0.0 21 45 628 652 623 655 0.88
18 19 0.0095 4.3 8.4 0.1 21 44 656 679 652 686 0.87
19 19 0.0018 0.83 10.7 0.1 21 52 712 743 707 745 0.94

Sequence Information

Coding Sequence
ATGGAACGCTTAGTAATGGTAGCTTTGTTACAGGGTCCTATCTCGGATTCAAGTAACTACTCCAAAATTCAGACAAAAGACACAGAATTGAACTTGGAAGATTTGCATCATCATTCCATTAACATTCAAGAAGAGACTTTAGTCCAACGTCGGAAAGCAAAAACCAGTAGTAAATTGCCTAGCCGAAAAATTTCTAATTATAAATCAGGTAACAAACCATTTATGTGTGAAGTTTGTGACTATCAATCTGCACGAAAGAATAAATTAAAACAACATTTAAAAACTCATACAGGCGAGAAGCCGTTTAAGTGTGATATTTGTGATTACAAATGTGCACGAAAGGATACATTAAAAAAACATTTAAAAACTCATACAGGCGAGAAACCGTTTAAGTGTGCTATTTGTGTATATCAATGTGCAACAAAAGAAGCACTAAAACAACATTTAAAAACTCATACAAGCGAAAAACCGTTCAAGTGTGAAGTTTGTGGCCATCAATTTGCGCGCAAGAATACTTTACAATTTCATTTAAGAACTCATACAGGCGAAAAGCCGTTTAAGTGTGAAATATGTGACTTCAAATCTGCACGTAAAGATCAGTTAACCGATCATTTAAATACTCATACAGGCCTGTTTACTTGTGAAATTTGTGGTAACAAATATGCAAGAAGAAGTAATTTAAAAGTACATTTAAATACTCATACAGGCGAGAAGCTGTTTACTTGTGAAATTTGTGGTAACAAATTTGCAAGTAGAAGTAATTTAGAAGTTCATTTAAGAATTCATACGGGCGAGAGATCGTTTACGTGTCAAGTATGCGGTAGCAAATTTGCACTAAAAAGTACTTTAAACAAGCATTTAAAAATACATACAGGCGAAAAACCATTTAAATGTGAAATTTGTGATTACAAATGCATACAGAAGAACACTTTAAAGAATCATTTAAGAACTCATTCAGGTGACAAACCGTTTAAATGTGAAATGTGTGACTTCAAATGCATACTGAAGAACAGTTTAACGAAGCATTTAAGAACTCATTTGGGCGAGAAACCATTTACATGTGAAGTTTGTCATCGCAAATTTGCACAAAAATCTGATTTAAAAGATCATTTAACGATTCATACGGGCGATCAGCCGTTTGCGTGTAAAATTTGTGATAAGAAATTTAGATGTAAAAGAAGTTTACCGATTCATTTAAAAACCCATACAGGCGAAAAACCGTTTAAATGTGAAATTTGTGATTACAAATGCATACAGAAGATCAATTTAAAGAATCATTTAAGAACTCATTCAGGCGACAAACCGTTTAAATGTGAAATGTGTGACTTCAAATGCATACAGAAGAACAATTTAAAGAAGCATTTAAGAACTCATTCGGGCGAGAAACCATTTCCTTGTGAAGTTTGTCATCGCAAATTTGCACGAAAATCTGATTTAAAAGATCATTTAACGATTCATACGGGCGAAAAGCCGTTTGCATGTAAAATTTGTGATATGAAATTTAGATCTAGAAGAAAATTACCGATCCATTTAAAAACTCATACAGGCGAAAAACCGTACAAGTGTGAATTTTGTGACTATAAAGGTACTCATAAAGATTCTTTACAACTTCATTTAATAACTCACACGGGTAAAAATGAGTTTCAGTGTAAAAATTGTGACTTTAAATCTGAAGTTAAACAACTTTTAAATAATCATTTAAAAATTCATAAAAGTGAAAAATCGTTTACGTGTGGCATTTGTGACTTCGAATGTACACGTAAGCATGGTTTAAAAAATCATTTAAAAATTCATACAGGTGAAAAGCCGTTTCTGTGTGAAATTTGTGGCTACGGATGTGTAGTACTTCAAAATTTAAAAGATCATTTAAGAACCCATACCGGTGAAAAACCGTTTAACTGTGAAGTTTGCGGTTCCAAATTTACTTTAAAAATTTCTTTAAAAAGGCATTTAAAAACTCATACTGGCGAGAAACCAATAACGTGTGACATTTGTGATTACAAATGTGTAGATAAGCGACAATTACGACTACATTTAATAAAACATACAGGCGAGAAATTGTTTAAATGTGCAATTTGTGATCACCAATTTGCACGAAAAGCATCTTTAAAAGATCACTTAAAAATTCATACCGGCGAAAAGCCGTTTACTTGTAAAATTTGTGAAAGGAAATTCAGAACTTTAAGCATTTTAAGAGGTCATTTAAAAATTCATATGGGCGCCAAACCGTTTCAGTGTGCAATTTGTGGCTACAAATGTAGTCGTAAGTATCGATTAAAAAGTCATTTAATAACTCACACTGTCATGAAGGAGTTAAAGGGTTAA
Protein Sequence
MERLVMVALLQGPISDSSNYSKIQTKDTELNLEDLHHHSINIQEETLVQRRKAKTSSKLPSRKISNYKSGNKPFMCEVCDYQSARKNKLKQHLKTHTGEKPFKCDICDYKCARKDTLKKHLKTHTGEKPFKCAICVYQCATKEALKQHLKTHTSEKPFKCEVCGHQFARKNTLQFHLRTHTGEKPFKCEICDFKSARKDQLTDHLNTHTGLFTCEICGNKYARRSNLKVHLNTHTGEKLFTCEICGNKFASRSNLEVHLRIHTGERSFTCQVCGSKFALKSTLNKHLKIHTGEKPFKCEICDYKCIQKNTLKNHLRTHSGDKPFKCEMCDFKCILKNSLTKHLRTHLGEKPFTCEVCHRKFAQKSDLKDHLTIHTGDQPFACKICDKKFRCKRSLPIHLKTHTGEKPFKCEICDYKCIQKINLKNHLRTHSGDKPFKCEMCDFKCIQKNNLKKHLRTHSGEKPFPCEVCHRKFARKSDLKDHLTIHTGEKPFACKICDMKFRSRRKLPIHLKTHTGEKPYKCEFCDYKGTHKDSLQLHLITHTGKNEFQCKNCDFKSEVKQLLNNHLKIHKSEKSFTCGICDFECTRKHGLKNHLKIHTGEKPFLCEICGYGCVVLQNLKDHLRTHTGEKPFNCEVCGSKFTLKISLKRHLKTHTGEKPITCDICDYKCVDKRQLRLHLIKHTGEKLFKCAICDHQFARKASLKDHLKIHTGEKPFTCKICERKFRTLSILRGHLKIHMGAKPFQCAICGYKCSRKYRLKSHLITHTVMKELKG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01292006;
90% Identity
iTF_01292006;
80% Identity
iTF_01292006;