Basic Information

Gene Symbol
-
Assembly
GCA_963170105.1
Location
OY720628.1:37727284-37735701[+]

Transcription Factor Domain

TF Family
zf-GAGA
Domain
zf-GAGA domain
PFAM
PF09237
TF Group
Zinc-Coordinating Group
Description
Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 24 1.3 1.9e+03 -0.1 0.1 23 46 94 117 91 124 0.71
2 24 0.078 1.1e+02 3.8 0.2 21 47 120 146 116 153 0.75
3 24 0.83 1.2e+03 0.5 0.0 21 52 148 179 145 181 0.84
4 24 0.19 2.7e+02 2.5 0.2 21 46 176 201 168 209 0.76
5 24 0.083 1.2e+02 3.7 0.0 21 47 344 370 333 382 0.84
6 24 0.014 20 6.1 0.2 21 48 400 427 396 433 0.86
7 24 0.49 6.9e+02 1.3 0.1 21 45 456 480 448 489 0.73
8 24 1.6 2.2e+03 -0.4 0.0 21 35 540 555 529 559 0.78
9 24 4.2 5.9e+03 -1.7 0.1 21 46 608 631 606 635 0.62
10 24 5.3e-05 0.075 13.9 0.1 21 44 636 659 628 665 0.89
11 24 0.1 1.5e+02 3.4 0.0 21 47 692 718 681 725 0.80
12 24 5.2 7.4e+03 -2.0 0.0 24 44 723 743 719 746 0.86
13 24 0.077 1.1e+02 3.8 0.0 21 46 748 773 744 784 0.82
14 24 0.047 67 4.5 0.1 21 46 855 880 847 886 0.84
15 24 0.046 64 4.5 0.4 22 44 884 906 879 910 0.89
16 24 0.001 1.4 9.8 0.0 21 51 911 941 907 942 0.85
17 24 0.43 6e+02 1.4 0.0 21 47 939 965 936 972 0.86
18 24 3 4.2e+03 -1.3 0.0 21 47 967 993 963 1000 0.71
19 24 0.62 8.8e+02 0.9 0.0 21 30 1023 1032 1015 1047 0.84
20 24 0.046 65 4.5 0.1 21 45 1079 1103 1066 1110 0.83
21 24 0.15 2.2e+02 2.8 0.1 22 44 1108 1130 1104 1134 0.89
22 24 0.0027 3.8 8.5 0.1 21 52 1135 1166 1131 1167 0.81
23 24 0.093 1.3e+02 3.6 0.1 21 47 1163 1189 1159 1196 0.77
24 24 0.094 1.3e+02 3.5 0.1 22 45 1192 1215 1187 1220 0.88

Sequence Information

Coding Sequence
ATGAAACCTTTTGCTGATGTGCGTTTATTCCAGGAAACACCAGAACATTCCGATGGCTGCTTTCAAATTAAGACGGAGGTGGAAGAAGAAACGGGATACGAGTTGGGAGATTTGCATCATTCCATCGATATAAAAGAAGAGACCAATATCGAACCACAGCCGGGTAGAGAATGCAAACCGGAAACCAGCGTGAAACCGTCCATGTGCGAAGGCTGGGACTACGCGTATGCTCCCGAAGGAGATTTAAAAACTGACGTTGGAACTCCCGAAGGTAACAGGAAACCTTTTTTGTGTaacatttgtgattataaatgtgcACGTAAAGGCCAATTGACGAGACATTTAACCACTCATACAGACGAAAAACCATTTGCTTGCGAATTTTGTGTTTACAAATGTTCACGTAAAGAACTTTTACGGAGGCATTTGAAGACCCACACAGGCGAGAAGCCGTTCTCGTGCGAAACCTGTGGTCGCAAGTTCACGCAGAGCGCTCATTTTAAAGTTcatttaagaactcatacGGGCGAAAAGCCGTTCGTGTGCGAAATTTGTGGATACAAGTGCATACAGAATGCAAAGCTGAAAATTCACTTGAGAACCCATACGGGGGAAAAACCGTATTCGTGTGAATTTTGTGCGTATAAATGTACGACGAAGGGAGTTTTGACTACGCATTTAAGAACccataccggcgagaaaccgtaTTCGTGCGAATTCTGCGAGTATAAATGTGCGCACAAAGTAAGTTTGCAGGTTCATTTGAGAACTCACACCGGAGAGAAACCATTCATGTGTGAATTTTGCGATTACAAATGTGTGCAGAAGTCGCTGTTGAATCTTCATGTGAGAACTCAcaccggggaaaaaccgttcCTCTGTGAATtctgcgattataaatgtgcaCGTAAGGAAGGATTGAAAAGTCATTTGACCATCCATACCGGCGAGAAAACTTTCGAATGCGAATTTTGCGACTATAAGAGTGCCAAGAAGGAACGGTTGAAAATTCATCTACGAACTCATTCCGGGGAAAAACCGTATAAGTGTGATACTTGCGGTTTGACCTTTACGCAGAATGGAAACTTTAAGAAGCATTTGAGAACGCAcaccggggaaaaaccgttcATGTGCGATATCTGTGGATATAAATGCGGATGTAAAGCTCAGTTGTCGGGTCACTTGACAACCCATacgggggaaaaaccgttcTCGTGTGAAATGTGCAATCATAAATTTGCTCGTAAGCAACACTTGCGAAGGCATTTGAAAACCCACACCGGGGAGAAGCCGTTTTCTTGTGGATTTTGCGAATATAAATGTGCGACGAAGGGAGTTTTACGAAGTCATTTAAGAACGCACACCGGCGAAAAGCCGTTCGTctgtgaaatttgtggttaCAAATGCATACAGAAGGGAATGTTAAAAGTCCACTTGAGAACTCATACGGGGGAAAAACCGTACTCGTGTGAACTCTGCGACTATAAATGTACGATCAAGGGAGTTTTGAAAACTcatttaagaactcatacGGGGGAAAAACCGTATTCGTGTGAATTCTGCGACTATAAATGCGGACATAAGGGAAGCTTCAAGATTCATATCAGAACTCATACCGGAGAGAAGCCCCATACGTGTGAATTTTGCGGTTACAAATGCATACAAAAAGCGGAAACTCCGTGCCATTCCAACGATTTCGCACAAATTAAGACGGAAGAAGATGTGGAGTCTCAAATGGCAGGTCTGCATCATTCCATCGACATTAAAGAGGAGACTTCGATTATGGAATTTAAAGTAGATGTGGAGTGCAAACCTGAAATTAGCGAAAAACTGATTGCATGTGGAATTTGCGACTATAAATGTTCGCGGAAGGAACGGATGAAAATTCATTTGAgaactcatactggcgagaagccATTtacgtgtgaaatttgtggcTACACATGCGCCCAAAAGCAGAATTTAAAGAGGCATTTGTTGACTCATACGGGCAACAAGAGATTTAAGTGTGAATTATGCGACTGCAAGTacgaatttattggaaatttaaagGTTCACTTGAGAACTCACAGCGGCGAAAAGCCATttatgtgtaaaatttgcggTTACAAATGCACACAGAAAGGAAGTTTGAAGACTCATTTGAAAACTCATACCGGTTTAAAGCCGTTTTTCTGTGGAATGTGCGATTACACCGGCGCACAAAAGCAAAGCTTGCAGCGACACATAATGACACATACGGGGGAAAAACCTTTTACGTGtgaaatttgcgattataaatgcgCGAATTCGggagttttgaaaattcactTAAGAACTCACACTGGCGAGAAGCCGTTCGAAACGCCAGGCTACTCCCAAATTAAGACGGAAGAAATAGAATTCAAGTTGGAAGATCTGCACCATGCGATTGATATTAAAGAGGAGCAGACCTCGCTGCTGGCAACCCAACCGGAGAGAGATTGCAAACCGCAAACGAGCAGCAAAGCGTTTTCGTGTAAGATTTGTAGTTACAAATGTAATCGGAAAGGAATTTTCGAAACTCATTTACGCACTCACAGTGGAGAGAAACCATTTATGTGTGAATTCTGTAATTATACATGCGCGCAGAAACCGAGTCTAAAAAGACATTTAACGACTCACACAGGCGACAAACCTTTCACCTGCGAAATATGTGCTTACCGGTGTGCGCAGAGCGGCGATTTAAAATGTCACATTAGGACCCATACCGGTGAGAAACCATATACCTGTGAAATTTGTAACTATAAATTTGCAGATCGGAGCAATTTGAGAGGtcatttgaaaattcacaCTGGCGAAAAGCCATTTAAATGCGGATTGTGCAATTTTCAGTGCGCCGATGGCAGCAATTTAAAGAGTCACTTAAAAACTCATacgggggaaaaaccgttctcttgtgaattttgtgactataaatgtACACGTAAAGGAGTTTTGAGAGTTCATTTAAGAACTCACACAGGCGAGAAGCCGTTTATTTGTGAATTCTGCGATTTTAGAAGTGCGCACAAGCAAAGCTATAAACTGCATTTCCGGACTCACACTGGCGAAAAACCGTTCAcctgtgaaatttgtgattatGAATGTTCGCATAAAGGTAGTTTGTTGATTCACTTAAAAACCCATACTGGTGAAAAGTTGTTCGAGTGCGAGAGTTGCGATTACAAAAGTGCGCATAAAGGAAGTTTGGAGACTCACGTAAgaactcatactggcgagaaaccataTATGTGTGAGTTTTGCGACTACGCATGCGCGCAGAAGACGAGCTTGAAGAGACATTTGATGACTCACAGCGGCGACAAACCGTTCACTTGCGAATTTTGCGACTACACATGTGCGCAGAAGCCCAGTTTAAAAAGACACATGATGACTCATACCGGCGAGAAGCCATTTACGTGCGATATTTGTGACTACAAATGCGCACGCAAGGAAGTTTTGAAGAGACACCTAAGAATCCACAACGCCGAGAAGCCGTTCACGTGTGAAATTTGCGGCTACAAATGTACGGAAAATGGGAATTTTACAAGTCATTTAAGAACTCATGCTGGTGACAAACCGTTTGTTTGTGAAGTTTGTGAATTTAAATGTGCtcgtaaagaaaatttaaaaagacatCTGAAAAATCACTCGGGCGAGAAACAGAGTACCATTGAAACCGGTGGTTAA
Protein Sequence
MKPFADVRLFQETPEHSDGCFQIKTEVEEETGYELGDLHHSIDIKEETNIEPQPGRECKPETSVKPSMCEGWDYAYAPEGDLKTDVGTPEGNRKPFLCNICDYKCARKGQLTRHLTTHTDEKPFACEFCVYKCSRKELLRRHLKTHTGEKPFSCETCGRKFTQSAHFKVHLRTHTGEKPFVCEICGYKCIQNAKLKIHLRTHTGEKPYSCEFCAYKCTTKGVLTTHLRTHTGEKPYSCEFCEYKCAHKVSLQVHLRTHTGEKPFMCEFCDYKCVQKSLLNLHVRTHTGEKPFLCEFCDYKCARKEGLKSHLTIHTGEKTFECEFCDYKSAKKERLKIHLRTHSGEKPYKCDTCGLTFTQNGNFKKHLRTHTGEKPFMCDICGYKCGCKAQLSGHLTTHTGEKPFSCEMCNHKFARKQHLRRHLKTHTGEKPFSCGFCEYKCATKGVLRSHLRTHTGEKPFVCEICGYKCIQKGMLKVHLRTHTGEKPYSCELCDYKCTIKGVLKTHLRTHTGEKPYSCEFCDYKCGHKGSFKIHIRTHTGEKPHTCEFCGYKCIQKAETPCHSNDFAQIKTEEDVESQMAGLHHSIDIKEETSIMEFKVDVECKPEISEKLIACGICDYKCSRKERMKIHLRTHTGEKPFTCEICGYTCAQKQNLKRHLLTHTGNKRFKCELCDCKYEFIGNLKVHLRTHSGEKPFMCKICGYKCTQKGSLKTHLKTHTGLKPFFCGMCDYTGAQKQSLQRHIMTHTGEKPFTCEICDYKCANSGVLKIHLRTHTGEKPFETPGYSQIKTEEIEFKLEDLHHAIDIKEEQTSLLATQPERDCKPQTSSKAFSCKICSYKCNRKGIFETHLRTHSGEKPFMCEFCNYTCAQKPSLKRHLTTHTGDKPFTCEICAYRCAQSGDLKCHIRTHTGEKPYTCEICNYKFADRSNLRGHLKIHTGEKPFKCGLCNFQCADGSNLKSHLKTHTGEKPFSCEFCDYKCTRKGVLRVHLRTHTGEKPFICEFCDFRSAHKQSYKLHFRTHTGEKPFTCEICDYECSHKGSLLIHLKTHTGEKLFECESCDYKSAHKGSLETHVRTHTGEKPYMCEFCDYACAQKTSLKRHLMTHSGDKPFTCEFCDYTCAQKPSLKRHMMTHTGEKPFTCDICDYKCARKEVLKRHLRIHNAEKPFTCEICGYKCTENGNFTSHLRTHAGDKPFVCEVCEFKCARKENLKRHLKNHSGEKQSTIETGG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00270106;
90% Identity
iTF_00270106;
80% Identity
iTF_00270106;