Basic Information

Gene Symbol
-
Assembly
GCA_947049265.1
Location
CAMRIQ010000220.1:1-21854[+]

Transcription Factor Domain

TF Family
zf-GAGA
Domain
zf-GAGA domain
PFAM
PF09237
TF Group
Zinc-Coordinating Group
Description
Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 21 0.0098 98 4.3 0.1 17 31 127 141 121 151 0.81
2 21 0.0099 99 4.3 0.1 17 31 236 250 231 260 0.81
3 21 0.0099 99 4.3 0.1 17 31 345 359 340 369 0.81
4 21 0.0099 99 4.3 0.1 17 31 454 468 449 478 0.81
5 21 0.0099 99 4.3 0.1 17 31 563 577 558 587 0.81
6 21 0.0099 99 4.3 0.1 17 31 672 686 667 696 0.81
7 21 0.0099 99 4.3 0.1 17 31 781 795 776 805 0.81
8 21 0.0099 99 4.3 0.1 17 31 890 904 885 914 0.81
9 21 0.0099 99 4.3 0.1 17 31 999 1013 994 1023 0.81
10 21 0.0099 99 4.3 0.1 17 31 1108 1122 1103 1132 0.81
11 21 0.0099 99 4.3 0.1 17 31 1217 1231 1212 1241 0.81
12 21 0.0099 99 4.3 0.1 17 31 1326 1340 1321 1350 0.81
13 21 0.0099 99 4.3 0.1 17 31 1435 1449 1430 1459 0.81
14 21 0.0099 99 4.3 0.1 17 31 1544 1558 1539 1568 0.81
15 21 0.0099 99 4.3 0.1 17 31 1653 1667 1648 1677 0.81
16 21 0.0099 99 4.3 0.1 17 31 1762 1776 1757 1786 0.81
17 21 0.0099 99 4.3 0.1 17 31 1871 1885 1866 1895 0.81
18 21 0.0099 99 4.3 0.1 17 31 1980 1994 1975 2004 0.81
19 21 0.0099 99 4.3 0.1 17 31 2089 2103 2084 2113 0.81
20 21 0.014 1.4e+02 3.9 0.3 17 31 2198 2212 2193 2234 0.80
21 21 0.0065 65 4.9 0.3 27 44 2298 2315 2295 2319 0.92

Sequence Information

Coding Sequence
GACACGAGAACGGGGAAGGTCCAGCAACGGAACTATCGCAAGCTGAAGAACCTGCCGGCAGACTTGGTGGAGCTGTACACCATGACGGAGGAAGAAATGTGGGAGGTGCGGCAGCAGGACGTGGAGAGCGCCGAGTTCTGCGCGCTCAAGTACAAGTGTAGCGACTGCATCATCGGGTTCAACTCGGAGCGACTCATGCACGACCATATGCAAGGGAAACACGCGCCGAAAAGTCCCGACTGCCACCAATGCGACGTGTGCAAGGCTTACTTCCTGACGCGCGACAACGTGTCCTGCCACCGCGCGCTGCACCTCACAGCGTACCGCTGCAccgcgtgcggcgcgcgcgcaggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAACCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAATTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGGTCAGGGGCATACTGGAGAGCAGTggggcctcaagcgccgcatgcTGTCCCACTCGCACACGCACCGCGACCAGCCCGCCGCGTGCCCCACCTGCGGGGAGCAGTTCAGCACCAAGTCCAAGCTGTCGTACCACCGCGGCGTCTGCAACCAGGCGCGGCCGCAGTGCGACTGCTGCGGGAAGGTGTTCGCCAACAAGATGACGCTCAAGTACCATCTCAAAATACTGCCTCAGAATAAAGACGAAAAGCCGAAGGAGAAGCTCTATATAACCTGCAAGGGCTGCAACAAGGTGTTCCACTCCAAGAAGAGCTATCGAGCACACGTGGTGATTCACGATGGACTGACCTACCCTTGTCCTATTTGCGGGAAGCTGTTCCAGTGGAAACGGAACCTGGCCCGCCACACGAGGAACCACAGGGATCGCGACGCGGGCGCCACGCACGAGTGCCGCGACTGCCGCAAAACGTTCAGCAGCCGCGACTGCTACAACAACCACATGAAACTCAGCAAGCGACACGTGCAGGAAGACGCCTATGTGCACGAGTGCTCTTACTGTGGCAAGAAATTTGCGACCAAGTGGTGCATGGTCGACCACATAGATTGGGACCATCTCAAGCGGATCAAATACCAGTGCAGCGTTTGCTTCAAGGCATTCAAGACGGCGAAGATAATGGTGGCTCACATGAACAACATACACGAGGGCAAGAAGAACAGGGAGCCCGAGGGCGAGCACCTCTGCGAGATCTGCGGGAAGTCATACAAGTATTGCCGGCACAGCGGTCTCACCTCTGAGCTACCCAGACGTTCACTGATGAAGCGTATCGTTCCATCACATTTGCCGCCCCTGCTACCAGCGACCTCTCCGTATTTCCCTGAAAGGCTCACAGCCTTTAGGCCACTCTAA
Protein Sequence
DTRTGKVQQRNYRKLKNLPADLVELYTMTEEEMWEVRQQDVESAEFCALKYKCSDCIIGFNSERLMHDHMQGKHAPKSPDCHQCDVCKAYFLTRDNVSCHRALHLTAYRCTACGARAGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAVQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSSSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAVQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSSSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAVQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSNSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAVQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSNSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAIQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSSSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAIQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSNSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAVQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSSSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAIQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSSSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAIQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSSSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAVQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSSSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAIQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSNSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAVQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSNSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAIQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSSSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAIQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSNSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAIQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSSSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPTRRVPHLRGAIQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSNSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAVQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSSSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAVQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSSSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFRSGAYWRAVGPQAPHAVPLAHAPRPARRVPHLRGAVQVRGILESSGASSAACCPTRTRTATSPPRAPPAGSSSGQGHTGEQWGLKRRMLSHSHTHRDQPAACPTCGEQFSTKSKLSYHRGVCNQARPQCDCCGKVFANKMTLKYHLKILPQNKDEKPKEKLYITCKGCNKVFHSKKSYRAHVVIHDGLTYPCPICGKLFQWKRNLARHTRNHRDRDAGATHECRDCRKTFSSRDCYNNHMKLSKRHVQEDAYVHECSYCGKKFATKWCMVDHIDWDHLKRIKYQCSVCFKAFKTAKIMVAHMNNIHEGKKNREPEGEHLCEICGKSYKYCRHSGLTSELPRRSLMKRIVPSHLPPLLPATSPYFPERLTAFRPL

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00656656;
90% Identity
iTF_00656656;
80% Identity
iTF_00656656;