Basic Information

Gene Symbol
-
Assembly
GCA_000648655.2
Location
NW:326035-333170[+]

Transcription Factor Domain

TF Family
zf-GAGA
Domain
zf-GAGA domain
PFAM
PF09237
TF Group
Zinc-Coordinating Group
Description
Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 18 0.16 2e+02 1.6 0.0 21 45 108 132 105 140 0.83
2 18 0.026 33 4.1 0.0 19 46 163 190 157 196 0.83
3 18 0.00011 0.14 11.7 0.1 21 46 193 218 189 224 0.87
4 18 0.0054 6.8 6.3 0.2 21 45 221 245 217 248 0.90
5 18 0.0042 5.4 6.6 0.0 21 45 249 273 245 281 0.86
6 18 1.7 2.1e+03 -1.7 0.0 21 45 277 301 273 309 0.82
7 18 1 1.3e+03 -1.0 0.0 21 44 305 328 299 337 0.75
8 18 0.06 77 2.9 0.1 21 46 389 414 383 422 0.85
9 18 0.37 4.7e+02 0.4 0.2 26 48 450 472 431 477 0.76
10 18 1.1 1.4e+03 -1.1 0.1 21 43 623 645 621 649 0.73
11 18 0.024 30 4.2 0.1 21 46 651 676 647 683 0.88
12 18 0.76 9.7e+02 -0.6 0.2 37 45 695 703 680 706 0.74
13 18 0.0014 1.8 8.1 0.0 21 46 735 760 721 766 0.88
14 18 0.042 54 3.4 0.1 21 46 763 788 759 791 0.88
15 18 0.93 1.2e+03 -0.9 0.1 21 44 819 842 807 851 0.75
16 18 0.022 28 4.3 0.1 26 45 880 899 869 902 0.89
17 18 0.005 6.3 6.4 0.1 21 45 903 927 900 931 0.90
18 18 0.13 1.7e+02 1.8 0.2 26 44 936 954 927 958 0.89

Sequence Information

Coding Sequence
ATGGCGTCGAATCACGGCTGCAACGAAGAGATTCGAGATTCGAGACAAAAAAGCAACGTATCCAGTTCCTTAACAATAAACGACAGTTACTGGAGTAAAAGAACTGGGGACAACGAAGCTACCAATAAGATTATAGAAACTTTCTCTTCGTGTTTGTCACACATCGAAGTCCAACTATTTATCAACTGTTCAGGGGCCATTATTAACGAAAATCCAACCATGAGATCGGCACTGAAAAAAACCTGTACGAGACTTTGCGAGATCTGCGGTAAATTATCTGCTAATCTTACAGATTTTAAAGAGCACCAACTCGTTCACACGGGCGAGAGACCGTTTGAGTGCTCAATTTGTTTACAAACGTTTAATCGAAAAATGACCCTTGTAAGACACAAACAAACTCATATGGGGAAAAAACCCTTCAAATGCGCAGATTGTTCTAAAACATTTGGGCAAAAAGATACTATCATACGACACATACGGACTGTTCACAACGAGGAGAAACCCTTTGAGTGCATAGtatgtcaaaaaagttttaaatcttcaGTTAACCTCTCAACGCACATGGGAACCCACACGAGCAAAAAACCCTTCAAGTGCACAGAATGTCTTAGATCATTTCGCCAAAAAGGTAATCTTAGGAGGCACATGCAGACCCACACAGAGGAAAAACCCTATAAGTGCACAGATTGTCCAAAATATTTCAGCCAAAAAAGCAATCTCAGGAGACACATCCAAAGCCACACAGGGGAAAAACCTTTTGAGTGCCCAACATGTCAAAAAGGTTTTGCATTatcttctaacctcaaaaaacacaTGATGACTCACACGGGCGAGAAACCCTTCGAGTGCTCAACTTGTTCTAAAACATTTAACCGAAAAACTAATCTCGTTTTACACATACGGACGCACACAGGGGAAAAACCCTACAAGTGCTCGGTATGCCAAAAAGGTTTCATATCTCGTAGTAATATCGCAAAGCACATGTGGACCCACAAAGAGGAAAAACCCTATAAGTGCGCAAATTGTCCAAAATCTTTCAGCCAAAAAAACCACCTCACAACTCATATGCGCGTCCACACTGGCATACCTTTTAGTCACTGCACCATCTGTCAAAAGCCATTTACGTACTCGAGTAATCTTATAGTACACATGAGGACCCACACAGGGGAGCGACCCTACAAGTGCTCGGTGTGTCAAAAAGGTTTTAGATCTTCTAGTTCACTCGCAGATCACATGCGGACCCATACCGGGGAAAAACCCTATAAGTGTGCAAATTGTCCAAACTCTTTTGCGCAAAAAGGTACTCTTAAAATTCACATGCGCATCCACACAGGCATACCTTACGGTTTTTGTACCATCTGCAAAAAGCCATTTAGGCACGCGAATAGTCTTAAAAATCATATGAGGACCCATACGAGGAAAAAACCTTCGAGAgaatccagtaaaaaaaaacaaaaacagatgATGTCCCACGAGATTAATGGCTATGTAGCACGTGTTAGTAATACTAAGTACGCTCTATCTATGTCTATTTCTCTCACGCACTTTACTGACACGTGCCATGTAGCTTATGCGTACAAGCCACAAAGACAAAAAAGCAACGCGTTCAGTTCCTCAACAATAAATGACAGTTACTGGAGTAAAAGAACTGGGGACAACAAAGCTACCACTAAGATTATAGAAACTTTCTCTTCGTGTTTACCACACATCGAAGTCCAATTATTTATCAACTGTACAGGGGCCATTATTAACGGAAATCCAATCATGACATCAGCACTGAAAAACACTTGTACGAGACTTTGCGAGGTCTGTGGTAAAACGTTTGCCAAAATTGCACATTTTAAATCGCACCAACTTGTTCACACGGGCGAGAAACCCTTCGAGTGTTCAATTTGTTCTAAAACGTTTAGTCGAAAAACTGATCTCGTATGTCACATACGGATTCATACAGAAGAGCGTCCTTATAGGTGCTCGGTATGTCGAATAGGTTTTACATTTTCCGGTAACCTCAAAAAACACATGAGGAtccatgcagaaaaaaaatcgtttaagtGCAAAGATTGTCCAAAATCTTTTAGTCTAAAAACCAATCTCAGGAGGCACATGCGCGTCCACACTGGCATACCTTATGGTAATTGTACTATATGCCAAAGGCCATTCACTGACCCGAGCAGTCTTAGAGTGCATATGAGGATCCATACAGGAGAGAAACCTTACACGTGCTCCGTATGTCAAAAAGGTTTTAGATCTTCCAGTGACCTCACACAGCACATGCGGACTCACACAGGGGAGAAACCCTATAAGTGTGCAGATTGTTCAGAATCTTTTAGTCTAAAAACCAATCTCAGGAGGCACATGCGCATCCACACTGGCATACCTTATGGGCCCTGTTTAATCTGCCAACAACCATTCATTGACCCGAGTAGTCTTAAAGTGCACATGAGGACCCATACAGGCGAACGACCCTACAAGTGCTCGGTATGCCAAAAAGGTTTCATATCTCATAGTAATATCGCAAAGCACATGTGGACCCACAAAGAGGAAAAACCCTATAAATGCACAGATTGTCCAGAATCTTTCAGCCAGAAAAAACTTCTAACAACTCATATGCGTGTCCACACCGGCATACCTTGCGGGCATTGTCCCATTTGCCAAAGGCCATTCATGGACATGGGTAATCTTAAAAGGCACATAAAGACCCATACGGGAGAAAGGCCCTATAAATGTCCAGTTTGTCCAAAATCTTTTGGACTAAAAAGCAATCTCAAGAGACACGTGCGTGTCCATACCGGTATACCTTCTTATCGCTGTTCCATCTGCCAAAGGCCATTCACCGACGCGAGTAATCTTAAAAGGCATATGAGGACCCACACGAGCGAGGGTTCTTCGAGTTCTTAG
Protein Sequence
MASNHGCNEEIRDSRQKSNVSSSLTINDSYWSKRTGDNEATNKIIETFSSCLSHIEVQLFINCSGAIINENPTMRSALKKTCTRLCEICGKLSANLTDFKEHQLVHTGERPFECSICLQTFNRKMTLVRHKQTHMGKKPFKCADCSKTFGQKDTIIRHIRTVHNEEKPFECIVCQKSFKSSVNLSTHMGTHTSKKPFKCTECLRSFRQKGNLRRHMQTHTEEKPYKCTDCPKYFSQKSNLRRHIQSHTGEKPFECPTCQKGFALSSNLKKHMMTHTGEKPFECSTCSKTFNRKTNLVLHIRTHTGEKPYKCSVCQKGFISRSNIAKHMWTHKEEKPYKCANCPKSFSQKNHLTTHMRVHTGIPFSHCTICQKPFTYSSNLIVHMRTHTGERPYKCSVCQKGFRSSSSLADHMRTHTGEKPYKCANCPNSFAQKGTLKIHMRIHTGIPYGFCTICKKPFRHANSLKNHMRTHTRKKPSRESSKKKQKQMMSHEINGYVARVSNTKYALSMSISLTHFTDTCHVAYAYKPQRQKSNAFSSSTINDSYWSKRTGDNKATTKIIETFSSCLPHIEVQLFINCTGAIINGNPIMTSALKNTCTRLCEVCGKTFAKIAHFKSHQLVHTGEKPFECSICSKTFSRKTDLVCHIRIHTEERPYRCSVCRIGFTFSGNLKKHMRIHAEKKSFKCKDCPKSFSLKTNLRRHMRVHTGIPYGNCTICQRPFTDPSSLRVHMRIHTGEKPYTCSVCQKGFRSSSDLTQHMRTHTGEKPYKCADCSESFSLKTNLRRHMRIHTGIPYGPCLICQQPFIDPSSLKVHMRTHTGERPYKCSVCQKGFISHSNIAKHMWTHKEEKPYKCTDCPESFSQKKLLTTHMRVHTGIPCGHCPICQRPFMDMGNLKRHIKTHTGERPYKCPVCPKSFGLKSNLKRHVRVHTGIPSYRCSICQRPFTDASNLKRHMRTHTSEGSSSS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00368958;
90% Identity
iTF_00368958;
80% Identity
iTF_00368958;