Gacu017228.1
Basic Information
- Insect
- Gonocerus acuteangulatus
- Gene Symbol
- pol
- Assembly
- GCA_946811585.1
- Location
- CAMPFD010000870.1:206336-219444[-]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 16 0.00016 1.1 11.3 1.3 15 52 85 127 80 129 0.83 2 16 1.5e-05 0.098 14.6 1.1 22 50 125 153 121 156 0.91 3 16 0.025 1.7e+02 4.3 0.7 17 48 158 189 153 193 0.82 4 16 0.00042 2.9 9.9 0.3 26 48 196 218 192 221 0.93 5 16 0.002 13 7.8 0.1 14 48 372 406 367 411 0.90 6 16 0.00071 4.8 9.2 0.2 22 49 409 436 407 438 0.91 7 16 0.033 2.2e+02 3.9 0.1 22 49 556 583 545 587 0.86 8 16 0.25 1.7e+03 1.1 2.8 23 48 586 611 583 614 0.87 9 16 4e-05 0.27 13.2 0.3 20 49 883 912 878 916 0.90 10 16 4.4e-06 0.029 16.3 1.4 23 48 915 940 912 943 0.93 11 16 0.0031 21 7.2 0.6 26 48 1088 1110 1073 1114 0.90 12 16 4.3e-05 0.29 13.1 0.4 24 48 1115 1139 1111 1143 0.92 13 16 0.015 1e+02 5.0 0.1 25 49 1280 1304 1271 1309 0.86 14 16 0.00028 1.9 10.5 0.6 23 48 1307 1332 1305 1336 0.92 15 16 0.0015 10 8.2 0.5 22 49 1473 1500 1469 1504 0.90 16 16 0.0015 9.9 8.2 0.6 23 48 1503 1528 1500 1531 0.92
Sequence Information
- Coding Sequence
- ATGCATctcattaCTGAAGACGAATGCACTTCTATGTGGATCCCTGACCTATCTTCAGAACTCAATTGTACAGAAAAGAATGACATTAATTCAAACTTTACGAGAAATAATGAATTTGTTGGAGAAGATTCCTCAAGGTATGCTGATGAAAAGCACCTTCAGTCGTCTTTTGAAACCATTGAGCATGGCAACGTTAGTGGAGAACCAATTATTTATCCTTACTGtcaatataaaagtaaaaattctTCCTTCCCGAGGTCTCGCATCTTGAGTAAGCACTGCAAATATAAACCTTTCTCATGTCCTTTTTGCAACTACCGTTGCAATCAAACAAATAACTTGAAATCTCATATTGCATTCAAGCATTCATTTGAAAAACCGTTTCATTGCAGTCATTGCTCTTATCGGAGTTCTCAATCAAGCAATTTGACTCGACATGTAAAGCTAAGGCATTTTAAAGACAAAAGCCAGTTGAAATctgcaaagaaaaaaacatttgtttgcCCACACTGTCATTATAGATGTGCCCAATTAAGCAATTTGAAATCACATATAACCTTCAAACATTCTAAAATTAAGCTACATGCCTGCCCATTTTGTATCTATAAAACCCCTCTATCAAGTAACTTGAGAAGGCATATAACATTGAAACATTCCTTGCAGAGGCTTAACCAAAAAGATTTCTCTTCTTGTTCTGGAATTGGAGAGTTTGGTCAATCACATCTGTTCAATAAGCAATTCTTAGATTCTGATTCTGAGAATGGACATTCTATAGCAAACGAAAACAATTATGCAAATTATGACAAACCTGAATTGATGACTTCTAACTCTCAGGCAGCTTCTTTTAGCAATCAAAAGTCTTTTTTAAGTGTCTCAAAGCAACTACTCCAAATTGTTTCTGTACCCTCTGACCAAAACACCGACCAATCACAAAATTCATCAGAACGTGATCATTCCAGTTCCATTCCAAGAAAATCATCTTTTGATCTTGTTCAAAACCAAATATTTGGTATTAATGCTACAGGGCAGTTctcaaaacatataaataatttccccACAAATAATGAAGTTCAACAGCATTCTTCATTACCTTCCTCTAATATTGGAGAAATATACCCCCAGTCTCTTAGTTCTATTGATGAGCCGTTTCAATGCCCATACTATGAGCAAAAATGCTCACAGTCAGTCAATTTGAAATCACATATTGCACTTATGCACTCTAAGAATAAACCATTTTCTTGCCCTCACTGTCCATACCAAAGCTCCATGTCCAGTAATCTTAATAGGCATGTCACCCTTAAGCATTATGACATTTTGAGTTCATTCAGACAAACTGACTTCAAAAAGCAAACTTCCACAAATAGTGGTCCTGATCAGCCTTTTAATGGATTAAAACCTGCATATCTTGCTGACAACATACAGAATTTTCAAATCAATGAGGATTGTACATTGAATGGAAGAAGTAATTCTACTTTGAACATTGGCAATGCAGTTTTTCCTTCAAATCCCTTAGTAGATAGTAGTAACAAATATGATTTTGTTAAAAGTCAAAACAATTCTGACATATTTGATAGAAAGGATACCTTTAGTAAGCATAAGATGACACCTGTAAATATTGATGACACTCAAAGCCATTCACCGTACAGTCACACTCTAGAATCTAATGAGAAACATTTTCAGTGCCCGTACTGCAATTACAAATGCTCATCATCTGGGAATTTGAAATCTCATATTGCTTTTAAACACACTAAAATTAAACCTTTTTGTTGCCCTTACTGTTCATATAGAAGTTGTGTTACAAGCAACGTCAATAGACATATTAGGTTGAAACATTGTGGTATTGACACATTTTCAAACCTAAGAGAACATGAAAAAACTATCTCAAATTGTGCGAGTCTACAGATAAATGAATCAAATGACCTCTGTGAGTCTTCGTTTTTACGTCCTCGTGAGGAGCAGCGTACCGAAAGATTCTCTTTGAATGAGGCGCTTGAGCCCTCTTGTCCTGAAAATATTTCTACGTTTACTTGCATAGACCCTCAAAGTCAGTCCACTGGTGGGCAGCAATGGGAATCAAACCTAATTAGACCAATTCCTTCAATATTTAATGATGATTGGGGCTCTTGCATATCGCTTGAAAAAACACACTTAGAGCAGAAATTGTTCTCTGAAAGTATTATGAGTGACAGTAAATACAGAAGTTTGGAGCCCACATTTAACAAAGAATCTAAAATCCATATATCCCTCGATAAGTACAATTGTGGTAAGGAAGGTTTTTCAAAAGCTCAAACTTTGTCTAAATATTCTGATGAATTTTCCTCACTTCATAGTCCTCCACAAATAAATTCAGAAACGCAAGTGCTATGCACTCCACAGTGTACAACACTTCCTCAGGACGAAGAATTAGACATCTCTTGTTCTGTCATTTTAAACCAATCAATGTCTTCTTTGAACCAAATCTCTACTCTTCAAAATTCTTGTAATGAAATCAAATCATTATCATGTCTAGACTCATCTCTTTCTGTGGATAGCCAAAATTTAAAGTCTTGCAGTATAAAAGTTGAAAACCACATCAGTGAATTTTCAAGCTCAGATGTGATTGAAACATCTTTAGAGCAGCAAAACATTGACAGCCATCAAAGTAATTTTAGTGAGAAGCCATATCATTGTCCTCACTGTAAATACCAAAGTTCTCAATCTGGCAATCTAAAGATGCATATTGCTTTCAGACACTCCAAAGTTAAACCTTTCGGTTGTCCTTACTGCAATTATCGCAGCTCTCAATCCAGTAATGTTAGAAGGCATTTGAGACTCAGGCATAATCCATTTCATTCTATAAAGACTTTTGtatcaaataatataaattatcaaaagATTACTGATAACAGTAATGGCAAAAGTGGCTCTCTAATCGAATCAAGTCAATATAATTCAGGAATGAATCGTCCACTGGTAAATTCAATTCAAGGTGTTTATGCAAAAAACTTTGTTAATGACCAATCAAACTCTTTAACTCACGAAATACTTAATGAGGATCCCATTAGTTCATTCATTTCAGAAGTCAATGAGAGTTTAATCTCTCAGCCTGAGGAGATGAAATCTGAATCAACTAGTTATGAAAGTGGAAATGAGGAAGGTAAGATGAATGATTTGCCTCCAAATATGGGAAATCCCTTGGGCACTGCCTACGAGGATGAAATCATTAAAAACAATTCCAAAAGGAGAAGATCTGATTATAAACGTAGCTGTATAAGGATGTATCATTGTCCACACTGTGAGCACAAGACGACTCAGTCAAGTAATTTGAAGACTCACATTGCTTTTAGACATTCTAAAATTATGCCATTTGGCTGCCCTTTTTGTAACTATCGTAGTTCGCAATCCAGTAATGTAAAGAGGCATGTCAGATTAAGGCATCAATTTGACAATTGTGCTGGATTATCTGCTCCTTTTTTCAATCATTCAAATACTGAAATGCCCTTAAGAGAGACTGATGACAGTCAAAACTCATTATCTCAATCCCCGGCATCCCAAAATTCGCCGGTAAAGCCGTGTTTTAATATAAAGGAATGTAATAATCCCTACAGTGATTCTGTAAGCTATGTAAAAAAAGAGGTCAGCACAACAAAttctaatgaaataattttttctgaaaatgaaaGTAACCAATTGAAAAAGATGAAGAGCAAAGTTGATGATAGCACTTGCTCTAATAGTGAACAGAGTAACATGTTACAAACGGAGAATTTTTCAATTGTGAAGAAGTGTTTTGAGGAAAGTTCCACTTCATCCCTAACCAACTCGACATTTGAATCCACAGACTTGAAAGAAAAACTTTATCAATGTCCTTACTGTGATCACAAGACACTGCAGCCAATTAATTTGAAGACTCACATTGCTTTTAGACATTTAAAAGTCAAGCCATTTGGCTGCCCTTATTGTAACTATCGTAGTGCCCAATCCAGTAACGTCAAGAGACATGTCAGATTTAGACATCAAATAGATAATTTTGTTGGGTTTTCTGCTTCCTCTTTTAGTCAATCAAATAACCTAAATGTTTTTAGGGATGTTGAGGACAGTCAAAACTCAGCATCCGATTCCTTGCAATCCCAAAATTTGCTCTCAAACCCAAGCTTTAATTTAGAGAAACCTTTTGCGTCCATCGGTGAATCATCAAGCATAATTAAAAAAGAATCGAACACCACGGATTCAGTCGAATTCGTTTCTTCAGAGCCTTCTGCAAGTCTAAGCAGCCAGTTAGACAATGTTAAGGGTGAACCAGATGATAGCTCTTGCTCTAATAATCAACAGAGTGTAAAGTTTCAAAATTTAGAGAATCTTtcaaacataaaatacattGACAAAAATGCCCTTTCATCTGTAATTGAACCCTCATTTGAATACTCAAACTTGAAAGATAAACCATACCAATGTCCGCACTGTGATCACAAGACACAGCAGCCAAATAATTTAAAGACTCATATTGCTTTTAGACATTCAAAAGTCAAACCTTTTAGCTGTACCTACTGCCATTATAGAAGTTCTCAGTCTAGTAATGTTAAAAGGCATGTTAGGCTGAAACATTCCCTTGAACTTAGTGCACCAGTATTTAGGACATATACAGCTATACAAACTGCAGCTCATACAGATGAATGTAGTGGCTCTGGCACACAAATAAAAACTGAAATGGCAACTCCTTCCCAAGATGATGTATATTTGCTTggagaaaatgaaatgaatgtGTCAAGAAGTGTAGATTCAAACCTTTTGTTGGAAAGGTGCGGGTGGGTTCACGACGGCACGTTACGGAAAAAATTGCAAGTTGGAGCCATCGATTCTGAGAAGGCAAATTGGAGCGATCTCAAAGACGTCTTCACTGATGTGTTTTACCAGGAGGGAGAGCCCTTAGGAGTAACCGGAAAGACCCGCCATGAGATCGTTCTTAGCTCAGACTGCACAGTTTATGTGAAGGAGAGACGATATCCTCAGGCCTTAAAAACCGCCATGCGGGAAGAACTTCAAAAGATGATTGATCAGGGGATCATAGTTCCGAGCAAGTCAACCTATAATAGCCCACTGTGGGGGGAGCCTACGGCGGGATCCCCGGGGAAGTACAGAGTAGTTAAATGA
- Protein Sequence
- MHLITEDECTSMWIPDLSSELNCTEKNDINSNFTRNNEFVGEDSSRYADEKHLQSSFETIEHGNVSGEPIIYPYCQYKSKNSSFPRSRILSKHCKYKPFSCPFCNYRCNQTNNLKSHIAFKHSFEKPFHCSHCSYRSSQSSNLTRHVKLRHFKDKSQLKSAKKKTFVCPHCHYRCAQLSNLKSHITFKHSKIKLHACPFCIYKTPLSSNLRRHITLKHSLQRLNQKDFSSCSGIGEFGQSHLFNKQFLDSDSENGHSIANENNYANYDKPELMTSNSQAASFSNQKSFLSVSKQLLQIVSVPSDQNTDQSQNSSERDHSSSIPRKSSFDLVQNQIFGINATGQFSKHINNFPTNNEVQQHSSLPSSNIGEIYPQSLSSIDEPFQCPYYEQKCSQSVNLKSHIALMHSKNKPFSCPHCPYQSSMSSNLNRHVTLKHYDILSSFRQTDFKKQTSTNSGPDQPFNGLKPAYLADNIQNFQINEDCTLNGRSNSTLNIGNAVFPSNPLVDSSNKYDFVKSQNNSDIFDRKDTFSKHKMTPVNIDDTQSHSPYSHTLESNEKHFQCPYCNYKCSSSGNLKSHIAFKHTKIKPFCCPYCSYRSCVTSNVNRHIRLKHCGIDTFSNLREHEKTISNCASLQINESNDLCESSFLRPREEQRTERFSLNEALEPSCPENISTFTCIDPQSQSTGGQQWESNLIRPIPSIFNDDWGSCISLEKTHLEQKLFSESIMSDSKYRSLEPTFNKESKIHISLDKYNCGKEGFSKAQTLSKYSDEFSSLHSPPQINSETQVLCTPQCTTLPQDEELDISCSVILNQSMSSLNQISTLQNSCNEIKSLSCLDSSLSVDSQNLKSCSIKVENHISEFSSSDVIETSLEQQNIDSHQSNFSEKPYHCPHCKYQSSQSGNLKMHIAFRHSKVKPFGCPYCNYRSSQSSNVRRHLRLRHNPFHSIKTFVSNNINYQKITDNSNGKSGSLIESSQYNSGMNRPLVNSIQGVYAKNFVNDQSNSLTHEILNEDPISSFISEVNESLISQPEEMKSESTSYESGNEEGKMNDLPPNMGNPLGTAYEDEIIKNNSKRRRSDYKRSCIRMYHCPHCEHKTTQSSNLKTHIAFRHSKIMPFGCPFCNYRSSQSSNVKRHVRLRHQFDNCAGLSAPFFNHSNTEMPLRETDDSQNSLSQSPASQNSPVKPCFNIKECNNPYSDSVSYVKKEVSTTNSNEIIFSENESNQLKKMKSKVDDSTCSNSEQSNMLQTENFSIVKKCFEESSTSSLTNSTFESTDLKEKLYQCPYCDHKTLQPINLKTHIAFRHLKVKPFGCPYCNYRSAQSSNVKRHVRFRHQIDNFVGFSASSFSQSNNLNVFRDVEDSQNSASDSLQSQNLLSNPSFNLEKPFASIGESSSIIKKESNTTDSVEFVSSEPSASLSSQLDNVKGEPDDSSCSNNQQSVKFQNLENLSNIKYIDKNALSSVIEPSFEYSNLKDKPYQCPHCDHKTQQPNNLKTHIAFRHSKVKPFSCTYCHYRSSQSSNVKRHVRLKHSLELSAPVFRTYTAIQTAAHTDECSGSGTQIKTEMATPSQDDVYLLGENEMNVSRSVDSNLLLERCGWVHDGTLRKKLQVGAIDSEKANWSDLKDVFTDVFYQEGEPLGVTGKTRHEIVLSSDCTVYVKERRYPQALKTAMREELQKMIDQGIIVPSKSTYNSPLWGEPTAGSPGKYRVVK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00757581;
- 90% Identity
- iTF_00757581;
- 80% Identity
- iTF_00757581;