Alei098079.1
Basic Information
- Insect
- Aquatica leii
- Gene Symbol
- -
- Assembly
- GCA_035610365.1
- Location
- CM069438.1:9157678-9173044[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 51 4.6 1.1e+03 3.0 0.7 10 23 2 15 1 15 0.93 2 51 0.059 14 9.0 1.6 3 23 22 42 21 42 0.97 3 51 2.4e-05 0.0058 19.6 3.0 2 23 49 70 48 70 0.97 4 51 1.6e-06 0.00038 23.4 3.2 1 23 76 98 76 98 0.99 5 51 0.00074 0.18 15.0 0.4 2 23 119 140 118 140 0.97 6 51 9.2e-05 0.022 17.8 4.2 1 23 146 169 146 169 0.97 7 51 0.0002 0.049 16.7 1.4 1 23 175 197 175 197 0.98 8 51 0.016 3.8 10.8 3.0 3 23 205 226 203 226 0.93 9 51 0.00011 0.025 17.6 1.1 1 23 231 253 231 253 0.97 10 51 0.021 5 10.4 4.1 2 23 264 285 263 285 0.97 11 51 0.052 12 9.2 0.2 2 23 291 312 290 312 0.96 12 51 3.4e-06 0.00082 22.3 1.7 1 23 318 340 318 340 0.99 13 51 6.9e-06 0.0016 21.4 3.4 1 23 346 368 346 368 0.99 14 51 3.6 8.7e+02 3.4 0.2 1 14 374 387 374 391 0.84 15 51 0.00074 0.18 15.0 0.4 2 23 402 423 401 423 0.97 16 51 9.2e-05 0.022 17.8 4.2 1 23 429 452 429 452 0.97 17 51 0.0002 0.049 16.7 1.4 1 23 458 480 458 480 0.98 18 51 0.016 3.8 10.8 3.0 3 23 488 509 486 509 0.93 19 51 0.00011 0.025 17.6 1.1 1 23 514 536 514 536 0.97 20 51 0.021 5 10.4 4.1 2 23 547 568 546 568 0.97 21 51 0.052 12 9.2 0.2 2 23 574 595 573 595 0.96 22 51 3.4e-06 0.00082 22.3 1.7 1 23 601 623 601 623 0.99 23 51 6.9e-06 0.0016 21.4 3.4 1 23 629 651 629 651 0.99 24 51 8.4 2e+03 2.2 0.4 1 12 657 668 657 670 0.88 25 51 7.8e-05 0.019 18.0 3.6 1 23 708 731 708 731 0.95 26 51 0.0025 0.6 13.3 0.8 1 23 737 759 737 759 0.98 27 51 0.0065 1.6 12.0 1.0 3 23 767 788 765 788 0.95 28 51 6.3e-05 0.015 18.3 3.1 1 23 793 815 793 815 0.98 29 51 0.0051 1.2 12.3 2.1 2 23 826 847 825 847 0.97 30 51 1.4e-05 0.0033 20.4 0.7 2 23 853 874 852 874 0.96 31 51 0.00045 0.11 15.6 1.4 2 23 881 902 880 902 0.97 32 51 3.7e-05 0.0088 19.1 5.1 1 23 908 930 908 930 0.98 33 51 0.00074 0.18 15.0 0.4 2 23 1123 1144 1122 1144 0.97 34 51 7.3e-05 0.017 18.1 2.3 1 23 1150 1173 1150 1173 0.97 35 51 0.0002 0.049 16.7 1.4 1 23 1179 1201 1179 1201 0.98 36 51 0.012 2.9 11.1 2.3 3 23 1209 1230 1207 1230 0.92 37 51 0.00011 0.025 17.6 1.1 1 23 1235 1257 1235 1257 0.97 38 51 0.021 5 10.4 4.1 2 23 1268 1289 1267 1289 0.97 39 51 0.052 12 9.2 0.2 2 23 1295 1316 1294 1316 0.96 40 51 3.4e-06 0.00082 22.3 1.7 1 23 1322 1344 1322 1344 0.99 41 51 6.9e-06 0.0016 21.4 3.4 1 23 1350 1372 1350 1372 0.99 42 51 8.4 2e+03 2.2 0.4 1 12 1378 1389 1378 1391 0.88 43 51 7.8e-05 0.019 18.0 3.6 1 23 1429 1452 1429 1452 0.95 44 51 0.0025 0.6 13.3 0.8 1 23 1458 1480 1458 1480 0.98 45 51 0.0065 1.6 12.0 1.0 3 23 1488 1509 1486 1509 0.95 46 51 6.3e-05 0.015 18.3 3.1 1 23 1514 1536 1514 1536 0.98 47 51 0.0051 1.2 12.3 2.1 2 23 1547 1568 1546 1568 0.97 48 51 1.4e-05 0.0033 20.4 0.7 2 23 1574 1595 1573 1595 0.96 49 51 0.00045 0.11 15.6 1.4 2 23 1602 1623 1601 1623 0.97 50 51 4.4e-06 0.0011 22.0 3.2 1 23 1629 1651 1629 1651 0.99 51 51 0.0012 0.28 14.3 1.1 1 23 1657 1680 1657 1680 0.98
Sequence Information
- Coding Sequence
- ATGTTCGAAAGTAATCGATCGTTAAAGTATCATCTTAAAACGCACACTGATGATAAGCCGATTTGTCACATTTGTGGAAAATCCGTGTCGTGCACAGCGAGTTTGGCACAACACATATTACGACACGATAACGTGAAACCTCTCAAATGCCAGTTCTGCGAAAAGCATTTCAGAAGCTCCAGTAATTTGGCTGTTCATCTCCGATCTCACTCGCACGTGAAGCTATACCAATGCAACGTTTGTTCTAAATCGTTTACTCAACGATCTACTTTGGTTATTCACATGAGACATCATACTGGAGAAAAGCCatttgtTCGTGAAAAAACAACGGTCATTCAGCTGGGATTTTTAAACCAATGCGATATATGCTTAAAAATTTTTCCCGGAAAAAGAAACCTGAAAAAGCATAGAATTGTTCATAGCGAAGAGAGGCGATACGCGTGTAACCATTGTCCGAGACGGTTTAAGCAAAAAATCCATCTTACTGTGCACGTAAAACGATTACATCAAAGCAGAGAGTCGCATAAGTGCGATGTATGTAATAAAAGTTACGTGTCGGATGCAAATTTGAAGTTACACCAAGAACGGCATACGAtgaaatacaaaacattttgttcTTTATGCGATAAAGGGTTTTTTAGGCGTTGCGAACTGACGAAGCACGTTTCGAACGTGCACGAAAAAATCTTGCACATATGCAAAATTTGCCAAAAACCACTTTCGTCAGCatatcaattaaaaatgcaCATGGAAACACACAACGGCAACTAtttgacaaagaaaaaattaaaatgcagtttGTGCAACAGAGTTTTTCAAAACAACCAATCGTTAAAGTGTCATATTAAAACACACACTGATGATAAGCTGATATGTCACATTTGTGGAATATCTGTGCTGAGCAAGAAAGGTTTAGAACAACATATATTAAGACACGATGGAGTGAAACCTTACAAATGCCAGTTTTGCGAAAAGCTTTTCAGAAGTGCAAGTAATTTGACAGTTCACCTTCGAACTCACTCGCGCGTGAAGCTATACCAGTGCAATGTTTGTTCTAAATCTTTTACTCAACGTACCACTTTGGTTATTCACATGAGGCACCATACTGGAGAAAAGCCTTATAAATGTGATTTGTGCGATAGTGCTTTTGTAAGCAGatTCGTTCTTGAAAAAGCAACGGTCACTCAGCTGGGATTTTTAAACCAGTGCGATATATGCTTAAAAATTTTTCCCGGAAAAAGGAACCTGAAAAAGCATAGAATTGTTCATAGCGAAGAAAGGCGATATGCGTGTAACCATTGTCCGAGACGGTTTAAGCAAAAAATCCATCTTACTGTGCACGTAAAACGTTTACATCAAAGCAGAGAGTCGCATAAGTGCGATGTATGTAATAAAAGTTACGTGTCGGATGCAAATTTGAAGTTACACCAAGAACGCCATACGAtgaaatacaaaacattttgttcTTTATGCGATAAAGGGTTTTTTAGGCGTTGCGAACTGACGAAGCACGTTTCGAACGTGCACGAAAAAATCTTGCACATATGCAAAATTTGCCAAAAACCACTTTCGTCAGCGtatcaattaaaaatgcatatggAAACTCACAACGGCAACTAtttgacaaagaaaaaattaaaatgcagtttGTGCAACAGAGTTTTTCAAAACAACCAATCGTTAAAGTGTCATATTAAAACACACACTGATGATAAGCTGATATGTCACATTTGTGGAATATCTGTGCTGAGCAAGAAAGGTTTGGAACAACATATATTAAGACACGATGGACTAAAACCTTACAAATGCCAGTTTTGCGAAAAGCTTTTTAGAAGTGCAAGTAATTTGACAGTTCACCTTCGAACTCACTCGCGCGTGAAGCTATACCAGTGCAATGTTTGTTCTAAATCTTTTACTCAACGTACCACTTTGGTTATTCACATGAGGCACCATACTGGAGAAAAGCCTTATAAATGTGATTTGTGCGATAGTGCTTTTGTAAGCAGatttatCAATGAAAAATTAACGATCATTCAGCTTGGGAATTTCAACCAATGCGAAAGTTTTCCTGAAGTGAAGTACCCGAAAAAACACGGAAATGTTAATAACGAAGAAAGGCGATTTGCCTGTGACCTATGTTCAAAacgctttaaacaaaaaagccATCTTAATACACACGTTAAACAATTGCATTTAATCAAAACATATTACCAATGTGATGTATGTAATAACACTTACGCGTcggaaaaaactttaaagttacaCCAAGATCGtcatttattgaaatacaaGACATTTTGTTCTTTATGCCAGAAAGGATTTTATCGAAATTTCGATTTATCGAACCACGTCTTAAAGGTGCACGAGAAAATCTTGCACGAAtgcaatatttgtaaaaaaccacTTTCgtcaaaaagatatttaaagaaaCACATGGAAACTCACAATGGCAACTCTTcgtcgaagaaaaaaattaaatgcagtaTGTGCGACAAAGTTTTTCAAAGTAACGAGTCGTTAAACTGCCATCTTAAAATACACACTGATGATAAGCTGATATGTCACATTTGCGGAAAATCTGTGTCGAGGAAATCGAGTTTGGAAGATCACGTATTACGACACGATAACGTGAAGCCTCTCAAATGCCAAATTTGCGAAAAGCATTTCTTAGATAGAAGCAATTTAGCAGTTCATCTCCGATCTCATTCGCACGTAAAGCTACACCAATGCAACGTTTGTTCAAAATCTTTTACTCAACGTACCAGTTTAGTTATTCACATGAGGCACCACACTGGAGAAAGGCCTTATAAATGTGATTTGTGCGATTCCGCATTCGCGTTTTTTATGCTAAGCCACGCCGTTAGTAAAGTCGCTTCGGCTTTTCTAACTGTGTGTTATAAAACCAAGAGTCCAATGACAAGGGAAACTGTTCCACTGCATATCCATCTCCTTTCCTCTATGCTGGGTGTCGTACTTTTAGATTTGTATTTGGCCTGCGTAATATACTGTCAACACCAGGTGCAGCGTAATACGACGTCAATACCAGGTTCAGGTAACGTGTTAAAGGATTGCGCACGTCTACTTGGTCTACTGTATTCTTTACCGGGCGTCATATTTGCAGTGTTGGATTTTGGCCAGCGTAACACGCCATCAATACCAGGTTTAGGTAACGTGTTGAGAGATTTTCTGTTGGACTTTGGCCAGCGTAATACGCCGTCAATACCAGATCCAGGTAACGTGTTGAGAGATTTTAGCACGTCTACTTGGTCTGCTGTTTTCTTTACCGGGCGTCCTATTTGTAGTGTTGGGTTTTGGCCAGCatTCGTTCTTGAAAAAGCAACGGTCACTCAGCTGGGATTTTTAAACCAGTGCGATATATGCTTAAAAATTTTTCCGGGAAAAAGAAACCTGAAAAAGCATAGAATTGTTCATAGCGAAGAGAGGCGATATGCGTGTAACCAGTGTCCGAGACGGTTTAAGCAAAAAATCCATCTTACTGTGCACGTAAAACGCTTACATCAAAGCAGAGAGTCGCATAAGTGCGATGTATGTAATAAAAGTTACGTGTCGGATGCAAATTTGAAGTTACACCAAGAACGCCATACGAtgaaatacaaaacattttgttcTTTATGCGATAAAGGTTTTTTTAGGCGTTGCGAACTGACGAAGCACGTTGCGAACGTACACGAAAAAATCTTGCACATATGCAAAATTTGCCAAAAACCACTTTCGTCAGCGTATCAATTAAAAATGCACATGGAAACTCACAACGGCAACTAtttgacaaagaaaaaattaaaatgcagtttGTGCAACAGAGTTTTTCAAAACAACCAATCGTTAAAGTGTCATATTAAAACACACACTGATGATAAGCTGATATGTCACATTTGTGGAATATCTGTGCTGAGCAAGAAAGGTTTAGAACAACATATATTAAGACACGATGGAGTGAAACCTTACAAATGCCAGTTTTGCGAAAAGCTTTTCAGAAGTGCAAGTAATTTGACAGTTCACCTTCGAACTCACTCGCGCGTGAAGCTATACCAGTGCAATGTTTGTTCTAAATCTTTTACTCAACGTACCACTTTGGTTATTCACATGAGGCACCATACTGGAGAAAAGCCTTATAAATGTGATTTGTGCGATAGTGCTTTTGTAAGCAGatttgtCAATGAAAAATTAACGATCATTCAGCTTGGACATTTCAACCAATGCGAAAGTTTTCCTGAAGTGAAGTACCCGAAAAAACACGGAAATGTTAATAACGAAGAAAGGCGATTTGCCTGTGACCTATGTTCAAAacgctttaaacaaaaaagccATCTTAATACGCACGTTAAACAATTGCATTTAATCAAAACATATTACCAATGTGATGTATGTAATAACACTTACGCGTcggaaaaaactttaaagttacaCCAAGATCGtcatttattgaaatacaaGACATTTTGTTCTTTATGCCAGAAAGGATTTTATCGAAATTTCGATTTATCGAACCACGTCTTAAAGGTGCACGAGAAAATCTTGCACGAAtgcaatatttgtaaaaaaccacTTTCgtcaaaaagatatttaaagaaaCACATGGAAACTCACAATGGCAACTCTTcgtcgaagaaaaaaattaaatgcagtaTGTGTGACAAAGTTTTTCAAAGTAACGAGTCGTTAAACTGCCATCTTAAAATACACACTGATGATAAGCTGATATGTCACATTTGTGGAAAATCTGTGTCGAGGAAATCGAGTTTGGAAGATCACGTACTACGACACGATAACGTGAAGCCTCTCAAATGCCAAATTTGCGAAAAGCATTTCTTAGATAGAAGCAATTTAGCAGTTCATCTCCGATCTCACGCGCACGTGAAGCTATACCAATGCAACGTTTGCTCTAAGTCTTTTACTCAACGTACCAGTTTAGTTATTCACATGAGGCACCACACTGGAGAAAGGCCTTATAAATGTGATTTGTGCGATTCCGCATTCGTAAGCAAGAATTTACTAAACCTTCACAAGAAGACTAAACATTGTATAGTTCCGTGA
- Protein Sequence
- MFESNRSLKYHLKTHTDDKPICHICGKSVSCTASLAQHILRHDNVKPLKCQFCEKHFRSSSNLAVHLRSHSHVKLYQCNVCSKSFTQRSTLVIHMRHHTGEKPFVREKTTVIQLGFLNQCDICLKIFPGKRNLKKHRIVHSEERRYACNHCPRRFKQKIHLTVHVKRLHQSRESHKCDVCNKSYVSDANLKLHQERHTMKYKTFCSLCDKGFFRRCELTKHVSNVHEKILHICKICQKPLSSAYQLKMHMETHNGNYLTKKKLKCSLCNRVFQNNQSLKCHIKTHTDDKLICHICGISVLSKKGLEQHILRHDGVKPYKCQFCEKLFRSASNLTVHLRTHSRVKLYQCNVCSKSFTQRTTLVIHMRHHTGEKPYKCDLCDSAFVSRFVLEKATVTQLGFLNQCDICLKIFPGKRNLKKHRIVHSEERRYACNHCPRRFKQKIHLTVHVKRLHQSRESHKCDVCNKSYVSDANLKLHQERHTMKYKTFCSLCDKGFFRRCELTKHVSNVHEKILHICKICQKPLSSAYQLKMHMETHNGNYLTKKKLKCSLCNRVFQNNQSLKCHIKTHTDDKLICHICGISVLSKKGLEQHILRHDGLKPYKCQFCEKLFRSASNLTVHLRTHSRVKLYQCNVCSKSFTQRTTLVIHMRHHTGEKPYKCDLCDSAFVSRFINEKLTIIQLGNFNQCESFPEVKYPKKHGNVNNEERRFACDLCSKRFKQKSHLNTHVKQLHLIKTYYQCDVCNNTYASEKTLKLHQDRHLLKYKTFCSLCQKGFYRNFDLSNHVLKVHEKILHECNICKKPLSSKRYLKKHMETHNGNSSSKKKIKCSMCDKVFQSNESLNCHLKIHTDDKLICHICGKSVSRKSSLEDHVLRHDNVKPLKCQICEKHFLDRSNLAVHLRSHSHVKLHQCNVCSKSFTQRTSLVIHMRHHTGERPYKCDLCDSAFAFFMLSHAVSKVASAFLTVCYKTKSPMTRETVPLHIHLLSSMLGVVLLDLYLACVIYCQHQVQRNTTSIPGSGNVLKDCARLLGLLYSLPGVIFAVLDFGQRNTPSIPGLGNVLRDFLLDFGQRNTPSIPDPGNVLRDFSTSTWSAVFFTGRPICSVGFWPAFVLEKATVTQLGFLNQCDICLKIFPGKRNLKKHRIVHSEERRYACNQCPRRFKQKIHLTVHVKRLHQSRESHKCDVCNKSYVSDANLKLHQERHTMKYKTFCSLCDKGFFRRCELTKHVANVHEKILHICKICQKPLSSAYQLKMHMETHNGNYLTKKKLKCSLCNRVFQNNQSLKCHIKTHTDDKLICHICGISVLSKKGLEQHILRHDGVKPYKCQFCEKLFRSASNLTVHLRTHSRVKLYQCNVCSKSFTQRTTLVIHMRHHTGEKPYKCDLCDSAFVSRFVNEKLTIIQLGHFNQCESFPEVKYPKKHGNVNNEERRFACDLCSKRFKQKSHLNTHVKQLHLIKTYYQCDVCNNTYASEKTLKLHQDRHLLKYKTFCSLCQKGFYRNFDLSNHVLKVHEKILHECNICKKPLSSKRYLKKHMETHNGNSSSKKKIKCSMCDKVFQSNESLNCHLKIHTDDKLICHICGKSVSRKSSLEDHVLRHDNVKPLKCQICEKHFLDRSNLAVHLRSHAHVKLYQCNVCSKSFTQRTSLVIHMRHHTGERPYKCDLCDSAFVSKNLLNLHKKTKHCIVP
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -