Basic Information

Insect
Sisyra nigra
Gene Symbol
-
Assembly
GCA_958496155.1
Location
OY292315.1:16661201-16686001[+]

Transcription Factor Domain

TF Family
zf-C2H2
Domain
zf-C2H2 domain
PFAM
PF00096
TF Group
Zinc-Coordinating Group
Description
The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 51 0.00037 0.03 15.0 1.7 1 23 63 85 63 85 0.95
2 51 0.23 19 6.2 0.1 3 23 94 114 93 114 0.92
3 51 0.0011 0.088 13.5 1.3 1 23 174 196 174 196 0.95
4 51 0.13 11 6.9 0.2 2 23 204 225 203 225 0.91
5 51 0.36 29 5.6 5.8 1 23 287 309 287 309 0.98
6 51 8.6e-05 0.007 17.0 2.4 1 23 315 337 315 337 0.97
7 51 0.13 11 6.9 0.9 1 13 344 356 344 358 0.90
8 51 1.8e-05 0.0014 19.1 3.6 1 23 372 395 372 395 0.97
9 51 0.018 1.4 9.7 5.0 1 23 400 423 400 423 0.97
10 51 0.00033 0.027 15.1 0.7 2 23 433 454 432 454 0.97
11 51 0.00066 0.053 14.2 0.0 1 23 457 479 457 479 0.97
12 51 1.3e-05 0.0011 19.5 3.7 2 23 486 507 485 507 0.97
13 51 2e-06 0.00017 22.1 0.9 1 23 513 535 513 535 0.99
14 51 2.1e-05 0.0017 18.9 3.4 1 23 638 660 638 660 0.99
15 51 2.4e-05 0.002 18.7 2.1 1 23 695 718 695 718 0.97
16 51 0.0011 0.09 13.5 2.4 1 23 723 746 723 746 0.98
17 51 0.00042 0.034 14.8 6.2 1 23 754 776 754 776 0.98
18 51 2.9e-05 0.0023 18.5 0.5 1 23 779 801 779 801 0.97
19 51 0.00018 0.015 16.0 2.6 2 23 808 829 807 829 0.97
20 51 1.3e-06 0.00011 22.7 2.5 1 23 835 857 835 857 0.99
21 51 0.0037 0.3 11.8 0.3 1 23 863 886 863 886 0.97
22 51 0.004 0.33 11.7 0.7 1 22 927 948 927 948 0.95
23 51 0.17 14 6.6 2.5 1 23 967 989 967 989 0.98
24 51 0.013 1 10.2 0.8 1 23 995 1018 995 1018 0.92
25 51 0.00021 0.017 15.7 2.2 1 23 1025 1047 1025 1047 0.96
26 51 1.3e-05 0.0011 19.5 1.2 1 23 1052 1075 1052 1075 0.95
27 51 1.2 1e+02 3.9 5.1 3 23 1082 1103 1081 1103 0.96
28 51 9.4e-06 0.00076 20.0 1.1 2 23 1113 1134 1112 1134 0.97
29 51 4.9e-05 0.004 17.7 0.1 1 23 1137 1159 1137 1159 0.97
30 51 0.0099 0.81 10.5 4.1 5 23 1169 1187 1166 1187 0.93
31 51 3.3e-06 0.00027 21.4 0.8 1 23 1193 1215 1193 1215 0.99
32 51 0.043 3.5 8.5 0.4 1 12 1221 1232 1221 1234 0.88
33 51 6 4.9e+02 1.7 0.0 2 23 1389 1410 1388 1410 0.93
34 51 0.045 3.6 8.4 1.9 2 23 1438 1459 1437 1459 0.92
35 51 0.0048 0.39 11.5 4.4 1 23 1464 1486 1464 1486 0.97
36 51 0.059 4.8 8.1 0.9 1 23 1520 1542 1520 1542 0.92
37 51 4.2e-05 0.0034 18.0 0.3 1 23 1548 1571 1548 1571 0.97
38 51 1.3e-05 0.0011 19.5 1.7 2 23 1583 1604 1583 1604 0.97
39 51 0.006 0.48 11.2 2.2 3 23 1612 1632 1611 1632 0.98
40 51 3.3e-05 0.0027 18.3 2.1 1 23 1638 1660 1638 1660 0.99
41 51 7.6e-06 0.00061 20.3 0.4 1 21 1666 1686 1666 1687 0.95
42 51 0.036 2.9 8.7 8.6 1 23 1734 1756 1734 1756 0.98
43 51 2.6e-05 0.0021 18.6 3.4 1 23 1762 1784 1762 1784 0.99
44 51 0.65 52 4.8 0.9 1 14 1791 1804 1791 1808 0.85
45 51 2.3e-05 0.0019 18.8 2.4 1 23 1819 1842 1819 1842 0.96
46 51 0.00041 0.033 14.9 3.1 2 23 1848 1870 1847 1870 0.97
47 51 0.00016 0.013 16.2 2.6 1 23 1879 1901 1879 1901 0.99
48 51 1.7e-05 0.0014 19.2 0.4 1 23 1904 1926 1904 1926 0.98
49 51 0.0014 0.12 13.1 2.9 2 23 1933 1954 1932 1954 0.97
50 51 4.3e-07 3.5e-05 24.2 0.5 1 23 1960 1982 1960 1982 0.98
51 51 0.00048 0.039 14.6 0.3 1 23 1988 2011 1988 2011 0.96

Sequence Information

Coding Sequence
ATGAGTCCAGGTCCACTACAACTCACCGATTGGgaaagACCAATCAAAATCAAAACAGAGAAAATCAAACAAGAATTGAACAGCGAttcaattgatttcaattacgaTTGGGTACCAACAAAATTTAGTGAATCGAGCACGAATAACGTGGAAAAATCTTCAGAAAATCCTAGCAAAAGGCCAACCTATAACTGTGATTTGTGTGATTTAGTTTTTACGCAAAAGAAAGACTTTAACAAACATAAAACGTGGCACAAGACGGGTGTGACAACAAGAAACTGTGTCATTTGTGATCAAGATTTCGACAGTTTAGAGCTTCTACGTGAACATGATCGCATTCATTTTAATGGAACGACTTATGAATGTAGTTCACCAATCATATTCAAAACAGAGAATATCAAACAAGAATTGAACAGCGAGTCAAATGATTTCGAGTACGATTGGGTACAAACAAAAGCTAGTGGATCGAGCATGAAAAACGTGAAAAAATCATCTAAAAATCCCATCAAAAGACCATCATATAACTGTGATTTGTGTGATTCAGTTTTTACGCAAAAGAAAGAACTTTCCAAACACAAAACGTGGCACAAGACGGGTGTGAAAACATGGAACTGTTTCATTTGCGATCAAGACTTCGACAGTTTAAAGCTTCTACGTGAACATGACCGCATTCATTTTAATGGAACGACTTATGAATGTAGTTCGTGCGATAAAAAGGCAATCGCTTGGATTAAAGAAGATCCAGACTTGGATGAGAGCATACCTTCAAACGAGACCGTAAAGAAATCAAACAGTAATGCatcacataaaaaaataaaattcgatgAAGATGAGGAATATGAACAGGTCCTTGATAAATTCCGTTGTCTTAATTGTCTCAAAATGTTTAACCGGAAAGACAAATGTAAAGATCACTATCGCGTACACTTAGATGTACGGCCGTTTGAATGCAACTACTGTGATAAGAAGTTCAAGCAAGGAGCAACGCTAAGGACGCATAAAAAATTACACGACGACATTCGACGCACATATCAatgtaaaatttgtagaaaagaGTTCAAATCGGCCAATTATTCAAAACGATGCGAAAAAATACACGCCGGTATCACTGAACATAAATGCACGGAGTGCAATAaagaattttatgataaaagctCATTGACGATTCATTTGCAAAAAACACACAATGCAATCGAGCATCAATGCCACGTATGCAGCAAAGTCTATCCAAACAAATACAGTTTTGACTATCATTTTCGATCGGTTCACAATGCCAAACCTTCTGAAGCCACTTTAAAATGTGAACTTTGTGGAAGAGTTTATAACAATGTGAAGTTGTATCGAAAACATGTTCGATCTCATGAAGGTTTTGCTTGTGAGGTGTGCGGTAAAATGTTTGGATATGGCCAAGCATTACGCAATCATATATTGATTCATACTGGGGAAAAACCTTGGCAGTGTACTGTTTGTAGTAAAAGTTTTAGAAAAAAGCATATGCTCACCATTCACTTGCGAACACATACCAATGAAAAGCCGTATGAGTGCAAAGAATGTGGAAAGCGTTTTACACAGCGCACTCCATTAGTAATACACACTAGGACTCATACTGGAGAACGGCCCTATCTCTGCTGTTTGTGTTCAGAAGCTTTCATCAGCTCCGATTTCAGGGACCCGAAATGGATTAAAGATGAATCGTTCTCTGACTCTGGTGAATCAGAAGAAAACGACATTGAGGACGTTGAGATCGATGAAATATCAATAAAAGAAGAACTGGTCACAAAAATAAGATTTGTGGAACACGAACATTATAATCGCATCGATGATAAATTCTGCTGTCTGCACTGTGATAAACAGTTTTTAACAAAAGGCCACTGCAAAGAACACTACCGACGACATTTAGCTTTGAGACCATTTCAATGTAGTAAGTGTGAGAAGTCCTTTAAAGTGAGAACTTCGCTAAAAAAGCACGAACAAACTCATGAGTCAATTCGAAAAAGCTATAATTGTAATTCTTGCATCAAGTCGTATAGAGCGGCCAAAGACGCCAAACTTTGTGAAAAATCTCATGTAAATGATTTTAGTTACAAATGTACACTATGTGATAAAGGATTTCACTATAAAAGTTTACTAACTGCTCATATGGACGTAAAGCATAGTACGAACGAGTATCAATGTACCATTTGCGACCAGCTCTTTGCGAGTAAATACAAATACGAATATCATTATCGAAAGATGCATAATCCAAAAGGCCGTCGTCATCACACATGTGAAATATGTAATAAAACTTATGAAACAACGCATAACTACAATAGACACCTTAAATCGCATGAAGGATTCGCTTGTGATATTTGCGGGAAAATCTTTAGATATGAGCACGGATTGAAAAGTCATTTATTGATACATCGCGGGGAACGACCCTGGCAATGTACGCTTTGTGAcaaaacttttaacaaaaaacatattCTCACCGTGCATATAAGGACTCATACAAATGAAAAACCTTATGAATGTAAGGAATGTGGAAAGTGTTATAGTCAGCGAACGCCTTTGGTAACGCACATGCGAACTCATACCGGCGAGAGACCGTATCCGTGCCGTTTATGTACTGGAGCTTTCATTAGTAAAAATTATCTAAGACTTCATATGAACAGTGCACATAATTGTTCAGATTTGACCAACAACGATAATGATACTTGGGAAAACAAAGCGGATCTTAAAATGAACGCAACAGACATTAAAACTGATAAATCGCAAGACATTAATTTCAGTATCTACGAATATCAATGCGAAAAATGCACGACAATATTTACAagtgaaaaaatgttatttgaccATCGCTTAACTTGTGATATCTCTGTGATTGATGTTGGTGACGGCTATGTTCTTAAAACAGATGGATATCATTGTAAAGGGTGTAAAGATAAATTCCAAAGCAAGGATGAATGGTTATTACACTATCGCATACACTCGTTTAAGAAACGTTATTTTTGTGAAGTATGTTCCTTGTATTTTAGATATAGTTGGGACTTGAAGAAACATAATATCTCGGTACATGAAATGCAAGGGAATTTTCATATTTGCGATGTGTGTGGAACTGGATTTCgtttgaaaaaatctttgaagaaacaccaagaaaaacatttaaatcaagGATATAAATGCAATAAATGTAATGAGGCTTTCGTGACGAAGTCTGAATTGGAACAGCATAAAAACGAAAAGCATGTTAAATTAGACAATTTTTGTGATATATGCAACAAATTATATACAACTAAATGCAACTACCAATTCCATTATAAAACGTTCCACGATAAAAGTTATGTGCCTCCAAATTTACGATGTGAAGTGTGCGAAAAAGTGTATGATTCGCGTTCGAAGCTTAAAAGACATTTACGAAGCCATGAAGGCTTTGCATGTTATATTTGTGGAAAAGTTCTTTCGACCAAGGCGAGTCTTCAAGACCATTTACTTGTGCACAGTGGTGATAAACCAATCTCGTGTATATTTTGTGATAAGCATTTTAGAAAAAAGAGGACTTTACAAATTCATATTAGGCAACACACCGGCGAAAAGCCTTACCAATGCAACGAATGCGGCAAGCAATTCGTGCAACGCGTTAGTCTTGTAAGCCATAGGCGAAACCATACGGGCGAAAGGCCTTATCTGTGTAATATTTGTGGCAAGTCTTTCAAAAATGTGTACGGAAGCTACAAGAATGTAGCGATTAAACAAGACCCGGGTGATTTTGAACCTATTGTTACAATATTGGAACTGCCTGTAGAAAAAGAGCTTGAGAATCAAAAGAAACGCACTGTTGGTTCGGGCGATGAGGCTGATTTAACGTCAGCCAAACTGAATCAAAAACGCAAACCAAAAACTAGGGCTGgtcgaaaaaagaaaaaaattcaaccgaCTGAAGACGAGTCAGATGAGGATGATGGTGAAGATGAGGACGATACTGATTTCAAGCCAACTAAGTCGACAAGAAAAGTTAGAGCTAAAAAGACGAAAAAACTTGAGAAGGACGAAGAAGACGATGATGGATATCGAAATCCTTTTAACTATGGCTTTGATATTCCCACAGCCCCAACCAGAAAACGAAGACCATTCACAAAAAGTCCGTCTACGCCAAACGCAGCGACGAAACCATCTAAATCAAAATGGTCGTGCTTGATTTGTTTAGCTGATTTGGCATCGAAAGAAGCTCTAATTGCTCATTATGCAGTTCACAAAGATGACCCGGTTAAACAAGAACAGGAAAACTCCGAAAAAACGGGATCTTCAACGCAACATGTTTCCATAAACGAGTTACAATGTCAACTGTGCGGTCAAATATTTGAAACTGATAAGTTGTTAAGAAAACATTGTGAAACGCATATTGAAGAACGTTTTCATTGTGACCTTTGTAACAAGTTCTTTAAACGAGCTTATGATTTTTCGATGCATTCGAAGGTCCATTCAAATGAACCCAAAATAAAATGTACTTCGTGTGATTTCGAGACAATTTATCAATGTGCCATGAACGTTCATAATCAACGCCACGAgtcaaattataaatacaaatgtgAAATTTGTAATAAAGGATTCTTCGCTTTAAATTGGTATGAGGAACATAAAAATTTCCATACTGGAGCTAAACCGTTTGTATGTGAAATCTGTGACAAACCATTTGCATATACTCGCTATTTAACAGCTCATAAACGCATAATGCATCCAGAAAGCTATCGAGATGAACCTATCGTAAATGAATGCGACATATGCAATAAACGCTTTGCCCACCAAAAAAGCTTAATGTTACACATGCGAGGTCACACTGGTGAAAATACAGTTCTTTGTGATATTTGTGGCAAACGTTTATCAAGTAATGAGCATTTGAAATTTCATCGTAGAATTCATACGGGGTATAAGCCGTACACATGTGATGTTTGCAGTAAAGGCTTtgcaaaaaaatgtaatttaacgtTACATTTACGAGTCCATAGCGGCGAAAAACCATATGTTTGCGATGTTTGTGGGAAATGTTTTTCGCAACGTTCGACGTTAGTTATTCATGAACGGGAGGTCTCTTGGATTAAAGAAGAAACACATGATATTGAAGAAAATGAAACGAACGAGCGTGACAATAAAAAGCTCGACGAAGATCGCGTTGGTAATATAAAATTCGACGAAAACGAAGAGTATTCACGCGTAGGCAAAAAATTCAGTTGTTCGCATTgtaaaaagacttttattaaGAAAGCACAGTGCAAAGAACATTATCGAACACATCTAAATATGCGGCCGTATCAGTGCAACAGCTGTGATAAGACGTTCAAACTGAAAACAACATTAAAATCACATCAAAAAATTCACGATACATCTCGACCAACGTTCGAATGTAGTGAATGCTATAAATCGTACAAATCAGCCAACTATGCGAAACGATGCGAGAAAGCTCATACgggtgtatttaattttaaatgtccaGATTGTGATCGTGGATTTCATCACAAAAGTTCCCTCAGCGTTCACATTGAGATCAAACATAACGGTACTGAACAGCAATGCCACGTGTGCagcaaaattttttcaaacaaagtCAACTTCGAGTATCATTTTCGAAGAATACACAATCCCAAACCATCTAATATCGACTTCAAATGTGAATTGTGCGGTAAATCTTacaacaatattaaattttataaaagacaTGTTAGATCTCACGAGGGTTTCCCTTGTGATATATGTGGAAAAATATTCAGATATGGACACGGCTTAAAGAATCATATGTTAATTCATCGCGGTGAAAGGCCGTGGACTTGTTCTGTTTGTAGcaaaacatttaacaaaaagCATATACTTACCGTCCATTTACGCACTCATACCAATGAAAGACCGTACGAATGTAAAGTGTGTGGAAAACGATTCACACAAAGAAGTCCATTAGTTATCCATATGAGACAGCACACTGGAGAAAGACCATATCCATGTCGGTTATGTTCAGAAGCATTTgttagcaaaaattatttacgaCTTCATATGCAAAATATTCATGATTGTCCAAATGTATAA
Protein Sequence
MSPGPLQLTDWERPIKIKTEKIKQELNSDSIDFNYDWVPTKFSESSTNNVEKSSENPSKRPTYNCDLCDLVFTQKKDFNKHKTWHKTGVTTRNCVICDQDFDSLELLREHDRIHFNGTTYECSSPIIFKTENIKQELNSESNDFEYDWVQTKASGSSMKNVKKSSKNPIKRPSYNCDLCDSVFTQKKELSKHKTWHKTGVKTWNCFICDQDFDSLKLLREHDRIHFNGTTYECSSCDKKAIAWIKEDPDLDESIPSNETVKKSNSNASHKKIKFDEDEEYEQVLDKFRCLNCLKMFNRKDKCKDHYRVHLDVRPFECNYCDKKFKQGATLRTHKKLHDDIRRTYQCKICRKEFKSANYSKRCEKIHAGITEHKCTECNKEFYDKSSLTIHLQKTHNAIEHQCHVCSKVYPNKYSFDYHFRSVHNAKPSEATLKCELCGRVYNNVKLYRKHVRSHEGFACEVCGKMFGYGQALRNHILIHTGEKPWQCTVCSKSFRKKHMLTIHLRTHTNEKPYECKECGKRFTQRTPLVIHTRTHTGERPYLCCLCSEAFISSDFRDPKWIKDESFSDSGESEENDIEDVEIDEISIKEELVTKIRFVEHEHYNRIDDKFCCLHCDKQFLTKGHCKEHYRRHLALRPFQCSKCEKSFKVRTSLKKHEQTHESIRKSYNCNSCIKSYRAAKDAKLCEKSHVNDFSYKCTLCDKGFHYKSLLTAHMDVKHSTNEYQCTICDQLFASKYKYEYHYRKMHNPKGRRHHTCEICNKTYETTHNYNRHLKSHEGFACDICGKIFRYEHGLKSHLLIHRGERPWQCTLCDKTFNKKHILTVHIRTHTNEKPYECKECGKCYSQRTPLVTHMRTHTGERPYPCRLCTGAFISKNYLRLHMNSAHNCSDLTNNDNDTWENKADLKMNATDIKTDKSQDINFSIYEYQCEKCTTIFTSEKMLFDHRLTCDISVIDVGDGYVLKTDGYHCKGCKDKFQSKDEWLLHYRIHSFKKRYFCEVCSLYFRYSWDLKKHNISVHEMQGNFHICDVCGTGFRLKKSLKKHQEKHLNQGYKCNKCNEAFVTKSELEQHKNEKHVKLDNFCDICNKLYTTKCNYQFHYKTFHDKSYVPPNLRCEVCEKVYDSRSKLKRHLRSHEGFACYICGKVLSTKASLQDHLLVHSGDKPISCIFCDKHFRKKRTLQIHIRQHTGEKPYQCNECGKQFVQRVSLVSHRRNHTGERPYLCNICGKSFKNVYGSYKNVAIKQDPGDFEPIVTILELPVEKELENQKKRTVGSGDEADLTSAKLNQKRKPKTRAGRKKKKIQPTEDESDEDDGEDEDDTDFKPTKSTRKVRAKKTKKLEKDEEDDDGYRNPFNYGFDIPTAPTRKRRPFTKSPSTPNAATKPSKSKWSCLICLADLASKEALIAHYAVHKDDPVKQEQENSEKTGSSTQHVSINELQCQLCGQIFETDKLLRKHCETHIEERFHCDLCNKFFKRAYDFSMHSKVHSNEPKIKCTSCDFETIYQCAMNVHNQRHESNYKYKCEICNKGFFALNWYEEHKNFHTGAKPFVCEICDKPFAYTRYLTAHKRIMHPESYRDEPIVNECDICNKRFAHQKSLMLHMRGHTGENTVLCDICGKRLSSNEHLKFHRRIHTGYKPYTCDVCSKGFAKKCNLTLHLRVHSGEKPYVCDVCGKCFSQRSTLVIHEREVSWIKEETHDIEENETNERDNKKLDEDRVGNIKFDENEEYSRVGKKFSCSHCKKTFIKKAQCKEHYRTHLNMRPYQCNSCDKTFKLKTTLKSHQKIHDTSRPTFECSECYKSYKSANYAKRCEKAHTGVFNFKCPDCDRGFHHKSSLSVHIEIKHNGTEQQCHVCSKIFSNKVNFEYHFRRIHNPKPSNIDFKCELCGKSYNNIKFYKRHVRSHEGFPCDICGKIFRYGHGLKNHMLIHRGERPWTCSVCSKTFNKKHILTVHLRTHTNERPYECKVCGKRFTQRSPLVIHMRQHTGERPYPCRLCSEAFVSKNYLRLHMQNIHDCPNV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-