Ccar011720.1
Basic Information
- Insect
- Catonia carolina
- Gene Symbol
- -
- Assembly
- GCA_035578175.1
- Location
- JAQMRL010000001.1:205152527-205199882[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 57 6.1 9.8e+02 2.3 0.5 2 23 47 67 47 67 0.92 2 57 3.7e-05 0.0059 18.7 4.1 2 23 71 93 71 93 0.96 3 57 0.00058 0.092 15.0 1.6 2 23 101 123 101 123 0.96 4 57 0.0021 0.34 13.2 0.4 1 23 130 153 130 153 0.94 5 57 0.014 2.2 10.6 2.8 3 23 162 183 160 183 0.94 6 57 7.6 1.2e+03 2.0 0.2 2 11 190 199 189 209 0.79 7 57 0.00029 0.046 15.9 0.3 3 23 217 238 215 238 0.94 8 57 0.00042 0.066 15.4 2.0 1 23 245 267 245 267 0.99 9 57 0.00016 0.025 16.8 1.5 1 23 273 295 273 295 0.99 10 57 0.084 13 8.2 2.1 1 21 301 321 301 322 0.95 11 57 0.35 56 6.2 0.1 2 23 361 382 361 382 0.97 12 57 0.14 23 7.5 0.9 3 23 423 443 422 443 0.97 13 57 0.00093 0.15 14.3 0.8 2 23 447 469 446 469 0.97 14 57 0.0009 0.14 14.4 2.0 1 23 476 499 476 499 0.97 15 57 0.015 2.3 10.6 0.4 1 23 505 528 505 528 0.95 16 57 2.3e-05 0.0037 19.4 1.3 2 23 536 558 535 558 0.96 17 57 1.1 1.7e+02 4.7 0.5 2 23 565 584 564 584 0.82 18 57 0.0056 0.9 11.9 0.2 2 23 591 613 590 613 0.92 19 57 0.28 44 6.6 0.0 2 23 656 677 655 677 0.94 20 57 0.05 7.9 8.9 0.4 2 23 681 703 680 703 0.95 21 57 9.7e-06 0.0015 20.6 0.7 2 23 711 733 711 733 0.96 22 57 0.0015 0.23 13.7 0.7 3 23 742 763 740 763 0.93 23 57 0.00024 0.038 16.2 1.1 2 23 771 793 770 793 0.96 24 57 0.00043 0.068 15.4 0.4 2 23 827 849 826 849 0.93 25 57 0.0062 0.99 11.7 2.9 1 23 856 878 856 878 0.98 26 57 0.00012 0.019 17.2 1.0 1 23 884 906 884 906 0.96 27 57 0.003 0.47 12.8 0.3 2 21 937 956 936 957 0.93 28 57 0.0021 0.33 13.2 0.1 2 23 990 1011 989 1011 0.97 29 57 1.5 2.3e+02 4.3 2.5 2 23 1015 1037 1014 1037 0.94 30 57 1.4e-05 0.0023 20.0 2.3 2 23 1045 1067 1045 1067 0.96 31 57 0.009 1.4 11.2 1.0 3 23 1076 1097 1074 1097 0.92 32 57 0.00071 0.11 14.7 3.6 2 23 1105 1127 1104 1127 0.95 33 57 0.0042 0.67 12.3 0.4 2 23 1142 1164 1142 1164 0.96 34 57 0.00018 0.028 16.6 1.6 2 23 1172 1194 1172 1194 0.96 35 57 0.00043 0.068 15.4 0.8 1 23 1204 1227 1204 1227 0.97 36 57 0.00018 0.028 16.6 1.3 2 23 1235 1257 1234 1257 0.95 37 57 0.14 22 7.5 0.2 2 23 1299 1320 1299 1320 0.96 38 57 0.00014 0.023 16.9 3.9 2 23 1324 1346 1323 1346 0.96 39 57 0.00018 0.028 16.6 0.3 2 23 1354 1376 1354 1376 0.94 40 57 0.0039 0.63 12.4 1.6 1 23 1383 1406 1383 1406 0.95 41 57 0.00074 0.12 14.7 0.5 2 23 1413 1435 1412 1435 0.94 42 57 3.3 5.2e+02 3.2 0.3 2 23 1443 1462 1443 1462 0.71 43 57 0.46 74 5.9 0.2 2 23 1469 1490 1468 1490 0.93 44 57 0.00011 0.017 17.3 0.4 1 23 1497 1520 1497 1520 0.96 45 57 3.8e-06 0.00061 21.9 0.5 1 23 1526 1548 1526 1548 0.99 46 57 0.045 7.1 9.1 0.3 1 20 1554 1573 1554 1575 0.91 47 57 5 7.9e+02 2.6 0.7 2 23 1612 1634 1612 1634 0.83 48 57 0.017 2.6 10.4 1.0 2 23 1638 1657 1637 1657 0.89 49 57 0.015 2.5 10.5 0.2 2 23 1825 1846 1825 1846 0.96 50 57 0.02 3.2 10.1 3.9 2 23 1850 1872 1849 1872 0.94 51 57 1.3e-05 0.002 20.2 0.4 2 23 1880 1902 1879 1902 0.97 52 57 0.0015 0.24 13.7 0.8 3 23 1911 1932 1909 1932 0.96 53 57 0.00064 0.1 14.9 0.4 2 23 1939 1961 1938 1961 0.94 54 57 1.6 2.5e+02 4.2 0.5 2 23 1969 1988 1969 1988 0.72 55 57 1.2 1.8e+02 4.6 0.2 2 23 1995 2016 1995 2016 0.93 56 57 8.2e-05 0.013 17.7 0.2 1 23 2023 2046 2023 2046 0.97 57 57 0.018 2.9 10.3 3.1 1 23 2052 2074 2052 2074 0.99
Sequence Information
- Coding Sequence
- ATGGATTTTGCCAAAAGAACCATTGTGAAAGAAGAATTAACATTTTGGGGCGGTGAACAAATTGAGGAAATTGTAGTACAACAAGCACCGATTCGTGTATCTCGTCGAAAGAGTAGAAAGAACAAAGACATTAAACGAACTTGTGACATATGTGGAGTCGTAGTCGAAAGTCATATGTACAGAACACATAAAGCTACTCATGTTGACAGGTCATGTCATATTTGCGGCAAGACATTTCGGAAACTATATTATCTACGAAAGCATAAACGAAATGTCCACTCAATCGGTCAAGTTTTTGAAAGTTGTAAAATCTGCGGTAAAGTATTTAAAGGTGAACAACATGTGAAAGATCACATGAGTCGAATGCATACATCTAGAGAAAAAACATATGAATGTGAAATATGTAACCGAAAATTTACGTCCAAGATAAATGCTGTTGCCCACAAGACTGTATATCATGCAGATGATTCTAATGCGGTTTGTTGCACGATTTGTGGCAAAACGTTAAAACATCGCCGGTTATTATTAGGGCATATTAAACAAGTTCACTTTAAACCTGCTGAAATTGTTTGCGAAATATGCGGGAAACCTGTTAAAGATATCATCCGCCATCATAAGATTATGCATTTAGAACGTACTTTCATTAACTGCGATATATGCGGTAAACAATACGCTaataagagaactttaaaagatcACCAATTAAACGTTCATTCAGAGGTAGAAAGAAAATACACTTGTCAGACATGTGGCGCTTCATTTAAAGCCAGTTGTACGTTAAAACTTCATATGATAACGCACCAAACGGAACGGAATTTTAAATGCGATATTTGCAATAATACATTCAAATTAAAAAGCGTTTTGAAAACTCATCTCAGAGTTCATAATTCAGATGAACCATATAAGTGCGATAAGTGTAGTGAAACGTTCAGGTGGAAATCTACGTACGATAAACATTTGAAAAAGTGCGTGAAAGAAGAATTAACATTTTGTGGCGATGAACAAATTGATGGAAAGGTAATAAAAACAGCTTCAGTTTTTGTGCCACATATAAAATTTGAAGCCAACGATAAAGAACGAACTTGTGAATTATGCGATATTGTAGTCGAAAATTATGTAATGTACAGAAAACATAAAGCTACACATGTTAACAAGACATGTGAAGTAAAAGAAGAATTGACATTTTTTGGCGATGAACAAATTGATGGGAGTGTAATGGAAACAGTTCCAAATGTAAATTTTGAAATGGGTAACAAAATACGAATTTGTGAATTATGTGGAATGGTAGTCGAAAATCATAAAATGTACAGAAAACATAAGGCTACTCATGTGAACAAGACATGTGAAGTTTGTGGCAAGATATTTCTGAAATTATACCTTCTTCAACGTCATAGAAGAACCGTCCATGACATCGATCCAGTATATCATAGTTGTCAGATTTGTAACAAAGTGTTTAAAAGTGATCAATATGTAAAAGATCATATAAGACGAATTCATAGGTCGATCGAGCGATTTCAATGCGAAATATGCTATAAAAAACTTACATCTAAATTATTACTTGAGGCCCATAAAACTGTCGTTCATGCCGATGGTTCGAAAGAAGTTGTTTGTGATGAATGTGGTAAAACATTTAAACATAAAGGCTTATTGCTACAACATTCGAGAAGACTTCATATGACCTCTAACGGCTCTGTTTGTGAAATATGTGGCAAACGTGTGAAGGGTTTGAATCGTCATCGTAAGCGTGTCCATTCAAAACGTCCAATTCTTAGCTGTGATTTGTGCGGTAAATCGTATACCAGCGAAGGAGGTTTTAgagaacataaaataaatattcattcgaATTTAGAAAGAAAACACGTGAAAGAAGAGATTATTGATGATGAAAATGATGAATACTGGTTAGGAAAGTCACCGATTATTGCGAGCATGATAGAAACATCGATTTcgtttaataaagaaataacttGTGATATTTGTGGATTGatattagaaaataatttgttactAGAAAAACATAAAGCTTATCATAAGAAAAAGACATGTGACATATGTAATATAACATTACCGGATATATATAAGCTACAACGGCATAAAAAATATGTCCACTCAATAGATTTGGTGTATGAAAGTTGTCAGGTATGTGGTAAAGTATTTAAAAGTTCAATTTATCTACGTGATCATATGAAACGCCTTCATTCTGTCAGAACGAAACCATTCGGATGCGATATTTGTCATAAAAATTTTTCTAGAAAATCATTAGTCGTTACTCATATTGATCTGGTTCACAATGACGAAGCTGAATCACTCCGTTGTGATTTATgcggaaaaatatttaaacaccgCATAACATTGGCTGATCATTTAAAAAGAATTCATTCGTCAGTTTGCCAGGTTAAAGTTTGCGAGATATGcggtaaaaatgttaaatatattgaAGAACATATTCAACGCATGCATACGGATCGGTCTAACAAACCATGCGATATCTGTGGTAAATCGTATGCGAGTGAAAAAAGTTTGAAAGTTCATAAGAAATTTGTACATGGTTCAGTGAATCGAAAATACGTTTGTTCAATATGCAATGCTGCATTTAAGTGCAGTTCTTCTCTAAAACATCATTTATCCGTACACAAAGATGAACGTAAATTTAAATGTGATATCTGCAATCGTACTTTTAAATCAAAAGGTGTTCTAAAAACTCATAGTATAGTTCATACTACCGAAGAATTAAACGTGATAAGCGATAAGAATGTTAAACACAATGAAGAGGATGTTAAACACATGAATTCTGATCGCTCTAACACAACGTGTAATATTTGCGGGAAATCATATGCGAGCGAACGAAGTTTAAAAGGGCATAAGAAGTATGTGAATGAAGAAAACAGTGATGAGGAATACTGGTTAGGAAAGACACCAATTATTCCGGAAGTGATAGAAACGGCAATTTCCATTGAGAAAGAAATAACTTGTGATGTTTGTGGAATGAGTTTAGAAAATAATGCTTTATTAAAAAAGCATAAAGCTACACATAAGAGTAAAACTTGTGACATATGTAACAAAACATTATGGGATATACATAAACTACGTAGACATAAAAAATATGTCCATTCGGTTGACTTGGTATACGAAAGTTGTTCGATATGTAGTAAAGTATTTAAAAGTTCAATTCATCTGCGAGATCATATGAGACGAGTTCATTCGATCAGAAAAAAACCATTCGGATGCaatatttgtcataaaaaattttctactaaAGCGTTTGTCGTTACACATATAGATTTAGTTCACAATGATAATGCTGAATCTCTCTGCTGTGATATGTGCGGTAAAAAATTCAAACACCAAATAACGTTGGCAGATCATTTAAAAAGGATTCATTTATCAGATTGTCAGGTTAAAgaagaagaagaagaagcaagaACATGTGATATATGTGGAAAAATAAGTGCCAAGAGAGAATATTTACTTAAACATAAAAGAAATGTTCATTCAAAAACTACAGCACATCGAAGTTGCAAATTCTGTGGTAGAATATTTAAGAGTCTTGAATATCTTCGACAGCATACCAGGAGAGTTCATTTACCGCAAAAACAACCAAAAAAGAGTTACCAGTGTGATTCATGCGGTAAAATTTTTGCGAAAAAATCGTACTTTGATGCTCACAAGAGCACCGTTCATATTGGAAGTCGCAAAGCAGTTGTCTGTGATAAATGTGGTAAAACATTTAAGCATCGTTTAGTGTTGCAAAAACACATTCAGAATGTTCATATTAAAGATAAACAAGTGAAAGAAGAAATAAGGAATTCAAGAGAAGAAACTGAATGGAATGTTCAATATAAGTCAACTACTCCAGAAATGATGGTAAACATAAAACTCCATGAAATTGGAAGAACTTGTGAAATATGTGGGATTGTAGTTGGCGATTACAAAATGTTTAGAAAACACAAAACAATCCATGTGGACAAGACATGTGACATCTGTGGGAGATCATTCACGAAGTTATGTCTTCTTCGAAAACACCAAAGAAATATGCATCCGGCTTATTCCGATAATCGCCGTTGTGATTTGTGCGGCAAAATATTTAGGTCGCTAATAAATCTTAAGCAACATGTGAAAGCGGTTCATTCAAACAGAGAAAAGAAATTCAATTGCGATTTATGCGAGAAAAAATTTGTGACcaacttttttcttaaaaatcataaaataagtgCACACGATGGTTCTAATGTTTTATCGTGCGATAAATgcgataaaacttttaaattaaaaattcttctgAAAACACATTACGCAGTAGTGCACTCGAAGACTGCCGTTAAACCTGTCTGTGAAATCTGTGGCAAAAAAGTTAGGAAACTTGCTACTCATATTAGACTTTTACATTCTGATCGTCCAACTAAGAAATGTGATGATTGCGGAAAAATAGTTTTTGCCGATACACTTGAATTGCATAAATTAAGAGTTCATTCTGATCGGGAAAGAAATTACGTTTGTCATTTATGCGGTGCATCATTTAAAGACAGTACTGTTTTAAAAAACCATGTGCGTGGCGTACATTCGGacaatagaaattttaaatgtaaaatctgCGGTAGGGGTTTTAAATCGGCGGATGTTATGAAAAATCATATGAGAGTTCATAGTGATGATAAACCTTACACATGTAATGTTTGTGATGAATCTTTTAAGTGGAGAAGTACATTTGGTAAACatGTGAATGAAGAATTAAGGAATTCAAGAGAAGGAACTGAATGTTTTGTACAATATAAGTCAACGATTCCAAAAATGATGGtaaacgaagaagaagaagaatctcaTGAATTAATAAGAACTTGTGAAATATGCGGTAAATTTGTACTTGGGAACAATGAAATGTACAAAAAACACAAATCAACCCATGCGGCAAAACCATGCGATATTTGCGGAAAAACATTTTCGAGTCTTCAAAGGCACAAGAAAAATATGCATTCCATGCGTTCAGATGCTCAGTGTTGTGATGTTTGTGGCAAAGTGAACGAAGAATTAAGAAATTCAAGAGAAGGAACTGAATGGAACGTACAATATATGTCAACGATTCCAGAAACTATGGTAAACGTAAAAGTAGAAGAATCTCATGAATTAGTAAGAACTTGTGAAATATGTGTTGGTGAGGCGGTCAGAGTCGAATCTGTAATTTTGTATTACTGGATCGCTTGTCGCCGCGACAAAGAATGGACCGCGTGGGTAGACCACGTGCTGTTTGACAGAGATGATGCATTGTTGCGTATTACAACTAGACCGCCTGACCGCTTCGGTCACAGATCGCTGACCGCCGCGACAGACCGCTTTCTGTGTGCCCAAGCCTTTAGCCCACCAAGTAAGGTGGATGAAGACCTAACAGATTCAAGAATTGAAATTCACCAATCTAATAGAAATGCATCCGGTAATCCAAATATTACTGTTGAAACAATGTCTAAAACCGGAGGAAGAACTTGTGAAATATGTGGATTGGTTGTATCCGATTACAAAATGTACAGGAGACATAAATCTGTTCATTTAGATAAAACGTGTGAAATTTGTAATAAGACATTCGTATTGCGGTGTCTGTTAGTGCGGCACCAACATAATGTACATTCTATTAACTCAAAACATCAGCGTTGTAATATATGTGGCAGAGTGTTCAATTCTGGAGACTATCTTAGACAACATATTAGACGAGTTCACACATCACGAGAAAGAGTGTTCGGGTGTGATTTATGTGATAAAAAATTTACGAGTAAAGATTACGTTAAAATTCATAAACAAAGTGTTCATGATGAGTGTTCTTTATTGCCATGTGATATATGCGAACAACAATTTAAGTATCGCAATCAATTGAAAAGACATTATTCAGTCGCTCATGGACAAACCTCCGTCACACCAGTTTGTGAAATATGCGGTAAAGAAGTTACAAGACTTAAGAATCATATGAAACGCTTACATTCAGATCGTAAAACTGAGAAATGTGATTTATGTGGAAAAATAGTGTTCGCTGATTTACTTGAGAACCATAAAAGTAGAGTTCATTCTGATCGAGAACGAAAATATATTTGCGATATATGTGGCGCATCTTTTAAAGATAACAAAGTTCTAAGGAATCATACAAAGATGGTACATTCGAATGACAGAAAATTCAAATGTGGTATATGTAGTAGCACGTTTAAATCAGCTTGTGTGTTAAAAACTCATAAAAGAGTCCATAGTGATGACAAACCTTTCATTTGTACGTGTGGTGAAATGTTTAAATGGAAAACTACTTTCAATAAACATACTCAGAAATGTAAAAATATCGAAAAATCTGATACGTAA
- Protein Sequence
- MDFAKRTIVKEELTFWGGEQIEEIVVQQAPIRVSRRKSRKNKDIKRTCDICGVVVESHMYRTHKATHVDRSCHICGKTFRKLYYLRKHKRNVHSIGQVFESCKICGKVFKGEQHVKDHMSRMHTSREKTYECEICNRKFTSKINAVAHKTVYHADDSNAVCCTICGKTLKHRRLLLGHIKQVHFKPAEIVCEICGKPVKDIIRHHKIMHLERTFINCDICGKQYANKRTLKDHQLNVHSEVERKYTCQTCGASFKASCTLKLHMITHQTERNFKCDICNNTFKLKSVLKTHLRVHNSDEPYKCDKCSETFRWKSTYDKHLKKCVKEELTFCGDEQIDGKVIKTASVFVPHIKFEANDKERTCELCDIVVENYVMYRKHKATHVNKTCEVKEELTFFGDEQIDGSVMETVPNVNFEMGNKIRICELCGMVVENHKMYRKHKATHVNKTCEVCGKIFLKLYLLQRHRRTVHDIDPVYHSCQICNKVFKSDQYVKDHIRRIHRSIERFQCEICYKKLTSKLLLEAHKTVVHADGSKEVVCDECGKTFKHKGLLLQHSRRLHMTSNGSVCEICGKRVKGLNRHRKRVHSKRPILSCDLCGKSYTSEGGFREHKINIHSNLERKHVKEEIIDDENDEYWLGKSPIIASMIETSISFNKEITCDICGLILENNLLLEKHKAYHKKKTCDICNITLPDIYKLQRHKKYVHSIDLVYESCQVCGKVFKSSIYLRDHMKRLHSVRTKPFGCDICHKNFSRKSLVVTHIDLVHNDEAESLRCDLCGKIFKHRITLADHLKRIHSSVCQVKVCEICGKNVKYIEEHIQRMHTDRSNKPCDICGKSYASEKSLKVHKKFVHGSVNRKYVCSICNAAFKCSSSLKHHLSVHKDERKFKCDICNRTFKSKGVLKTHSIVHTTEELNVISDKNVKHNEEDVKHMNSDRSNTTCNICGKSYASERSLKGHKKYVNEENSDEEYWLGKTPIIPEVIETAISIEKEITCDVCGMSLENNALLKKHKATHKSKTCDICNKTLWDIHKLRRHKKYVHSVDLVYESCSICSKVFKSSIHLRDHMRRVHSIRKKPFGCNICHKKFSTKAFVVTHIDLVHNDNAESLCCDMCGKKFKHQITLADHLKRIHLSDCQVKEEEEEARTCDICGKISAKREYLLKHKRNVHSKTTAHRSCKFCGRIFKSLEYLRQHTRRVHLPQKQPKKSYQCDSCGKIFAKKSYFDAHKSTVHIGSRKAVVCDKCGKTFKHRLVLQKHIQNVHIKDKQVKEEIRNSREETEWNVQYKSTTPEMMVNIKLHEIGRTCEICGIVVGDYKMFRKHKTIHVDKTCDICGRSFTKLCLLRKHQRNMHPAYSDNRRCDLCGKIFRSLINLKQHVKAVHSNREKKFNCDLCEKKFVTNFFLKNHKISAHDGSNVLSCDKCDKTFKLKILLKTHYAVVHSKTAVKPVCEICGKKVRKLATHIRLLHSDRPTKKCDDCGKIVFADTLELHKLRVHSDRERNYVCHLCGASFKDSTVLKNHVRGVHSDNRNFKCKICGRGFKSADVMKNHMRVHSDDKPYTCNVCDESFKWRSTFGKHVNEELRNSREGTECFVQYKSTIPKMMVNEEEEESHELIRTCEICGKFVLGNNEMYKKHKSTHAAKPCDICGKTFSSLQRHKKNMHSMRSDAQCCDVCGKVNEELRNSREGTEWNVQYMSTIPETMVNVKVEESHELVRTCEICVGEAVRVESVILYYWIACRRDKEWTAWVDHVLFDRDDALLRITTRPPDRFGHRSLTAATDRFLCAQAFSPPSKVDEDLTDSRIEIHQSNRNASGNPNITVETMSKTGGRTCEICGLVVSDYKMYRRHKSVHLDKTCEICNKTFVLRCLLVRHQHNVHSINSKHQRCNICGRVFNSGDYLRQHIRRVHTSRERVFGCDLCDKKFTSKDYVKIHKQSVHDECSLLPCDICEQQFKYRNQLKRHYSVAHGQTSVTPVCEICGKEVTRLKNHMKRLHSDRKTEKCDLCGKIVFADLLENHKSRVHSDRERKYICDICGASFKDNKVLRNHTKMVHSNDRKFKCGICSSTFKSACVLKTHKRVHSDDKPFICTCGEMFKWKTTFNKHTQKCKNIEKSDT
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -