Cimp080952.1
Basic Information
- Insect
- Cordilura impudica
- Gene Symbol
- Zfx
- Assembly
- GCA_963682025.1
- Location
- OY821520.1:84918033-84924988[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 44 0.019 5.8 10.3 0.0 2 22 45 65 44 67 0.89 2 44 0.0006 0.18 15.0 0.4 1 23 73 95 73 95 0.99 3 44 2.7e-05 0.0082 19.2 4.4 1 23 106 128 106 128 0.97 4 44 4.7e-05 0.014 18.5 1.7 1 23 134 156 134 156 0.98 5 44 0.00095 0.29 14.4 3.5 1 23 163 185 163 185 0.96 6 44 0.0054 1.7 12.0 1.1 2 23 219 240 219 240 0.97 7 44 2.5e-06 0.00075 22.5 1.0 1 23 246 268 246 268 0.98 8 44 0.25 75 6.8 0.8 5 23 302 320 299 320 0.93 9 44 6.7e-06 0.0021 21.1 1.9 1 23 326 348 326 348 0.98 10 44 0.0085 2.6 11.4 0.3 5 23 448 466 445 466 0.93 11 44 1.2e-05 0.0038 20.3 2.0 1 23 472 494 472 494 0.98 12 44 0.00063 0.19 14.9 1.4 1 23 620 643 620 643 0.96 13 44 9.2e-05 0.028 17.6 1.4 1 23 649 671 649 671 0.99 14 44 0.57 1.7e+02 5.6 3.1 2 23 678 699 677 699 0.95 15 44 3.8e-05 0.012 18.8 4.4 1 23 705 728 705 728 0.97 16 44 0.0016 0.49 13.7 0.3 5 21 734 750 733 751 0.92 17 44 0.0023 0.71 13.1 1.3 2 23 901 922 900 923 0.94 18 44 0.0052 1.6 12.1 2.2 2 23 930 951 929 951 0.97 19 44 0.0035 1.1 12.6 6.4 1 23 958 980 958 980 0.98 20 44 0.14 42 7.6 3.3 2 23 1016 1037 1015 1037 0.95 21 44 2e-05 0.006 19.7 6.1 2 23 1044 1065 1043 1065 0.97 22 44 0.0021 0.64 13.3 4.8 1 23 1072 1094 1072 1094 0.98 23 44 0.0038 1.2 12.5 2.9 1 21 1100 1120 1100 1121 0.95 24 44 0.0012 0.35 14.1 0.1 3 23 1128 1148 1126 1148 0.96 25 44 0.01 3.1 11.1 1.6 2 23 1152 1174 1151 1174 0.95 26 44 1.1 3.4e+02 4.7 1.3 1 21 1180 1200 1180 1201 0.93 27 44 9.2 2.8e+03 1.8 1.8 2 23 1207 1228 1206 1228 0.93 28 44 0.0052 1.6 12.0 1.1 2 23 1232 1254 1231 1254 0.96 29 44 0.33 1e+02 6.4 7.5 1 23 1260 1282 1260 1282 0.98 30 44 2.4e-05 0.0073 19.4 1.5 2 23 1521 1542 1521 1542 0.98 31 44 0.00014 0.043 17.0 3.4 1 23 1570 1592 1570 1592 0.96 32 44 0.0047 1.4 12.2 2.2 1 23 1598 1620 1598 1620 0.98 33 44 0.36 1.1e+02 6.2 2.8 1 23 1628 1650 1628 1650 0.95 34 44 0.00095 0.29 14.4 1.1 1 23 1656 1678 1656 1678 0.97 35 44 0.002 0.61 13.4 0.3 1 23 1688 1710 1688 1710 0.96 36 44 1.3e-05 0.0041 20.2 1.6 1 23 1716 1738 1716 1738 0.98 37 44 0.012 3.6 10.9 5.0 1 23 1771 1793 1771 1793 0.96 38 44 8.2e-05 0.025 17.7 2.4 1 23 1799 1821 1799 1821 0.98 39 44 0.35 1.1e+02 6.3 3.3 1 23 1829 1851 1829 1851 0.95 40 44 0.012 3.5 10.9 4.8 1 23 1857 1879 1857 1879 0.99 41 44 0.43 1.3e+02 6.0 3.4 1 23 1888 1910 1888 1910 0.95 42 44 0.00019 0.057 16.6 0.3 1 23 1916 1938 1916 1938 0.97 43 44 0.00094 0.29 14.4 0.3 1 23 1948 1970 1948 1970 0.97 44 44 4.4e-06 0.0014 21.7 1.0 1 23 1976 1998 1976 1998 0.98
Sequence Information
- Coding Sequence
- ATGaaaagtaagttttctaaaaTAGTGTGTGTTGAAGATTCAACAAAGTCGCAACAAACACCGAGTCGCGAGAATAACATTAGGAATGAGAGCCAAAGACCACATAGTGAAGAGCTTCTGCAGACGGACAACCAATGTGCAGTTTGCGATAAGATATTCGATTCGCAGCAGTTACTTATTATACATGCCATAGACGACCATCCCAATGATAAGGCTTTTAAGTGTGCTGATTGTAgtgaagttttttcaaaaataggTACTTTAGTCAGTCATAGAAAAACTCATTACGATTTGTCAAAGAATTTGTCAAAGAATCATGTTTGTGAATTTTGttcgaaaaactttaaaaataactaCGAATTGAGACGTCACTTAAGGTCACATACTACAAAACATATCTACGCTTGTCCTTTATGTGAAAAGACTTGTAAAAACGAGTACCTTTTGCAAGTCCATCAGAAGACCCACTTTAAGTTGAAATCGACTCATTCTTGTAAACTATGTTCCAAATCCTTTAATTCTCTTAACACTTTAAAGAATCATTTAATGGAACACAATGCAGAACGTGTTCCTATAAAAAAGCCCCAAAGAGCAGAAAATCATGCTTTCTCTAAGAGTCAAGATGGTGTGAAATCCGAGGgagaaaagaaaaatgtttgtAACTTTTGTTCCAAGTCTTTCTTACGCAAGGATGTTTTGAAACGTCACTTAATGGTACATTCTGGAGAACGTCCCTACCCATGTCCTATATGTAATAAGGCCTATAAAGATAAACGTAATTTAAGTCATCACCTACAGGTTCATTCGAAAGAGACAACGAAAGATTGTTTTAATAACGCGAATAATGATAAAATGCtgCAAGATGGTTTGAAATCCGAGagagaaaagaaaaatgtatgtattttttgttCCAAGTCTTTCTTACGCAAGGATGTTTTGAAACGTCACTTAATGGTACATTCTGGAGAACGTCCCTACCCATGTTCTATATGTAATAAGGCCTATAAAGATAAACGTGATTTAAGACATCACCTAAAGGTTCACTCGCAAGAGACATCGAAAAATTGtgaccaaaatttaaaattaaaaaattttaacatttcaaaATCGCATGAAGAAAACCTTTCTATGGAATCTAATAACTGTAACACCAAGGACGTTTCAATGCAATTCGAAGAATATATTCTTCCGCATGTTCCAATATCTTTAATAGATAAAGAAGAAGCTATTAATTTAGCATATGATTCGGATACTGAAAAGCCAGAAAGACCACAAAATCATGCTTTCTCTAAGAGTcaaGATGTTTTGAAATCCGAGGgagaaaagaaaaatgtttgtatattttgttcCAAGCCTTTCGTACGCAAGGATGTTTTGAAACGTCACTTAATGGTACATTCTGGAGAACGTCCCTACCCATGTTCTATATGCAATAAGTCCTATAAAGATAAACGTGATTTAAGCCATCACCTAAATGTACACACGAAAGAGACATCGAAAACTTgtgaaacaatttttaacatttcaAAATCGCATGAAGAAAACCTTCTTTCTATGGAATCTAAGAACTATAAAACCACGGACATGTCAAAGCAATTCGTGGACTATATTCTTCCGCATGTTCCAATATCTTTAATAGATAAAAAAGAAACTATTAATTTAGCACTTTGTTCGGATACTGACTCAATAGATGGTTTGAATAGAAAAAATGAGTCTCAAAATAAATACACATATGAAGAAGAAACAAAACCTACAGTTTCTGACCATTTagaagttaattctattttaaaaGAACATGCCGCTGAAAATGGTGATATAACAGAGAGTCAAGCAAATAGTAATGACAAACAACTTAGCTACAAGTGCGCTGTTTGTGATAAGatgttCAACTACGAGAGAGATTTTATTAAACATGTAAAACAATTTCATAACAATAGTAAAACTTTTAAATGTCGTTATTGTACTGCAAGCTTTTCGAATGCAAGAACTTTAGCGAATCATAGAAGAATCCACAAATGTAGAAACCGACTTATTTGTAGGTTTTGTTCCAAGTCCTTTACTTGCATTTCATCTTTAAAGTATCATGTAAAAGTTCATTCTGTTCAACGTCCTTATACTTGTTCGCTTTGTGAGAAAACCTATAAATATAACAGTCATTTAAAAAGACACGAAAGAATTACCCACTTAAAATTGAATTCTTTGTGTGGAAAATCCTTCAAGCACAACATTAATTTAACgactcatcaaaatatttgcctGCAGTCCACATCATCATTGTATGAAAACCCCATAGAAAACCATAAAATGATAGACTTAGAAGAGGATGAGTTCATCACTGTTAAATGTGAGGCAAATAAAGAATCAAATAATGTTGAAAGTCCACTTCGTTTCGATCTTTCTAACAAGTTTTGCCAAAATGACGAACTTTCTCTAAAAGATAATGAAAACCACGAAAGCTCCATTATAGAAGACAGCAAAATAATATATTCGGATAATGACTCTATCGACGGAAAGAATCTCTTGGAAATATGTAAAACCGAATGTATATCAACCGAATTAGTTCATCCTGCTGTTAAGCGTTCAAAAGATACCACCACTAATACTAAATTCCAACAAAAACCGATTACTGACAATGATAATCGCATCAACAAGAAAACAACAAAACGACACGGTGTGAAAAGACAAATACAGACAAACAACAAATGTGCAGTTTGTAATAAGATGTTCATTTTCCAGCGTGATTTAATCAAACATGCCCAAGAACATCATCATAGTCGCAAGTCATTAAAATGTCGTTATTGTAAAGAACGATTTTccaatatacaaaaattattaagaCATAGTCGAATCCATaacgaaataacaaaaaaacataaatgtgAATTTTGTTTGAAACATTTCAGTAGCAAAGAAGATTTAAAAATGCATCTTAACAAACATTATGAAGATGTAGATGAAGAGTCATTCTATAAAAACACATcatcaaatgaaaataaaaacagtaaaaagTCCAATGCTACTGGTGGTGTTGCAAAAAAGAATTGTATTTGTGAGTTATGTTCAAAAAGGTTCCCTACCATGCATcgattaaatttacatttaatctCTCATACTGTTGACCGACCTCTCGTTTGTACATTTTGTGGAAAAACCTTTAAACATAAATCACATTTGCGTGACCATGAAAAGACCCACATGGATATTGAGAAAAGGTTTACTTGTGAAATTTGCTCCAAGTCCTTTCGAATCCATTCAAAACTTATTCACCATTTGACATCTCATACTAATGCACGTCCTTACTCCTGTTTTCAATGTAAAAAAACCTATAAACATAAGGCTGATGTAAAAAAACACCAACAGATTTGCCTGCAGTCGATTGCGTGTTCTTTGTGTGAAAAAATCTTTGCAGACACTGAAAGTTTAGAAACTCACCTAAGGCTTCACTATGTAAACAAATGTGCAGTCTGTAATAAGatctgcaattCCCAGCGAGATCTGGCTAAACATGCTAGAGAATTTCATCCCAAGGGTAAAGCTTTTCAATGTCAAACTTGTAGTGCTGTTTTTTGCGGTAAAACGTCTTGGATTTTACATAAAAAGAATTGCTTGAAATTAAAATCTTGTGCTTATTGCAAGAAAACCATGGCCGACGATGAGTTTTTCAAACATCATTTAGTGATGCACTATGTCAATAAATGTGCAGTTTGTGAGaagGTGTGTAAATCACAAAACGATCTGATTGCTCATGCTCAAAAATTCCATGCCAACGCAAAAGCTTTTAAGTGCCCTATTTGTGATAAACGTTATGCCAGTAGTACACGGTTCTGTTGTCATAAGAGAACTCATACAGACAAAGTCAAAAAAGATTTCTATATGTATCTATATGAAACaaagtataaacaaaataaagatttaggatttttaaaatttatggaaGAAAGTGGATCCAATACAAATCTAAAACCGATCAATAATGATAAACAAATTAATCAAAATCAATATGTAACGAAAAGTTATGAGGTAGAAGACGATTCAATGATTGCTGAATATCTAAGTGATTACGAATACATGGTAGATGAGGTTAGCCAAAACGATGAAGATAACGCAAAAGAAACCATGAGTACAATAAACTCCCTCGATATAGAAATTCTTTGTAATAACACTCAAGACGATGATGATTTCGATGTTAGGAATGAATTGAATTTGTCGAATACCGAACTTAAAGACGTTGAAGCCGTTGTGCTACCAGTTGATTACTTAGAAGACAATTCTGGAAAAAATGACACAAAACAAGAACAGGTCGTCGAGGAAAGTCCTCCACCTACAGTAATTTCACCCAAACCCGGGCGCCTGTACCAACCAATTGGTGTGAAAAGGAAACTCCTAGCAGATAACAAATGCGCAGTTTGTCATAAGCGAGATTTAATTAAACATGGTCAACAATTTCATCGTAATGCAAGAGCTTTTAAATGTCAATGTTGTAAAGCAAGATTTTCAAACAAACAAGCCGTTTCAATTCATAATGGAATATATCATACAAGCAACTTGTCCAAGAAGCGTACCTGTAAGATTTGTTCGACATCTTTTGCACGCCAACGTGATTTAAGAGTTCATTTAAAGACTCACTTGACAGACCCACTTGCTGCTAAATACCCACCTGCAGCTATACAGATAACTAATTCTTCCGATGGCGATTCAAAAAGAACACATTTTTGTGACATATGTTCTAAAGGCTTTTCTAACAAATACCGTTTAAATCGACATTTAAAAGTACATACCATAGAACGGCCTTACTCTTGTTCTGTATGTGGAAATGACTATAAACATGAGTATACTTTGAGGGAACATGAAAAGACACACTTggatttttcaaatagtaatcaCGCTTGTGAGTTGTGTTACCAAACTTTTTCTTGTGCCTCTCGCTTGAATCGACATTTAATTGTTCACTCTAAACAACGTCCATACTCTTGTTCAATCTGTGGCTTAGCATTTAAACATATGAAAAATATGAAGGTGCATGAAGCTAGACACTTAGGATTAGATCTAACAGAGAGAACACATGTTTGTGAAATTTGTTCCAAGGCCTTTGCTGCCGAAAGATATTTGAAACGACACTTAGGTGTTCACACGGGAGAACAGCTCTACGCTTGTTCAAAGTGTGAAAAAGTCTTCAAATATAAAAACGGTTTGAACAAGCATCTACAGACCCATTCAACTGATGAATTGAACGGGCAAGAACAGGTCGTAGAGGAAAATCCTCCATCTACAGTTATTCCAGCCAAACCCGATGGCGAttcaaaaagaaaacatatttgtGACATATGTTCTAAAGGCTGTTCTACGAAATACCGTTTAAAGCAACATGTAAAATCACATACCGCAGAACGGCCTTACTCCTGTTCGGTGTGTggcaaaaactataaatataagaatactttGAATCAACATGAAAAGACACACTTGGATTTTTCAATTAGTAAACATGCTTGTGAGTTGTGTTTCCAATCCTTTTCCTGTGCCTCTCGCTTGAAACGACATTTAATTGTTCACACTAATGAACATTCATACACTTGTCGTATATGTGGCAATAACTATAAACATGAGCATACTTTGAGGGTACATGAAAAGACGCACTTGAATGACTTTTCAAGGAGTGAACATGCTTGTGAGTTGTGTTTCCAAACCTTTTCTTGTGCCTCTCGCTTGAAACGACATTTAATTGTTCACACTAAGGAACGTCCATACTCTTGTTCAATCTGTGGCAAAGCTTTTAAAAATATGGAATATATGAAGGTGCATGAAGCTAGACACTTTGGATTAGATCAAATAGAGCGAACATATGTTTGTGAAATTTGTTCCATGTCCTTTGCGGCCGAAAGACATTTGAAACGACACTTAGGTGTTCACACGGGAGAACGGCCCTATGCTTGTTCTATGTGTgaaaaagtctttaaaaataaaaacggttTGAACAAGCATCTTCAGACCCATCCAACTGATGAGTTAAATGAGCAAAATAAAACAGAAgtaaaagaattaaaagaatCTACCGTAAATAGCGAACGTATAACAAGGAGGTCCACAAAACTCCCAGCTAATCAAAATGATTTACAAGAATCTACACCAACCAATATCAGCGATGATATAGATGTTGTAGGAACTGATGACAATCCATTTGTATCTGAGTCTTATGAAACGAACGAAGATATTATTGTAGTGGACGAAGATAACGTTGATGCAACCACAAATGCAATAAATTCCCAGCGTATTATAGAAGATCTTTTTATTCGGATAGTTACTCCTTAG
- Protein Sequence
- MKSKFSKIVCVEDSTKSQQTPSRENNIRNESQRPHSEELLQTDNQCAVCDKIFDSQQLLIIHAIDDHPNDKAFKCADCSEVFSKIGTLVSHRKTHYDLSKNLSKNHVCEFCSKNFKNNYELRRHLRSHTTKHIYACPLCEKTCKNEYLLQVHQKTHFKLKSTHSCKLCSKSFNSLNTLKNHLMEHNAERVPIKKPQRAENHAFSKSQDGVKSEGEKKNVCNFCSKSFLRKDVLKRHLMVHSGERPYPCPICNKAYKDKRNLSHHLQVHSKETTKDCFNNANNDKMLQDGLKSEREKKNVCIFCSKSFLRKDVLKRHLMVHSGERPYPCSICNKAYKDKRDLRHHLKVHSQETSKNCDQNLKLKNFNISKSHEENLSMESNNCNTKDVSMQFEEYILPHVPISLIDKEEAINLAYDSDTEKPERPQNHAFSKSQDVLKSEGEKKNVCIFCSKPFVRKDVLKRHLMVHSGERPYPCSICNKSYKDKRDLSHHLNVHTKETSKTCETIFNISKSHEENLLSMESKNYKTTDMSKQFVDYILPHVPISLIDKKETINLALCSDTDSIDGLNRKNESQNKYTYEEETKPTVSDHLEVNSILKEHAAENGDITESQANSNDKQLSYKCAVCDKMFNYERDFIKHVKQFHNNSKTFKCRYCTASFSNARTLANHRRIHKCRNRLICRFCSKSFTCISSLKYHVKVHSVQRPYTCSLCEKTYKYNSHLKRHERITHLKLNSLCGKSFKHNINLTTHQNICLQSTSSLYENPIENHKMIDLEEDEFITVKCEANKESNNVESPLRFDLSNKFCQNDELSLKDNENHESSIIEDSKIIYSDNDSIDGKNLLEICKTECISTELVHPAVKRSKDTTTNTKFQQKPITDNDNRINKKTTKRHGVKRQIQTNNKCAVCNKMFIFQRDLIKHAQEHHHSRKSLKCRYCKERFSNIQKLLRHSRIHNEITKKHKCEFCLKHFSSKEDLKMHLNKHYEDVDEESFYKNTSSNENKNSKKSNATGGVAKKNCICELCSKRFPTMHRLNLHLISHTVDRPLVCTFCGKTFKHKSHLRDHEKTHMDIEKRFTCEICSKSFRIHSKLIHHLTSHTNARPYSCFQCKKTYKHKADVKKHQQICLQSIACSLCEKIFADTESLETHLRLHYVNKCAVCNKICNSQRDLAKHAREFHPKGKAFQCQTCSAVFCGKTSWILHKKNCLKLKSCAYCKKTMADDEFFKHHLVMHYVNKCAVCEKVCKSQNDLIAHAQKFHANAKAFKCPICDKRYASSTRFCCHKRTHTDKVKKDFYMYLYETKYKQNKDLGFLKFMEESGSNTNLKPINNDKQINQNQYVTKSYEVEDDSMIAEYLSDYEYMVDEVSQNDEDNAKETMSTINSLDIEILCNNTQDDDDFDVRNELNLSNTELKDVEAVVLPVDYLEDNSGKNDTKQEQVVEESPPPTVISPKPGRLYQPIGVKRKLLADNKCAVCHKRDLIKHGQQFHRNARAFKCQCCKARFSNKQAVSIHNGIYHTSNLSKKRTCKICSTSFARQRDLRVHLKTHLTDPLAAKYPPAAIQITNSSDGDSKRTHFCDICSKGFSNKYRLNRHLKVHTIERPYSCSVCGNDYKHEYTLREHEKTHLDFSNSNHACELCYQTFSCASRLNRHLIVHSKQRPYSCSICGLAFKHMKNMKVHEARHLGLDLTERTHVCEICSKAFAAERYLKRHLGVHTGEQLYACSKCEKVFKYKNGLNKHLQTHSTDELNGQEQVVEENPPSTVIPAKPDGDSKRKHICDICSKGCSTKYRLKQHVKSHTAERPYSCSVCGKNYKYKNTLNQHEKTHLDFSISKHACELCFQSFSCASRLKRHLIVHTNEHSYTCRICGNNYKHEHTLRVHEKTHLNDFSRSEHACELCFQTFSCASRLKRHLIVHTKERPYSCSICGKAFKNMEYMKVHEARHFGLDQIERTYVCEICSMSFAAERHLKRHLGVHTGERPYACSMCEKVFKNKNGLNKHLQTHPTDELNEQNKTEVKELKESTVNSERITRRSTKLPANQNDLQESTPTNISDDIDVVGTDDNPFVSESYETNEDIIVVDEDNVDATTNAINSQRIIEDLFIRIVTP
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -