Pven004597.1
Basic Information
- Insect
- Pachypsylla venusta
- Gene Symbol
- -
- Assembly
- GCA_012654025.1
- Location
- CM022878.1:2212451-2220500[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 47 0.022 2 9.2 2.7 2 23 5 26 4 26 0.96 2 47 0.00029 0.027 15.1 0.3 1 23 61 84 61 84 0.96 3 47 6.7 6.3e+02 1.4 0.4 2 9 141 148 140 149 0.88 4 47 0.38 35 5.3 0.0 2 23 171 193 170 193 0.92 5 47 2.3 2.1e+02 2.9 7.1 1 23 287 310 287 310 0.93 6 47 1.5 1.4e+02 3.4 2.5 1 23 367 390 367 390 0.97 7 47 0.00093 0.087 13.5 1.0 1 19 396 414 396 419 0.89 8 47 0.0021 0.19 12.4 3.3 1 23 437 460 437 460 0.96 9 47 0.0004 0.038 14.7 0.1 1 23 491 514 491 514 0.93 10 47 0.028 2.6 8.9 2.0 1 23 520 543 520 543 0.97 11 47 0.0027 0.25 12.1 0.1 1 23 574 597 574 597 0.97 12 47 0.00082 0.077 13.7 2.6 1 23 603 626 603 626 0.97 13 47 1.8e-06 0.00017 22.1 0.8 1 23 641 664 641 664 0.97 14 47 0.00023 0.021 15.4 0.5 1 23 699 721 699 721 0.98 15 47 0.28 26 5.7 0.9 1 19 729 747 729 752 0.88 16 47 1.4e-05 0.0013 19.3 0.9 1 23 772 795 772 795 0.96 17 47 1.6e-07 1.5e-05 25.4 0.5 1 23 800 822 800 822 0.98 18 47 0.048 4.5 8.1 7.0 1 23 828 850 828 850 0.95 19 47 9.6e-05 0.009 16.6 2.9 1 23 856 878 856 878 0.99 20 47 0.0022 0.21 12.3 1.4 3 23 887 907 885 907 0.95 21 47 0.0054 0.51 11.1 5.8 3 23 963 984 961 984 0.95 22 47 6.7 6.3e+02 1.4 0.4 2 9 1041 1048 1040 1049 0.88 23 47 0.0002 0.019 15.6 1.2 1 23 1073 1096 1073 1096 0.97 24 47 0.0004 0.038 14.7 0.1 1 23 1127 1150 1127 1150 0.93 25 47 0.00057 0.053 14.2 1.3 1 23 1156 1179 1156 1179 0.98 26 47 0.0028 0.27 12.0 0.1 1 23 1192 1215 1192 1215 0.97 27 47 0.00014 0.013 16.1 0.8 1 23 1221 1244 1221 1244 0.95 28 47 9.1e-06 0.00085 19.8 1.2 1 23 1259 1282 1259 1282 0.98 29 47 0.0016 0.15 12.8 2.5 2 23 1301 1322 1300 1322 0.96 30 47 5e-05 0.0047 17.5 1.9 1 23 1360 1383 1360 1383 0.97 31 47 2.9 2.7e+02 2.5 0.3 10 23 1404 1418 1394 1418 0.79 32 47 0.76 71 4.4 0.4 3 23 1470 1491 1468 1491 0.89 33 47 3 2.8e+02 2.5 2.8 6 23 1510 1528 1501 1528 0.83 34 47 6.7 6.3e+02 1.4 3.6 1 20 1583 1602 1583 1606 0.88 35 47 7.5e-05 0.007 17.0 1.0 1 23 1659 1682 1659 1682 0.97 36 47 0.086 8 7.3 1.4 1 23 1688 1711 1688 1711 0.89 37 47 0.0018 0.17 12.6 0.1 1 23 1736 1759 1736 1759 0.98 38 47 0.0058 0.54 11.0 5.9 1 23 1766 1789 1766 1790 0.96 39 47 1.4 1.3e+02 3.5 3.0 2 23 1827 1849 1826 1849 0.94 40 47 0.0057 0.53 11.0 6.0 1 23 1856 1879 1856 1879 0.98 41 47 0.0002 0.019 15.6 0.3 1 23 1904 1926 1904 1926 0.95 42 47 1.4e-05 0.0014 19.2 1.0 1 23 1933 1955 1933 1955 0.98 43 47 2.2e-06 0.00021 21.7 0.1 1 23 1960 1982 1960 1982 0.98 44 47 0.043 4 8.3 2.3 1 23 1988 2010 1988 2010 0.85 45 47 0.00011 0.0099 16.5 0.3 1 23 2016 2038 2016 2038 0.98 46 47 2.1e-05 0.002 18.7 0.3 1 23 2043 2065 2043 2065 0.98 47 47 0.00063 0.059 14.0 1.6 1 23 2071 2093 2071 2093 0.99
Sequence Information
- Coding Sequence
- ATGCTaagGTGGTCCTGTTGCATTTGTAGTTGTAAATTTGGTTCCCTGTCAGATTTTGAGATTCATTTACGTATACATAACAACTCTCTCCTGCCTTTAAGTGATTCTGATGAATGTGTCAAATGTCGAAAAAGTATTCTAGCGCACATTGTTATAATGGAAGAAGATTCCGATGATACTTTCATTTGTGGCCTTTGCCAGTACGGCTTTGATGATAAACATCAATTGGAAGTTCATATTAAAATTGAGCATGGTCAGATTGTGTATGGTGTGAGTGATTGGTATGCTTGTAAATTGTGTTTGAATCCGTTGTATAGTCGTAGTCAGGATATTGTAAAGCATATGCGTAATGTTCACAATATATTGGCTCCTGTTCAGTCTGAGGAGTACTTGGTCAAATCAATCACTGATGTCAATATCCCTTGCCATGTCTGTGGTAAAACAAGTTTGTTATCTAAATTTTGTACAGCTCATCTGGATAGTTTAGTTGTCACAAGAAAACTTGACATCCCGTGTGAGTTGTGCCATGAATCTTTCCCAGATGACGGCATTCTTGAGGCGCACGTTAATATCTCTCATGGAAAAATGATCCACAACAATCCTAATTTCAAATGTTACATGTGTAACATTTGTCGCGATCAACTTTACATCGAGGGTGACACCTTGGCAACACATCTACAAGAGCTTCACAAAGACTTTCGGTTTGATGAAAGTGCTCTCGAGGAATTTGTTATAGAATCTCTGGATGACATTTTAAAATCGTGTCGCTTTTGTGGAGAAAGAagtcttttttcaatgttttgtaGCCAGCACGATGCAAAGATAGTAATTTTGAATTGTAATAAACATGAAAAGCGTCACACTTGTAAACATTGTGAGGGTGATTTTAAGAATTGTAAGGATTTATGGGATCACAAATTTTCATCACATGCGAAAGATAAACTTGTTTGTAATCTTTGTGATGAATCAAAGGTCACCGTCATAAAGTGTCCGAGCAATTTCATGGCACACTGTCGGAAAGTTCATCGTAATACTCAATTTTGGGATGAAGCTAATTTGCTTGAAAAGTCGAAAGTTGTAAATGAGGATATGATCAAATATCAATGTCCCGAATGCTGTTCGttgtttaaaaatttcaatgatatGAAAGTGCACTTGAAATCTGAACATGATTTAATCAATAAATTTGTTTGTAGTGTTTGTTCGAAATCATTCAgactttcaaaaaatttgcgTGATCATGATAGAAACGTTCACAAGCTGAAGAGTCAGGGTGACACATCGGAGACAGATGTGATCAACAACAAGCATGTGTGTAATATATGTTCGAAATGTTTTGGACTTGCAAAAAGTTTGCGGGATCACAGGAGGAATGTGCATAAGATTTTCAGTGAAACGCGCAAATCACAGACAGCTGTCAATGAAGTTCTTAAGAAGAAGTGTATGGTTGTTCATgagggaaaaatcaaatttaaatgcaCCGAATGTGATAAAATAGTGTCACGACTACAAACACTCAGAGAGCACATAGCTGCTGCGCATGAAGATATTAAAGATTATGTTTGCTCTTCCTGTGGAAAAGGCTTTTCAGCACACAGAAGATTGGCGGAACATTTTCGGAGGAAACATACGCCACTAATTGTTACCAAAGAAGAGGCCATCTTAGAGAAATGTAGCGCCGAGGTCTCGGTCAAAGATGTTAATAAAGATTGGATAAAATATCGCTGTCCTGAATGCAGTGCATTGTATTATAATGCAAATGATTTGAAAGACCACATGGAATCTGAACATGATTTGATCAAGAAGTTTGTTTGTGATGTGTGTTCCAAATATTTTCGACTTCAAAGACATTTGCGTGATCACGAGAGAAGAGTTCACAAGATGAGACAAACTAACACAGAGACTGATGAGAACACAGAGTATGTGTGTGAcgtttgttcaaaatcttttaGACGTCCAAATAATTTGCGAGATCACAAGAGAAATGTTCACAAGATATTCAGTGGAAAGCGCATATCACAGGCAACAACTGTCAATGTCTCAGTGGATGTTGAAGAAGATGAAGTGCAAATTGTTGTCGGTGAAGAATTGAATTATAAATGTCCTATCTGTGACCACATCTATTCCCAATTTTCCGATGTAAAAGAACATTTGATCGTCCACAAtggagaagaaaacaaaaactttgCTTGTACCTATTGTGATGTAATACTTTCCCGGAAAAGTGATGTTGCGACGCACTGCAAGCAAGCACATCAAATTGAAGTTGTTGAAGACGTTAACGACGGGAGCTTTGAAACGGTAGAGCGATCTTTTTCTTGTGACATGTGTGGAAAATCGTTTCCTTTTAAAAGTCGTTTGATACGACACTTGACCAGGATTCATTTGGGTATAAATTTTACATGTAATATTTGTGGACGTGTTTTTTCTGATAAGACCAATTTGAAAGGCCATATTCGCGATCATATGGGAGAATATAAATACATGTGCGACATATGTGGTAAACATTTCAATCATTGGTCCTCCCATTACTATCATATGTTTAGTCATAGTAAGGAGCAAAGCTTTCAATGTCCCTATTGTGATTCAAAGTTTAAAAGTCCAAAGACGTTGTACTCACATAAGAAGACGCATGCTACAACTGAAGCTTCGCATTTATGTGACGTGTGTGGTGCTAATTTTAAATCACGAAAAAATTTATTAGGTCATAAATTAACTCACACTGTTGTAGGTAAACCTCATGCTTGTGACAAGCGGTCCTGTTGCATTTGTAGTTGTAAATTTGGTTCCCTGTCTTTATATGATTCTGATGAATGTTTCAATTGTCGAAATAGCATTCTAACGCCCACTGTAAAAATAGAAGTCGATTCTGATGATCCTATCATTTGTTGCCACTGTCAACACAGCTTTGATGATAAGCACCAATTGGAAGttcatattaaaattgaacatgGTCAGATTGTGTATGGTGTGAGTGATTGGTATGCTTGTAAATTGTGTTTGAATCCGTTGTATAGTCGTAGTCAGGATATTGTAAAGCATATGCGTAATGTTCACAATATATTGGCTCCTGTTCAGTCTGAGGAGTACTTGGTCAAATCAATCACTGATGTCAGTATCCCTTGCCATGTCTGTGGTAAAACAAGTTTGTTATCTAAATTTTGTACCTATCATTTGAGCGATATCGAGGGAAAATTTCACAAGATGACGAATGAATATGTGTGTAATGTTTGCTCGAAATCTTTTcgacttttaaaatatttgcgtCAACATGAGAGAAATATACATAAGATATTCAGTGAAACGCGCCAAGTACAGGCAGCTGTCAATGAAGTTCTTAAGAAGAAGTGTATGGTTGTTCATgagggaaaaatcaaatttaaatgcaCCGAATGTGATAAAATAGTGTCACGACTACAAACACTCAGAGAGCACATAGCTGCTGCGCATGAAGATATTAAACATTATGTTTGCTCTTCCTGTGGGAAAGGCTTTTCAGCACACAGAAAATTGGCGGAACATATTCGAAGGAAACATACACCAGTCAAAGATGTTATTAAAGATTGCATAAAATATCGCTGTCCTGAATGTAGTGTACTATATTATAATGCCAATGAATTAAAAGATCACATGGAATCTGAACATGCTTTGATAAAAAAGTTTGTTTGTGGTGTTTGTTCGAAATCTTTTCGACTTAAAGGACATTTGCGTGATCACGAGAGAGGGGTTCACAAGATGAGACAAACTGACGCTCAGACAGATGCGACCACCGAGTATGTGTGTGACGTATGTTCGAAATCTTATAGACAAAGAAAAAATTTGCGTGATCATGAGAGAAGAGTTCACAAGATGAAACGTCAGGTTCACACATCACAAACTGAGATGATCAACAACGAGTGGTCCTGTTGCATTTGTAGTTGTAAATTTGGTTCCCAGTCAGATCTTGAGATTCATTTACGTATACATAACAACTCTCTCCCGCCTTTAAGTGATTCTGATGAATGTGTCAACTGTCGAAAAAGTATTCTAGCGCACTCCTTTAAAATAGTTGTCGATTCTAATAATACTTCAGAATCCTATATTTGTAATCATTGCCAGTACGGCTTTGATGATAAACATCAATTGGAAGTTCATATTAAAATTGAGCATGGTCAGATTGTGTATGGTGTGAGTGATTGGTATGCTTGTAAATTGTGTTTGAATGCGTTGTATAGTCGTAGTCAGGATATTGTAAAGCATATGCGTACTATGCATGAAATGTTGACCGTTGATCCTGTCCATCATTTAGTTCGAACATCAAACGATGTAAATTTCCGCTGCAAAGTCTGTGCCAAACAgagtttgttttcaaaatattgtaaaatgcaTGGCCGGAATGGGATTGAACGGAGAAACGTGGAAACGTGTTGTACTTTTTGTGAGGAATCCTTTTCGAATGAGATCATTCTGGATGCGCATTTAAATATTGTTCACGGGAAAGATATTCATAATGATTCCAATTTTAGATGTTTAGTATGTTCAATCTGTTGCAACCAGCTTTACATTGATAGCAAGCATTTGATCGCTCACATGAGCAATGTCCATGGGGAATCCCATATAAATGTCGATGATTTTATTGTTGAATCTCTCAACGAGATTTTAGTTCCGTGTAAAGTCTGCGGAGAAGTTGGtttgttttctgtattttgtAAAGAACACGATGTAAGTATATCTACACTTGCTTGTATTGAACATGTTGAACATTGTCATACATGTTCATTTTGCGAGGCCTCTTTTGAGGATTGTAAAGTTTTATGGAACCACAACTTTTTACGACATAGTGATGAGCAATTAAGTTGTAATCTTTGTTCGACATCCAATGTTCCTTTCCAATTAAAATCATCTGCCGATCTTAAAATCCATTGGAATAAACAACATTCAAGTTGGTCTGATTTGCGTTCACAAGCAACAATTACAGTTGATGGTGAGACAAAGTATGTTTGTCCAAAGTGTGGTAAAACGGTATGTCTGTTTTCTTCTCTTAAAGATCATATGTTGAGTGTACACGGCGAAGATAAGAAGTTTGCTTGTGAGATATGTCCTAAAGTATATTCTCTTAGAAGAAGACTTACCGAGCATTGTAAAAGAGCTCACGGGATAGGTGTCAGTGGGGATTTGAGAGATCTTAGAAAGAAATCCGTTGTTTTTACTCCGGAAGGAGAACGTTTTAAATGTCCAGTTTGTGATAAAGTTATGACAACGTTGGTAGGTGTACAAGAACATATGAAAGTGATGCATTGTGACTCTCACACAGAGTTCTTTTGCTCTGTTTGTCATGAAATGTACACAAATAGAagagctctcagagaacattatCGACGGAAACATCATGGGAAGTTTGGAATTATTAGAAATGAGACTGATGACCGGAATGGATTAGAAGGTTTAAATATTGACACATCTGAGGTTCAAGTTTGGGAGGGCAATGCattgaaattgaaatgtttATCCTGTGACAAACTGTTCAAAACAATGCATGGTCTGAAATGTCATATGGCTGTCGTTCATTTGCAGGGAGAAAAGAATTTCGTTTGTTCGTACTGTGCGAAAGCATTTTTCTTAAAGAAACATCTCCATCGGCATTACAAAACAGTTCACGGAATTAATTCTacggacaaaaataaaattctagaaGACCAAGTCGCCTTTGTCACAGACGGTGTCACTCATTACAAGTGTCCAGATTGTGAATTAGTGTTTCCCCAGTTTGACAGTTTACGGCAACATTTTGTGTCGCATTCCACAAGCGAAGAACAATTTACGTGTGATGTTTGTGGTAAAATATTCGATCGAGAAGATCGTTTGCGAAGACATTCGAAAACTCATTCGGGATCTGGCTACCAGTGTCCCGTGTGTGGGAAAGTGATCAACGATCCTGCTAATTTCAAAGTTCATATGAGATTTCATACTGGGGAGAAACGATACGTTTGTGAAATTTGTGGCAAACGATTTGTTCAGTTTGCATCGCATTACTATCATATGTTTAATCATAGAGAGAGTTGTGACTATCAGTGTTCGGTCTGTGATAAGAAATACAAGTCGCCCGCTGTACTTCGAGAACATGAGAAAATACACAAAGGAGAATCGTATCCTTGTCCTACGTGTGGCAAAGAATTTCCAGTTCATGGAAAGTTGCAGCAACATCTGAAAATACATTCGACAGATAGACCTTACGTGTGTACTTATTGTAAtggttcatttaaaataagaaaatatcttGTGCAACATTTGAAAACTCACGGTGCAATTAAACTTAGATAA
- Protein Sequence
- MLRWSCCICSCKFGSLSDFEIHLRIHNNSLLPLSDSDECVKCRKSILAHIVIMEEDSDDTFICGLCQYGFDDKHQLEVHIKIEHGQIVYGVSDWYACKLCLNPLYSRSQDIVKHMRNVHNILAPVQSEEYLVKSITDVNIPCHVCGKTSLLSKFCTAHLDSLVVTRKLDIPCELCHESFPDDGILEAHVNISHGKMIHNNPNFKCYMCNICRDQLYIEGDTLATHLQELHKDFRFDESALEEFVIESLDDILKSCRFCGERSLFSMFCSQHDAKIVILNCNKHEKRHTCKHCEGDFKNCKDLWDHKFSSHAKDKLVCNLCDESKVTVIKCPSNFMAHCRKVHRNTQFWDEANLLEKSKVVNEDMIKYQCPECCSLFKNFNDMKVHLKSEHDLINKFVCSVCSKSFRLSKNLRDHDRNVHKLKSQGDTSETDVINNKHVCNICSKCFGLAKSLRDHRRNVHKIFSETRKSQTAVNEVLKKKCMVVHEGKIKFKCTECDKIVSRLQTLREHIAAAHEDIKDYVCSSCGKGFSAHRRLAEHFRRKHTPLIVTKEEAILEKCSAEVSVKDVNKDWIKYRCPECSALYYNANDLKDHMESEHDLIKKFVCDVCSKYFRLQRHLRDHERRVHKMRQTNTETDENTEYVCDVCSKSFRRPNNLRDHKRNVHKIFSGKRISQATTVNVSVDVEEDEVQIVVGEELNYKCPICDHIYSQFSDVKEHLIVHNGEENKNFACTYCDVILSRKSDVATHCKQAHQIEVVEDVNDGSFETVERSFSCDMCGKSFPFKSRLIRHLTRIHLGINFTCNICGRVFSDKTNLKGHIRDHMGEYKYMCDICGKHFNHWSSHYYHMFSHSKEQSFQCPYCDSKFKSPKTLYSHKKTHATTEASHLCDVCGANFKSRKNLLGHKLTHTVVGKPHACDKRSCCICSCKFGSLSLYDSDECFNCRNSILTPTVKIEVDSDDPIICCHCQHSFDDKHQLEVHIKIEHGQIVYGVSDWYACKLCLNPLYSRSQDIVKHMRNVHNILAPVQSEEYLVKSITDVSIPCHVCGKTSLLSKFCTYHLSDIEGKFHKMTNEYVCNVCSKSFRLLKYLRQHERNIHKIFSETRQVQAAVNEVLKKKCMVVHEGKIKFKCTECDKIVSRLQTLREHIAAAHEDIKHYVCSSCGKGFSAHRKLAEHIRRKHTPVKDVIKDCIKYRCPECSVLYYNANELKDHMESEHALIKKFVCGVCSKSFRLKGHLRDHERGVHKMRQTDAQTDATTEYVCDVCSKSYRQRKNLRDHERRVHKMKRQVHTSQTEMINNEWSCCICSCKFGSQSDLEIHLRIHNNSLPPLSDSDECVNCRKSILAHSFKIVVDSNNTSESYICNHCQYGFDDKHQLEVHIKIEHGQIVYGVSDWYACKLCLNALYSRSQDIVKHMRTMHEMLTVDPVHHLVRTSNDVNFRCKVCAKQSLFSKYCKMHGRNGIERRNVETCCTFCEESFSNEIILDAHLNIVHGKDIHNDSNFRCLVCSICCNQLYIDSKHLIAHMSNVHGESHINVDDFIVESLNEILVPCKVCGEVGLFSVFCKEHDVSISTLACIEHVEHCHTCSFCEASFEDCKVLWNHNFLRHSDEQLSCNLCSTSNVPFQLKSSADLKIHWNKQHSSWSDLRSQATITVDGETKYVCPKCGKTVCLFSSLKDHMLSVHGEDKKFACEICPKVYSLRRRLTEHCKRAHGIGVSGDLRDLRKKSVVFTPEGERFKCPVCDKVMTTLVGVQEHMKVMHCDSHTEFFCSVCHEMYTNRRALREHYRRKHHGKFGIIRNETDDRNGLEGLNIDTSEVQVWEGNALKLKCLSCDKLFKTMHGLKCHMAVVHLQGEKNFVCSYCAKAFFLKKHLHRHYKTVHGINSTDKNKILEDQVAFVTDGVTHYKCPDCELVFPQFDSLRQHFVSHSTSEEQFTCDVCGKIFDREDRLRRHSKTHSGSGYQCPVCGKVINDPANFKVHMRFHTGEKRYVCEICGKRFVQFASHYYHMFNHRESCDYQCSVCDKKYKSPAVLREHEKIHKGESYPCPTCGKEFPVHGKLQQHLKIHSTDRPYVCTYCNGSFKIRKYLVQHLKTHGAIKLR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -