Pven011491.1
Basic Information
- Insect
- Pachypsylla venusta
- Gene Symbol
- -
- Assembly
- GCA_012654025.1
- Location
- CM022874.1:12193959-12209420[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 34 0.015 1.4 9.7 0.3 3 23 16 37 14 37 0.91 2 34 0.0056 0.53 11.0 0.0 1 23 43 66 43 66 0.91 3 34 0.00049 0.045 14.4 2.1 1 23 77 99 77 99 0.97 4 34 0.064 6 7.7 5.0 1 23 105 127 105 127 0.97 5 34 1.8e-05 0.0017 18.9 0.4 3 23 160 181 159 181 0.96 6 34 0.0098 0.91 10.3 1.6 3 23 189 210 189 210 0.96 7 34 0.00069 0.064 13.9 0.3 1 23 222 244 222 244 0.98 8 34 0.00024 0.022 15.4 2.1 1 23 249 271 249 271 0.98 9 34 0.00019 0.017 15.7 4.2 1 23 284 307 284 307 0.93 10 34 3.2 3e+02 2.4 1.2 3 23 314 332 312 332 0.89 11 34 0.37 35 5.3 5.5 1 23 338 358 338 358 0.97 12 34 2.7e-06 0.00025 21.5 0.5 1 23 364 386 364 386 0.98 13 34 3.1e-05 0.0029 18.1 0.4 1 23 392 414 392 414 0.98 14 34 0.00091 0.085 13.5 1.2 3 23 422 443 420 443 0.94 15 34 0.038 3.6 8.4 2.1 1 23 501 524 501 524 0.96 16 34 0.092 8.6 7.2 2.8 3 23 531 552 530 552 0.93 17 34 1.5e-05 0.0014 19.1 2.3 1 23 557 580 557 580 0.96 18 34 8.9e-06 0.00083 19.9 2.3 1 23 585 607 585 607 0.97 19 34 0.00021 0.019 15.6 0.4 1 23 614 636 614 636 0.98 20 34 1.4e-06 0.00013 22.4 0.5 1 23 642 664 642 664 0.98 21 34 0.093 8.7 7.2 4.9 3 23 1122 1143 1121 1143 0.96 22 34 0.0089 0.83 10.4 4.2 2 23 1248 1270 1247 1270 0.97 23 34 0.17 16 6.4 0.2 1 20 1307 1326 1307 1328 0.94 24 34 0.59 55 4.7 0.2 3 23 1364 1385 1363 1385 0.89 25 34 0.058 5.4 7.9 0.1 1 21 1611 1631 1611 1632 0.93 26 34 0.00077 0.072 13.8 1.9 1 23 1808 1831 1808 1831 0.97 27 34 0.0035 0.33 11.7 2.7 1 23 1837 1859 1837 1859 0.97 28 34 1.3 1.2e+02 3.6 0.4 3 23 1867 1888 1866 1888 0.92 29 34 7.8e-05 0.0073 16.9 2.4 1 23 1893 1915 1893 1915 0.95 30 34 3.4e-06 0.00032 21.2 3.3 1 23 1922 1945 1922 1945 0.97 31 34 0.065 6 7.7 0.7 2 23 1951 1972 1951 1972 0.98 32 34 3.3e-06 0.00031 21.2 0.1 1 23 1978 2000 1978 2000 0.98 33 34 2.5e-06 0.00023 21.6 1.0 1 23 2006 2028 2006 2028 0.99 34 34 0.0079 0.73 10.6 4.8 1 23 2034 2057 2034 2057 0.93
Sequence Information
- Coding Sequence
- ATGGATATTGATTGGGACATGCTGTTGAACCAAAAATCTACCGCCTGCTTCATATGTAACAAAAACTTCTCCTCTGCCCGAGTTATGAAAAGACACGTTGAAGAAGTTCACACAATGGACCGAAACTATCCTTGTGATGAATGTGGAAAGATGTTTGTTTCGGCCGCCCGAGTCGGTATTCACAAGAGAAATGTACACATGGGATTCAAACCAGAATCACAAACAAACTATGTGTGTCACTATTGTAATGCGATCTATACGAATAAAGTGTTACTTCGAGATCATATTTGTGCACATATGGGCGTGAAACTTCATCAGTGTGACATTTGTCAAAACTATTATTCCTCTGCAAAATGTTTGAAAAGACACAAACTTGTACATAGCAAAGTTGCTAAAACCAAATCCTTTGTAGTCCCTGACATAGAGAATTTATCTGCCGATCAGTATAAGGATTTAGTAAAAAGTAAAGTGAAATTCTGTCCTGTATGTGGTAAAGAATTTTCTACAGCAAGATATATGAGGAAACATTTGAGAGAGATACATTCAACAGATAAAAACAATTGGTGTGATCAGTGCggaaatcaatacaaatctaAAAGTAGTCTCACTGAACACAAGAAATATATGCATTCAACCAAACACATTGAacgcaaagaaaagaaatttgaatGTACGATATGTGGTGCCATGCAAGCGACAAAATATACGCTAAGAGATCATATGAATACACACACCGGGAACAAGTACACGTGTGAAATATGTGGAAATAGTTATGCATCTGTGAAACATTTGAGACGCCATAAATTGAAACATCGGAAAGAGGAAGGCACGTTACCCGATTATATGATATACAAGTGTGACGTGTGTCATCGAGAGTTTACACAAAATcatatgttaaaaaaacataaagaatGGGTCCATGGTGACAAATACCATTTGTGCAAAATATGTGGCGCCAAAATCAAAGGAAGTTTACGTCAACACATGGTGACACATAcgggagagaaaaaatattgttgtcaTATATGTGGCAAGAAATTACGcggaaaattaaaagaacataTGTTGACACACACGGGAGAAAGACCCTATGCTTGTGAAGTATGCGGCAGTACGTTCAAAGATAAATGGTATTTGAGAATACATATGAGGAAGCATAATGGTGAAAGACCTTATATGTGCAGCGAGTGCGGACAAAGCTTTGCCGCAAGAAGTGCATTCACGTTACATTTGAGAAAACATAACGCACCTGTACCAATTGCTTGTAATTATTGTCAACtcacttttacatcaaagccgCTTTTGAAAGAACATATTGTTAAATGTCATTTGATgGCTcagacaaaacaaaagaaaaaacgcACAACCGGTGCAAGTAAAAAGTCAAGATCAAATAAAGCATACATGGACGAATTACTGAAAACACTTGATATTGAAAGGAAAattgttaaaagaaaatatgaaaaacgaCCTAAAACTGAACCGTCTGACGAGAATCTTCATTCGTGTGATCAGTGCGACTTTGttacaaacaacaaaaaaactttaGGCTCTCATATTCTTCGTCGCCATGTTGGATACACCGTATTTTGTAAAGAATGTAACGTTGGGTTTTATAGTAGCACACATCTTCACTCACATGGTATAAAAGTTCACGGGTTACCTCCGTTCATGTGTGAACATTGTTCAAAACCGTTCACTTCGAAAGGAAATTTAACAGTtcatgtaaaatattatcacgCCAAAACATTCTTTGCTTGTGAAGTTTGTGGTAAaacgtttaattttaaaacaagttATAAACGTCATTTGCGTCAACATTCTGATTCTGTGACGTACTATCCGTGTACAGTTTGTCAGAAGAATTTAAGTTCGCCCTATCGATTAAAAACGCATATGTTAATTCATGCAAATAATCGATCGTTTACTTGTGAGATTTGTGGAAAAGGGTTTATACAGAAACGATATTTGGAAGAACATAGACGTGTACACACGGGATACAAACCGTATGATGGGTACGAACAAGATACAAGACATTCAAATACAGAAAAtactcgtaacgattccaattGTGATTTACTCATTAAAAATTATGATTGCTCTGAAATagaaaatcaatcaattattgcaaatttcaataaaatgaaatcaaaagacAGTGTACCTCAATTCAGTAATATGGATTTAAAAATGAGAGATGGTTTCATGCTTAATTCTAAACACATTTGCACTAATTTTAAAGTATCTACAAAAAATTTGGCCCAGCCTCAGATAGAACGAGATGAAGCCAATAATAATGTAACCCTACTTGAGAAAGGGACGCATGAAAGTACTAAACAAGCTGTAGATGAAGAGATGGCCAATAAATATTCACAAACAAATAGAACTGAAGAAATTGCTTACAGTCAAAATGAACGAATTGAAGATGTGCACAGGGAAGCTTTCCTCGACGACGGGGCGCATGAGAGTACAGAAAAAGCTAAAGAAGTGGTCCTTAAATCTTAtgaagatatgaaaaaaaataattcggaACCAAATCTAACGGAAGAAATTGCTTACAGTCAAAGAGGACGAATTGAAGATGTGCAGAGTGAGTCTGTCCTCGACGGAGGGACGCCTGAAAGTATAGAACAAGCTAAAGAAGTGGTCCGTAAAACTAATAAAGAGATGACCAAAAATAATTCGGAACCAAATCTAACTGAGGAAATTTTTTATAGTCAAAGAGGACAAATTGAAGATGTGCAGAGTGAGACAGTCCCCGACGGAGGGACGCATGAAAGAATAGAACAATCTGAGGAAATGGTCGGTAAAACTGGTAAAGAGATGACTTATACTAATTCGGAACCAAATCTAACAGAAGATTTTGCGTATAGTCAAAGAGAAAGAGTTGAAGATGTGCAGTGTGAAACAGCTCACTTCGAAGAAACGACGCACGAAGATAAGATCGAAGAAGCTAAAGAAGAAGTCCTCATAACTGAACATGCAAATGAAGAAGAATGTATAAGTACTTCTAAACGAAAAAGACAACGAAAACAATCTATGCATAGTCGAAAATGTTCAGAATTAATAATTaaagatagaaatgataatattcTAGATTCTATTCCATATTCTATTATTAAACCATTCTTCAAAACCCAATTTAATAGTTCTATCCATACaatcgaaaaatttaaaaaaatggtacagaaaGCTAACGAATGGATATCCAAAGAAGAGAGGCGACTGACCGACAGTAGGCCAGAACTAACTGTTAAACTTACTACCAGTGTTCAAAAtgtaaatgaggaaaaaaattctgataGTGTTGATGAAAAATATCATCTAGACGTGATTACGCAAAATAAGAAAAGACAACAAGCATGTCCAGCAtgctcaaaattgttttttcataaaaaatctttaaaaacgCACATTAAGAGTTGTCATATAATAtgcaaattttgttttgaaccCAAAATTGATTTCCCAACATTTTTATGTCATCTTTCAGACTCACATGATGTTTTACCGAATATCTATTTAAATCAGTTGGAAAGACACGATGTTgaccattttatatttttacgtgagtctccaaaatataaattcacCTGTGTTTTATGTAGTCAAAAGTTTACCgcagaaaaaagttgtattGACCATATTTTAAAATGTCACTATCATAAAATCTATATCTGGAAAGACAAAGGATTAATTGAAAAACAtaacaatcaattttttatccaaaaattCAGAAAGAAATGTCCTTACTGTGATATGTATTTCTATCAAGAtgtacattttcatttacaCATGCGCAGAGATCATTATTCATACTATTTTTGTAAATACTGTTTACATTACATTTCTCAATATGATTTTGAAATGCATTGGAAAATATGCAATCAGAATAAGTTACCTTTACAGGATATAGAGTATACATGTGATCTCTGTGATGACATATTTCAAAGTGCGGAAGAATTAACCCATCATAGTCAAAATCGTTATTTGGGTAACTGTTTGTTCTGTGTACAAGATAAGTTTTGTAGTGCAAAAGATTTTAGTGAACATGAATGGAAGCGGCACGCCGATCAAGTGTACGGTATGTGTGCAAGTTGTCAAGTTATGTTTTACAACAAACAAAGTTTACTTGTTCACGTTGATCTGTACCATTGTAATCGAGCGCATCGTTGGGATTTGGTTTCCTGTAACTCCTGTTCTTCATTACCATATACCCGTTTCTTCATTTACGAATCGTTCATTGATCATTTCCTTTCTTCTCACTGTCTTCTTATTGACAGCCATTTAGGATCTCactcagaaaataataaattctattttgatGTCGTCCCCCGTGCGTTTACCAATATGTTTATTAATATTGAATGTGGTTTTTGTGATAATACTAACAAATCATCTTCCATTAAAATTGATAGCCTTGAAGATTttgttgaacattttggaatttTTCATTCACAAGCTTCAGTTGGTCTATGCGTACCTTGTAATTACACGCCTCATCCTATTGATTCATTGGATTTTCATCTATTTTTAGAACATGTTTCATTTGATGATAATAGACCTCAGTGTTTGGTGTGCCGCTCTTATTTTAATTCTTACCAAACGATCGTCACACACATGGTTATGAGTCATGGCATTTTCATCGGTGTCATCCCAGATCATGTTATGAATGACATTGTGAACCAGAGTTCACCATCTTCATGCAGAATGAGCTGTGGAGTATTGAATACCAGGGCGTACATTTGTAAATATTGCTCCTTGACTGTTCATGGTATATTAGTATACCTTGTTCATTTGAATGTAAATCATGGTATTATAGAGACACAGTTTGTGTGTCAAATATGCAAAATCAGTTTTGCTGCAATCAGTACCTTTTTAAAACACCTAATCAGACAACGATTATTTGCAATAGTGTCCAGTGACCAAGTCTCTAAAAATATAGATTGTTTAAATGAACATAACTATTGCCTGCAGGAGATTGCTAAAGATATTGAACCAAGAGACCAAGGTAGCTCTGAAGGTGGTGACCATAGGGAACTGTGTAGCTCTGAAGATGGCGACCACAGGGAACTGTGCAGCACTGAACATAATGTGCTTATGGAACATGACTATTCGTATTATCCAAGAAGACCGTTTAATAGAAAATCTTCCAGAATATCTTGTCAAAGGCTGTCCTCCAAAAAGAAACCGACCCGAAAAAAACGAAATACCAACCAAGTGATTGAGGCAGAGATGTccaagaaaaagaggaaaagtTCAAACTTGGAGGAACCTTCCTTAAAATTAGAGGATTTAAAATCAGAATACGATCTTGTGGAGAAGAAAGACCTGCCGAGTGATGACGACTCCGACGCGGACGAGGGGAATGAAACAGAAGACAATGACAATTCCGACAAGATTTTGTTCCCCGACATGAAATTTCCCTGTAGATTATGTGACCAAACCTTTGATAGGAAAAGCCAACGAAAGACACACGAGAGAAAAGGTCACCAAACACAAGGAACATTTAGCTGTGATATTTGTAGCTATACAACGaatcagaaatattatttgGTTAAACATAAAAGCCGCCATATCAAAGAGTATACGGTGTTCTGTAAAGAATGTCAACTTGGATTTTTATCAAAGAATGAACTCAATGTTCACAATATTAAAGTCCATGACGCTCAACCTCACGCATGTCCTGTGTGCAAGAAAATATTTGTCAATAAGTTCAACTTAACAACGCATAAAAAGTTACATTCAGAACTTAGCCGGAGTCATCAGTGCGACACTTGTGGAAAGTCCTTCACTGGGAACAATCATTTGAAACGACACATTTTGAGTGTTCACATGAAAAGAGGCGCTCAATGTAAAGTTTGTTTAAAAGTACTCAACTCTCACGAAGCGCTTTATGCTCATAGTAGAATTCATAATGATGAACGACCGTTCATTTGTCCGACTTGTGGGAAAGCATCTCGTACAAGTAGTGCTCTGGTGGTGCATTTGAGAACACACACAGGAGAAAAACCTTACAAATGTGAATATTGCTCCAAAGGATTCACGCAGACAAGTTCTCTAGCTGTTCATCGTCGAATACACACGGGACAGCGGCCACATGCTTGTCACATTTGTCCGGATAAATTTATTACGAAATCTTTACTCAATCATCATATGACCAATAGACACAAGAATGCCAATATAAACGTTGACCCTGTCCCGCATGCAGTACCGGAGCTTTTGAACGATGTAAAGATAGGAACTAGTTAG
- Protein Sequence
- MDIDWDMLLNQKSTACFICNKNFSSARVMKRHVEEVHTMDRNYPCDECGKMFVSAARVGIHKRNVHMGFKPESQTNYVCHYCNAIYTNKVLLRDHICAHMGVKLHQCDICQNYYSSAKCLKRHKLVHSKVAKTKSFVVPDIENLSADQYKDLVKSKVKFCPVCGKEFSTARYMRKHLREIHSTDKNNWCDQCGNQYKSKSSLTEHKKYMHSTKHIERKEKKFECTICGAMQATKYTLRDHMNTHTGNKYTCEICGNSYASVKHLRRHKLKHRKEEGTLPDYMIYKCDVCHREFTQNHMLKKHKEWVHGDKYHLCKICGAKIKGSLRQHMVTHTGEKKYCCHICGKKLRGKLKEHMLTHTGERPYACEVCGSTFKDKWYLRIHMRKHNGERPYMCSECGQSFAARSAFTLHLRKHNAPVPIACNYCQLTFTSKPLLKEHIVKCHLMAQTKQKKKRTTGASKKSRSNKAYMDELLKTLDIERKIVKRKYEKRPKTEPSDENLHSCDQCDFVTNNKKTLGSHILRRHVGYTVFCKECNVGFYSSTHLHSHGIKVHGLPPFMCEHCSKPFTSKGNLTVHVKYYHAKTFFACEVCGKTFNFKTSYKRHLRQHSDSVTYYPCTVCQKNLSSPYRLKTHMLIHANNRSFTCEICGKGFIQKRYLEEHRRVHTGYKPYDGYEQDTRHSNTENTRNDSNCDLLIKNYDCSEIENQSIIANFNKMKSKDSVPQFSNMDLKMRDGFMLNSKHICTNFKVSTKNLAQPQIERDEANNNVTLLEKGTHESTKQAVDEEMANKYSQTNRTEEIAYSQNERIEDVHREAFLDDGAHESTEKAKEVVLKSYEDMKKNNSEPNLTEEIAYSQRGRIEDVQSESVLDGGTPESIEQAKEVVRKTNKEMTKNNSEPNLTEEIFYSQRGQIEDVQSETVPDGGTHERIEQSEEMVGKTGKEMTYTNSEPNLTEDFAYSQRERVEDVQCETAHFEETTHEDKIEEAKEEVLITEHANEEECISTSKRKRQRKQSMHSRKCSELIIKDRNDNILDSIPYSIIKPFFKTQFNSSIHTIEKFKKMVQKANEWISKEERRLTDSRPELTVKLTTSVQNVNEEKNSDSVDEKYHLDVITQNKKRQQACPACSKLFFHKKSLKTHIKSCHIICKFCFEPKIDFPTFLCHLSDSHDVLPNIYLNQLERHDVDHFIFLRESPKYKFTCVLCSQKFTAEKSCIDHILKCHYHKIYIWKDKGLIEKHNNQFFIQKFRKKCPYCDMYFYQDVHFHLHMRRDHYSYYFCKYCLHYISQYDFEMHWKICNQNKLPLQDIEYTCDLCDDIFQSAEELTHHSQNRYLGNCLFCVQDKFCSAKDFSEHEWKRHADQVYGMCASCQVMFYNKQSLLVHVDLYHCNRAHRWDLVSCNSCSSLPYTRFFIYESFIDHFLSSHCLLIDSHLGSHSENNKFYFDVVPRAFTNMFINIECGFCDNTNKSSSIKIDSLEDFVEHFGIFHSQASVGLCVPCNYTPHPIDSLDFHLFLEHVSFDDNRPQCLVCRSYFNSYQTIVTHMVMSHGIFIGVIPDHVMNDIVNQSSPSSCRMSCGVLNTRAYICKYCSLTVHGILVYLVHLNVNHGIIETQFVCQICKISFAAISTFLKHLIRQRLFAIVSSDQVSKNIDCLNEHNYCLQEIAKDIEPRDQGSSEGGDHRELCSSEDGDHRELCSTEHNVLMEHDYSYYPRRPFNRKSSRISCQRLSSKKKPTRKKRNTNQVIEAEMSKKKRKSSNLEEPSLKLEDLKSEYDLVEKKDLPSDDDSDADEGNETEDNDNSDKILFPDMKFPCRLCDQTFDRKSQRKTHERKGHQTQGTFSCDICSYTTNQKYYLVKHKSRHIKEYTVFCKECQLGFLSKNELNVHNIKVHDAQPHACPVCKKIFVNKFNLTTHKKLHSELSRSHQCDTCGKSFTGNNHLKRHILSVHMKRGAQCKVCLKVLNSHEALYAHSRIHNDERPFICPTCGKASRTSSALVVHLRTHTGEKPYKCEYCSKGFTQTSSLAVHRRIHTGQRPHACHICPDKFITKSLLNHHMTNRHKNANINVDPVPHAVPELLNDVKIGTS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -