Pser006916.1
Basic Information
- Insect
- Polyplax serrata
- Gene Symbol
- -
- Assembly
- GCA_037055365.1
- Location
- JAWJWE010000036.1:14353224-14359074[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 42 0.0096 0.56 10.2 1.2 5 23 59 78 56 78 0.92 2 42 0.0006 0.035 14.0 0.9 2 23 88 109 87 109 0.97 3 42 0.55 32 4.7 2.0 1 23 115 137 115 137 0.96 4 42 0.088 5.1 7.2 0.5 1 23 143 166 143 166 0.97 5 42 0.00019 0.011 15.6 2.2 1 23 170 192 170 192 0.99 6 42 1e-06 5.8e-05 22.8 3.2 1 23 202 224 202 224 0.98 7 42 0.0005 0.029 14.3 1.2 1 23 231 253 231 253 0.97 8 42 3.6e-05 0.0021 17.9 0.1 2 23 260 281 259 281 0.97 9 42 2.5e-05 0.0014 18.4 0.9 1 23 287 309 287 309 0.97 10 42 1.5e-05 0.00089 19.0 1.5 1 23 315 337 315 337 0.99 11 42 6.3 3.7e+02 1.4 0.1 2 19 427 443 426 446 0.85 12 42 0.0006 0.035 14.0 1.1 2 23 455 477 454 477 0.91 13 42 0.39 23 5.2 5.8 1 21 480 500 480 503 0.88 14 42 0.45 26 5.0 1.2 2 23 512 534 511 534 0.93 15 42 0.00074 0.043 13.7 1.2 2 23 543 564 542 564 0.97 16 42 0.0016 0.094 12.7 1.0 1 23 570 593 570 593 0.93 17 42 4.2e-06 0.00024 20.8 1.8 1 23 597 619 597 619 0.98 18 42 1.9e-06 0.00011 21.9 1.4 2 23 631 652 630 652 0.97 19 42 4.6e-05 0.0027 17.6 0.2 1 23 659 681 659 681 0.97 20 42 3.3e-06 0.00019 21.1 1.2 1 23 687 709 687 709 0.98 21 42 8.2e-07 4.8e-05 23.0 5.2 1 23 715 737 715 737 0.98 22 42 5.4 3.1e+02 1.6 0.2 8 23 801 817 795 817 0.92 23 42 3.2e-06 0.00019 21.2 2.1 1 23 821 844 821 844 0.98 24 42 0.057 3.3 7.8 5.0 2 23 852 873 851 873 0.96 25 42 3.2e-05 0.0018 18.0 0.6 1 23 879 902 879 902 0.94 26 42 3.6e-05 0.0021 17.9 0.5 1 23 906 928 906 928 0.99 27 42 0.44 25 5.0 1.3 1 23 937 961 937 961 0.65 28 42 0.00023 0.013 15.4 2.3 1 23 968 990 968 990 0.99 29 42 3.8e-05 0.0022 17.8 1.2 1 23 996 1018 996 1018 0.98 30 42 7.4e-05 0.0043 16.9 1.0 1 21 1024 1044 1024 1045 0.96 31 42 0.26 15 5.7 2.8 2 23 1129 1149 1128 1149 0.97 32 42 0.018 1 9.4 0.9 3 23 1158 1178 1156 1178 0.91 33 42 0.0051 0.29 11.1 2.6 1 23 1181 1204 1181 1204 0.96 34 42 0.082 4.8 7.3 3.7 1 23 1211 1233 1211 1233 0.96 35 42 0.28 17 5.6 1.9 3 23 1241 1261 1240 1261 0.97 36 42 0.00099 0.058 13.3 0.3 1 23 1267 1290 1267 1290 0.97 37 42 5.9e-06 0.00034 20.3 1.0 1 23 1297 1319 1297 1319 0.97 38 42 1.2e-05 0.0007 19.4 2.9 1 23 1329 1351 1329 1351 0.96 39 42 0.4 23 5.1 2.1 1 23 1357 1379 1357 1379 0.96 40 42 5.1e-06 0.0003 20.5 1.8 2 23 1386 1407 1385 1407 0.96 41 42 1.9e-06 0.00011 21.9 1.3 1 23 1413 1435 1413 1435 0.98 42 42 8e-05 0.0046 16.8 0.9 1 23 1441 1463 1441 1463 0.98
Sequence Information
- Coding Sequence
- ATGGAAAAATATCTCCtggaaaatagaaaggaaagACAGGggaaaaaaccaaagaagGGGCGAAGGAACAACGCTTCTGGAACTCTTGATAATACTAAGAAGGATGACAGAAACTCTTCTGAAACCAGTGACAATTCAAGAGTTTTCAGCGATCAAATCCCTGGGCACTGTATTATTTGTAATCGTGACTATGTTACACAAGAGAACTACAACAACCATATGAAAAACGTTCATAATAAAGGATCAAAATTTCCGCCGGTGTCTTGTCCTGTCTGCCTGAAGACTTTAAAGAACAAGTACTTTCTCAAGACTCATTTAAAAATCCATGGAGGCGCAGCAGACTATGAGTGTCCTCACTGCGAATATAAGaacaacaatttgttttatatgaAGATACATATTGACAGGCATAAAAAACAGCCATCATATCAGTGTGAGATATGCAGCGACAATTTCTACGAGAAGTCTATTTTGCAGACGCATATTCAAGTGAAACATGGCCGGGGATATGAGTGCAAGACTTGCGACAAAAGGTATGGAACCAAGCAAAGATTGAGGGAACACGAGAGGACTCATGACCCTAGCTATATGCAAGTAAACAAACATCAGTGTGAGGATTGTGGAAAGACCTACAAGTACAGGGCGCAGCTGAAGACTCACATGCTCAAGCATCGGGGCTTGGATGTCAAATACGACTGTAACATTTGCGGCAAGGTGGTCACCACCAAAAAGTCGCATGCGAATCACATGAAGATACACACGGGAGAGAAGTCAAGCGTTTGTGATATTTGCGGGAAAGCATTCACGATCGAAAAGTACCTAGTTGTGCATAGGAGAACTCACACGGGAGAAAAGCCTTACATGTGTCAAGTGTGTGGGAAACGCTTCACACAAAGAGTGACTTTGGTGATACACAATCGCTACCACACTGGTGAGAGACCATACAAGTGTCAGATCTGCAATAAAGGTTTTGTAACCAAGACTCTCCTCTCTGGTCATCGCAAAACACATGGAAAGAATGGAAAGAAGTCTGCGTTTAGGCCCAAGTTGGGTAGGATGAAACTGCATGGGACCAGGAAAAATCAAGCTAAGAGGATATGGCGCCACGAAAACGAATCGACCAAGCCGAAGACAGGAGAAGTTGAATGTGAAAGCTTGTCCTACTCGTGTTTTgaaccagaaataaaaatagagtcAGGTACGGGAAATACAGAAAACGCAACAACAAACAATCAATCGTCGCCCGATCACTACTTGTCAGAGTCGTTACCCGCGGTACGATGCACCCTCTGTGGCGAAATGGTTCCTGCAAGGAATTTTGAAAAGCACAAATTCGTTGAAGAGGTGTACCTATCATCCATGAAATGTCGGATATGTAATATAACGTTCAAGAAGAAATCGGCCTTCAGGATTCACATTTACGACCAACACAAAATCCACCACTGCAAGCTCTGCTCTGAGGTGTTTACAAAGCGACAGCATTATCGCATGCACAAGGGACTCGTACATGACTTGAAGAAAAAGCCACGGGTTCCGTGCGATCTCTGCTGCAGAACTTTTGCTAGCAAACAGATATTGAAACGGCACCAGAATGAGGTTCATTTTGGTCAGAGAGAACCAGAAACTAAGTGTCCCCAATGCGATTATAAATCTTATAATCGCTCTAACATGAAAGTTCACATTGACAAGCATAACAACACACCTGCGTTCGTTTGCGACCTGTGTGGAAAGGGCTGCTTTACGAAGACCTCATTGAACGACCACACAGCAAACGCACACGGGCAAGGATTTAAGTGTGATATTTGTGGGAAATTCTACAAGAACGAGAGCAGTTTGAAGACACACTCAAAAATCCATTTACCTGGTTTTGATCCGGCTGCACTGAAGCTCCAATGTGAAGAATGCGGGGAAATATTCAATCATAAGAGTTCGTTGAACAAGCACTTGCTCAAGCATCGTGGCCTAGACAAAACATACGACTGTGATGTGTGCGGAAAAAAGCTCAGTTCCAAAGGGTCCTACAGGTCACATATGGCAATACATTCGGGGTACAAACCTTACGCTTGCGAATACTGTGACAAGAGATTTGGCGATAAGCAGTACCTGACGCAGCACAGAAGGGTGCACACAGGCGAGAAGCCATTTAAGTGCGATGAATGCGGCCAGTGCTTTAGCCAGAGATCTAGTCTTAATAGACATAAAAGGTATCATATTGGAATGCGACGGAatgtatcaaattttaaatccgGGCAAACAGAAGAAAACCGTGAGATACgagaagttcaaaattttgtttctgaagTAAACGTGGACTTTGTAAAGACGGAAGCACAAACGTATTCTAATTTGGATGACAACTGTCAAGAGGAGAGTAATCAGGAGCAATGTGAACTGCGCAGAAAGACCTTTGCTGCAAGACAGGCGCTGCAAGTACACCAAACGACGTCACATAAGGAGAAATACGTTTGTGATGTGTGCGAAAAGACGTACATTTCCCGCTACCACTTAAAACAGCACATGAACAAAGTTCACCCTCAGAATGCAATGCACTTGAAGTGTCATCTCTGCGACTATCGAACGCACAACAGGTTCAACCTAAAAGGTCACATTGACCGGCATAACAAAGCGGCTTCCTTTATTTGTGATGAGTGCGGAAAAGGCTTCTATTCGAGGAGCACACTGGAGGAACACAAAAAAGGCAAACATGGGCCTGGTTTCCGGTGCGACGTCTGCGGTGCCGTCCTCACGAGTATGTCAAATCTCAGGCAGCATAAAAAGACTCACGAACCCAAGTCAGCAGGTGTCAAATACCAGTGCGACATTTGCGGACAGGGGTATCGGGCGAGGTCAAAATGCGCTTTTCGATTGCACGTGCTCAAGCATCAGGGCATAGTCAAACAATTCAAGTGTTCGGACTGCAACAAAGTCCTGACGTCGGACAACAGCTACCGACAGCATTTGAGGAAACACTCTGGAGAGAAACCATTCATTTGCGAATTTTGCAGCAAGGCATTTTCAGAGAAAAAATACTTGCAGGTGCACAGGCGGATACATACAGGAGAAAAACCTTACGAGTGTGACATCTGTGGTGTTTGTTTTAATCAGCGGTCCACTATGACGTCACACAGGAGGGAATTTTTGGATTCTTCCGATCTCAAAATTGAGAATTGGATTCTTACTGAGAACGTGACGGAAGCGTTCAGTAGACTTATCGAGCCCGGTTCAACCAGTGAGGCACATGCGTCTGACATGCAGGCAGAGCAAACGGAggagacgaaagaaaaaataacggaGCGGAAAAAAATAGTGGAAAGGGATAAGAAGATGAGAGTGAAGAACTCGCGACTTGGACCACTCTCGGAACTATTCTCGAATCCTATAATGAAATGCGAATTCTGTGGGGCCATTTATCATGAATCAGAGATGATACACCACAGGAAAACTCACCCAGAAATGCGGGAAGCTCTTGAATGCGCCCCCTGTAACAAATCGTTCTCGAGCCAAACACAATTTGATAGTCACAGCAATAGGCACAAAATATTCTTCTGCGAGGCCTGCAACAAAGACTTCAACCCTAGGGAGAAGTACACGAAGCATATGAAGCGGAAACATGGTAATCTTCCCAAGACGCATATGTGCGACATCTGCAAGAAAACCTTCCTCGAAAAGTTCGAATTAAATATGCACAAGAAGAATCACTCAGGAGTGAAGGACAAGCATTGTCCACTTTGCTCGTACGCAACAAACAGCACGACCTACATGAACCTGCACATTAAAAAGCACAACAACGAATACGTATTTCAGTGTTCTACCTGTGGGAAAGGTTTCCTCGCTAATAACCAACTTCAAGCGCACATAAATTCTAGACACGGCGATGGTGGTCAGCTGTTCCCTTGTGAGGTCTGCAACAAGACCTATTCAAGCAAGGGAAACCTTAATGAGCACAAAAAACAACACGAACCTGGCTACAAGCCAGAGAAGAGTCACCAGTGTGAGGAGTGCGGGAAGGCCTTTCCGAGGAAACATAAACTGTTGAAACATGTGCTGCAACACCGCGGTGTCTTCAGTTTTAAGTGTGTACCCTGCAGAAAAGGATTTTCTTGCAAGGTGTCTCTGGACAATCATATGAAAATACACACGGGAGAAAAGAGCTGCGTTTGCGACGTTTGCGGGAAGTCTTTCACTGTGAAAAGGTATCTGGAGGTGCACAGGCGCTCACATACCGGCGAGAAACCTTACAAATGCGACCAGTGCGAAAAGGCGTTCACGCAACAATCCTCCTTGGTTGTGCATAAGAGGTACCATACGGGGGATCGGCCTTACTCTTGCAACGTCTGCAACAAGGGATTCGTCACGGGTACCTTGCTTAAGAGTCATCAAAAGAACCATTTCTGCTGA
- Protein Sequence
- MEKYLLENRKERQGKKPKKGRRNNASGTLDNTKKDDRNSSETSDNSRVFSDQIPGHCIICNRDYVTQENYNNHMKNVHNKGSKFPPVSCPVCLKTLKNKYFLKTHLKIHGGAADYECPHCEYKNNNLFYMKIHIDRHKKQPSYQCEICSDNFYEKSILQTHIQVKHGRGYECKTCDKRYGTKQRLREHERTHDPSYMQVNKHQCEDCGKTYKYRAQLKTHMLKHRGLDVKYDCNICGKVVTTKKSHANHMKIHTGEKSSVCDICGKAFTIEKYLVVHRRTHTGEKPYMCQVCGKRFTQRVTLVIHNRYHTGERPYKCQICNKGFVTKTLLSGHRKTHGKNGKKSAFRPKLGRMKLHGTRKNQAKRIWRHENESTKPKTGEVECESLSYSCFEPEIKIESGTGNTENATTNNQSSPDHYLSESLPAVRCTLCGEMVPARNFEKHKFVEEVYLSSMKCRICNITFKKKSAFRIHIYDQHKIHHCKLCSEVFTKRQHYRMHKGLVHDLKKKPRVPCDLCCRTFASKQILKRHQNEVHFGQREPETKCPQCDYKSYNRSNMKVHIDKHNNTPAFVCDLCGKGCFTKTSLNDHTANAHGQGFKCDICGKFYKNESSLKTHSKIHLPGFDPAALKLQCEECGEIFNHKSSLNKHLLKHRGLDKTYDCDVCGKKLSSKGSYRSHMAIHSGYKPYACEYCDKRFGDKQYLTQHRRVHTGEKPFKCDECGQCFSQRSSLNRHKRYHIGMRRNVSNFKSGQTEENREIREVQNFVSEVNVDFVKTEAQTYSNLDDNCQEESNQEQCELRRKTFAARQALQVHQTTSHKEKYVCDVCEKTYISRYHLKQHMNKVHPQNAMHLKCHLCDYRTHNRFNLKGHIDRHNKAASFICDECGKGFYSRSTLEEHKKGKHGPGFRCDVCGAVLTSMSNLRQHKKTHEPKSAGVKYQCDICGQGYRARSKCAFRLHVLKHQGIVKQFKCSDCNKVLTSDNSYRQHLRKHSGEKPFICEFCSKAFSEKKYLQVHRRIHTGEKPYECDICGVCFNQRSTMTSHRREFLDSSDLKIENWILTENVTEAFSRLIEPGSTSEAHASDMQAEQTEETKEKITERKKIVERDKKMRVKNSRLGPLSELFSNPIMKCEFCGAIYHESEMIHHRKTHPEMREALECAPCNKSFSSQTQFDSHSNRHKIFFCEACNKDFNPREKYTKHMKRKHGNLPKTHMCDICKKTFLEKFELNMHKKNHSGVKDKHCPLCSYATNSTTYMNLHIKKHNNEYVFQCSTCGKGFLANNQLQAHINSRHGDGGQLFPCEVCNKTYSSKGNLNEHKKQHEPGYKPEKSHQCEECGKAFPRKHKLLKHVLQHRGVFSFKCVPCRKGFSCKVSLDNHMKIHTGEKSCVCDVCGKSFTVKRYLEVHRRSHTGEKPYKCDQCEKAFTQQSSLVVHKRYHTGDRPYSCNVCNKGFVTGTLLKSHQKNHFC
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -