Lsal015890.1
Basic Information
- Insect
- Leucoma salicis
- Gene Symbol
- -
- Assembly
- GCA_948253155.1
- Location
- OX411826.1:4686897-4692720[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 45 0.077 7.2 8.0 0.0 2 23 11 33 10 33 0.93 2 45 0.28 26 6.2 0.2 2 23 54 76 53 76 0.92 3 45 0.011 1 10.7 3.1 2 23 82 104 81 104 0.94 4 45 0.011 1.1 10.6 0.8 1 23 109 132 109 132 0.96 5 45 1.5 1.4e+02 3.9 0.6 2 23 135 157 134 157 0.91 6 45 0.13 12 7.3 2.1 2 21 164 183 163 184 0.93 7 45 0.013 1.2 10.5 0.7 1 23 244 267 244 267 0.90 8 45 0.3 28 6.2 0.1 3 23 296 317 294 317 0.94 9 45 0.054 5.1 8.5 0.2 2 23 340 363 339 363 0.95 10 45 0.0012 0.11 13.7 0.6 3 20 370 387 368 391 0.90 11 45 0.018 1.7 10.0 0.3 1 23 396 419 396 419 0.97 12 45 0.0018 0.17 13.1 0.4 2 23 425 447 425 447 0.94 13 45 0.00028 0.027 15.7 4.7 1 23 451 474 451 474 0.97 14 45 0.00038 0.035 15.3 0.6 2 23 481 502 480 502 0.96 15 45 0.7 66 5.0 0.4 1 23 562 584 562 584 0.95 16 45 0.64 60 5.1 0.2 2 23 612 634 611 634 0.93 17 45 0.14 13 7.2 1.4 2 23 655 677 654 677 0.96 18 45 7.7e-05 0.0072 17.5 1.4 1 23 682 705 682 705 0.97 19 45 0.2 19 6.7 1.3 1 23 709 732 709 732 0.90 20 45 0.39 36 5.8 0.9 2 23 735 757 734 757 0.94 21 45 5 4.6e+02 2.3 0.1 2 19 763 780 762 783 0.92 22 45 0.33 31 6.0 2.9 1 23 832 855 832 855 0.89 23 45 0.0016 0.15 13.3 0.1 1 23 928 951 928 951 0.96 24 45 3.7e-05 0.0035 18.4 0.2 1 23 956 979 956 979 0.96 25 45 5.1e-05 0.0047 18.0 2.5 1 23 985 1008 985 1008 0.95 26 45 0.0025 0.23 12.7 0.5 1 23 1013 1036 1013 1036 0.97 27 45 0.002 0.19 13.0 2.2 1 23 1040 1063 1040 1063 0.97 28 45 0.0097 0.91 10.8 0.7 2 23 1070 1091 1069 1091 0.97 29 45 0.063 5.9 8.3 1.7 1 23 1097 1119 1097 1119 0.96 30 45 0.00098 0.092 14.0 2.5 1 23 1173 1195 1173 1195 0.96 31 45 0.55 51 5.3 0.0 2 23 1223 1245 1222 1245 0.87 32 45 0.33 31 6.0 3.0 2 23 1266 1288 1265 1288 0.94 33 45 0.00025 0.023 15.8 0.5 1 23 1293 1316 1293 1316 0.96 34 45 0.011 1 10.7 1.1 1 23 1321 1344 1321 1344 0.95 35 45 0.00065 0.061 14.5 0.9 2 23 1347 1369 1347 1369 0.96 36 45 0.0079 0.74 11.1 1.0 2 21 1377 1396 1376 1397 0.94 37 45 0.0088 0.82 11.0 3.1 1 23 1449 1472 1449 1472 0.97 38 45 9.8 9.1e+02 1.4 0.0 3 23 1501 1522 1499 1522 0.90 39 45 4.2 3.9e+02 2.5 0.0 2 23 1545 1568 1544 1568 0.88 40 45 2.7e-05 0.0026 18.9 0.2 3 23 1575 1596 1574 1596 0.94 41 45 0.0063 0.59 11.4 3.4 2 23 1601 1623 1600 1623 0.94 42 45 0.0033 0.31 12.3 2.9 1 23 1628 1651 1628 1651 0.94 43 45 0.00012 0.011 16.8 5.4 1 23 1655 1678 1655 1678 0.97 44 45 0.00039 0.037 15.2 0.3 2 23 1685 1706 1684 1706 0.96 45 45 3.5e-05 0.0033 18.5 1.4 1 23 1712 1734 1712 1734 0.97
Sequence Information
- Coding Sequence
- ATGGAAATCAAAATAGATGTATCGGATATTACTTGCGAAATCTGTTCAGAACCATTTCCGTCCTTGGATGAAGTTGTTAATCATTTATTTGTGAAACACAAGTTAGAATATGATAAAGGGGTAGAAATGGCTATAGAGGAGTACAGAGTTGTTGATCTTAGTTGTTTGGCTTGTGATGAAAAGTTTACATACTTCGGCTACTTAGTTTCGCACGTTAACAATAGTCATCCGAAAAAATGTCTCATTTGTGATAAGTGTAATCAGAAATTCAATAAAAAGAGAGATTTGTTTTCTCACAAGAAAAACTATCACAGAGACGGTGGGTATCAATGCGAATTGTGCCCTCAGATTTTTACATCGCTAAATATCCTAAGAAAACATAGGAATAATAGGCATTTGACTAGGTGCAATATCTGCCATCTGAAATTACCTTCCGCGGCCCTAAAACAGAAACATCTAGATTTAGAACATCCAGACGATGGGTCTCTACAGTGTGACACTTGTTTCAAAGAGTTTCATACTAAACAGGGTCTCCGAATGCACAATAGGAAATGCAAAGGTACAGATATTTTCGAAATTGCAATTAAGAAAGAGGAATACGTAGCTATGGATTTGGATCAAAGTTATGACGATCAAGTTAAGAGGCCCAGTGTGAAACAAATTCGCGAAAATATTGTTATTGTTATAAATATGTCTACTGCTATACCTTTTAACTTTTATAAAAATAAATTTAATTGCTTTTATTGTTCGAAAGATTTTGTTGATTCTGATTCGATGAGGGATCACACAGTTTTGGAACATCCCATATGTGATGTCAAACAAAAATGCATCAGGAAATGTAGGGAATCGGTCGCTTGTGTCAAAATCGACATATCTAATCTAGCTTGCAAAATATGTTTTGAGTCTATGAATGATTTAGATAATCTCCTTGAACATTTGATAGTCAAGCATAATGCTAATTATGATAAATCTATAACTACATGTCTGCAACCATACAGACTAATTAAAGACCATATGGTTTGTCCAAATTGCCCCGGAGAAGTCTTCAGATTTTTCGGAACTTTACTTAAACATATGAACAAAAAACATACAAACAATAATATAATATGTGTATACTGCGGTCAAACGTTCCGTCGAGACCAAAACCTTCGCGTACACATATGGAGACATCATCGAGATGGTAGGTTTAAATGTAACATTTGCGGTGCTGAGTGCAATATTCCGTCTAGACTGTACATGCATATGGCTAGAGCTCACGGAGTTAAAGCTGCAAAGTGTCCTAAATGTTCTGAAAGTTTCGCTACTCAGTACTTAAGACAGAAGCACTTAATAGAAGCCCACGATTGCGGTCATAAATGCTCTTACTGTGGGAAGCTTTTTACTAGAAACTCCTTCATGAGGGATCACGTGAGAAGGACCCATCTGAAAGAAAAAAACGTGGAGTGTTCTATAtgtaacatgaagttctttaataatattcttctacggagacatatggtcaaacatagtgAATTAGATAATAAAACCTCCCTGCGCTCATTGCTGACGAAATACAATAGAGAACATTTCTTTTCAGATTCCATAAGAGAGCCAACAAAGAAAAGCGCTAATTTCTTAAGGAGGAGGAATCTTCTTGCATTGTTCAACAACACGACATTGATGCCGTTTAAATGGCGAGGGAAATATCTGTGTTTCTATTGCGGAGATGCAGTTGACAATTATCAAATGCTCCGTAAGCATACGGAAAGTCATGGCCAGTGTTCTGATAAAGACCGAGCCTTGAGGCTTGTAAAGTCAGCAGATACCGAAACCAAAATCGATGTGTCATACATAACATGCAATTTGTGCTCAAACTCCTTCGATTATCTAGATGATATCATCAACCATTTAGTGTCTAAACACGAATTACCTTACATAAAAAGTGCTAAGCTAATGATAACGGCCTACAGGCTTGTAGATTTAAAATGCCTGCTCTGTGCAAAAATCTTCGATTATTTCCCAAAATTAGTCTCACATATGAATACTTGCCACCCTAAAGGATGTTTCTCTTGCGAGGAATGTGATCAAACATTTAACAAAAAACGAGATTTAGACACTCACATCCGAAACTATCACAGAAAAGAATATTCGTGTTTAAAATGCAGTGAAGTCTTCGTTTCGAATGCAACGCTAAGATTCCATAGATTACACGCGCATCCATCTACGTGTAACATATGCTTGCAATCCTTTTCGTCTGATATAAAACGTTTAAATCATATGAAAAATGATCACACCTCTGATCATGTCAGATGTGGGTTTTGTGAACGAGATTTGTCAACCAAACAAGCAATGTTACGTCACGCCTCGAATTGTAAAGCAAAATTAGACAATATATTTGAAACTGTCGTTGTGGATGATGAAAAAGATAAAGTTTCTACTAAAGATATCCGCAATAGTATTGCCACAGTAATAAATCTGTCTACTGCTTTACCTTTTAAGTTTTTCATGAGTAAATTTCGATGTTTCTATTGCTCTAAGGATTTCTCAACTTGTAACATTTTAAAAGAACATACTATTGCAGAACATCCTCAATGTGACGTTAGCGAAAAATGCATGAAATTACGTAATAGATACGACGGGAGCAAAATCAAAGTCGATACCTCGGCTCTTTCATGCAAGTTATGTCTAGAAACGATGTCTGATTTAAATATTTTAGTCGAGCATTTAACTACGGAACACAAAGTTCAATGCAATAGATCTGTCGAAAACTACTTGCAGTCGTTTAAATTAATGAAAGATAACTATCCTTGTCCTTTGTGTGATGAAGTTTATAGATATTTCGGTTTGCTACTGAAACATGTTAGCAAAATACATACTGGCAACAAATTCATTTGCGTTTACTGTGGAAAACCATTCAGGACAGATCCTAACCTTCGCGCGCACGTCATGAGATACCACCCGGCCGCCAATAAATATAAATGTACCCATTGCGATACGATTTTCACTACTTACAATAATTTAAAGATCCATTTAGGTAGGGATCACGACGTCAAAATGTACAAGTGTGTGGAATGCGCGGATAAATTTACAACACAGTACTGGATGCAACGGCACATGCTTATGGTGCACGGCGAAGGACATAAATGCGCTTACTGCGATAAAGCCTTTATTAAATACTCCTTTATGGTGAATCATGTGAGGAGACTGCACTTAAAAGAGAGAaatgtgaagtgtaaagtatgcagtgaaggattcttcgatcgtcagagactaaaggtgcatatggtcaagcacgttggagaaaggaatttccattgtgatgtttgtaacaagaagtttctctggaagagaaatttaagggcacatatCGCTTCCCACGCTAGGAATATCAATAATCAAGTTGCTGTCACCAAGAAACATTTCTTTTCAGATGAATTGCCAGGCCCGTACGGAGGCTGTATCTCTGAGCGGAGACGAAAGAATCTGAAAATACTTTTCGACAACACCTCGATACTACCATTCAAATGGCGGGGAAAATATCTTTGTTTCTATTGTGGCAAGAATTATACGGAATATCAAGAATTAAAAAAGCATACTAAATCTCACGGTTTATGCAATACTAAAGACTATGCTTTAAAACTAATCAAAGGAAATAATATTGAGATTAAAATTGATGTTTCTGAAATCGTCTGCGATATATGCAATGAAAATTTTGATCGCTTTGAGAAAATAGTTGATCATTTGATCGCAGAACACCATTTGGATTACAACAAAAGCATAGATATTCCTTTTCAAGAATACCGCCTGGTAGATTGTAGGTGCTTACATTGCGAAGAGAAATTCGCTTACTTTGGTTATTTAGTAACACACGTAAATAATATGCATCCTCAAAATTGTTTTATATGCAATGACTGTGGTGGTCGATTTAATAAAAAGAGGGATTTAGCTATACATTTAAGAAATTATCACCGCGAAGGAGGGTATCCGTGTGATTCGTGCCTTCAAAGTTTTGAAACACTGCAATCTTTAAGACGACATAAGAATAATACCCATTTTAGACAATGCAAAAGTTGTAATTTAAGGTTTGCGTCTCTATCGCTTTTACAAAAACATCTACAGATCTCACATCCTCACGATGACGGTAACATGAAGTGCACTTACTGCTCAAAAGAATTCCATTCATCTATTGGTTTACGACAGCATATTAGCAAATGTAAAATCAAAATTAATTCCCAAGTCGAAATTGAATCATTTACGGAAAATAAACTAGAACCTCGTAAGAAGCAGAATATTCAACAAATACGTCAGAATATTCAGTGCATATTGAATATGTCTACAGCTGTACCTTTTAAATTCTTCTCCAAGTACTCTTGTTTTTATTGTTCAAAGAAATTCTTTGAATTCGATGATCTTCGACAGCATACGAGTACTGAGCATCCAGTGTGTGATCTAAAACAAAAATGCATGCAAAAGTGTAAAGGTGAAAGAATCACAGTTAAAATAGATATATCAACCTTAGCTTGTAAAATTTGTTCACTTCCAATGGCAAGTTTAGAAACTTTAATCGATCATTTAATACTCGAACACAAGGCAAATTATGATAAATCGATTAAAGGCAGTCTTGAACCGTTTAAGATCATCAACGACAATATGCCTTGTCCTATATGCCCTGATAAGGTTTTCCGATATTTTGGTATTTTGCTGCGGCATGTGAATGCCGAGCATAGTAACAATAACAGGATTTGTGATTTCTGCGGTCGGAGCTTTCGCAATGCAGCAAATTTGAATGTTCACATAACATATTCTCATACAGGATCTTGCGAATGTGACGTTTGCGGCATGAAATATAAAAATCAATGGTGCTTAGCTAGACATAGAGCTAAAACACATAACGCGAAAGACTATAAATGTCCAAAATGTCCAGATACTTTCAAGTCGCAGTACCATAAGCAGAAGCACTTGATTGAAATGCACAACATCGGCCATAAGTGCGACTACTGCAACAAGATGTTTACCCGAAACAGTTTTATGAAAGATCACGTGAGAAGAACACATTTGAAGGAAAAAAATGTTCCATGTTCGATTTGTAACGAGAAATTCTTTGACAATTATCTTTTGAGGATGCATATGGTGAAGCACGAGGGTTTTAGGAAGTTTTGTTGTACAGTTTGCGGTAAGGCGTTTCTGCGAAGGAGTAATTTGGCGTCCCATATGGAAATGCATAAAAAATACGGACACGTGTCTTTGGAATCTCTGGCATAG
- Protein Sequence
- MEIKIDVSDITCEICSEPFPSLDEVVNHLFVKHKLEYDKGVEMAIEEYRVVDLSCLACDEKFTYFGYLVSHVNNSHPKKCLICDKCNQKFNKKRDLFSHKKNYHRDGGYQCELCPQIFTSLNILRKHRNNRHLTRCNICHLKLPSAALKQKHLDLEHPDDGSLQCDTCFKEFHTKQGLRMHNRKCKGTDIFEIAIKKEEYVAMDLDQSYDDQVKRPSVKQIRENIVIVINMSTAIPFNFYKNKFNCFYCSKDFVDSDSMRDHTVLEHPICDVKQKCIRKCRESVACVKIDISNLACKICFESMNDLDNLLEHLIVKHNANYDKSITTCLQPYRLIKDHMVCPNCPGEVFRFFGTLLKHMNKKHTNNNIICVYCGQTFRRDQNLRVHIWRHHRDGRFKCNICGAECNIPSRLYMHMARAHGVKAAKCPKCSESFATQYLRQKHLIEAHDCGHKCSYCGKLFTRNSFMRDHVRRTHLKEKNVECSICNMKFFNNILLRRHMVKHSELDNKTSLRSLLTKYNREHFFSDSIREPTKKSANFLRRRNLLALFNNTTLMPFKWRGKYLCFYCGDAVDNYQMLRKHTESHGQCSDKDRALRLVKSADTETKIDVSYITCNLCSNSFDYLDDIINHLVSKHELPYIKSAKLMITAYRLVDLKCLLCAKIFDYFPKLVSHMNTCHPKGCFSCEECDQTFNKKRDLDTHIRNYHRKEYSCLKCSEVFVSNATLRFHRLHAHPSTCNICLQSFSSDIKRLNHMKNDHTSDHVRCGFCERDLSTKQAMLRHASNCKAKLDNIFETVVVDDEKDKVSTKDIRNSIATVINLSTALPFKFFMSKFRCFYCSKDFSTCNILKEHTIAEHPQCDVSEKCMKLRNRYDGSKIKVDTSALSCKLCLETMSDLNILVEHLTTEHKVQCNRSVENYLQSFKLMKDNYPCPLCDEVYRYFGLLLKHVSKIHTGNKFICVYCGKPFRTDPNLRAHVMRYHPAANKYKCTHCDTIFTTYNNLKIHLGRDHDVKMYKCVECADKFTTQYWMQRHMLMVHGEGHKCAYCDKAFIKYSFMVNHVRRLHLKERNVKCKVCSEGFFDRQRLKVHMVKHVGERNFHCDVCNKKFLWKRNLRAHIASHARNINNQVAVTKKHFFSDELPGPYGGCISERRRKNLKILFDNTSILPFKWRGKYLCFYCGKNYTEYQELKKHTKSHGLCNTKDYALKLIKGNNIEIKIDVSEIVCDICNENFDRFEKIVDHLIAEHHLDYNKSIDIPFQEYRLVDCRCLHCEEKFAYFGYLVTHVNNMHPQNCFICNDCGGRFNKKRDLAIHLRNYHREGGYPCDSCLQSFETLQSLRRHKNNTHFRQCKSCNLRFASLSLLQKHLQISHPHDDGNMKCTYCSKEFHSSIGLRQHISKCKIKINSQVEIESFTENKLEPRKKQNIQQIRQNIQCILNMSTAVPFKFFSKYSCFYCSKKFFEFDDLRQHTSTEHPVCDLKQKCMQKCKGERITVKIDISTLACKICSLPMASLETLIDHLILEHKANYDKSIKGSLEPFKIINDNMPCPICPDKVFRYFGILLRHVNAEHSNNNRICDFCGRSFRNAANLNVHITYSHTGSCECDVCGMKYKNQWCLARHRAKTHNAKDYKCPKCPDTFKSQYHKQKHLIEMHNIGHKCDYCNKMFTRNSFMKDHVRRTHLKEKNVPCSICNEKFFDNYLLRMHMVKHEGFRKFCCTVCGKAFLRRSNLASHMEMHKKYGHVSLESLA
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -