Lcur022132.1
Basic Information
- Insect
- Lamproptera curius
- Gene Symbol
- -
- Assembly
- GCA_029286875.1
- Location
- JAGSMT010000099.1:1802490-1815244[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 48 0.084 13 7.7 0.2 1 23 46 68 46 68 0.97 2 48 0.054 8.5 8.3 0.2 2 23 92 114 91 114 0.95 3 48 0.00068 0.11 14.2 1.7 1 23 136 158 136 158 0.98 4 48 0.0021 0.34 12.7 0.1 2 23 163 184 162 184 0.95 5 48 0.0025 0.4 12.4 0.4 2 23 192 214 191 214 0.96 6 48 0.0013 0.21 13.4 4.9 1 23 220 243 220 243 0.98 7 48 0.31 49 5.9 1.7 1 23 249 272 249 272 0.96 8 48 0.011 1.8 10.4 3.5 1 23 278 300 278 300 0.98 9 48 0.00031 0.05 15.3 3.1 1 23 306 328 306 328 0.99 10 48 0.00032 0.051 15.3 3.0 1 23 334 356 334 357 0.95 11 48 0.3 47 5.9 1.3 1 23 425 448 425 448 0.94 12 48 0.0052 0.82 11.5 0.1 3 23 471 491 470 491 0.97 13 48 0.061 9.6 8.1 3.3 1 23 513 535 513 535 0.97 14 48 0.00028 0.045 15.4 0.6 1 23 539 561 539 561 0.98 15 48 0.66 1e+02 4.9 0.1 1 23 566 589 566 589 0.95 16 48 0.0014 0.22 13.2 1.2 1 23 595 617 595 617 0.96 17 48 0.43 68 5.4 1.2 1 23 624 647 624 647 0.93 18 48 0.077 12 7.8 3.0 1 23 653 675 653 675 0.93 19 48 2.1e-06 0.00034 22.1 1.0 1 23 681 703 681 703 0.99 20 48 3.5e-05 0.0056 18.3 3.3 1 23 709 731 709 732 0.96 21 48 0.002 0.31 12.8 0.2 1 23 799 822 799 822 0.89 22 48 0.03 4.7 9.1 1.5 3 23 852 873 852 873 0.95 23 48 9.5 1.5e+03 1.2 2.5 1 23 895 917 895 917 0.89 24 48 0.0023 0.37 12.6 0.1 1 23 921 943 921 943 0.97 25 48 0.0025 0.39 12.5 0.6 1 23 948 971 948 971 0.95 26 48 0.043 6.8 8.6 2.5 1 23 977 1000 977 1000 0.94 27 48 0.00013 0.02 16.6 0.2 1 23 1007 1030 1007 1030 0.98 28 48 0.00016 0.026 16.2 0.8 1 23 1036 1058 1036 1058 0.95 29 48 1.9e-06 0.0003 22.3 1.9 1 23 1064 1086 1064 1086 0.99 30 48 5.5e-05 0.0087 17.7 5.5 1 23 1092 1114 1092 1115 0.96 31 48 0.17 27 6.7 0.6 1 23 1246 1269 1246 1269 0.95 32 48 0.12 19 7.2 1.1 3 23 1295 1316 1293 1317 0.94 33 48 0.091 14 7.6 0.2 1 23 1340 1362 1340 1362 0.98 34 48 0.00074 0.12 14.1 0.2 1 23 1366 1388 1366 1388 0.98 35 48 0.00045 0.072 14.8 2.3 1 23 1393 1416 1393 1416 0.96 36 48 0.021 3.3 9.5 0.8 2 23 1423 1445 1423 1445 0.93 37 48 0.28 44 6.0 7.4 1 23 1452 1475 1452 1475 0.96 38 48 0.0002 0.032 15.9 5.7 1 23 1481 1503 1481 1503 0.97 39 48 0.0013 0.21 13.4 0.3 1 21 1509 1529 1509 1530 0.97 40 48 0.0017 0.27 13.0 0.6 1 23 1691 1713 1691 1713 0.97 41 48 1.4 2.2e+02 3.9 2.2 2 23 1739 1760 1738 1760 0.96 42 48 0.00029 0.046 15.4 0.2 1 23 1782 1804 1782 1804 0.97 43 48 1.4e-05 0.0022 19.6 0.2 1 23 1808 1830 1808 1830 0.99 44 48 6.7e-05 0.011 17.4 0.2 1 23 1835 1858 1835 1858 0.96 45 48 0.0016 0.25 13.1 3.0 2 23 1865 1887 1865 1887 0.96 46 48 0.0012 0.19 13.5 1.3 1 23 1893 1916 1893 1916 0.98 47 48 0.013 2.1 10.2 4.0 2 23 1923 1944 1922 1944 0.96 48 48 1.7e-05 0.0028 19.3 1.1 1 23 1950 1972 1950 1972 0.98
Sequence Information
- Coding Sequence
- ATGAACGAAGggACCACGGATAATGATAACGGTGATACGTGGAAACCGAGGCGGAAATTTAACGATCACAGGGATAACGCGATAATTATCTTTGAGTGTTCGAACGCGTGCCCCTTTCGTTTGAGTCGGGGCGAGTTCACGTGCGCCGTTTGCTCTACCACGTTCGCGGACTTTACCGGTGTGAGGGAGCACACTGAAAGCCATCCTAACAAGCTCGAGGCTATACGTTCGTATCGGATTCTCGAAAATATCAAAGCGGATATCGCAGAGCTGCAATGCAATCTGTGCTTGCTCCATTTTAACAATCTGGAGGATCTCGCGGATCACTTGGTGACTGTGCACGACAGGCCCCTGATTAAAGAATGGGGTCTCGGTATCACTCCGTTTCTTCTAGGCGGTGGCGCGTATGTCTGTTCGCACTGCGGCGAGCGTTTCGACCTGTTCACAAGTCTCAATACCCATATAAACCAACATTCTAAGAACATTGTGTGTTTTACATGCGGGAAAGCATTCGCCAACGAGCAACGTCTCACGCAACACTTGGTTATGCACAACGATACCGAGGGCGGCTCCGTTTGCCCCAAATGCGGTGTCACCTTTGCCAGTCGAAGGCTAAAGAACAGTCACGTCAGAAGAGTGCACGGACCAGAGCTTAGGTACAGGTGTCCATATTGCAATGAGAAGTTTAAACACTACCGCGAAAGGACAAAGCATTTAAACAAAGTGCACGGCAGGGTGGTACAGTATCCGTGCCACTTGTGCAGCGCTGTCTTCTCAATATTCTACCGGAGGACTAGACACATCCGACAGGTACACGAAAATATTAAACCGTTCAAATGCGGGCATTGCCCGTACAAGTCGTCCACCACGGCAGACTTACACAGCCACATGGTCAAGCACAGCGGCACAAGGAGGTATGAGTGCGAAATCTGCAAAAAGTTGTACTGCCGCATGAAAACCTTACGCGAGCACATGCGCATACACAACAGTGACAAGCGATTCGCCTGCGAGCACTGCGACGCGACGTACGTGCAGGAGAGTAGTTTAAAGAGCCACACGCGTAACCACCACTCCAACGCAACCACACACCAAAAGCAAAAAGAACTAGAACCTATTACTGAAGTCGGCTCCGACACGGAGAGTGTGGTCGCGCTGTCGGGTGCGGTCGCGGTGCGCGACGCGGGCAACGAGAGACGGGCCGCTTTCCGGGATAATATTAGCATTATTATGGAGACGTGTACCGCTTGCCCGTTCAAATACAGGAAAGGGGCGTACCTCTGCTTCTTCTGTAAGACCTCGTTTATGGAGCCGGAAAGGCTGCGCGAGCACACCCGCTTACAGCACGGCGATGTAATGCAGCTGAAACCACGAAAGTACGAGCCGCTTAAAATGGACTTTGCGTTAGCGGCGTGCAAACTATGTGGAGTGTCCGTCAGCGATTACGAAACTCTAAAGGCACATCTAAACGAACACGGGAAGCCGTTGGACTGCACATACGGCGACAGCGTTCTACCGTACAGCCTCACTAGGGAGTCGCACCAATGCCAGATCTGCGGTGGCCGTTACGAGACCTTCCTGACCCTGCACAAGCATATGAACGAGCACTACGAGCGTTTCATATGCGAGACGTGCGGCAAACGGTTTGCGACTTCCCAACGGATGGTCAATCACGCACGGACTCACGAACGTGGTGCGTTCCCCTGCAAACGATGCGGAGATACATTACCGTCCCACGCGTCATTATACGCCCACGTGGCGAGAGTGCACCGCTCAAACAAGCGCTACAAGTGCCCCGTCTGCGACGAGAAGTTCGCCTCGTACAAGCACCGTCTGAAACACCTAGCGGCGCATGGAGTACGAACTGCCCTATTCCCCTGTCCGTCCTGCCCTCGGGTGTTTGACTTGTGTAGCAGACGCACCGCCCACGTGCGCTTCCAGCATTTACAGGAGCGTAATCATTCCTGCCCGCTGTGCTACATGAAGTTCTTCACGCGATACGAGCTCAGGGAGCACTCGGTCAAACACGGCGGCGAAAGGATATACCAGTGTGACGTGTGCAAGAAATCGTACGCGCGTCTCAAGACCCTGCGAGAGCATATGCGGATACATAATAACGATAGGCGCTTCTCGTGTCAGGTGTGCGGCCAGGCGTTCGTTCAGAACTGTAGTCTGAAACAACATCTGAGGGTACATCATCCGACTCAGGGGAAGAGCGGTTGTGATAAAGCGTGGAACGAGGACGAAAACATATCAGCCGAGAATGCAAATCATCAAGGCGTGAAGATCGAAACTGGTCTATTAAACGACAATGATGCTATTGCCGCTCGCAGGAGAAACGCTACGTTGATCCTGGAATGTTCAAAGATAAGCCCGTTCAGGTGGACGAAGAACCTGTACATCTGTTTCTACTGTGACCAGCAGTTCACCGACCCGGCGGCTCTTAGGGACCATAACTCTCTCGACCACGCTACGACGTCTGCCGCGCAGATAAAATACGAGCTATCAAAGCAAAAGAAGCACGAACTTGTAAAGGCGGATGTAACCGACGCGGGCTGCAAACTATGCAGCGAGAGCTACCGTGACATTTCAACTCTCAAATCTCATTTGTTGCACGCACACGGGAAGAGTATCGAAACAGGTTCCTGTGATGGTTTGCTACCATTCAAGCTTAGCCGTGACAGTTTCGAGTGCGCCATTTGTAATCAGCTTCACGATGAGTTCAAATCTCTCAATCACCACATGAACGTGCACTTCCAACACTTTATATGCGAGCAATGCGGTGCCGGCTTCATAACGCCCGACCGTCTCCGCACCCATTCGCAGTCACACGGCACCGGTTCGCATCAGTGCGAGGGCTGTGACAAGGTGTTCCGTTCGACAGTCGCCAAGAACGAGCACTACGCGACGGTGCACAAGAGGGTCAAACGTCACCGGTGCCCGCACTGCGCCGAGGCCTTCCGTAACTACTTCCAGCGAAACAAACACGTCGCCTCCGCCCACGGACTTAAGCTGAAAGAGTTCAAATGTACTCTGTGTCCAAAAGTGTTCACTTTGAGCGGGAAACTGGGTGTACACGTTAGGACTGTGCATCTAAAGTTGAAGCGACATGCGTGCGAAGTGTGCGAATGGAAATTCTACTCTAGATCGGAGCTTAGAGAGCACATGGTCAGACATGGCGGCGAGAGAAAGTTCCAGTGTAGTGTTTGCAGGAAATCCTATGCCAGGAAGTACACTCTACGGGAGCATATGAGAATACACGAGAACGATAGACGATTCGTGTGCCAGGAGTGTGGGAGATCGTTTGTCCAGAACTGCAGTTTGAAGCATCATGCAAGGGTTCACCACCCGTCTAGCGCACATCTTCACACGATTTGTTTGTTTCTAGGCTCGGACGTAATGCCGCAAGCGCCGGCCGAACGGATAGTGCCGCTGTTCCAAGTGCAATACGATCGCAGCCTGTGTAAACCGTTGGGCACCGTTGCCGATTTCAGTAAGCTGCTCGCCATGTCGAAGCCGTCCAGGGCAGTCGCGATAGCCTCTGACAATCGCTCCCCGCTATGCTCACGTAGCCCGTCTCCTCTGTCCTTCATGAACGATAGCCCGTCGTTGGATATAGACAACTCCGCACTTATGTTGTCGAAGCGGGTACCTAGAAAGCGCGCAGATATTAGACAGAACGCATTAACACTTTTCGAGTTTACCACCGTCTACCCTTTCATATACggtaataataagtttaaatgtttCGTGTGTTCCCAACCGTTCTTAGAGACGGCGCTACTACGGCGACACATGCAAGACACCCACACTTTCGCACCTCTGAAGCGTCTTCTAAATAACCGGAGGGAGAATGTGATCAAAGTCGATGTTAGAGAGATGAACTGTAAATTGTGTCCTATCCAGCTGAGGGACCTGCAGCAACTGAAACAACATCTTAAGGAGGACCACCATAAACTGATAGATCCCGAGCTGAAGGACAACATAATACCGTTCATACTCGAGGCCGAAGAGGAAGGCTACAAGTGCGTGATGTGCGAAAGCAATTTCATCAAAGTGCGCACACTAGTCATCCACATGAGCGTGCACTTTAACAACTATAGCTGCGAGATTTGCGGCTCCGGATTCATGACGCTCCGTCTACTAAAGAAGCACTTGGAAGTTCACGAGAGTGGCAACTTCCACTGTGACAGATGCAGCAAAGTGTTCAACACATCGTACAAGCTGTCGTTGCACATTAGGGGTGTGCACCTAAAACAGTTTCCCAGACGCTGCCCGATGTGTCCCGAACGCTTCAACTCGAACTATCGCCGTACGAAGCATCTACAGGATGTTCACAATCAGTCCACCCGCGTGCACAAATGCAAGACTTGCGGTCGCGGCTTCAATCTGAAGTACCATCTAGTGTGTCACACGCGATCCGTACACCTGCAAGAGAGGAACCACCAATGCAAGGTTTGCTCGCAGAGGTTCTGCAACAAGGAAACACTGAAACGACATATGGTTATACACACGGGCGAGAAGAATTACAAGTGCGACTTGTGTGGGCTGGCGTTCTTGCGTCGAAAGAATTTGAAGGACCATTTGAGATTGGTGCCGACTATTTCGGACCTATCGGGGGTCCTTCCATCGATTTTAAGGTTCCTTCAAGTATCAAACACATCAATAACCCTGAAAGGTGAGGATGGCTCCGATATTAAATTGGCAGTACTGAAGAAGCCAATATCAATCGATAAATTGGAATATGAACACGTCGTCGCAGACAATACATATAATCTAGATTTTGATGATATCAGCGTGAAGAGCAACGGGGAAATGGTTGAAGAGGTTAAACGTTATTTGCTTCAGGATAAGCTACCTCTGACGTTTCTAAAGGTGCCTATCTCCTTGGATGAAATTGGTAATGGAAAGGAGCTGTCTCTTATGGATACAGAGTCTGTGAATAAGCGTTATGTAGAAACTGGACCGTCGCGCAGAAAGTGCGCTAGTCGAGACGACCAGACTTGGAAGCAGAACGCGTTGACACTTTTCGAGTTTTCTTACGTGTATCCGTTTATCCACGCGGGTAACAAATTTAAGTGTTACATCTGTTCCAAACCATTcgttgaaagtaatttattgaGGGAGCATACCCTGCAACATAGCTCACAAGAACTAAAAATTGAGATGAATAAGAAGGTTCGCGATAAGATTCTGAAGGTAGACGTCGTTCAGTTGCAATGCAGACTATGTCACGGGAACTGTCCGAATTTGACCGGCCTTAAAGTACATTTACTGCGTCATGGGAAGCGAATAGACCCTAACTATCAGGATAATATAATTCCATTTAAACTTGGTGGGGATTCGTTTGACTGTCAAATTTGTGGAGAAGGTTTTCAACGGTTGCGCCTCCTTATAATACACATGAGTAAACATTTCAACAACTACAGCTGTGAGATCTGCGGAGGAGTGTTCATATCTATGAGCTTACTAAAACGACATCTCCAGACCCACAAGGTTGGCAGCTATCCGTGCGATAAATGTGACAAGGTCTTTACAAATTCTGCAAAACGGATCAGTCATATACGAGGCGTGCATTTGAAACAATGCACCAGATCTTGTCCACATTGTTCGGAGCGCTTCAGCTCTAACTACCAACGCAGCAAACATCTACGCATATATCACAACCAAACAAATAGGTATAGATGCGAGATATGCGACCGACAATATGACCTAAAGTACCAGTTGTTTCTCCACAAAAGATCGGTACATTTACAAGAGCGGAACCAGGAGTGTAATATATGTCATTCTAGATTCTTTTCGAAGTACTGTCTGGCGAGGCACATGGTCATACACACAGGTGACAAGAACTTCAAGTGTGGAGTTTGTGGGAAAGCTTACACTAGAAAGAAGGGTCTACAGGAACACTTTAAATCTCACGAGGCAGTGCGCACTTCTTGTACCCGTTGTCTGCGAAGCCTGAGGCCTTATTTTAGTCTCTCAACAGGAGCTGGTCAGCCACCGCTGTTTGATCCATCCGAAGAGTGTGCACACACGCGGCTAATGTTCGTACTTATTAACCAGCAAGCTGACATTACGCTACCACGTGTTCAAAAAACACGATCCCAAGAACGCCGGGAAGTCGAGGCCGCTGATGTGCCCTCACTGCGCGGAGCTACCAATAAGCAACATACGCGTAGCATATAG
- Protein Sequence
- MNEGTTDNDNGDTWKPRRKFNDHRDNAIIIFECSNACPFRLSRGEFTCAVCSTTFADFTGVREHTESHPNKLEAIRSYRILENIKADIAELQCNLCLLHFNNLEDLADHLVTVHDRPLIKEWGLGITPFLLGGGAYVCSHCGERFDLFTSLNTHINQHSKNIVCFTCGKAFANEQRLTQHLVMHNDTEGGSVCPKCGVTFASRRLKNSHVRRVHGPELRYRCPYCNEKFKHYRERTKHLNKVHGRVVQYPCHLCSAVFSIFYRRTRHIRQVHENIKPFKCGHCPYKSSTTADLHSHMVKHSGTRRYECEICKKLYCRMKTLREHMRIHNSDKRFACEHCDATYVQESSLKSHTRNHHSNATTHQKQKELEPITEVGSDTESVVALSGAVAVRDAGNERRAAFRDNISIIMETCTACPFKYRKGAYLCFFCKTSFMEPERLREHTRLQHGDVMQLKPRKYEPLKMDFALAACKLCGVSVSDYETLKAHLNEHGKPLDCTYGDSVLPYSLTRESHQCQICGGRYETFLTLHKHMNEHYERFICETCGKRFATSQRMVNHARTHERGAFPCKRCGDTLPSHASLYAHVARVHRSNKRYKCPVCDEKFASYKHRLKHLAAHGVRTALFPCPSCPRVFDLCSRRTAHVRFQHLQERNHSCPLCYMKFFTRYELREHSVKHGGERIYQCDVCKKSYARLKTLREHMRIHNNDRRFSCQVCGQAFVQNCSLKQHLRVHHPTQGKSGCDKAWNEDENISAENANHQGVKIETGLLNDNDAIAARRRNATLILECSKISPFRWTKNLYICFYCDQQFTDPAALRDHNSLDHATTSAAQIKYELSKQKKHELVKADVTDAGCKLCSESYRDISTLKSHLLHAHGKSIETGSCDGLLPFKLSRDSFECAICNQLHDEFKSLNHHMNVHFQHFICEQCGAGFITPDRLRTHSQSHGTGSHQCEGCDKVFRSTVAKNEHYATVHKRVKRHRCPHCAEAFRNYFQRNKHVASAHGLKLKEFKCTLCPKVFTLSGKLGVHVRTVHLKLKRHACEVCEWKFYSRSELREHMVRHGGERKFQCSVCRKSYARKYTLREHMRIHENDRRFVCQECGRSFVQNCSLKHHARVHHPSSAHLHTICLFLGSDVMPQAPAERIVPLFQVQYDRSLCKPLGTVADFSKLLAMSKPSRAVAIASDNRSPLCSRSPSPLSFMNDSPSLDIDNSALMLSKRVPRKRADIRQNALTLFEFTTVYPFIYGNNKFKCFVCSQPFLETALLRRHMQDTHTFAPLKRLLNNRRENVIKVDVREMNCKLCPIQLRDLQQLKQHLKEDHHKLIDPELKDNIIPFILEAEEEGYKCVMCESNFIKVRTLVIHMSVHFNNYSCEICGSGFMTLRLLKKHLEVHESGNFHCDRCSKVFNTSYKLSLHIRGVHLKQFPRRCPMCPERFNSNYRRTKHLQDVHNQSTRVHKCKTCGRGFNLKYHLVCHTRSVHLQERNHQCKVCSQRFCNKETLKRHMVIHTGEKNYKCDLCGLAFLRRKNLKDHLRLVPTISDLSGVLPSILRFLQVSNTSITLKGEDGSDIKLAVLKKPISIDKLEYEHVVADNTYNLDFDDISVKSNGEMVEEVKRYLLQDKLPLTFLKVPISLDEIGNGKELSLMDTESVNKRYVETGPSRRKCASRDDQTWKQNALTLFEFSYVYPFIHAGNKFKCYICSKPFVESNLLREHTLQHSSQELKIEMNKKVRDKILKVDVVQLQCRLCHGNCPNLTGLKVHLLRHGKRIDPNYQDNIIPFKLGGDSFDCQICGEGFQRLRLLIIHMSKHFNNYSCEICGGVFISMSLLKRHLQTHKVGSYPCDKCDKVFTNSAKRISHIRGVHLKQCTRSCPHCSERFSSNYQRSKHLRIYHNQTNRYRCEICDRQYDLKYQLFLHKRSVHLQERNQECNICHSRFFSKYCLARHMVIHTGDKNFKCGVCGKAYTRKKGLQEHFKSHEAVRTSCTRCLRSLRPYFSLSTGAGQPPLFDPSEECAHTRLMFVLINQQADITLPRVQKTRSQERREVEAADVPSLRGATNKQHTRSI
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -