Lhal015064.1
Basic Information
- Insect
- Lobophora halterata
- Gene Symbol
- -
- Assembly
- GCA_932525835.1
- Location
- CAKOAV010000037.1:258208-268937[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 34 0.0011 0.062 13.9 1.6 3 23 303 323 299 323 0.94 2 34 0.0018 0.1 13.2 0.2 1 23 331 354 331 354 0.95 3 34 0.22 13 6.7 0.0 2 23 357 379 356 379 0.93 4 34 0.00022 0.013 16.1 4.3 1 23 385 407 385 408 0.95 5 34 0.00032 0.019 15.6 3.9 1 23 446 469 446 469 0.96 6 34 8.2 4.8e+02 1.7 1.5 5 23 604 621 603 621 0.82 7 34 0.26 15 6.4 0.2 2 23 629 650 628 650 0.93 8 34 0.056 3.3 8.5 1.0 1 23 658 681 658 681 0.91 9 34 0.32 19 6.1 0.0 2 23 696 718 695 718 0.93 10 34 0.0004 0.024 15.3 2.4 1 23 724 746 724 747 0.95 11 34 2.8 1.7e+02 3.1 0.6 2 23 754 775 753 775 0.95 12 34 0.0016 0.093 13.4 4.1 1 23 785 807 785 807 0.98 13 34 0.0073 0.43 11.3 1.5 1 23 810 834 810 835 0.92 14 34 0.0025 0.15 12.8 0.2 1 23 844 867 844 867 0.96 15 34 0.023 1.4 9.7 0.3 2 23 970 991 969 991 0.95 16 34 0.0045 0.27 11.9 1.4 1 23 999 1022 999 1022 0.95 17 34 0.32 19 6.1 0.0 2 23 1037 1059 1036 1059 0.93 18 34 0.00031 0.018 15.6 4.3 1 23 1065 1087 1065 1088 0.95 19 34 0.0089 0.52 11.0 2.0 1 23 1094 1116 1094 1116 0.98 20 34 1.4e-05 0.0008 19.9 1.3 1 23 1126 1148 1126 1148 0.98 21 34 0.0028 0.16 12.6 0.6 1 21 1151 1171 1151 1175 0.92 22 34 0.028 1.7 9.4 2.6 1 23 1183 1206 1183 1206 0.96 23 34 2.8 1.6e+02 3.2 1.0 2 23 1212 1233 1211 1233 0.95 24 34 5.7e-05 0.0033 17.9 1.2 2 23 1255 1276 1254 1276 0.97 25 34 0.0045 0.26 12.0 0.3 2 23 1283 1304 1282 1304 0.94 26 34 4.3e-06 0.00025 21.4 0.2 1 21 1310 1330 1310 1331 0.95 27 34 9.8e-06 0.00058 20.3 1.7 2 23 1533 1554 1532 1554 0.97 28 34 9.5e-05 0.0055 17.2 6.3 1 23 1560 1582 1560 1582 0.98 29 34 0.00021 0.012 16.1 2.5 1 23 1617 1639 1617 1639 0.97 30 34 1.6e-06 9.6e-05 22.8 1.3 1 23 1645 1667 1645 1667 0.99 31 34 0.00012 0.0072 16.9 3.3 2 23 1674 1695 1673 1695 0.97 32 34 0.023 1.4 9.7 8.4 1 23 1700 1722 1700 1722 0.96 33 34 5.3e-05 0.0031 18.0 2.2 1 23 1728 1750 1728 1750 0.98 34 34 3.3e-06 0.00019 21.8 2.0 2 23 1757 1778 1756 1778 0.98
Sequence Information
- Coding Sequence
- ATGAAAAGGACTATGCAGAATCATTTGAAGCAATGTAAAGACTTGAAGGAGTCTCTAATAAAACTAAAGAAAAAGACAAGCAAAGTGAGACAGGTAAAAAAAAATAATGAAGAAAAAGATGGATGTCATGTTGGAAAATTGTTTGTGGAAAATAATGAGCCATTGCAAGAACTATACAAAGTGCGATGGAAAGAAGAGGTGGACGAAACAACGGAATTCATTTATGAAACTTATGAACCAGAAAAATATCCACGCGCAATTAATGTTAAACCAACTGAACCACTAACCAATGATCCATTTGATGAACCAATGATTGAACCCAACGAAGAACCGAATGAATCAGCTGAACCCCTCATTGATCCATTTGATGACCCAACTATTGAACCTAAAGAAGAACCAAGTGATGAAGTACTTATTGAACCAAAAGAAGAACTATATGATGAATTAAGTTTTGAATCTGATCAAGAACCAAATGATGAATTTACTGAAGATGTATTAAGTGAATTAACTGAAGAAATAACTATTGAACCTAAAGAAGAACTATATGATGACTTGAGTATTGAAACTGATGAAGAACCAAGTGATAAATTGACTAAAGATGAACCAAATGAATTAACTAAACAAATAACGATTGAACCTAAAGAAAATCTAAATGATAAATTAAGAGTTGAATCTGATGAAGAACCAAGTGATCAATTCACTAAGGATGAACCAAGTGAATTTACTAAAGATGAACTAAGTGATGACTTAACTGTAGAACCTAGAGAAAAACCTAATAACCATTTAACTATTGAATCTAAAGATAAACAAATTTGTAAATCAACCGACAAATCATTTGTTAAAAGAATTAGTGCTAAAAAACAAGGGAAAGCAAATAAATTGGTTAATAAAGTAAAATGCAAAATCTGTGATCTATGTGGCCGCTCATTCCCATCGAAAGTTAGACTGTTGAAACACCTGAATTGGCACATTAGGGCAGAGCACTTCCCGTACCCTTGTGCTGTGTGTCCCAAGAAGTTCACCGCAAAATGGAGATTCCAGAAACACATGGAACTTATACATGGTCTGAAATGTAAAGCATGCGAAGACTCCTTCGCCGATGAAGATAGCCTAGAAGCTCACGTGATGGAAATGCATCCGGAGCTAAGCATATACCCTTGCAAACGATGTCCGTCAAGCTTCTCAAACAGCCACAACTTAAACAAACACGTGATAAAACATCACCCTGATTTAAACCCATTGGAATGTAATAAATGCAAACATATTTCGGGATTAATTGGCCATTGGAAGGTGCATCTAATCACGCACTTGGAAGCGGAAAAGAAACCAAAACCTTTCATGTGTGATTACTGTTCAAGACAATTTACTTTTAAATACGAATTGAAGAGACACACATTGAAAACTCACAATGAATTTAAGTCTGCAATGTGTGGGGAATCGGTTTTGGACAGTATTTCGTTAAGAGGTCACAGAAATGCTACCGAACTCGATAACCGTCTTATCAGTACGAGTAGTTATATTTGTGATAAGCCCGGAAAAAGGATGATAAATAAAAAATGTTTGAGAATCAACCTCAAAGAACAACACACACCCCGTGAAAGCACTTGTAACGATTTCAGCGAAAAACTCTATGATAAGATTCAGGAAAGGGTACATAAGAAACACCATTCAATTCCTAAGACTAAGAAAGAACCAAATAACGAATTGACTATTGAATCTAAAGGAAAAACAATTCGTGAATCAACCGTTAAAACAAAAAAAAATAGTGCTAAAACACGAGGAAAAGCGAATAAACTTGCTAAAAGCAAAATCTGTGATAGACAGTTTTGTGAAACATACCTCAAAACACATATGAAGCGTCACAATTACAAACCTAAATCGATGAAGTGTGATCAATGTGGCCGCATTTTCCTTAAGAAAATGAAACTGGTAGAACACCTGGCTTGGCACACTCGGACAGAGAGCTTACCGTTCCATTGTAACGTGTGTCCCAAGAAGTTCGCCATAGAATTGAGTTTTCAGAGACACGTGACACTTATACATGGTCCAAGTTGCAAAATAATAAATAAAAAGAGACTTGAACTGAAATGTGAAACTTGCCAAGACTCATTCGATGGTGAAGATAGCCTAGAAGCTCACGTGATGGAAATGCATCCGGAGCTAAGCATATACCCTTGCAAACGATGTCCGTCAAGCTTCTCAAACAGCCTCAACTTAAAAAAACACGTGATAAAACATCACCCTGATTTAAACCCATTAGAATGTAATAAATGCAAACATATTTCGGGGAAAGTTGGCCAATGGAAGGTGCACCTAATCACGCACTTGGAATCGGAAAAGAAACCAAAACCTTTCATATGTGATTACTGTTCAAGAAAATTCACTTTGAAATGTGGTTTGATGACTCACATGAAATCTCACAATGCATTTAAGTGTGCAGTGTGTGGTGAATCATTTTTGGACAGAGTGTCGTTAAGAGCTCACAGAAAATCTTTCAAACATCATCGTCATCGCTCAAGTGCGAGTAGTTACATTTGTGATGCGTGCGGAAAAGGATTGAAAGACAAAAAAGGTTTGAAAGCCCACCTCGAAAAATTCCATACACCCCGTGAAAGCACTTGCGACGTTTGCGGCGAAAAACTCTTTGATAGGTTTCAGGAAAGGGTACATCAGAAACACCATATAATTCCTAAGCCTAAGAAAAAACCAAATAACGAATTGACTATTGAATCTAAAGGAAAAACAATTTGTGAATCAACCGTTAAATCATTTGTAAAAAAAACTAGTGCTAGAACACGAGGAAAAGCGAATAAACTTGCTAATAAAGAAAAATGCAAAATCTGTGATCTACAGTTTTGTGAAACATACATCGAAACACATATGAAGCGGCACAAAACAAAATCGATGAAGTGTGATCAATGTGGCCGCTCTTTCCTTACGAAAATGAAACTGGTAGAACACCTGGCTTGGCACACTCGGACAGAGAGTTTACCGTTCCATTGTAACGTGTGTCCCAAGAAGTTCGCCATAGAATGGAGTTTTCAGAGACACGTGAGATTTATACATGGTCCAAGTTGCAAAATAATAAATAAAAAGAGACGTGAACTGAAATGTGAAACTTGCCAAGACTCATTCGATGGTGAAGATAGCCTAGAAGCTCACGTGATGGAAATGCATCCGGAGCTAAGCATATACCCTTGCAAACGATGTCCGTCAAGCTTCTCAAACAGCCACAACTTAAAAAATCACGTGATAAAACATCACCCTGATTTAAACCCATTTGAATGTAATAAATGCAACAAAATTACGGTACATTACACCGATTGGAAGATGCACCTAATAACGCACTTGGAATCGGAGAAGAAACCAAAACCTTACATTTGTGATTACTGTTCAAGACAATTTACTAAGAAAGGCGGTTTGAAGAAACACATACTGACTCACATAGAATTTAAGTGTAAAGTGTGTGGTGAATCGTTTTTGGACAGGATTTCATTAAAAGCTCACAGAAAAGCCTCCAAACATCGTCGTCGCTCAAATACGATTTACATTTGTGATTCGTGCGGAAAAAGATTGAAAAGTAAAGACTATTTGAAATTCCACCTCAAAAAATTCCACACACCCCGTGAAAGCACTTGCGATGTTTGCGGTGAAAAACTATTTGATAAGTTTCAGGAAAGGTTACATCAGAAACGCCATTTTATTTCAAAAGTAAAAGCAAAAGTTAAGGTTAATTACTATAAAGGTGCGGCCTCTAAGGTCACATGCCACAAATGTGGAAAAATATTGGCTTCAAAACCAAATTTGAAAAATCACCTGAGATTACATGATAACGATAAGCCCTGCCAATGTGCACGGTGTGGGAAACGATTCGTGGATGCCGCGCGGCTCCGTCTACACGCTGCCGTCCACTTCGAAGAAAAACCACACGTTTGTCAAGGATGTGGCAGAGCTTTCTCCCAGAAGGGCAATCTCAAGGCTCACCTGAAGTTATGTCGCAAGTTTGTGGACTATTCAGAGAAAAACATCGCAACCGAAAGTCTAATACGAACGCAAAAGGTTGTCGCACACGAAAATATTCCTTCGCATGGTATGATTATAGAACCAGAGAAACAAGATGAAGCAATCGATGAAACAGTGCATAAAGAAGATGCCTTTGACGTGATAGCACAAAAGTTTGATGTTGAATTCAAAGAAGAAACGAACGATGATCCTAAAAATGAAGTGATTGAGAACAGTCCCCGAGATAAAACATTCGAAGTCGGAATTAACGAAACATACAAACTAGAAATAACTATTGACAAAGATGAAGCATCTAATGACGGTTCATATTTCGACCCTGGTGAACACCATAACAGTAAGGAAATTGAACACATTTCTTTTCAAAACGAAGATAACGAAACTCTCAAAGAAGGAACCAAAGAGGAAGATGAAGATCCTACGAATGACACAAACGTTGGACTAGAAGTGAACCTCGAAGATCCGTTAGATGAATCGGGCGAAGATACACGCCGTCAGGAAGAAACCAGTGTCGTACCAGTCGGGAGTAAGCCACTAACTGACAAACCTGATGTGAATGCTGTGTACCAAAACCTTACCTGTGACGTTTGCGACAAACTGTTCAAGAGTCGAAATACCTTAACACGCCACATGAAAATGCACAGCGATGGCAAAAAACACCAATGTGAAAAATGCGGCCACGCGTTCCACCAAAAGTCTAACTTGAAAAGCCACCTGCAAATACACACGGGGGATAAATTAGACCTCATTATCTCTTGCCAAAGTAGCTCTATAGGATTATCTGATAATAAGACCACAGGCCACAAAAAATTGCAGTCAGGCGAAAGACACGTTTGTCTCGATTGTGGAAAGTCTTACAAAGCATACAAATATTTGAAACTTCATATGCGAAATCATACAGGAGAAAGGCCTTACGTTTGCCACGCATGTGGGCGAAGTTTTACAAAAAGCGGCTTGCTAAAAGTCCATATGAACACTCACACAAATGAAGGCAAAGTAGAGTGTCATTATTGTGGTAAGAGTTGCGCCAGTAAGCTTACGTTATCCCAACATATGCTCACTCACGGGGAGAAAAAACACGCGTGTCATCTTTGTGAGAAAAAATTCTGTAGGAAAGATCTTTTGGACGCGCATTTGCATCGCCACTCGGGAATCAAACCGCATAAATGTAAATTATGCGAAAAGTCGTTTGCTTTCACTATCAACTTGAATCGTCACATGGAGGTGCACGCTGATATACAACCTCAAAAATGCGAAATATGCAACAAAGTTTACAGCCGCAACATATATTTTAAAAAGCACATGCGGATGCATAAGAAATCTAAACTAAAGAAAAAAACTTAG
- Protein Sequence
- MKRTMQNHLKQCKDLKESLIKLKKKTSKVRQVKKNNEEKDGCHVGKLFVENNEPLQELYKVRWKEEVDETTEFIYETYEPEKYPRAINVKPTEPLTNDPFDEPMIEPNEEPNESAEPLIDPFDDPTIEPKEEPSDEVLIEPKEELYDELSFESDQEPNDEFTEDVLSELTEEITIEPKEELYDDLSIETDEEPSDKLTKDEPNELTKQITIEPKENLNDKLRVESDEEPSDQFTKDEPSEFTKDELSDDLTVEPREKPNNHLTIESKDKQICKSTDKSFVKRISAKKQGKANKLVNKVKCKICDLCGRSFPSKVRLLKHLNWHIRAEHFPYPCAVCPKKFTAKWRFQKHMELIHGLKCKACEDSFADEDSLEAHVMEMHPELSIYPCKRCPSSFSNSHNLNKHVIKHHPDLNPLECNKCKHISGLIGHWKVHLITHLEAEKKPKPFMCDYCSRQFTFKYELKRHTLKTHNEFKSAMCGESVLDSISLRGHRNATELDNRLISTSSYICDKPGKRMINKKCLRINLKEQHTPRESTCNDFSEKLYDKIQERVHKKHHSIPKTKKEPNNELTIESKGKTIRESTVKTKKNSAKTRGKANKLAKSKICDRQFCETYLKTHMKRHNYKPKSMKCDQCGRIFLKKMKLVEHLAWHTRTESLPFHCNVCPKKFAIELSFQRHVTLIHGPSCKIINKKRLELKCETCQDSFDGEDSLEAHVMEMHPELSIYPCKRCPSSFSNSLNLKKHVIKHHPDLNPLECNKCKHISGKVGQWKVHLITHLESEKKPKPFICDYCSRKFTLKCGLMTHMKSHNAFKCAVCGESFLDRVSLRAHRKSFKHHRHRSSASSYICDACGKGLKDKKGLKAHLEKFHTPRESTCDVCGEKLFDRFQERVHQKHHIIPKPKKKPNNELTIESKGKTICESTVKSFVKKTSARTRGKANKLANKEKCKICDLQFCETYIETHMKRHKTKSMKCDQCGRSFLTKMKLVEHLAWHTRTESLPFHCNVCPKKFAIEWSFQRHVRFIHGPSCKIINKKRRELKCETCQDSFDGEDSLEAHVMEMHPELSIYPCKRCPSSFSNSHNLKNHVIKHHPDLNPFECNKCNKITVHYTDWKMHLITHLESEKKPKPYICDYCSRQFTKKGGLKKHILTHIEFKCKVCGESFLDRISLKAHRKASKHRRRSNTIYICDSCGKRLKSKDYLKFHLKKFHTPRESTCDVCGEKLFDKFQERLHQKRHFISKVKAKVKVNYYKGAASKVTCHKCGKILASKPNLKNHLRLHDNDKPCQCARCGKRFVDAARLRLHAAVHFEEKPHVCQGCGRAFSQKGNLKAHLKLCRKFVDYSEKNIATESLIRTQKVVAHENIPSHGMIIEPEKQDEAIDETVHKEDAFDVIAQKFDVEFKEETNDDPKNEVIENSPRDKTFEVGINETYKLEITIDKDEASNDGSYFDPGEHHNSKEIEHISFQNEDNETLKEGTKEEDEDPTNDTNVGLEVNLEDPLDESGEDTRRQEETSVVPVGSKPLTDKPDVNAVYQNLTCDVCDKLFKSRNTLTRHMKMHSDGKKHQCEKCGHAFHQKSNLKSHLQIHTGDKLDLIISCQSSSIGLSDNKTTGHKKLQSGERHVCLDCGKSYKAYKYLKLHMRNHTGERPYVCHACGRSFTKSGLLKVHMNTHTNEGKVECHYCGKSCASKLTLSQHMLTHGEKKHACHLCEKKFCRKDLLDAHLHRHSGIKPHKCKLCEKSFAFTINLNRHMEVHADIQPQKCEICNKVYSRNIYFKKHMRMHKKSKLKKKT
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -