Lcup011281.1
Basic Information
- Insect
- Lucilia cuprina
- Gene Symbol
- -
- Assembly
- GCA_001187945.1
- Location
- JRES01000780.1:211898-221346[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 46 0.022 1.4 9.2 1.5 2 23 79 101 78 101 0.94 2 46 6.5e-05 0.0043 17.1 0.5 1 23 108 131 108 131 0.96 3 46 0.00025 0.016 15.3 0.1 1 23 136 158 136 158 0.98 4 46 0.0019 0.12 12.5 0.7 1 23 164 187 164 187 0.88 5 46 8.2e-06 0.00055 19.9 2.3 1 23 193 215 193 215 0.99 6 46 0.00011 0.0071 16.4 3.6 3 23 223 243 222 243 0.97 7 46 3.8 2.5e+02 2.1 0.5 1 12 249 260 249 265 0.84 8 46 0.017 1.1 9.5 6.8 1 23 371 394 371 394 0.95 9 46 7.7e-06 0.00051 20.0 1.2 1 23 401 424 401 424 0.98 10 46 0.0017 0.11 12.7 0.1 1 23 429 451 429 451 0.97 11 46 0.00075 0.05 13.8 0.5 1 23 457 480 457 480 0.90 12 46 0.0003 0.02 15.0 1.6 1 23 486 508 486 508 0.99 13 46 8.6e-06 0.00057 19.9 1.4 3 23 516 536 515 536 0.97 14 46 0.15 10 6.5 0.4 1 21 542 562 542 562 0.95 15 46 0.046 3 8.1 3.7 1 23 709 731 709 732 0.94 16 46 0.004 0.27 11.5 0.6 1 23 739 762 739 762 0.96 17 46 0.00037 0.025 14.7 0.2 1 23 767 789 767 789 0.98 18 46 0.00075 0.05 13.8 0.5 1 23 795 818 795 818 0.90 19 46 0.0003 0.02 15.0 1.6 1 23 824 846 824 846 0.99 20 46 7.1e-06 0.00047 20.1 1.7 2 23 853 874 852 874 0.97 21 46 0.02 1.3 9.3 1.9 1 23 880 903 880 903 0.97 22 46 6.4e-05 0.0043 17.1 0.8 1 23 926 949 926 949 0.97 23 46 0.025 1.7 8.9 1.9 1 23 954 976 954 976 0.96 24 46 0.0025 0.17 12.1 0.7 1 23 982 1005 982 1005 0.92 25 46 0.0019 0.13 12.5 1.8 1 23 1011 1033 1011 1033 0.97 26 46 3.5e-05 0.0023 17.9 2.8 3 23 1041 1061 1040 1061 0.97 27 46 0.00044 0.029 14.5 3.0 1 23 1067 1090 1067 1090 0.97 28 46 0.00027 0.018 15.2 1.2 2 23 1200 1222 1199 1222 0.95 29 46 0.079 5.2 7.4 0.3 1 23 1229 1252 1229 1252 0.90 30 46 0.0012 0.082 13.1 0.0 1 23 1257 1279 1257 1279 0.98 31 46 0.0048 0.32 11.2 3.2 1 23 1285 1308 1285 1308 0.94 32 46 0.0005 0.034 14.3 1.6 1 23 1314 1336 1314 1336 0.99 33 46 0.00012 0.0079 16.3 3.7 3 23 1344 1364 1343 1364 0.97 34 46 0.022 1.5 9.1 1.6 1 23 1370 1393 1370 1393 0.97 35 46 3.6e-05 0.0024 17.9 0.7 1 23 1514 1537 1514 1537 0.97 36 46 0.004 0.26 11.5 0.5 1 23 1551 1574 1551 1574 0.93 37 46 0.00027 0.018 15.2 0.8 1 23 1580 1602 1580 1602 0.99 38 46 0.00013 0.0089 16.1 4.9 3 23 1610 1630 1609 1630 0.97 39 46 0.022 1.5 9.1 1.6 1 23 1636 1659 1636 1659 0.97 40 46 0.00087 0.058 13.6 2.4 2 23 1775 1797 1774 1797 0.95 41 46 0.0087 0.58 10.4 1.1 1 23 1804 1827 1804 1827 0.94 42 46 0.00044 0.03 14.5 0.0 1 23 1832 1854 1832 1854 0.98 43 46 0.0037 0.25 11.6 0.6 1 23 1860 1883 1860 1883 0.94 44 46 0.0038 0.25 11.5 3.5 1 23 1889 1911 1889 1911 0.99 45 46 0.00012 0.0079 16.3 3.7 3 23 1919 1939 1918 1939 0.97 46 46 0.022 1.5 9.1 1.6 1 23 1945 1968 1945 1968 0.97
Sequence Information
- Coding Sequence
- GCTGAAGCGAGGCATAAAGCTCTATCCGATCTAAGCCAAACATTATTAAGTGAACACAATATAGATTTTAGCAAAGAAGTCTTGTCAAATGCCATTGAACAATTCCAATCTTGGTATTACGATACGAAACGGCGTAAAGGAAACCGATATAAACTGACAAGTAGTATcgaagataaatatttaaaaatgtgcaAATTCATGCCTTCGAAACAATTGAAACTGAGGCAAAGGATGATGTGTGAATTGTGTAATACACTCTTTTATACGGAACACAACTTAAAGCTCCATCTGTATAAAGCTCACCAAATAGGAGAACTACCGTTCGAGTGTGAGCAATGTGGAAAGAAATTCGACTCAAAATTGTATTTGAGAGTTCACGTACTTCGCGTACACGTTGGCAAGCAGTTCCCCTGTGAATATTGTGGCAGAAAGTTTGCCATTATGCCAGAACTGAACACTCATATCAAGATTCACACAGCTGAAAAGCCTCACGTATGTGAGCAATGTGGCAAATCATTTAGGTTAAAAACTCAATTGGGCTATCATGTTACAGCGATACACACTAACATTCGCGCTTTCAAATGTACCATGTGTCCCAAGGACTTTAAAAAGAAGCGCGATCTGACTGATCACATCAAGTCGCACTTAAATATACGCGACAAGATTTGCGAGACATGTGGTAAAGGATTTTCCAATTGTCATTCCTTGATTCGTCACCGTCAAATACATTCGGCGGTGAAAAAGTACGAATGCAAATTGTGcgatgccaaatttcatcagtTTAAACCATTAACGCTCAAACAATGGGATGACAATACTTCGAAATTCATCGATGCTTACGCAAAATTCCCCCTTCTATACGACGGAGAGCATCCATTGTATGGTAATTTCGAGGAAGTACAAAAAGCTAATAACGATCTAACAAATACACTTTTAACTGAACACAATATCGATTTTAAAGAAGAAACCCTGTCCTCGGCAATAAATCAATTGCATTCTTGGTACTACGATGCCAAGCAACGCAAGGCAAAACAAAATAGATTGAAAAAAGAGGAAgaggaatattttaaaaagtgtaacTTTTTACCGGCAAGACgattaaaaacgaaattttattgtcatgtttgtaagaaaaacttcaaaatggAACACAATCTAAAGGGCCATCTCTTTAGATATCATCAGATAGGCGACCTACCATTCCAATGCGAGCAATGTGGAAAGAAATTCGAAATCAAATCCACACTGCAAACACACATTCAAAGAATGCACGTCAGTAGAAAATTCCCCTGCGGTTTTTGTGGAAAACTCTTTGCCATAAAATCAGAGTTAAAAGTTCACAACATGGTTCATACGGCGGAAAAGCCGCATGTCTGCGAATTATGCGGAAAAGCCTTTAGAATAAAGACTCAACTACGCTACCATGTGACAGCAATACACACCAAGATACGTGCCTATAAATGCACCATGTGTCCCAAGGATTTCCTTAAGAAGCGTGACTTGACCGACCACATTAAAACGCATCTAAATATTCGCGACAAGATATGCGAAACGTGTGGAAAGGGTTTTTCGAATAGTCATTCGTTAATTAGACATCGTCAGATACATTCGGAAGTAAAACGATATGCCTGCAAATTATGCGATGCTAAATTTCACCAGTTCGTTGGTCTAAACGGTCATATGAAACcaattattttacaactattatttccttttattgtctttattttccttaataaaaaaCAGGAACTTAATAAAAAACAGGAACGTAACAAGAACCGGGAACAGGatcaaaatctattaaattttaaaaagttggaTGAAAATACTTTCAGTTTCATAGAGACTTATGAAAAATTTCCTCAACTATATGATGGAAATCATCCTTTGTACGAAAATGTTGAAGAAAAATTGAAATCTTTAACTGAATTTGCAGATCGTCTTTTACTATTTAACGGCTTAAATTTCTCAATTCAGACTCTAACTATCGCCATCAGACAATTACAGTCGTGGTATTATCATTCTAAGACACGCAACGAAAAccgaaagaaaaaattaaatgaccACGAAATGGaatactttagaaaatgttcGTTTTTCCCTCCAAAGCGCCTAATTGATAAATACTTTTGTACAATTTGTAAACAACAATTTACTTTGAAAAGTAGTTTGGACGGCCATCTCTTCAGACACCATCAAATCGGCGATTTGTCCTTTGAGTGCGATAAGTGTGGCAGAAAGtatgcacaaaaaaatatacttcgAGAGCATGTACAACGAATGCATGGCAGCAAAGAATTTTCCTGCGGTTTCTGTGGAAAACTCTTTGCCATAAAATCGGAGCTAAAAGTTCACACCATGGTACATACGGCGGAAAAGCCACATGTATGCGAACTATGCGGCAAAGCTTTTAGAATAAAGACTCAACTACGCTACCATGTAACAGCAATACACACCAAGATTCGTGCCTATAAATGCACCATGTGTCCCAAGGATTTTCTTAAGAAGCGTGACTTGACCGATCACATTAAAACGCATCTAAATATTCGCGACAAAGTATGCGAAACGTGTGGAAAGGGCTTTTCGAACAACCATTCGTTAATACGACATCGTCAGATACATTCGGAAGTAAAGCGATATGCCTGCAAATTATGCGATGCTAAATATCACCAGTTTGTTGGCCTAAACAGTCACATGAAACGTACGCATAACATTGTAAAAAAAGATTCATTATCGAATgaaatttcattATACCATCAAATCGGCGATTTGTCCTTTGAGTGCGATGAGTGTGGTAAAAAGTTCTCACAAAAAACTATACTCCAACAGCACGTACAACGAATGCACGCCAGCAGAAAATATCGCTGCGAATTTTGCGACAAACTATTTGCCACAAAAtcggaacaaaattttcacaccAGAATACACACGGACGAAAAACCACATGTCTGTGAATTTTGTGGAAAATCTTTTAGAATTAAAACTCAATTGGGTTATCATGTCAACGCTGTCCACACGAAAATACGTGCCCACAAATGTACCATGTGTCCCAaggattttttaacaaaacgtgATCTAACGGACCACATTAAAGCTCATTTGAATATACGTGACAAGATATGTGAAACTTGTGGAAAAGGGTTTTCGAATAATCATTCATTTAGACGACATCGTCAAATACATTCGGAAATCAAAAGATATGAGTGTAAAATGTGTGACGAGAAGTTCAATCAATTAATAAGTTTACACAGACATAAAAAACGTATTCACAAGAATAAACCACTCAATCTTGACCAATGGGATGATACAACGtgcaaatttatcgaaatttatgaaaaatttccatTACTTTATGACAAGAAAAACCCATTGTACGGCAATTTAGAAGATAAACAAAAGGCTTTAACAGACATTAGCAGTACCCTGAAAAGTCAGCATAATATTGATTTTAGCGAAGAATTAATAGCCTTTGCCATAGATGAATTGAAATCTTGGTTAATACGAGTCAAGCGACGAAAAGGTAAACTTGACAAAACAACTgagaagtatttaaaaatgtgcaaatttATGCCGGCTAAGCGATCAAAAGAAAAGATTCCGTGTAGCATGTGCGAAAAGGTTTTCTACACCGAGCATAATTTAAAATACCATTTAACAAAAGTTCATAAAGTAGGAGACTTTTCGTTTCAATGTGATTTGTGCGGAAAAAAACTTGTCTCAAAAAATGCTGTCCTATATCACAATTTAAGGGTACATGTTGAGAAGAAATATCCATGTGAGTTTTGTGGAAAACTATATGCTATTCCTGCTGAACTGAAGGTCCATACTCGGATACACACGGCAGAGAAGCCTCATGTATGCGAATTATGTGGTAAATGCTTTAGACTGCGCAATCTTTTGAGTCTCCATGTCACACGTATGCACACCAAGATTCGTGCCTTCAAATGCACAATGTGCCCCAaagattttctcaaaaatacTGATCTAAGGGATCACATGAAATCCCATTTAAATATTCGAGACAAAATATGCGAAACCTGTGGCAAAGGTTTTACAAATTGTCATTCTTTGATACGACACCGTCAGATACACTCAGATATAAAGAAATTCGCATGTAAACTGTGTGACGCTAAATTCCATCAATTTGTTGGACTCAATGGTCATATGAAACGCACACATAACATAAACCCACTCACTCTCGACAAATGGAATGAAACTACGTGCAAATTTATCGATATTTATGCGAAATTTCCATTGCTATACGATCCAGATCATCCATTACATGgcaaaaatgtagaaaaacaGAAAGCCTTGTCTGAGATTCGCAGCACCCTACTGAGTCAACATAACATAAATTTCAGCGACGAAATCGTAACCGAAGCCATAGATCAACTGCGATCTTGGTACTATTCAAGCAAACGTCGACAACATGTGaccaataataatgatgatcttaaaaaatgttattcaaaatatttaaaattgtgcaaatttttgccagaaaaacaaaaaaggGTTCATCAAATTGGGGAGATATCCTTTCAATGCACTCAATGTGGAAAAAAACTTGTTTCGAATTCAGCTTTACAAAGTCATACCCAAAGGGTTCACTCTGGAAAGGAATATCCATTACACACGAATGAGAAACCGCATGTCTGTGAATTATGCGGTAAAGGTTTTAGATTGCGAAATAATTTAGGTCTTCATGTTACCAGTATGCACACCAAAATTCGTGCCTTTAAATGCACCATGTGTCCCAAAGATTTTCTGAAAAATGCCGATCTGAAGGATcacataaaaacacatttaaatatacGCGATAAAATATGCGACACGTGTGGCAAAGGATTTACAAATTGTCATTCCCTTAGAAGACATCGTCAGATACACTCAGAGATAAAGAAATTCGCCTGTAAACTATGTGACGCAAAATTCCATCAATTTGTTGGACTAAATGGTCATATGAAACGTACTCATAATATAAAGCCACTCATACTTGATAAATGGGACGATACTACGTGTAAATTTATTGACATATATGCGAAATTCCAATTACTCTACGATGGAGAGCACCCATTATATGGGAAATATGAGGATAAACAGAGGGCCATAACTGATATACGCAACGCTCTGCTCACTCAACAtgaaatagattttaatgagGAACTTGTATCGAATGCTATAGATCAATTACAATCTTggtattatttaacaaaacggCAAAAACATAAGAATAAAGATGATGCCAAACTTGCCAAGAGTTGCGAAAACTATTTACAAATGTGCAAATTTTTGCCGGAGAAGCGATTAAAAGTGAAGATTGTGTGTAgcatttgcaataaaaatttttatacggAACATAATTTCAAATATCACTTAACGAAAGTCCATCAAGTGggagatttttcatttcaatgcgATCTGTGtggaaaaaaacttatttcgaAGTCTGCTTTACAATGTCACGCTCAAAGGGTTCACTCTGAAAAGAAATATCCATGtgaattttgtggaaaattattTGCCATACCTTCTGAACTGAGGATCCACACACTGGTGCACACCGCTGAAAAGCCACATGTTTGTGAATTGTGCGGCAAAGGTTTTAGATTGCGAAATAATTTAGGCCTTCATGTTACACGTATGCATACCAAGATACGTGCCTTCAAATGTACCATGTGTCCCAAAGATTTTCTGAAAAACTGCGATCTAAAGGATCAcataaaaactcatttaaataTACGTGATAAAATCTGTGAAACTTGCGGTAAAGGTTTTACCAATTGTCATTCTCTAATACGACATCGTCAAATACACTCAGAGGTTAAAAAATTTGCCTGTAAACTCTGTgatgcaaaatttcatcaatttgtTGGACTTAATGGTCATATGAAACGTACCCATAATATAGTAAAAAAAGacacaaacaattaa
- Protein Sequence
- AEARHKALSDLSQTLLSEHNIDFSKEVLSNAIEQFQSWYYDTKRRKGNRYKLTSSIEDKYLKMCKFMPSKQLKLRQRMMCELCNTLFYTEHNLKLHLYKAHQIGELPFECEQCGKKFDSKLYLRVHVLRVHVGKQFPCEYCGRKFAIMPELNTHIKIHTAEKPHVCEQCGKSFRLKTQLGYHVTAIHTNIRAFKCTMCPKDFKKKRDLTDHIKSHLNIRDKICETCGKGFSNCHSLIRHRQIHSAVKKYECKLCDAKFHQFKPLTLKQWDDNTSKFIDAYAKFPLLYDGEHPLYGNFEEVQKANNDLTNTLLTEHNIDFKEETLSSAINQLHSWYYDAKQRKAKQNRLKKEEEEYFKKCNFLPARRLKTKFYCHVCKKNFKMEHNLKGHLFRYHQIGDLPFQCEQCGKKFEIKSTLQTHIQRMHVSRKFPCGFCGKLFAIKSELKVHNMVHTAEKPHVCELCGKAFRIKTQLRYHVTAIHTKIRAYKCTMCPKDFLKKRDLTDHIKTHLNIRDKICETCGKGFSNSHSLIRHRQIHSEVKRYACKLCDAKFHQFVGLNGHMKPIILQLLFPFIVFIFLNKKQELNKKQERNKNREQDQNLLNFKKLDENTFSFIETYEKFPQLYDGNHPLYENVEEKLKSLTEFADRLLLFNGLNFSIQTLTIAIRQLQSWYYHSKTRNENRKKKLNDHEMEYFRKCSFFPPKRLIDKYFCTICKQQFTLKSSLDGHLFRHHQIGDLSFECDKCGRKYAQKNILREHVQRMHGSKEFSCGFCGKLFAIKSELKVHTMVHTAEKPHVCELCGKAFRIKTQLRYHVTAIHTKIRAYKCTMCPKDFLKKRDLTDHIKTHLNIRDKVCETCGKGFSNNHSLIRHRQIHSEVKRYACKLCDAKYHQFVGLNSHMKRTHNIVKKDSLSNEISLYHQIGDLSFECDECGKKFSQKTILQQHVQRMHASRKYRCEFCDKLFATKSEQNFHTRIHTDEKPHVCEFCGKSFRIKTQLGYHVNAVHTKIRAHKCTMCPKDFLTKRDLTDHIKAHLNIRDKICETCGKGFSNNHSFRRHRQIHSEIKRYECKMCDEKFNQLISLHRHKKRIHKNKPLNLDQWDDTTCKFIEIYEKFPLLYDKKNPLYGNLEDKQKALTDISSTLKSQHNIDFSEELIAFAIDELKSWLIRVKRRKGKLDKTTEKYLKMCKFMPAKRSKEKIPCSMCEKVFYTEHNLKYHLTKVHKVGDFSFQCDLCGKKLVSKNAVLYHNLRVHVEKKYPCEFCGKLYAIPAELKVHTRIHTAEKPHVCELCGKCFRLRNLLSLHVTRMHTKIRAFKCTMCPKDFLKNTDLRDHMKSHLNIRDKICETCGKGFTNCHSLIRHRQIHSDIKKFACKLCDAKFHQFVGLNGHMKRTHNINPLTLDKWNETTCKFIDIYAKFPLLYDPDHPLHGKNVEKQKALSEIRSTLLSQHNINFSDEIVTEAIDQLRSWYYSSKRRQHVTNNNDDLKKCYSKYLKLCKFLPEKQKRVHQIGEISFQCTQCGKKLVSNSALQSHTQRVHSGKEYPLHTNEKPHVCELCGKGFRLRNNLGLHVTSMHTKIRAFKCTMCPKDFLKNADLKDHIKTHLNIRDKICDTCGKGFTNCHSLRRHRQIHSEIKKFACKLCDAKFHQFVGLNGHMKRTHNIKPLILDKWDDTTCKFIDIYAKFQLLYDGEHPLYGKYEDKQRAITDIRNALLTQHEIDFNEELVSNAIDQLQSWYYLTKRQKHKNKDDAKLAKSCENYLQMCKFLPEKRLKVKIVCSICNKNFYTEHNFKYHLTKVHQVGDFSFQCDLCGKKLISKSALQCHAQRVHSEKKYPCEFCGKLFAIPSELRIHTLVHTAEKPHVCELCGKGFRLRNNLGLHVTRMHTKIRAFKCTMCPKDFLKNCDLKDHIKTHLNIRDKICETCGKGFTNCHSLIRHRQIHSEVKKFACKLCDAKFHQFVGLNGHMKRTHNIVKKDTNN
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -