Hcor061138.1
Basic Information
- Insect
- Hymenopus coronatus
- Gene Symbol
- lin-13
- Assembly
- GCA_030762935.1
- Location
- CM060895.1:16296273-16320022[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 36 0.0002 0.086 16.0 2.8 1 23 106 128 106 128 0.97 2 36 0.024 10 9.5 3.2 1 23 156 179 156 179 0.93 3 36 0.00018 0.079 16.1 1.9 2 23 186 208 185 208 0.97 4 36 0.0017 0.74 13.0 0.9 1 23 272 294 272 294 0.97 5 36 0.023 9.9 9.5 0.1 1 23 402 424 402 424 0.96 6 36 0.16 67 6.9 3.1 1 23 441 464 441 464 0.93 7 36 0.0021 0.89 12.8 0.1 1 23 527 550 527 550 0.96 8 36 0.35 1.5e+02 5.8 1.1 1 23 591 613 591 613 0.97 9 36 0.0078 3.3 11.0 0.2 3 23 680 701 678 701 0.94 10 36 3.6 1.6e+03 2.6 1.9 1 23 723 746 723 746 0.94 11 36 0.74 3.2e+02 4.8 1.7 2 19 753 772 752 777 0.84 12 36 1.5 6.5e+02 3.8 0.0 2 23 786 808 786 808 0.91 13 36 0.069 30 8.0 0.4 2 23 815 837 814 837 0.94 14 36 1.1e-05 0.0048 19.9 2.8 1 23 845 867 845 867 0.97 15 36 2.1 9.1e+02 3.3 0.1 8 23 940 956 932 956 0.83 16 36 0.00092 0.39 13.9 1.9 1 23 959 982 959 982 0.97 17 36 0.0002 0.087 16.0 0.7 2 23 989 1011 988 1011 0.96 18 36 0.014 6 10.2 6.2 1 21 1040 1060 1040 1061 0.95 19 36 0.00014 0.058 16.5 2.5 1 23 1080 1103 1080 1103 0.96 20 36 5.5 2.4e+03 2.0 0.9 3 23 1108 1130 1106 1130 0.87 21 36 4.9 2.1e+03 2.2 0.8 2 23 1168 1190 1168 1190 0.88 22 36 3.5 1.5e+03 2.6 0.2 2 23 1197 1219 1197 1219 0.95 23 36 1.3 5.7e+02 4.0 3.5 2 23 1228 1249 1227 1250 0.92 24 36 0.89 3.8e+02 4.5 0.2 1 23 1254 1277 1254 1277 0.94 25 36 1.1 4.9e+02 4.2 0.3 2 14 1336 1348 1335 1358 0.81 26 36 7.2e-05 0.031 17.4 0.2 1 23 1387 1409 1387 1409 0.98 27 36 0.00015 0.062 16.4 2.3 1 23 1420 1444 1420 1444 0.91 28 36 2.7 1.2e+03 3.0 2.0 2 23 1552 1574 1551 1574 0.92 29 36 2.2 9.5e+02 3.3 0.5 2 23 1581 1603 1580 1603 0.94 30 36 3.3 1.4e+03 2.7 1.2 1 23 1614 1636 1614 1636 0.92 31 36 0.018 7.8 9.8 5.4 1 23 1652 1674 1652 1675 0.95 32 36 0.032 14 9.1 2.0 1 23 1692 1714 1692 1714 0.97 33 36 3e-05 0.013 18.6 1.2 1 23 1797 1819 1797 1819 0.98 34 36 8.4e-05 0.036 17.2 0.1 1 23 1852 1875 1852 1875 0.97 35 36 1 4.5e+02 4.3 0.0 1 23 1885 1908 1885 1908 0.94 36 36 0.0062 2.7 11.3 2.9 1 22 1912 1934 1912 1934 0.96
Sequence Information
- Coding Sequence
- ATGCACGTGCCCACTTGCACAGGTGTAGCAATGGACGGCAACAATCAGCTACCAGTACCATCGGAGGCAACCAATGGGAGATCAAATAGCCCTGTATCAGGCCCAGCAACTCCATTAGAAGACGATATAGATGAAGAGGTGATGGGCCTAGAAATGGAAGACAGCCAGGATGTCATTATTGATAGCAATTCTGTGGAATGTGAGGTCCTAGAGGATGTGGTGATGGAGGATTCTCTTAGTTCCAAAAAGGACAGCAAAGCCCCAGCAGAACAAAAGCCAATTAGGTTTGAGGATGGCCAGGAAGAGGGCAATTTGCACACATGCAGCATCTGCAGCAAGGCATTCTCCAGTAAGGGTCACCTTGCACTACACAACAAGATTCATAGCAAGGAATCTGCATCTTCCAAGTCAGTGCCAGATGCCTCTACTGCTGGAACTGCCTGGAAGAGTGGTTACTATCGCCCATATCAATGTgacctctgcaacaaatcatattCAACAGCTAAGCATCGCTGGGGCCATGTCAGTGTGTGTCATCGTGGTGATCCTCTGGTGACATGTCCAGTGTGTTCTAGAGTCTTTTCTACCTGCTATAATCTTGGAGAACATAAAAGGACAAAACATGGTATGGAGGAGGACAATGAAGTTCCAGAAGAAAGTGCAGCAATTGTTAAATCAAAGCCACTATCCAGCACTCAACATGTGTCGCCAACACCCTTCCCAAGCAATAACGTAGTAAAACCATTATCACCCACTGCCAACACACAAGAGGCGCCACATATCTctttggatgaagtaaatacagCGGTTCTACACACTTGCGCACAGTGCCTCAAAGTTTTCAGAACAGAAGGTGAACTGGAACAACATTCTCTTACACATTTACAGTCTTCTGCAAAATGTTCATCCCCAGAACCCACTCGAAAATCGCTTTCTGgttttacaaatgaaatgagtTTATTAAAGCAAACGCTGTTACAAAATCGTCCTTCTATAACTGAAGGAAAAAAGAGACAATCATCTGTGGAGGAAACAGAGTGCTCTACAAATAAGAAAGCTGCAGTTGAAAGTGCTTCTGAGGCAACTCCAACAGGTGGCAGTAGACGAAAAAATTCAAAGCCACGAAAGATTGAGCAACGCAATGAAGAGAATGcaagagaaaatttagaactaaGTAGTGAAGACTCTATGGCAAATCAAGGTTATGTATGCTCTGTTTGTTCACAGATGTTTTCATCTATTCTCGAGCTTCAAGAACACAGTATTGTCCATGAAGAGGAGGCAGCTGCCACAACTGCAGAAAACAGTGTTGTCAAGCCTTTCTCTTGCCTACTCTGTGAAAAGGATTTCTCGCTGCGCACATCACTGAGTCGTCATTTTAATGCATGTCATGGCATTGATCCTGCTGaaataatggatatctcaaaaTATCAACGTTCACCAAAGAAAATAGCTCCCCGGATCAAAGAGGAGGAACAGGTTCCTGTTATGGATGAAGAAGATCTCTTTGATGTTACTGCAGAAGATGAAGAAAATGATACTAGAGGAACAGAAAACAAATCACAACAGGGTGGCAGCTTTGCCTGTGAGGTATGTAGCCGAGAATTTGGTGATCGTGCTTCACTATGGCTACACTTGCGTTATACGCACAAAGAATATGCTGCATATGCCTGTGGAGTTTGCTTACAGATTTGTGGGAGCAATACGCACATGTATGAACATTGGACTACCCAGCATCCTCCAGATCGTGCATCCACAGAACAAAGAAGGTACAGTTGTCAGATGTGTGGCCGACAACATGATTCACGCAAGAAACTTTTGGCTCATGTTCTTGTCCACAACTTGGATGATGGTTCAGGTGGCACGTATGATCCAGAACTGCTCGTTACTTTGAACACTGGCTTCTATAACTTTGAAGCACGAGAAAACCATGACAGTCCTGCAACAGTCAATGACTTTAATTCTACATTAGAAACAGACTTTGGCTCTGCAGAATGTAGTGGAGGAGAATATGCTGGAAATCCAGCTTTGTTTGGCTGCGAGTTGTGTTATAAATCTTTTCCTACTGAAGATGGTCTTGTGAAACATAAAAAGGGAGCTCATCGTTTGGACCCTAGAGCAAGCTCTTCAACACCAAACAATTCAAAGGGTTCATATCAGTTGTATTTTGTTTGCGAGCTGTGTGGAAGCTCACATAGAAGCAAATCTGAGCGCTGGAGACATGTCTTCAGGACACATGGGGGAGAGCAAGCACTAACCTGTGACAGACCAGGATGTGGAAAGGTGTTTCCAACTAGGGCTTTAAGGCAAGACCACTGCACCAATCATCATCGTCTTCAGGGAGCAACTCCCAATGTCTGTGAGATATGTGGTAAACTTTGGGGTACTCGTGTTGACTTCTGGAAACACCTCATGGGTGTACATCCAGATTGTGTTCCTTTGACCTGTGGTGTCTGCCTCAAGATTTTTTGTTCAGTTCCAGAACTTCAGAATCATGTAAGTTCAAACCATATGCCTCTGACAGGTGGCGACTTCTGCTGTGATATTTGTGGTCGTCCTTACTCCAACAGATCAAAGCTTTCGCGCCACCGCAGAATTCATTTGGTTGGTGAATccggagatattccatgttacccTGAAATGCTGCTTGGCACTCCTGGAACAGTGACAAAAAAGTCTGCCTTTCCTACTTTAACACCAAAATTGGGCCAAGATACTGCAGGCAAGTTTGACAAGTCTCAGGCTGAGATGCAAGGGCCTAAAGGCAAGCACAAATCTCTGGCATTACCACCCCTGTACTGTGATGCCTGTCCAGATGcaatttttacaaacatatttaatttagCCGAACATCGCCGCACAATCCATGGCCTTTTTCCTTGCGATCTTTGCTCTAAGTTTTATGGTCGCACATCACATTTATGGAAGCATGTCAAACGAGTACATAATAACCATCCTGAATTAACTTGCCCCATATGTAAACGTATATCGGCAAGTAAATCCCATTTGGAAACACACATTGCTACCAAGCATCGATCTTTGGAATCAAAAACTTCTACTCCTAAGACACAAGAAATAGAATTTCCTTCATTGAAAGGACCAAGTATACCAACTCTATATCCTTGCCACAAATGTTCGAGGCGGTTCTATAAACGACATTTACTGACAAAACATCAACTTCATTGTCCAGCAAAACGTCATCGTGAAAAACCAAAGTCAGACGATACCATAACCGAACACCCTTGTGATACCTGTGCAAAAGTATTTTCTTCTCATTCTAAGTTACGAGATCATCAACGAGTAATCCATGCCCCACAGAGTTGTGAACTATGTTCCAACGTAACATACAATTCCAAAATAGATCTCATGCATCACATAAAAGATTCTCATGCAGATGATCCTAATTTTGCATGTTCTCATCCTGGATGCAGTCGTACAATGAGATCACGCACAGATCGTGACCGTCATGTAGCAATGCACCGCTCATTCAAGTATCCACCAACATGTTCCCTTTGTGGTGAGAACTGTGCAAATCGTGTTAAGGCATGGAGTCATTACTCAGCTCGTCATAAATCTTCAATTTCAAATGTTTGTGGTGTTTGTTATGATCATTTCCCCTCTACCAGTGTTCTAGTGGAGCATGTACGTACACTTCATACTAGGGTTCTCAGAAATAAGGATACTTGCAAAATCTGTGTGAAAACATACTCTAGTTTTTATAAGGTACTACTGCATCATGCAAATCATCACCCAGATTACCATATCTGTCGTCAGTGTCTTGAGGTATTTGCTGAGCAGCAGGAACTAGACAGTCATGTTGAAAAGGAACATGCaaaaaaagaggaagaagaggaggagggggaggaggaggaggtagataatgaagaggaggaagaagaagataaATTACAACAGTCACAGGAGAATGCTAGTGTTTCAGGGGAGAAAAATGAACTTCCATCTCCAATAGTTATAATGATTAAGGAAGAGAAAAAAGATGAGATAGAATGCAGTCATTGTGGGGAAGAGTTTGCCACAGAAAATGAACATGACCAGCATTTTGCATCAATACATGCTGCTGATGGGACTGCAGCCAAACGCCAACATACTCCTGATCCTGGGGGCAATCCTAGTGCTGTAAAACGTCCACGTCGTTCATACGATTGTGACAAGTGTCCTGAGATTTTCCGTTCTCCATCAGACCTAGTGGAGCACAAGAAGCGACATCAAAAGACACCAGTAGCTGAAACTAAGCCTTATCATTGTAGTCAGTGTGAGAAATATTTTACCAACAAATCATCTTATTGGAGACACATAAACTCCCCAGCACACCATGCAGCAAAATTACAGGTGCAACAGAAATTGTCTGAAGAACAATCCACTCCAAGAAGTGAAAGTCCAGCAAATAAGAGTGCTACTATGGTGTACATCCCCGAAACCAACCTTAGGGAAACATTGATGAGAAGAGTAAAAATGGAGTTGGACATGTTGGATGATAGCGACCATGATAGCAAGAGCTTCACTGAGAGCGAAGTTGATAAGGATGCAAGCATAGAGAATGAAAGTGTCAAGGAGTCACCAGTAACTTTAAAACCTACGAGAAAGCGTCCAGAATCACGTAAAGTGTACCGTGGAGGACCAGATGTGCCATGCCAATGTCAGTTGTGTGGTAAACGATGGCCTGCAATGAAGCATTTGTGGCAGCATCTGATTCGTAATCACCGTCATGAGGCTGCAGTAACATGTGGAGTTTGCCTTGAGGTATGCCAAGACTATCGCAGTCTATCTGCACACTTAACCTCACAGCATCCTGATAACTTTAAAGGGGAAGGAAACAACTTCACTTGCCGCATCTGTGGACGCTATCACAATGCAAGATCAAAGCTTATTCAGCATGCTACCATCCATATTATTCTAGGGTCAGCTCCAGAGCAGCAACCACAATCAAATGTTCATTCGTGTCGCTTCTGTTTCAAAGCTTTCACCAATGAAAACACACTCCAAGACCATGTTAAAATGCACCATTCATCTGGGAGCATATCAAAGACTGTACGTCCTGCTGAACTTTCCCCACATACATGTGATATATGCTTCAAAGTGTGTGGTAATGAAGGAGCATTGGTCAGTCACAAGAAAAGCCACCAGAGGGGAACAACAGAATACCGCCGTCCAAAAGAAGATGTTGAAGTGGTAGAGACAGATGAGGACGAAGATGATAATGACAACGAATCTGCTACAGAACATGGTAGCAAAGAAGAGAATGATAAtgatgaagaggaagaagatgaggacaatgaagatgatgatgatgatgatgatgacgatgatgatgatgaagaggaTGAACAAAGTGAACAAACGTCTGTTACCAGTGAACCCAAAGAAGTTTTTAGTTGTGATATGTGCCCAAAGACCTTTATAGATAAAAATGATTTCAAAAGTCATGTTAAAAGCCATCCTCAGCTTGCTGACTTGCATTTGCAGATGAAGAGGAAAACACCTGTAGTGGCTATAGCAGAGTTGCCAAAACAAGAAGTAGAGAAACAAACCTTTTATGTGTGCAGCATTTGTAGTCAGGTATTCAAAACACCGGCAGCACTTTCAGTGCATCAGACCAGAGACCACCAACAAAGAGCACCTAGCACTGGTAAATTCCCTTGTGCTTCTTGCAGCGAGGGCTTTCCTACTATTGCTGCACTCTCAGAGCATGTTAAAGCAAGCCATGCCAAAATGTTCACCTGCACCAGGTGCAACAAGAAATCCTTTTCTTCCTATGTCAATCTGACCAAACACTATCGCACATGTCAACCGCAGGTAATGtattctcaaaacaaaagtcAGCTCGAGAGCTCAACTAACTTGTCTGAAGATAGCCAGGACTCTTCTGCAGACGAAAGTAGCTGCACCAAAAGCAGCAGACATACAGATCCAACATATTCTAACACAAGTAATAACATGAATATTGTGGACCAAGGAGCCAAAAGGGCTGATGCCCAAAAAAATTTGGATTTAGTGATTGGAATCCCAAATATTGGTTTGAGAAGATTAAGTAGCAGTTCTGACGATTCAGATGAGGAGAACATCACAGCACAGACttccattaaaagcaatattgGAAACACAGCAGAAGTGGAATCTGTCAACAAGTTGCCAGAAAACAGCAATCCTGACTCTTCTGATCATGAAATCAGAGCTTCTATTAATAGTGTAGTATCCTCAGAACCCAGGGAGGATTCTGCAAACGTGAGCCCGGAAGAATGCGCAACAGTCCCTGATAGTGCTAGTAGCACCCAAGAAATTGCAAAATCCGTGATTACATTAAGTGATGATGAATCCCAAGCATCTGTGAAATCTATAAAACAGGAGGAAAAGGACATATTTGTGGTGTCGCTGAGTGATGATTCAGAAATGTCCATTAAGGACTGTAAAGAAGATGCAGCAAACGAAAACTGTGTTGTTTTGGAAGTAATTAACAGTGAAGTAAAGCAGGACCTGACAGCTCCACAGGAAACTGAACCTGAGCTTGAACCTGAGTTTGTGTCTGTTAGTGATAGGACTGAAAGTGATACGGATGTAGTGGAGTTGGCAGCACCTGATGCTGATGAACAAAATACTGTGTGTGAAGTTAAACCAGAAACAGTGAACGAACCAAAAGCAGAAGGAACAAAGGGTAAAGACTATTTGTTTTCTTTAGAAGATGCACTGGATAAGTTAGAGGCTGAGAATACTGAAAAGTGA
- Protein Sequence
- MHVPTCTGVAMDGNNQLPVPSEATNGRSNSPVSGPATPLEDDIDEEVMGLEMEDSQDVIIDSNSVECEVLEDVVMEDSLSSKKDSKAPAEQKPIRFEDGQEEGNLHTCSICSKAFSSKGHLALHNKIHSKESASSKSVPDASTAGTAWKSGYYRPYQCDLCNKSYSTAKHRWGHVSVCHRGDPLVTCPVCSRVFSTCYNLGEHKRTKHGMEEDNEVPEESAAIVKSKPLSSTQHVSPTPFPSNNVVKPLSPTANTQEAPHISLDEVNTAVLHTCAQCLKVFRTEGELEQHSLTHLQSSAKCSSPEPTRKSLSGFTNEMSLLKQTLLQNRPSITEGKKRQSSVEETECSTNKKAAVESASEATPTGGSRRKNSKPRKIEQRNEENARENLELSSEDSMANQGYVCSVCSQMFSSILELQEHSIVHEEEAAATTAENSVVKPFSCLLCEKDFSLRTSLSRHFNACHGIDPAEIMDISKYQRSPKKIAPRIKEEEQVPVMDEEDLFDVTAEDEENDTRGTENKSQQGGSFACEVCSREFGDRASLWLHLRYTHKEYAAYACGVCLQICGSNTHMYEHWTTQHPPDRASTEQRRYSCQMCGRQHDSRKKLLAHVLVHNLDDGSGGTYDPELLVTLNTGFYNFEARENHDSPATVNDFNSTLETDFGSAECSGGEYAGNPALFGCELCYKSFPTEDGLVKHKKGAHRLDPRASSSTPNNSKGSYQLYFVCELCGSSHRSKSERWRHVFRTHGGEQALTCDRPGCGKVFPTRALRQDHCTNHHRLQGATPNVCEICGKLWGTRVDFWKHLMGVHPDCVPLTCGVCLKIFCSVPELQNHVSSNHMPLTGGDFCCDICGRPYSNRSKLSRHRRIHLVGESGDIPCYPEMLLGTPGTVTKKSAFPTLTPKLGQDTAGKFDKSQAEMQGPKGKHKSLALPPLYCDACPDAIFTNIFNLAEHRRTIHGLFPCDLCSKFYGRTSHLWKHVKRVHNNHPELTCPICKRISASKSHLETHIATKHRSLESKTSTPKTQEIEFPSLKGPSIPTLYPCHKCSRRFYKRHLLTKHQLHCPAKRHREKPKSDDTITEHPCDTCAKVFSSHSKLRDHQRVIHAPQSCELCSNVTYNSKIDLMHHIKDSHADDPNFACSHPGCSRTMRSRTDRDRHVAMHRSFKYPPTCSLCGENCANRVKAWSHYSARHKSSISNVCGVCYDHFPSTSVLVEHVRTLHTRVLRNKDTCKICVKTYSSFYKVLLHHANHHPDYHICRQCLEVFAEQQELDSHVEKEHAKKEEEEEEGEEEEVDNEEEEEEDKLQQSQENASVSGEKNELPSPIVIMIKEEKKDEIECSHCGEEFATENEHDQHFASIHAADGTAAKRQHTPDPGGNPSAVKRPRRSYDCDKCPEIFRSPSDLVEHKKRHQKTPVAETKPYHCSQCEKYFTNKSSYWRHINSPAHHAAKLQVQQKLSEEQSTPRSESPANKSATMVYIPETNLRETLMRRVKMELDMLDDSDHDSKSFTESEVDKDASIENESVKESPVTLKPTRKRPESRKVYRGGPDVPCQCQLCGKRWPAMKHLWQHLIRNHRHEAAVTCGVCLEVCQDYRSLSAHLTSQHPDNFKGEGNNFTCRICGRYHNARSKLIQHATIHIILGSAPEQQPQSNVHSCRFCFKAFTNENTLQDHVKMHHSSGSISKTVRPAELSPHTCDICFKVCGNEGALVSHKKSHQRGTTEYRRPKEDVEVVETDEDEDDNDNESATEHGSKEENDNDEEEEDEDNEDDDDDDDDDDDDEEDEQSEQTSVTSEPKEVFSCDMCPKTFIDKNDFKSHVKSHPQLADLHLQMKRKTPVVAIAELPKQEVEKQTFYVCSICSQVFKTPAALSVHQTRDHQQRAPSTGKFPCASCSEGFPTIAALSEHVKASHAKMFTCTRCNKKSFSSYVNLTKHYRTCQPQVMYSQNKSQLESSTNLSEDSQDSSADESSCTKSSRHTDPTYSNTSNNMNIVDQGAKRADAQKNLDLVIGIPNIGLRRLSSSSDDSDEENITAQTSIKSNIGNTAEVESVNKLPENSNPDSSDHEIRASINSVVSSEPREDSANVSPEECATVPDSASSTQEIAKSVITLSDDESQASVKSIKQEEKDIFVVSLSDDSEMSIKDCKEDAANENCVVLEVINSEVKQDLTAPQETEPELEPEFVSVSDRTESDTDVVELAAPDADEQNTVCEVKPETVNEPKAEGTKGKDYLFSLEDALDKLEAENTEK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01410930;
- 90% Identity
- -
- 80% Identity
- -