Tyan013674.1
Basic Information
- Insect
- Topomyia yanbarensis
- Gene Symbol
- -
- Assembly
- GCA_030247195.1
- Location
- CM058068.1:201243120-201260816[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 65 1.7e-05 0.0017 20.7 1.4 2 23 35 56 34 56 0.98 2 65 0.0018 0.18 14.3 0.8 1 23 60 82 60 82 0.98 3 65 0.00018 0.018 17.5 3.3 1 23 88 110 88 110 0.97 4 65 0.0027 0.27 13.8 2.7 1 23 116 138 116 138 0.99 5 65 2.4e-05 0.0025 20.2 0.4 1 23 144 166 144 166 0.97 6 65 3.6e-06 0.00036 22.8 1.5 1 23 173 195 173 195 0.99 7 65 7.5e-05 0.0077 18.6 0.4 1 23 201 223 201 223 0.99 8 65 0.00075 0.076 15.5 0.5 2 23 232 253 231 253 0.96 9 65 0.0056 0.57 12.7 0.6 1 23 259 282 259 282 0.95 10 65 0.0056 0.57 12.8 1.4 1 23 287 309 287 309 0.97 11 65 0.00093 0.095 15.2 4.3 1 23 315 338 315 338 0.97 12 65 7.8e-06 0.00079 21.7 1.9 2 23 362 384 361 384 0.98 13 65 0.016 1.6 11.4 2.6 1 23 389 411 389 411 0.98 14 65 0.004 0.41 13.2 6.4 1 23 417 439 417 439 0.97 15 65 0.0031 0.32 13.6 1.6 1 23 445 467 445 467 0.96 16 65 0.00025 0.026 17.0 1.9 1 23 473 495 473 495 0.98 17 65 6.6e-05 0.0067 18.8 2.7 1 23 501 523 501 523 0.97 18 65 0.00016 0.016 17.6 2.8 1 23 529 551 529 551 0.94 19 65 3.6e-05 0.0037 19.6 1.0 1 23 557 579 557 579 0.98 20 65 0.00044 0.045 16.2 5.4 1 21 585 605 585 607 0.95 21 65 4.5e-07 4.6e-05 25.6 1.4 1 23 613 635 613 635 0.99 22 65 0.017 1.7 11.3 0.1 1 23 641 663 641 663 0.97 23 65 0.17 17 8.1 0.4 2 23 670 692 670 692 0.95 24 65 1.9e-05 0.002 20.5 2.1 1 23 697 719 697 719 0.98 25 65 0.0024 0.24 13.9 0.8 2 20 726 744 725 746 0.93 26 65 2.8e-07 2.9e-05 26.3 1.4 2 23 778 799 777 799 0.98 27 65 0.0049 0.5 12.9 0.7 1 23 805 827 805 827 0.98 28 65 1.6e-05 0.0017 20.7 2.7 1 23 833 855 833 855 0.97 29 65 0.00017 0.017 17.6 0.9 1 23 861 883 861 883 0.99 30 65 2.1e-05 0.0022 20.4 1.9 1 23 889 911 889 911 0.96 31 65 0.0016 0.16 14.5 0.9 1 23 917 939 917 939 0.96 32 65 0.017 1.8 11.2 1.1 1 23 945 967 945 967 0.94 33 65 0.0001 0.01 18.2 1.3 1 23 973 995 973 995 0.98 34 65 1.3e-05 0.0014 21.0 1.0 1 23 1001 1023 1001 1023 0.98 35 65 4.8e-06 0.00048 22.4 0.9 1 23 1029 1051 1029 1051 0.99 36 65 9.4e-05 0.0096 18.3 1.7 1 23 1059 1081 1059 1081 0.98 37 65 0.046 4.7 9.9 1.7 1 23 1087 1110 1087 1110 0.96 38 65 0.00012 0.012 18.0 1.1 1 23 1115 1137 1115 1137 0.98 39 65 0.0016 0.16 14.5 1.0 2 23 1144 1166 1143 1166 0.90 40 65 0.0005 0.051 16.0 0.0 2 23 1191 1212 1190 1212 0.98 41 65 4.1e-05 0.0041 19.5 0.2 2 23 1220 1241 1220 1241 0.97 42 65 2.1e-05 0.0022 20.4 1.7 1 23 1247 1269 1247 1269 0.97 43 65 0.07 7.1 9.3 0.3 1 21 1275 1295 1275 1296 0.95 44 65 9.3e-06 0.00094 21.5 1.7 1 23 1303 1325 1303 1325 0.98 45 65 0.005 0.51 12.9 0.9 1 23 1331 1353 1331 1353 0.97 46 65 0.00034 0.035 16.6 4.6 1 23 1359 1381 1359 1381 0.93 47 65 8.5e-06 0.00086 21.6 1.1 1 23 1387 1409 1387 1409 0.98 48 65 3.6e-05 0.0036 19.7 3.4 1 23 1415 1437 1415 1437 0.98 49 65 0.0069 0.71 12.5 0.0 1 23 1443 1465 1443 1465 0.99 50 65 0.009 0.92 12.1 0.3 1 23 1473 1495 1473 1495 0.96 51 65 8.8 9e+02 2.7 1.3 1 23 1501 1524 1501 1524 0.75 52 65 3.7e-06 0.00038 22.7 0.8 1 23 1529 1551 1529 1551 0.98 53 65 0.00022 0.022 17.2 1.9 2 23 1558 1580 1557 1580 0.95 54 65 5.6e-06 0.00057 22.2 2.9 1 23 1619 1641 1619 1641 0.99 55 65 5.4e-05 0.0054 19.1 1.3 1 23 1647 1669 1647 1669 0.99 56 65 0.00062 0.063 15.8 0.3 1 23 1675 1697 1675 1697 0.99 57 65 1.8e-06 0.00019 23.7 0.5 1 23 1703 1725 1703 1725 0.99 58 65 0.0018 0.18 14.3 3.4 1 23 1731 1753 1731 1753 0.93 59 65 0.097 9.8 8.9 2.3 1 21 1759 1779 1759 1781 0.94 60 65 2.2e-06 0.00023 23.5 1.0 1 23 1787 1809 1787 1809 0.98 61 65 7.5e-05 0.0076 18.7 0.1 1 23 1815 1837 1815 1837 0.98 62 65 1.3e-06 0.00013 24.2 1.4 1 23 1843 1865 1843 1865 0.99 63 65 8.8e-05 0.0089 18.4 1.4 1 23 1871 1893 1871 1893 0.93 64 65 0.00088 0.089 15.3 0.6 1 23 1899 1921 1899 1921 0.98 65 65 0.00028 0.028 16.9 2.8 1 23 1927 1949 1927 1949 0.99
Sequence Information
- Coding Sequence
- ATGACACGAGGACAAAAACCTTCAAAAATTACCGACACAATTTCCGCAAGCCATTCCGACGACGAGTCGAATCGTGTTGATACCGGCTTCGAAGAGTTCAACAAATGCGAGATTTGCGACCGTCAATTCGCTAAACTGCGCAACCTGAAACTGCACATGAGAACCCACACCGAAGCTCACCGGTGCGACATTTGCGGGAAAAGCATTGCGTGGCGGGAGTCACTCGAAAGGCATCGAAGAACTCATACCGGCGAACAGCCGTTTCGTTGCGAAATATGCGGCAAACGTTTCGTATGCAAACAAAATCTGAACAGGCATATGAGCTTGCATACTGGAAATCACAAGTACAAGTGCGATGTTTGTGGAAAAGGATACAGCTCAAGGTGTAGGTTTATGGACCACGTGAGATCCCATACAGGCGAGCGCTCGTTCAACTGTGATGTCTGTGGCAAAGGATTTACTACAAGAAGCACCCTTGTGATTCACGAGAAATCCCATAGCATCGACGCACCGCTGTATCGGTGCAATGTTTGCAATAAGAAGTTCAGAAGTGATTATCAACTCGTTGCACACATGCGCTCTCATACTGATGAACGGCCGTTCAAATGTGAAGTGTGCGGTTCGTCGTTCAAATTTTCTCCTAACCTGATAGAGCACAGGAGGATTCACTTAAACCTAGCATCAAACACGATCGAATGTGACATATGCGGAGCATCGTTCAACCACCGAAGAAGTCTCAGTGCTCACAGTAAGGTTCATCGGCAGACACCGTCCCACGAATGCTCTATTTGCGACAAACAATTTATCGCCTCTAGAGAGCTGGATAGGCATAGAGTCACCGTGCACAATGATCGACCGCACAAATGTACGATCTGTGACGAAGCCTTCTTCGCTCTAGCGCACTTGCGGGAACATCTGAATCTTCACAATAGCGCGGTTCGGCACAAATGTCACATTTGCGAGCGGAGCTTTGCAAAGAAATCTGGTCTTTATACACATGTGAAAGTAAAGCACGCTCAAGATAAACCCCTCCCTTCCGACGACGAGCCGGATGGTGATGAGTCTGGCCTCGATGAGTCCAACAAATGCAAGATATGCGATCGACAATTCGCGAAACTGCGCAACTTAGAACGGCACATGCGGACCCAACACGAAAAATCAGCCCACCGGTGTGACATTTGCGGGAAAACCATTACGCGCAGCGACGCCTGCGAAATACATAGACGAATGCATGCCGGCGAACGGCCGTTCTGTTGCGAAATATGCGGCAAAGGTTTTACATGCAAACACTATCTGAACAGGCATATGATAGTGCACACAGGAAATCACAACTACAAGTGCGATGTTTGTGGCAGAGGATACAGTTCAAAGTGTAGGCTTTTAAGACACGTGGGATCCCATACAGGCGAGTGCCGCTATGAATGTGACACATGTGGAAAAAGTTTTGTTAGTCAATGTCTCCTGGACAACCATACGCTAAGGCATACAGCAAATCGCACGTTCAACTGCGATATTTGTGGCAAAGGATTTACTACAAGAAACTCCCTCGTGCATCACAAGAAATCTCATAGCACCGAGCGCCCACACAAATGTGACATATGTGGTAAAACGTTCAAATATAAATATGCGCTACGTTATCACAGTGGAATGCATATGGCTGAACATCCTTTCACCTGTAACATTTGTGGCAAGGGATTCTCTTTCCGTGGACCGTTCAATAAACATCAGAAAGTTCATAATGACGCACGTCTGCATCGGTGTAATGTTTGCGATAAAGAGTTCATAAGTAATTCTCACCTCGTCACACACATGCGTTGTCACACAGATGAACGGCCATACAAATGTGAGGTTTGCGGTTTTTCGTTCAGGTTCTCTAGTAACCTTGCATCTCACATGAGACGTCACCTAACACCAAAACCATTCGAATGTGACGTATGCGGAGCAAAGTTTGGCAAGAAGACAGTCCTGAACGCTCACAGTAAGATTCATCGGCAGGACCTGTTCCGTGAATGTCCCAATTGTGACAAACAATTTCTCACCACGAGAGAACTGGATAGGCATAGAGTTACCATTCACCTTGATCGACCGTACAAATGTACGACCTGTGACAAAGGCTACACTACGATGTCGAACTTGCGGCAACATCTGAAACTTCACACTGACGGGAATCGGCTGAAATGTCACATTTGTGAGCAGAGCTTTGCTAAGAAAACATTGCTTAATAGACATAGCAACTGGTACTGTACATTTCTGTCCGAGCAGACTACTGATTTCCGCCACTCCGACGACAACGAACCGGATGGTGATGATACCGACCTCAAAGGGTCCAACAAATGCGATATTTGCGATCGACAATTCACTAAACCAGGCAACTTAAAACGGCACATGCGAACCCACACCGAAAAACCATCCCACCGGTGCGGCATTTGTGGGAAAATCTTTATGCTGAGGATCTTACTCGTAATACATAGAAAAACCCATAACGGCGAATGGCCGTTTCGTTGCGAAACGTGCGGCAAACGTTTTACGCACAAACCCTCTTTGTACAGGCATATGGCAGAGCATACCGAAGAGTACGATTACAAGTGCGATGTTTGTGGCAAAGGATACAGTTCAAACTATAGGTATATAACCCACGTGAGATCTCATACGGGTGAGCGCCCGCACGAATGTGACATATGTGGAAAACGTTTTGCAAGCCAGAGCTATCTGAAAAGTCACTCGCTAAGACATGCCGGAAATCAAAGATTCAACTGTGATGTTTGCGGCAAAGGATGTAGTACAAGAACCGCGCTTAGGATGCACGAAATTTCTCACAGCGCCGAGCGCCCACACAAATGTGACATATGTGGGAAAGGTTTCGCAAAGATTGATTCGCTGCGGTGTCACATTGGAGTGCATACGGCTGAACATCCTTTCCAATGTAACATTTGTGGCAAGGAATTCTCTTTCCTTGTGCGGTTGAAGCAGCATCGGAAAATTCACACAGACGAACGCCTACATCGATGTGATGTTTGCAACAAAAGGTTCAGGAGTAATGCTCTCCTCGGCGCACACATGCGGATTCATACGGGTGAACGACCATACAAATGTGAAGTTTGCGGTTCTTCGTTTAGGACTTCTAATACCCTGGATAGGCACATGAGAATACACTCAAACTTAGCATCAAACCCGTTCGAATGTGACATATGCGGAGCAAAGTTTCGCCAACGAAGTAGTCTTAGCCATCACAGTAAGGTTCATCGGCAGACACCGTTCCATGAATGTCCTAATTGTGACAAACAATTCCTCACCACTAGAGAGCTGGATAGGCATAGATCTACCATTCACAATGAGCGACCGTACAAATGTACCATCTGTGACAACGGCTACACTACCATGTCGAACTTGCGGCAACATCTGAAACTTCACACTGACGGGAATCGGCTAAAATGTCATATTTGTGAGCAGAGCTTTGCTAAGAAAACATTGCTTAATAAACATGCAATAGAAATGCACGCTCAAAGTGAACCCCTCCACTCCGACGACAACGATCCGGATGGTGATGATACCGACCTCAAAGGGTCCAACAAATGCGAGATTTGCGATCGACAAATCGCTGGACCTGGCAACTTAGAACGGCACATGAGAATCCACACCGAAAAAAAACCATCCCGCCGGTGCGACATTTGCGGGAAAGTCTTTGCGCGGAAGCTCTCACTCGAAGCACATAAAAGAGCGCATACCGGCGAACGGCCGTTTCATTGCGAAATATGCGGTAAAGATTTCGCACGCAAACCCTATCTGACCAAGCATATGACATTGCATACAGGAAATCACAAATATAAGTGCGATGCTTGTGACAAAGGATTCAGTGTGAGGAAAAGGTTTATGATTCACGTGAGATCCTATACAGGCGAGCGCCCGTACGAATGTCACATCTGTGGAAAAGGTTTCGTTAGGCAGTGCCTTCTCAACATTCATATGCTAGAGCATACAGAAAAACGCATGTTCAACTGCGATATTTGCGGCAAAGGATTTGTTACAAGAGGCATCCTTGTGCATCACAAGAAATCTCATAGCATCGAGCGCCCACACAAATGTGACATATGTGGTAAAACTTTCAAACATAACTATACGCTACGTTATCACAGTGGAGTGCATACGGCTGAACATCCTTTCCAATGTAACATTTGTGGCAAGGAATTCTCTTTCCTTGTGCAGTTGAAGCAGCATCGGAAAGTTCACACAGACGAACGTCTGCATCGATGTAATGTTTGCAACAAAGAGTTCAGAAGTGATTATCATCTCGTTGCACACATGCGGATTCATACTGGTGAACGGCCATACAAATGTGAAGTTTGCGGTTCTTCGTTTGCGATTGCTAATGTGCTGGCAACGCACATGAAAATTCACTCGAACCTAGCATCGGACCCGTTCGAATGTGACATATGCGGAGCAAAGTTCCGCCTACAAAGAACTCTTAGCATTCACAGTAAGGCTCATCGACAGACACCGTTCCATGAATGTTCTAATTGTGACAAACAATTTCTCACCATTAAAGAGCAGGATAGGCATAGAGTAACCATGCACAATGATCGACCGTACAAATGTACGATCTGTGACAAAGCCTTCAATACCATGGGGAACTTGCGGTCACATCTGAAACTTCACACTGACGGAAACAGGCTTAAATGTCATATTTGTGAGCAGAGCTTTGCTAAGAAAACATTGCTTAATAGACATGTAAGAGAAATGCACGCTCAAAGTAAACCCTGTGTAATCCACTGCGACAACCAGCCGCTGGATGGTGATGGCACCAACGGTGAGAGCTCGAACAAATGTGACCTACCCAGCAGTTCACAACAGCCATCCTTCAAATGTGACACCTGTGGCAAAGAATTCCGACTCAAGCGACAGTTGCAGCGACATTTGAAAACTCACAGTCAAGAACGTCACTATCGGTGCAGTGCCTGTGGCCGAACATTCAGGATCAAGTACCAACTCGTCCTGCACATGCGGCGTCATGCTGTCGATCGGCCGTTCCGCTGTGAAACCTGCGGGGAAAGTTTTGTACTGGAAATTTTGCTGGAAAATCACCGGAGAAAACATACTGGAACCCAGAGGTTTAAGTGTGACGTTTGTGGCAAAGGATACGATCAAAACAGTCAGATGAAGGCTCACATGAGATCCCATTCGAGCGAACGTCCCCACAAATGTGACCAGTGCGATAAAAGTTTCAAAAATAAGAAATCGCTACGGTTTCACATTGCCGGCCATACGAACGATTTTCCCTTCAAATGTGAGGCATGTGGTAAAGCATTTTCCATCGATAAAAAGTTACAACGGCATGTGAAATGTCATAATAAGGAACTGCTCTATCGGTGTAGCATTTGTGACAAAGGATTCCGATTGAATTCCCAGCTTAACAAACACCTGCGTTGGCATACTGGGGAATGTCGGTATCAGTGCGGAATTTGCGGAAAAAGTTTTGCCGATGAACGCTATCTGGGAATTCATACGAAGATACACACGGGTGATCAGCAGTACAAGTGTGAAGTATGCGGAAAAGGATGCAGTTCCAACAGTAACTTTGTAGCTCACATGAGATCCCACACGGGCGAACGCCCGTACAAGTGTGACACATGCGATAAAACATTCGCCACCAAATATTCCCTGCAGTATCACAGTGGGATACACGCGACGGAACATCCCTTCAAGTGTGACACTTGTGGGAAGGGATATTCCATCGAGGTACAGTTCAAGCGACATGTGAAAATTCATAGCGACGAACGGCTGTATCAGTGTAGTATTTGCAGCAAAGCATTCAGGATTCATTCGCGCCTTGTCACACACATGCGGCGGCACACTGGTGAACGGTCGTGA
- Protein Sequence
- MTRGQKPSKITDTISASHSDDESNRVDTGFEEFNKCEICDRQFAKLRNLKLHMRTHTEAHRCDICGKSIAWRESLERHRRTHTGEQPFRCEICGKRFVCKQNLNRHMSLHTGNHKYKCDVCGKGYSSRCRFMDHVRSHTGERSFNCDVCGKGFTTRSTLVIHEKSHSIDAPLYRCNVCNKKFRSDYQLVAHMRSHTDERPFKCEVCGSSFKFSPNLIEHRRIHLNLASNTIECDICGASFNHRRSLSAHSKVHRQTPSHECSICDKQFIASRELDRHRVTVHNDRPHKCTICDEAFFALAHLREHLNLHNSAVRHKCHICERSFAKKSGLYTHVKVKHAQDKPLPSDDEPDGDESGLDESNKCKICDRQFAKLRNLERHMRTQHEKSAHRCDICGKTITRSDACEIHRRMHAGERPFCCEICGKGFTCKHYLNRHMIVHTGNHNYKCDVCGRGYSSKCRLLRHVGSHTGECRYECDTCGKSFVSQCLLDNHTLRHTANRTFNCDICGKGFTTRNSLVHHKKSHSTERPHKCDICGKTFKYKYALRYHSGMHMAEHPFTCNICGKGFSFRGPFNKHQKVHNDARLHRCNVCDKEFISNSHLVTHMRCHTDERPYKCEVCGFSFRFSSNLASHMRRHLTPKPFECDVCGAKFGKKTVLNAHSKIHRQDLFRECPNCDKQFLTTRELDRHRVTIHLDRPYKCTTCDKGYTTMSNLRQHLKLHTDGNRLKCHICEQSFAKKTLLNRHSNWYCTFLSEQTTDFRHSDDNEPDGDDTDLKGSNKCDICDRQFTKPGNLKRHMRTHTEKPSHRCGICGKIFMLRILLVIHRKTHNGEWPFRCETCGKRFTHKPSLYRHMAEHTEEYDYKCDVCGKGYSSNYRYITHVRSHTGERPHECDICGKRFASQSYLKSHSLRHAGNQRFNCDVCGKGCSTRTALRMHEISHSAERPHKCDICGKGFAKIDSLRCHIGVHTAEHPFQCNICGKEFSFLVRLKQHRKIHTDERLHRCDVCNKRFRSNALLGAHMRIHTGERPYKCEVCGSSFRTSNTLDRHMRIHSNLASNPFECDICGAKFRQRSSLSHHSKVHRQTPFHECPNCDKQFLTTRELDRHRSTIHNERPYKCTICDNGYTTMSNLRQHLKLHTDGNRLKCHICEQSFAKKTLLNKHAIEMHAQSEPLHSDDNDPDGDDTDLKGSNKCEICDRQIAGPGNLERHMRIHTEKKPSRRCDICGKVFARKLSLEAHKRAHTGERPFHCEICGKDFARKPYLTKHMTLHTGNHKYKCDACDKGFSVRKRFMIHVRSYTGERPYECHICGKGFVRQCLLNIHMLEHTEKRMFNCDICGKGFVTRGILVHHKKSHSIERPHKCDICGKTFKHNYTLRYHSGVHTAEHPFQCNICGKEFSFLVQLKQHRKVHTDERLHRCNVCNKEFRSDYHLVAHMRIHTGERPYKCEVCGSSFAIANVLATHMKIHSNLASDPFECDICGAKFRLQRTLSIHSKAHRQTPFHECSNCDKQFLTIKEQDRHRVTMHNDRPYKCTICDKAFNTMGNLRSHLKLHTDGNRLKCHICEQSFAKKTLLNRHVREMHAQSKPCVIHCDNQPLDGDGTNGESSNKCDLPSSSQQPSFKCDTCGKEFRLKRQLQRHLKTHSQERHYRCSACGRTFRIKYQLVLHMRRHAVDRPFRCETCGESFVLEILLENHRRKHTGTQRFKCDVCGKGYDQNSQMKAHMRSHSSERPHKCDQCDKSFKNKKSLRFHIAGHTNDFPFKCEACGKAFSIDKKLQRHVKCHNKELLYRCSICDKGFRLNSQLNKHLRWHTGECRYQCGICGKSFADERYLGIHTKIHTGDQQYKCEVCGKGCSSNSNFVAHMRSHTGERPYKCDTCDKTFATKYSLQYHSGIHATEHPFKCDTCGKGYSIEVQFKRHVKIHSDERLYQCSICSKAFRIHSRLVTHMRRHTGERS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -