Mtil018423.1
Basic Information
- Insect
- Mimas tiliae
- Gene Symbol
- ZFY
- Assembly
- GCA_905332985.1
- Location
- HG995238.1:439377-447996[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 36 0.00082 0.057 14.3 3.8 2 21 161 180 160 181 0.94 2 36 0.35 24 6.0 2.0 1 23 214 237 214 237 0.96 3 36 0.0029 0.2 12.6 2.9 2 23 243 264 242 264 0.95 4 36 2.3 1.6e+02 3.5 3.5 2 21 268 287 267 288 0.92 5 36 0.0032 0.22 12.4 0.9 2 23 330 352 329 352 0.95 6 36 0.0031 0.22 12.5 1.7 1 23 385 407 385 407 0.98 7 36 0.0035 0.24 12.3 2.0 1 23 413 436 413 436 0.97 8 36 0.0084 0.58 11.1 0.7 1 23 445 468 445 468 0.95 9 36 0.036 2.5 9.1 0.3 1 21 476 496 476 497 0.94 10 36 0.48 33 5.6 0.1 2 21 615 634 614 635 0.94 11 36 0.048 3.3 8.7 0.3 1 23 665 687 665 687 0.97 12 36 0.00011 0.0077 17.0 0.3 1 23 691 713 691 713 0.96 13 36 0.065 4.5 8.3 2.2 3 20 718 735 717 737 0.92 14 36 0.0014 0.099 13.5 2.4 1 23 747 770 747 770 0.95 15 36 0.00056 0.038 14.8 1.1 1 23 776 798 776 798 0.98 16 36 0.034 2.4 9.2 3.4 1 23 805 827 805 827 0.98 17 36 0.00038 0.026 15.3 0.7 1 23 842 865 842 865 0.97 18 36 0.029 2 9.4 2.9 2 20 979 997 978 999 0.85 19 36 1.8 1.3e+02 3.7 0.7 3 23 1005 1025 1004 1025 0.92 20 36 0.01 0.69 10.9 0.5 2 23 1031 1053 1030 1053 0.94 21 36 0.28 19 6.3 5.1 1 23 1058 1080 1058 1080 0.97 22 36 2.4 1.6e+02 3.4 7.2 1 21 1083 1103 1083 1104 0.95 23 36 0.0009 0.062 14.2 0.3 1 23 1116 1139 1116 1139 0.95 24 36 2.3e-05 0.0016 19.2 1.5 1 23 1145 1167 1145 1167 0.98 25 36 7.2 5e+02 1.9 0.5 1 23 1201 1224 1201 1224 0.87 26 36 0.00079 0.055 14.3 0.8 1 23 1232 1255 1232 1255 0.95 27 36 0.00017 0.011 16.5 0.7 2 21 1366 1385 1365 1386 0.93 28 36 5.4 3.7e+02 2.3 0.1 3 23 1393 1414 1392 1414 0.95 29 36 0.012 0.86 10.6 0.8 3 23 1421 1442 1419 1442 0.94 30 36 0.0036 0.25 12.3 0.7 2 19 1448 1465 1447 1469 0.88 31 36 2.1 1.4e+02 3.6 2.8 2 21 1473 1492 1473 1493 0.93 32 36 1.5e-06 0.00011 22.9 0.4 1 23 1519 1542 1519 1542 0.95 33 36 0.00069 0.047 14.5 0.5 1 23 1548 1570 1548 1570 0.95 34 36 0.17 12 7.0 0.1 1 23 1576 1598 1576 1598 0.97 35 36 0.0058 0.4 11.6 0.4 1 23 1604 1627 1604 1627 0.94 36 36 0.00077 0.053 14.4 0.3 2 23 1634 1656 1633 1656 0.94
Sequence Information
- Coding Sequence
- ATGACCCGACAAGTGGATGTAAAGGCATTAGTTTCCCATATAGTAAGGGGTGACGGCACGGACAAATGCCGAATTTGTATGGGTGAGACAACCGAGGGTCAAGTTTTCCTCGGCGACACAGTAATGATGGACGGAGACAAGGCGGTAACCCTTGCTGAGCTTCTAGAAATTATCACTGGAGTCGAGGTGGACGTGGATGACGACCTGCCCGACGGGCTGTGCAGAACATGCTCAGCATCTGCATTTAAAGCTGCAGAGTTCCGAACATTGTGTCATCAGTCTAGCGGACACTGGACAACTGTACTAGACATGCTCAACATGATACCAAAAAACAAACTTGTTGCTTATTCAAAAAATATCTATGCATTGATATCGGAAAACAACATTTCCATACATAACAATAGACTTTTAAATGATGTCAAAGCAAAACCACTAAAAGTAAAAGCCATGAAACGTAATAATGTTAAAAGTTTGCAATGTCAATGTTCAAATTGTGGTAAAAAGTTTACATATGCATACGACCTTCATACTCATTTAAAACAGTCCATGGATTTGATCCAAGCTTGCTATGTTTGTGCAAAAACTATGAACAGAGATGAGTTGATCCAACATTTGATCCAAAAGCACAACAAGAAGCCACATCCTTGCAAAAAGTGTCCTACGTTATTGAATACGCTTGATCGATATAAACAGCACATGATGAAGGAGCATGGTCCTGGAGTGTGTACATGTGGGGAATGTGGGAGAACATTCCAATCCAAACATGCATACTACGCCCATCTATCTATCCACACACCTAAAAGCTGTCCCCGATGCTCCAAACTGTTCCGTAACAAAAGATGTTACGTGTATCATGTGAAGAAATGTTGTGACTTGCACAAGAACAGGAAGGATGTGCACCAAACGAAGCACAAGTTGTCCATACTGGTGAAAAATGAGGACAAACAGATCAAAGTAGGCCTCAGAGGCAGCGCAAATAATGAATGCATTTGTGACTACTGCAACAAACGCTTCTCTGGAAAGAAATTCGTGGCGGCACACATACAAATAGTACATCTAAAGAACACACATAGGCCGTGTGCTCACTGTGGGAAGATGCTTGCAGCAGCTCGTATGTCGTCCCATCTGAAGAAGCACCAACCGTCACAGAATTACAACTGTGAGCACTGTGGCATGGTGCTAAAGACTAAATTGGGATATAGGCAGCATTTGCGCCTCCACACTGGTGAAAAACCCTATGTGTGCCAGCATTGCGGAGAGACATTCACCGCATCTTCTAGAAAATCTGAACATATCCGTAGAACTCATAAGAACACAGAGGAAGTGTTACGTCATGCGTGTGAACAATGTCCGGCGAAGTTTCGCCTCCCATACAGGTTGAAGAAGCATGTATTAACTGTCCATAGTAAGGCGAAGACACGTCTGTACGTGTGCAACAACTGCGAGGTGAAATTCGCAAGCTGTAGGGCGCTGTTAGCACACAGTAGAATGTGTAACACAATGGATATAGTGCAGACCAGCTCAGTGCTGCCAAATAAATTGTGTAAACCCTGCATGGCACACATAACAACTTTCGTACCATTCCTGCTCCTATGCAAGGAATCCTGCCAGCGGTGGGACCAAATCACAGGAATGTTTGACGACATCCAAGTACCAAAGAAAGCAAAAAGCTATTACTTCTTAGTCTTAAAAGATATACGGTCTTACTCCAGTACAAAAACAGCTACAAATGAAAAGGAGGTGTTGCGGAATGTGAAAACTAAAATATCTAACCGATATTGTTACAAAAAGCGATTGGCGAAAGAGAACCAGATGTGCCGAAGGTGGGATACACCAAAGATGACTTGCCCAGAATGCGAAGGTACATATACGAATATTGTAAAATACAATACTCATATTAAAAACGTCAAGAAGAGAATGTGCGTACACTGCACCACTCTGATAGAACTAGAAAAATACAAGGTACATTTGAAAGATCATTCAGATACAGTATTTGCTTGTAACTTATGCCCTGAAGCATTTCTAAGAGAAAAGTCGTTGAATCTTCATAGATCGAAACACAGAGGAAACCATGTCTGCGTGGATTGTAAGCAAAGTTTCGTTACGTCAGCGAATCTCGCCGCACATTTGGTGAAACACCAGTCTGAAATATGCGCTGGATGCAATAAGAAATTTAGCAACAGAATATGCTACACAAAACACAAGGAAAAATGTCTAATTGTAGAGAATCATAAGTCCTACATCTGTGACTACTGCTCAAAAGAATACAATAAAAGGAGTACTTTAAAACTGCACATTATTCACAATCATTTGACGGGGAAGCAGTTCCAATGTGACATATGTGGTAGGACTTTCCTCAGCCGCGTGCATTTGGCCGAGCATAGCAATACCCACAACAAGATCCTTGATCGATACGTATGCTCAACATGTGGAAGCAAATTCAGTACCAGGCGAGGTCACGAGAGACATATAAAGAAGCATAATACGGGCACTTACGAAGTTAAGTCTAAAAAAAAGATTATATATGTCTGTCCGATCTGTAAGAAGGTTTACTATCAAAAAAAGAGCATTGATAGCCATTTGAGGGTGAAACATGATGTGGCTAAGGTGGGAGAGGAGTCTGCGCTGCCGCAGGGGCTCTGCAGGACCTGCTCCGAGCAAGCCATGGCGGCATTCCGCTACAGACAGCTCTGTCTGAACTCCGACCAGCACTGGAGAGACGCAACCGTCTGTCTCGCGCAGATTCACGCACCCGCTGACGACGACAAAACATTCTACGTTCTGTATAACGGGCAAATTATAGTTATAAAAGATCAGATCGAACGAGTGCCGAACGAATCGGAGGCTCTGAAAAAACTTAACCACCGGCTTAAAGATCCGGAACCCGATAAGCCGCCGAGGAGGGCGAAGCGATCATACGCGCCGATCAGTTGCTACTGCACGGACTGCGGGAAAACCTTCTCGAATACTGAATTCCTGAACTTCCACCTGAAGAACTCGTTGAAACGTGCTTGTCATATGTGCGGCGTTGTTATACCTAAGAAGAACCTATCGAAACACTTGCAAGATAAACACGAAGTGGCGGTGGTAGAGTGTAGTACATGTCACAGCGTCTTCGACAACCAGTGGCAGTTAGAACAGCACTGGGTGACATATCATGGCGTTGATACGTACTCTTGTCACTGCTGCGGTAGCGGCTTCATTAACCACAGAGCTTTGCGTGCCCACATGTACTCCCATACGCTGTTCCATTGCAAGGCTTGTGGAAATAGTTTCGACAATCGCAAGTGCTACAAGCATCACCAAAAACAGTGCAAGAATACAGATGAGCCGCACCATGAAGTGTACATTTGTGATTACTGCAACATAGAATATAATAGGAAGCCGTCGCTTCGGATCCATATAATCCAAAAACATCTGAACGTTAACCCTTTCGTGTGCCAAGTGTGTGGTAAAAGAGCTTCGACGATGGCGCATTTAAAATCTCATATGAAAACGCATAACGCGGCTCGTAGCGTTTTCCAATGTCATTGCGGTGCTAAAATGCGTACTGAGGTCGGATACCAGCTTCATCAACGGATCCACTCGGGGGAGAGGCCTTACGAGTGTGAGACTTGTGGTGATCGGTTTCTGTCGGCGTCGCGACGCCTCGATCACATTAAACGAAGGCATCGCACCGCTGGACACATGCCTCACGCCTGCTACAAGTGTTCTGCGTCCTTCATCAGACCATGGGAATTACGGAAACATTATCTTACTGTGCATAATGATATAAAGGAAGAGTCCCTGTGCCCGAACAGCGCATGCGTGGCGTGTGCGAGCACAGCGCTGGCCGCGCAGGAATTCCGCTACTTCGTACAGTCCTCCCAAAAGATGTGGGGCAAAGCCATCACCAATCTCTCAGCCCTGCCTACCACCATGTCTTCCCCGCTGAAGTCGCTATGCGTGCTAGTAACCCCAAATGATATGCGCATACAGGTGGTCAAGAACTACTTCGGTGGTGACACTAAAACCGTCTTAAATCGGCTGAAAATAGGGATAGGTAAAAAGCCTTTTGGTCGGAAACCTCGCACCCACCGTACAGGTCCGTCATGCGTCTGCCCAGACTGTGGGAAGAGCTTTGTAAGCCCACATTTCCTGAGCCTCCACCTCATGAACAGCGGACAGAAGGAGGCCTGCGTACTGTGTGGTGATGTGGTGTCTCGCGGCAAGAAAATGAAAGAGCACATGAGTACAGTACATCGGACTGACCTACATCTCTGTACCGACTGCCCATTACTGTTTTCCACCGAAGAGGAGATGAAGAACCACATTAAACTTAATCACAAGCCAGGCTCTTTAACGTGTAATGAATGCGGCAGGAGTTTCCCTAGAAAAGTGGCGTTCGACCACCACCCTCAGATGCACATGGTGCGCACGTGTCGTAGTTGTGGCGCGCAGTTCACCAATCGGGGGTGTTACAGAGAACACAGGATGCATTGCGAGCCTGACGCCAAGCCTAGTGCGGACAATGTACCGCGAGCGCGGCGGTCGAATATCCGCGACCCAGCTACATACACCTGCGACTACTGCGGCAAAACGTACAGCTCGCGGCCTCAGCTTAAAAACCACATCTTATGGATCCACATGGATGTCCGTCCCCACCAGTGCCAGTGGTGTGGGAAACGGTTTTACACCCCGGCGCGTCTCGCCGAGCATACTGTCGTGCACACTCGTGTACGGAACTTCGAATGTGACATATGTGGTGCTAAGTTAGTGAGTAAAATGGCCGCCATTTATCACAAAAGAAGGCACACGGGTGAGCGGCCTTATAAATGTGAGGACTGCGGTGAAGGGTTCATATCATCGTCGCGGAGATTGGAGCACGCCAAGCGGAAGCACAATAAAGGCTCGAAACTACAGTGTCAGTACTGTCCAGCCAGCTTTGTGCGGAAACAAGAGCTGGCGAACCACAAAGACAAGCTGCACATGCCTGAAATTATGTTTGATAATATCGAACATATTGAAATAAAGAAGAACAGCACGCCACGGACGTATTAG
- Protein Sequence
- MTRQVDVKALVSHIVRGDGTDKCRICMGETTEGQVFLGDTVMMDGDKAVTLAELLEIITGVEVDVDDDLPDGLCRTCSASAFKAAEFRTLCHQSSGHWTTVLDMLNMIPKNKLVAYSKNIYALISENNISIHNNRLLNDVKAKPLKVKAMKRNNVKSLQCQCSNCGKKFTYAYDLHTHLKQSMDLIQACYVCAKTMNRDELIQHLIQKHNKKPHPCKKCPTLLNTLDRYKQHMMKEHGPGVCTCGECGRTFQSKHAYYAHLSIHTPKSCPRCSKLFRNKRCYVYHVKKCCDLHKNRKDVHQTKHKLSILVKNEDKQIKVGLRGSANNECICDYCNKRFSGKKFVAAHIQIVHLKNTHRPCAHCGKMLAAARMSSHLKKHQPSQNYNCEHCGMVLKTKLGYRQHLRLHTGEKPYVCQHCGETFTASSRKSEHIRRTHKNTEEVLRHACEQCPAKFRLPYRLKKHVLTVHSKAKTRLYVCNNCEVKFASCRALLAHSRMCNTMDIVQTSSVLPNKLCKPCMAHITTFVPFLLLCKESCQRWDQITGMFDDIQVPKKAKSYYFLVLKDIRSYSSTKTATNEKEVLRNVKTKISNRYCYKKRLAKENQMCRRWDTPKMTCPECEGTYTNIVKYNTHIKNVKKRMCVHCTTLIELEKYKVHLKDHSDTVFACNLCPEAFLREKSLNLHRSKHRGNHVCVDCKQSFVTSANLAAHLVKHQSEICAGCNKKFSNRICYTKHKEKCLIVENHKSYICDYCSKEYNKRSTLKLHIIHNHLTGKQFQCDICGRTFLSRVHLAEHSNTHNKILDRYVCSTCGSKFSTRRGHERHIKKHNTGTYEVKSKKKIIYVCPICKKVYYQKKSIDSHLRVKHDVAKVGEESALPQGLCRTCSEQAMAAFRYRQLCLNSDQHWRDATVCLAQIHAPADDDKTFYVLYNGQIIVIKDQIERVPNESEALKKLNHRLKDPEPDKPPRRAKRSYAPISCYCTDCGKTFSNTEFLNFHLKNSLKRACHMCGVVIPKKNLSKHLQDKHEVAVVECSTCHSVFDNQWQLEQHWVTYHGVDTYSCHCCGSGFINHRALRAHMYSHTLFHCKACGNSFDNRKCYKHHQKQCKNTDEPHHEVYICDYCNIEYNRKPSLRIHIIQKHLNVNPFVCQVCGKRASTMAHLKSHMKTHNAARSVFQCHCGAKMRTEVGYQLHQRIHSGERPYECETCGDRFLSASRRLDHIKRRHRTAGHMPHACYKCSASFIRPWELRKHYLTVHNDIKEESLCPNSACVACASTALAAQEFRYFVQSSQKMWGKAITNLSALPTTMSSPLKSLCVLVTPNDMRIQVVKNYFGGDTKTVLNRLKIGIGKKPFGRKPRTHRTGPSCVCPDCGKSFVSPHFLSLHLMNSGQKEACVLCGDVVSRGKKMKEHMSTVHRTDLHLCTDCPLLFSTEEEMKNHIKLNHKPGSLTCNECGRSFPRKVAFDHHPQMHMVRTCRSCGAQFTNRGCYREHRMHCEPDAKPSADNVPRARRSNIRDPATYTCDYCGKTYSSRPQLKNHILWIHMDVRPHQCQWCGKRFYTPARLAEHTVVHTRVRNFECDICGAKLVSKMAAIYHKRRHTGERPYKCEDCGEGFISSSRRLEHAKRKHNKGSKLQCQYCPASFVRKQELANHKDKLHMPEIMFDNIEHIEIKKNSTPRTY
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -