Amar014268.1
Basic Information
- Insect
- Agriopis marginaria
- Gene Symbol
- ZFY
- Assembly
- GCA_932301435.1
- Location
- CAKOAR010000134.1:417143-430449[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 36 0.058 3.4 8.6 4.5 1 23 166 188 166 188 0.94 2 36 0.066 3.9 8.5 0.3 1 23 193 216 193 216 0.93 3 36 0.45 27 5.8 3.1 1 23 249 272 249 272 0.93 4 36 4.4e-05 0.0026 18.5 3.5 2 23 279 300 279 300 0.95 5 36 0.00013 0.008 16.9 1.5 1 23 307 330 307 330 0.97 6 36 5.2 3.1e+02 2.5 0.4 1 9 459 467 459 481 0.65 7 36 0.032 1.9 9.5 3.8 1 23 511 533 511 533 0.98 8 36 0.026 1.5 9.8 0.5 1 23 538 561 538 561 0.93 9 36 0.0067 0.4 11.6 3.0 1 23 594 617 594 617 0.94 10 36 0.00011 0.0068 17.2 4.8 1 23 624 646 624 646 0.95 11 36 0.00054 0.032 15.0 1.3 1 23 653 675 653 675 0.98 12 36 0.00057 0.034 15.0 0.1 1 23 688 711 688 711 0.96 13 36 5.5 3.2e+02 2.4 0.2 1 8 826 833 826 848 0.72 14 36 0.059 3.5 8.6 3.4 1 23 878 900 878 900 0.96 15 36 0.0023 0.14 13.0 0.4 1 23 905 928 905 928 0.94 16 36 3.2 1.9e+02 3.1 0.3 6 21 935 950 934 951 0.95 17 36 0.2 12 6.9 3.3 1 23 960 983 960 983 0.89 18 36 8.6e-05 0.0051 17.5 2.7 2 23 991 1012 991 1012 0.95 19 36 5.2e-05 0.0031 18.2 1.9 1 23 1019 1042 1019 1042 0.97 20 36 0.15 9.1 7.3 4.1 2 21 1161 1180 1160 1181 0.93 21 36 0.86 51 4.9 0.7 3 23 1187 1207 1186 1207 0.92 22 36 0.022 1.3 10.0 0.5 1 23 1240 1262 1240 1262 0.92 23 36 0.0021 0.13 13.1 0.4 1 23 1297 1320 1297 1320 0.93 24 36 0.00029 0.017 15.9 3.1 1 23 1326 1348 1326 1348 0.98 25 36 1.9 1.1e+02 3.9 1.8 5 23 1358 1376 1356 1376 0.96 26 36 3.1 1.9e+02 3.2 1.4 1 23 1382 1405 1382 1405 0.87 27 36 0.015 0.91 10.5 1.0 2 21 1529 1548 1528 1549 0.94 28 36 2.1 1.3e+02 3.7 0.0 3 23 1556 1577 1554 1577 0.94 29 36 0.45 26 5.8 1.4 2 23 1583 1605 1582 1605 0.93 30 36 1e-05 0.00061 20.4 2.8 1 23 1610 1632 1610 1632 0.98 31 36 0.52 31 5.6 3.6 2 20 1636 1654 1636 1656 0.93 32 36 3e-05 0.0018 19.0 0.6 1 23 1682 1705 1682 1705 0.95 33 36 0.094 5.6 8.0 0.7 1 23 1711 1733 1711 1733 0.84 34 36 0.19 11 7.0 0.1 1 23 1739 1761 1739 1761 0.97 35 36 0.013 0.8 10.6 0.7 1 23 1767 1790 1767 1790 0.94 36 36 1.9e-05 0.0012 19.6 0.2 2 23 1797 1819 1796 1819 0.96
Sequence Information
- Coding Sequence
- ATGATTGAATGGACGAACTGCCTGCCGCCACGCCTCTGCCGAGCCTGCTCCTCTTCCATCACCGACTTTATACCCTTCATCGTACTGTGCCGAGAATCAACTCAGAAATGGAAACAAACCGCCAACTATTTAAACACGGTGTCAGTAAACAAGCAGGCCACCACGGCGTACGTGTTATTGGGCGACGAGCCACAGACTATAGTGGACAGAAGCACCACCAACAACAAAACACCCAAAGAAGTCATGAGAAATATAAGACTGCGTCTCAATTATATTCGTTCCGCGCGGAAAAAGGCAAAAGCAAATATTAGTAATAGCAACCAAAGAGGGTCTCATAAATTTTCGTGTCCGGATTGCAAAAACGAAGGATTTGAAGCGTTATACAAGTTCAACTTCCATTTAAACCTCAACAAGAAACATTGCTGTCCACATTGCTTCGCAATAATCACCATGAAGGAGTTGGCGGAGCATCTCATATCACACAACATGCCCAGTTTTGAATGTGACATCTGCCATATGCATTTTGAAACAGAATCACGTTTGCTGAGGCACAAAACCTTTCACAAGAGAGGGAAATTCATGTGTCCAGAATGCAAATACACTTTTCTGACAATGGATCGGCTGACAGCCCACACGAACGGCGTCCACGCCACAGTGCGCTGCGCATGCGACAAGATCTTCCACGGGAAAGGCTGCTACATCTTTCACAAAAAAAGAATGTGCCCGAAAAAGCGTATACAAACGTACATTTGTGATTATTGCTCGAAAGAATTTAATAAGAAATGTATTTTAGGTAATCATATCAAAAACTTTCACGTGGTAAAATATGGAGAACAGTGTGATAAATGTGGGAAAACGTTTAAGAACAAAGCTCATCTGAAAGAGCACAGCTATATGCATGACAAAGTGCTGGACCGGTTCTCGTGCGAGTTCTGCGATAAAAACTACAGCACCCCAAGAGGATACAGGAGGCATATGACGAGAGTACATTACCAAGTGAACCCGGACTTATACAAGAAGAAGACATATCCTTGTAAAATTGAATGGACGCCCTGTCTGCCGCCACGCCTCTGCCGAGCCTGCTCCTCGTCCATCATCGACTTTTTACCCTTCTTCGTATTGTGCCGAGAATCAGTTCAGAAATGGAATCAAACCGCTGACTGTATAAACACGATTTCGGTCAGCAAACAGGCCACCACAGCGTACGTGTTACTGGGCGACGAGCCACAAATTTTGGTAGACAGAAGCACGACCAAAAACAAAACGCCGAAAAATATCATTGAGAACATAACACTGCGTCTCAAGTATATGCGTGCTGTGAAGGAAAAGGCGAAAGCAAAAATGACAATATGCAAGCGGAGAGGTTCTCACAAATTTTCATGCCCGGATTGCAAAAAGGGAGGCTTCGAAAAGTTTTACAAGTTTAACGCCCACTTGAACATCAACAAGAAGCATTGCTGTCCACATTGCTTAGCAATAATCGACATGAAGGAGTTTGGGGACCATCTCACATCACACAACATGCCCAGTTTCGAGTGTGACATCTGTCATATACATTTTGAAAAAGAAATAAGCTTGCTGAGACACAAAAAATATCATAAGAGAGGGAAATTCACGTGTCCAGAATGCAAATACACGTTTCTGACAATGGACGGGCTGACCGTCCACACGAACTTCGTCCACGCCACAGTGCGCTGCGCTTGCGACAAGGTCTTCCACGGGAAAGGCTGCTATTTCGGTCACAAGAAGAGAGTATGCGCCAAAAAACGTACACAGAACTACATTTGTGATTACTGCTCAAAGGAATACAAGCAGAAATCTATTTTGAGGAAACACATAGAACACTTTCACTTGGTGTTACATTCTAAGCATCAGTGCGATCATTGTGGTAAAATGTTTAAGAATAATACGGAATTGAAAGAGCACAGCTATATGCATGACAAAGTGCTGGACCGATTCTCCTGCGAGTACTGCGGTAACAACTACAGCACGGCCAGAGGTTACAGTAAACATATGAATAAACATTATCAAGTTAATCCTGAACTGTACAAGAAGAAGACGTACCCTTGTGAAATTTGCGGCAAGGTCTTGATCAGTGAACTGAACCTGAAGAATCATACTTTGAGACAACATAAGAAAATTGAATGGACGAACTGCCTGCCGCCACGCCTCTGCCGAGCCTGCTCCTCTTCCATCACCGACTTTATACCCTTCATCGTACTGTGCCGAGAATCAGCTCGGAAATGGAAACAAACCGCCAACTATTTAAACACGGTGTCAGTAAACAAGCAGGCCACCACAGCGTACGTGTTATTGGGCGACGAGCCACAGACTATAGTGGCCAGAAGCACCACCAACAACAAAACACCCAAAGAAGTCATGAGAAATATAAAACTGCGTCTCAGTTATATTCGTTCAGCGCGGAAAAAGGCAAAAGCAAAAATTGCAATAAGCAAGCAAAGAGGGTCTCATAAATTTTCGTGTCCGGATTGCAAAAAAGAAGGATTTGAAGCGTTATACAAGTTCAACTCCCACTTAAACATCAATAAGAAACATTGCTGTCCACATTGCTTCACAATAATGACCATGAAGGAGTTTGGGGAGCATCTCACAACACACAACATGCCCAGTTTCGAGTGTGACGTCTGTCATCTACATTTTGAAACAGAAACACGTTTGCTGAGACACCAAATATTTCATAAGAGAGGGAAATTCATGTGTCCAGAATGCAAATACACTTTTATGACAATGGACGGGCTGACCGTCCACACGAACTTCGTCCACGCCACAGTCCGCTGCGCTTGCGACAAAGTCTTCCACGGGAAAGGCTGTTATATCCGTCACAAAAGAATTTGCCCCAGGAGACCAGCACAAAAGTTCACGTGTGATTATTGCTCGAAAGATTTCAAACAGAAATGTATATTAGGAAGACACATCGTAAACTTTCACACGGTGATACAATATCAAGAACAGTGTGACAATTGTGGAAAAATGTTTAAGAACAAAGCTCATCTGAAAGAGCACAGCTATATGCATGACAAAGTGCTGGACCGGTTCTCGTGCGAGTACTGCGATAAAAACTACAGCACCCCAAGAGGATACAGAAAGCATATGACCAGAATGCATTATCAAGTCAATCCTGTGTTGTATAAGAAGAAGACTTACCCTTGTAAATTCTGCGGCAAGGTAGACGAGGAGACAGACCTGCCACCTGGCCTGTGCCGGGATTGCACTGAGGACATCATAACAGCTGTGACCTTCAAGCAGCTGTGCCAGCAATCAGCTCAGTACTGGGAGGAGGCTACCGATTACCTTGCAAGCATTAACAAACCTACAGTTGCAGACAAGACGTTATTCATCTTCTACCCTCAAGAAAAACCAATCTACAACGTTACGGAACCTGCTTCAACCACACAAATCGCTTTGGATGTCCTGAACAAACCAAAACGGATTAAAACTAAAAAGAAACGCTTCTCGTGCAGCTGCCCCTTTTGCGATAAAAAGTTTAGCTGCTTGGAAAATTTAAACGAACATTTGAAGTACACTTTGCAGAGATTTTGCCACGAATGCGGTATCGGCGTCAGCAAGAAAAATTTAGCTATACATTTGACTCTGAAACATAACATAGAAGTATTGGAGTGCGAATCTTGTCACGATATATTTCGGAATGATGCTGAAGGACAGAGGCATTGGGTGAAGCATCACGGGCCTGGTACGCATTCATGTTCATTGTGCGGAACTGGTTTCCCCAGCGAACGAGGCCTCAGAGCCCACAGCTACTTACACACGCTCTTTCATTGTACATGCGGTGCTAACTTTGAGAACAGTCGATGTCACAAGTACCATATATCCAACTGCAAGCCGAAAAAGCGTCAAATCGAATCTCACTTCGAATGCGATCACTGCGGCGTCGTATACGACAAGAAGCCGTCGCTTCGCATCCACATCGTTCAAAAACATCTAAACGTACTCCCATTCGTGTGTCAAACATGCGGCAAGCGTACATCTACTCTGGCGCATTTGAGATCTCACTCTAAAACTCACGAAACCGAGCGGAAGGTTCTAGAATGTCATTGCGGATCTAAATTTCGTACCGATCTAGGATACAAGTTGCATATGCGAATCCATACTGGGGATAAGCCTTATAAATGCGACTTTTGCGAGGAAAGCTTCCTATCTGCATCTAGAAGACTAGATCATACCAAAAGACGTCACCAGTCTTCGAAAGAGATGCCGCACGGGTGCAGTGAGTGTAGTGCAAAATTCATAATAGAAGATGATTCTTGCCCGATCGGCGCATGCGTCGGATGCGCGAGCGCAGCCCTCTCCGCGCAAGAATTCCGATTATTCGTTCGCAACTCGGAGAAACTGTGGCACAAAGCCATCGCCAATCTCGCCAACCTCGAAAACGAACAAGAGGAACGCGCCAAATCGATTTGCGCCTTCTTCACACCCAACGACTTAACAGTCCTAACCGCCAGAGACTACACTGGAGGCGAACCAAAAGAAGTCGTCAATCGTTCCAACACTTCCGTCAAAAAAACCAAACGGAAGGTCAGACGGCACCACACGGGACCTATGAGCACTTGCCCCGATTGTCGGAAAACTTTCATGAGCCCATTTTTTTTGCATGAGCATTTGAAAAACAACGGGCCTAGGACGGCATGTGTGACTTGCGGTGCTGTAGTGGCTAGAGGTCAAGATTTAGTAGAACACATGGCTTCCGCTCACAACGAGGTCTTGACGATGTGCCCCCACTGCCCTTGCTTGCTTAAAACTGAAGAAGAGCTGAGTCAGCATTTGAAAGACGCTCACAAACCAGGCGTTTTTACTTGCTTAGACTGCGGCCGGACCTTCCCAAGAAGGTCTACCTTCGAGCATCATTTGCAGATGCATGGGGTCCGAACTTGTCGCAGCTGCGACCGGCAGTTCACGAACAGAAGCTGCTATAGAGAGCATAGGACACAATGCGAGCCCGACGCCAAACCCGACAAAAATACTGTACCCAGAAATCGACGATCGAACATTCGAGACCCTGCGGTGTTTATATGCGATTACTGCAACAAGAAGTATACGACGCGTCCTCAGTTGAAGAACCATATACTGTGGATCCATATGGATCACAGACCACATCAATGTCAATGGTGCGGGAAGAGGTTTTTCACTCTGGCTCGAATGGCTGAGCACTCTGTGGTCCACACGCGCGCGCGAAACTTCGAGTGCGACATTTGTGGCGCTAAATTGGTTTCCAAAATGGCGGCCGTGTACCACAGACGGCGTCACACCGGGGAAAAACCTTACAAGTGCGAAGACTGTGATGCTACCTTCATTTCTTCATCGCGGAGACTTGAACATGCTAAAAGAAAACACAATAAAGGTACCAGAATTCAGTGTCCGCTGTGTCCGAACAACTTTGTCCGGAAAAGTGAGCTTAAAAGGCATATGGATAAGGCTCACAATACTGTGGATGAGGTGGAAAAGAAACTTGATATTATAGGGGATTGGAGGATGGTATCAGAACATCCCCAATTGGAGAGTGTGGCGACAATGTAA
- Protein Sequence
- MIEWTNCLPPRLCRACSSSITDFIPFIVLCRESTQKWKQTANYLNTVSVNKQATTAYVLLGDEPQTIVDRSTTNNKTPKEVMRNIRLRLNYIRSARKKAKANISNSNQRGSHKFSCPDCKNEGFEALYKFNFHLNLNKKHCCPHCFAIITMKELAEHLISHNMPSFECDICHMHFETESRLLRHKTFHKRGKFMCPECKYTFLTMDRLTAHTNGVHATVRCACDKIFHGKGCYIFHKKRMCPKKRIQTYICDYCSKEFNKKCILGNHIKNFHVVKYGEQCDKCGKTFKNKAHLKEHSYMHDKVLDRFSCEFCDKNYSTPRGYRRHMTRVHYQVNPDLYKKKTYPCKIEWTPCLPPRLCRACSSSIIDFLPFFVLCRESVQKWNQTADCINTISVSKQATTAYVLLGDEPQILVDRSTTKNKTPKNIIENITLRLKYMRAVKEKAKAKMTICKRRGSHKFSCPDCKKGGFEKFYKFNAHLNINKKHCCPHCLAIIDMKEFGDHLTSHNMPSFECDICHIHFEKEISLLRHKKYHKRGKFTCPECKYTFLTMDGLTVHTNFVHATVRCACDKVFHGKGCYFGHKKRVCAKKRTQNYICDYCSKEYKQKSILRKHIEHFHLVLHSKHQCDHCGKMFKNNTELKEHSYMHDKVLDRFSCEYCGNNYSTARGYSKHMNKHYQVNPELYKKKTYPCEICGKVLISELNLKNHTLRQHKKIEWTNCLPPRLCRACSSSITDFIPFIVLCRESARKWKQTANYLNTVSVNKQATTAYVLLGDEPQTIVARSTTNNKTPKEVMRNIKLRLSYIRSARKKAKAKIAISKQRGSHKFSCPDCKKEGFEALYKFNSHLNINKKHCCPHCFTIMTMKEFGEHLTTHNMPSFECDVCHLHFETETRLLRHQIFHKRGKFMCPECKYTFMTMDGLTVHTNFVHATVRCACDKVFHGKGCYIRHKRICPRRPAQKFTCDYCSKDFKQKCILGRHIVNFHTVIQYQEQCDNCGKMFKNKAHLKEHSYMHDKVLDRFSCEYCDKNYSTPRGYRKHMTRMHYQVNPVLYKKKTYPCKFCGKVDEETDLPPGLCRDCTEDIITAVTFKQLCQQSAQYWEEATDYLASINKPTVADKTLFIFYPQEKPIYNVTEPASTTQIALDVLNKPKRIKTKKKRFSCSCPFCDKKFSCLENLNEHLKYTLQRFCHECGIGVSKKNLAIHLTLKHNIEVLECESCHDIFRNDAEGQRHWVKHHGPGTHSCSLCGTGFPSERGLRAHSYLHTLFHCTCGANFENSRCHKYHISNCKPKKRQIESHFECDHCGVVYDKKPSLRIHIVQKHLNVLPFVCQTCGKRTSTLAHLRSHSKTHETERKVLECHCGSKFRTDLGYKLHMRIHTGDKPYKCDFCEESFLSASRRLDHTKRRHQSSKEMPHGCSECSAKFIIEDDSCPIGACVGCASAALSAQEFRLFVRNSEKLWHKAIANLANLENEQEERAKSICAFFTPNDLTVLTARDYTGGEPKEVVNRSNTSVKKTKRKVRRHHTGPMSTCPDCRKTFMSPFFLHEHLKNNGPRTACVTCGAVVARGQDLVEHMASAHNEVLTMCPHCPCLLKTEEELSQHLKDAHKPGVFTCLDCGRTFPRRSTFEHHLQMHGVRTCRSCDRQFTNRSCYREHRTQCEPDAKPDKNTVPRNRRSNIRDPAVFICDYCNKKYTTRPQLKNHILWIHMDHRPHQCQWCGKRFFTLARMAEHSVVHTRARNFECDICGAKLVSKMAAVYHRRRHTGEKPYKCEDCDATFISSSRRLEHAKRKHNKGTRIQCPLCPNNFVRKSELKRHMDKAHNTVDEVEKKLDIIGDWRMVSEHPQLESVATM
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -