Ecor015516.1
Basic Information
- Insect
- Electrophaes corylata
- Gene Symbol
- -
- Assembly
- GCA_947095575.1
- Location
- OX352725.1:5531170-5543439[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 9 0.16 7.9 7.3 0.3 3 23 3763 3784 3762 3784 0.92 2 9 0.95 48 4.8 1.1 2 23 3819 3840 3818 3840 0.96 3 9 0.001 0.05 14.2 0.4 2 20 3845 3863 3844 3865 0.93 4 9 0.00025 0.012 16.1 0.7 2 23 3886 3907 3885 3907 0.96 5 9 0.00037 0.019 15.6 1.3 1 23 3914 3936 3914 3936 0.98 6 9 0.14 7.1 7.4 0.6 1 20 3942 3961 3942 3964 0.92 7 9 3.8e-05 0.0019 18.7 0.8 1 23 3969 3991 3969 3991 0.97 8 9 5.1e-05 0.0025 18.3 3.8 1 23 3997 4019 3997 4019 0.99 9 9 0.001 0.05 14.2 2.4 1 23 4025 4048 4025 4048 0.94
Sequence Information
- Coding Sequence
- ATGAGTTCCGGCATTGTTATTTGCAGACTTTGCGCGGAATCGAAGCACAACAGCAAACAAGTCGACCTAGAATGCGATATGATAAAGCGAGACGAGGTAATAGAACACTTGGCAAAACTCGACACCGTTCTAGATTTCAACGATGAAAAACTTCCCAGCACAGTGTGCTTAGACTGCATCTGCACCCTAGATAAATCCTTCGACTTCGTAGTCAATCTAGAAAGCGCTCAGAAAGTTTTACACGACTTATTCTACAAACGAACATCGACACGGATCGACCTATCTGACGATGTAATTTTTTTCGAAAGGTCTGGGTTGGATGACTGTGGTTTTTTAACAGAGTCGGAAAGCGATTTTTCTATTGAATCTCAAGACGCAGCGACTCCTAGCACCGATGATGACCTGATCTCGTTAACAGATTTGATAATATATGATCCAAATTCAAGCATTGGGTTTGATGAAACATCTGCGGTAATCAGAACATACAACAGTAATACAAGATCCAACATACAATCAACTGAATTTTCTTTAGATTTGGGAGTAATTCATCCAGATTCCAGTGTTGGCTCTAATGAAGCTATAAAGGTAAAAACTCCTAGTATTGAAGTAGAATCTGATGATATTATTCAGTTACAAGATTCGAGACTCAATCACTCAAATGCTACTGTTGACTCCGATGAATCTGATGCTATAACCCAGTTGAAATGTTTGATAAATAATAAATCAAACTCCAGTGTTCCTTCTGTTCAAGAGAGTTTGGCAAGCAACAGATCCGACCCCAGTGTAACCACGGTACAGGAATGTTTGGCAAGCAATAGATATGACCCCAGTGTAACCACCATTCAAGAATGTTTGGCAAGCAATAGTTATGACCCCAGTGTAACCGCCGTTGATAAATGTTTGGCAAACAATAAATCCGACTCCAGTGTAAGTGCCATTAAAGAATGTTTGGTAAGCAATAAATCAAACTCCAGTGCTAGCTCTGTTCAAGATAGTTTGGAAAGCAATAGATCCGACCCCAGTGTAGCCACCATTCAAGAATATTTGGCAAACAATAGATATGACCCCAGTGTAACTACTGTTCAAGAATATTTGGCAAGCAATAGATATGACCCCAGTGTAACTACCCTTCAAGAATATTTGGCAAGCAATAGATGTGACCCCAGTGTAACCACCATTCAAGAATGTTTGGCAAGCAATAGATATGACCCCAGTGTAACAACCGTTCAAGAATATTTGCCAAGCAATAGATGTGACCCCAGTGTAACCACCATTCAAGAATGTTTGGCAAGCAATAGATATGACCCCAGTGTAACCACTGTTCAAGAATGTTTGGCAAGCAATAGTTATGACCCCAGTGTAACCGCAGTTGATAAATGTTTGGCAAACAATAAATCCGACTCCAGTGTAAGCGCCATTAAAGAATGTTTGGCTAGCAATAAATCAAACTCCAGTGCTAGCTCTGTTCAAGATAGTTTGGAAAGCAATAGATCCGACCCCAGTGTAGCCACCATTCAAGAATATTTGGCAAACAATAGATATGACCCCAGTGTAACTACTGTTCAAGAATATTTGGCAAGCAATAGATATGACCCCAGTGTAACTACCCTTCAAGAATATTTGGCAAGCAATAGATATGACCCCAGTGTAACCACCATTCAAGAATGTTTGGCAAGCAATAGATATGACTCCAGTGTAAATACCGTTCAAGAATATTTGCCAAGCAATAGATGTGACCCCAGTGTAACCACTGTTCAAGGATATTTGGCAAGCAATAGATATAACCCCAGTGTAACCACTGTTCAAGAATATTTGGAAAGCAATAGTTATGACCCCAGTGTAACTACCGTTCAAGAATATTTGGCAAGCAATAGATGTGACCCAAGTGTAACCACCGTTCAAGAATGTTTGGCAAGCAATAGATCCGACCCCAGTGTAACCACCATTCAAGAATATTTGGCAAGCAATAGATGTGACCCCAGTGTAACCACCGTTCAAGAATGTTTGGCAAGCAATAGATGTGACCCCAGTGTAACCACCGTTCAAGAATATTTGGCAAGCAATAGATATGACCCCCGTGTAACCACCATTCAAGAATATTTGGCAAGCAATAGATATGACCCCAGTATAACCACCGTTCAAGAATATTTGGCAAGCAATAGATGTGACCCCAGTGTAACCATTCAAGAATGTTTGGCAAGCAATAGATGCGACCCCAGTGTAACCACCATTCAAGAATGTTTGGCAAGCAATAGATATGACCCCAGTGTAACCACCATTCAAGAATGTTTGGCAATCAATAGATATGACCCCAGTGTAACTACCGTTCAAGAATATTTGGCAAGCAATAGATGTGACCCCAGTGTAACTACCGTTCAAGAATATTTGGCAAGCAATAGATGTGACCCCAGTGTAACCACCGTTCAAGAATATTTGGCAATCAATAGATATGACCCCAGTGTAACTACCGTTCAAGAATATTTGGCAAGCAATAGATGTGACCCCAGTGTAACCACCATTCAAGAATGTTTGGCAAGCAATAGATGCGACCCCAGTGTAACCACCATTCAAGAATGTTTGGCAAGCAATAGATGTGACCCCAGTGTAACCACCATTCAAGAATGTTTGGCAAGCAATAGATATGACCCCAGTGTACCCACCGTTCAAGAATATTTGGCAAGCAATAGATATGACCCCAGTGTAACCACCCTTCAAGAATATTTGGTTAGCAATAGATGTGACCCCAGTGTAACCATTCAAGAATGTTTGGCAAGCAATAGATATGACCCCAGTGTAACTACCGTTCAAGAATATTTGGCAAGCAATAGATGTGACCTCAGTGTAACCGCAATTGAAAAATGTTTGGCAAGCAATAAATCAAACTCCAGTGTAAGTACCATTAAAGAATGTTTGGCAAGCACTAAATTCAAATCCAATGTTAACTTGGTTCAAGATAGTTTGGCAAGCAATAAATCCGGTCCCTGTGCTAGCACTGTTCAGGAGAGTTTGGCAAGTACAAAATCAAAGTCCAATGCTAGCTCTGTTCAAGATAGTTTGGCAAGCAAATCTGACCCCAGTGTAAGCGCCATAACAGAATTTTTGACAAACAATAAATCAAAATTCAGTGCTAGCCCTATTCCAGAGAGATTGTCAAGCACAAAATCAAAATACAATGCTAGCTCTGTTCAAAAGAGTTCGGCAAGCAATAGACCCAACCCCAGTGTAAGCGCCATTAAAAAATGTGTGGCAAGCAATAAATCTGTTCAAGATAGTTTGGCAAGCAATAAATCAAAGTCCAATGCTAGCTCTATTCAAGAAAGTTTGACAAGCAATAAATCAAAATCTAGTGCTAGCCTTATTCAAGAGAGATTGTCAAGCACAAAATCAAAGTCCAATGCTAGCTCTGTTCAAGAGTGTCTGGCAAACAATAGATCTGATCCCAGTGTAAGCGCTATTAAAGAATGTTTGGCAAGCAATAAATCCGACCCCTGTGCTAGCTCTGTTCAAGAGAGTTTGGCAAGCAATAAATCTGGCGCCATAAAAGAATGTTTGACAAGCTATAAATCAAAATCCAGTACTAGATCTGTTCAAAATAGTTTGGCAAGCATTAAATCAAAGTCCAATGCTAGCTCTGCTCAAGAGAATTCGGCAAGCAATAAATCCAACCCCAGTGTAAGCGCTATTAAAGAATGTTTGGCAAGCAATAAATCAAAAGCTAATACTAAATCTGTTCAAGATAGTTTGGCAAGCAATAAACCAAAATTCAGTGCAAGCCCTATTCAAGAAAGATTTTCAAACACAAAATCAAAGTCCAATGCTAGCTCTGTTCAAGAGAGTCTGTCAAACAATAGATCTGATCCCAGTGTAAGCGCTATCAAAGAATGTTTGGCAAGCAATAAATCTGACCCCTGTGCTAGCTCTGTTCAAGAGAGTTTGGCAAGCAATCAATCTGACCCTAGTATAAGCGCCATAAAAGAATGTTTTACAAGCTATAAATCAAAATCCAGTACTAGATCTGTTCAAGATAGTTTGGCAAGCATTAAATCAAAGTCCAATGCTAGCTCTGCTCAAGAGAATTCGGCAAGCAATAAATCCAACCCCAGTGTAAGCGCTTTTAAAGAATGTTTGGCAAGCAATAAATCAAAAGCTAATACTAAATCTGTTCAAGATAGTTTGGCAAACAATAAATCAAAATCCAGTGCTAGATCTGTTCAAGAGAGTTTGGCAAGCAATAAATCCAACCCTAGAGAAAGCGCTATTAAAGAATGTTTGGCAAGCAATAAATCCGGTCCCTGTGCTAGCACTGTTCAGGAGAGTTTGGCAAGTACAAAATCAAAGTCCAATGCTAGCTCTGTTCAAGATAGTTTGGCAAGCAAATCTGACCCCAGTGTAAGCGCCATAACAGAATGTTTGACAAACAATAAATCAAAATTCAGTGCAAGCCCAATTCAAGAAAGATTGTCAAACACAAAATCAAAGTCCAATACTAGCTCTGTTCGAGAGAGTTCTTCAAGCAATAGACCCAAACCCAGTGTAAGCGCCATTAAAGAATGTTTGGCAAGCTATAAATCAAATGCTAATACTAAATCTGTTCAAGATACTTTGGCAAACACTAAATCAAAGTCCAATGGCAGCTGTTTTCAAGACAGTGTGACAAGCAGTAAATCAAAATCCAGTGCTAGCCCTATTCAAGAGAGATTGTCAAGCACAAAATCAGAGTCCAATGCTAGCTCTGTGGAAGACAGTCTGGCAAGCAATAGATCTGACCTCAGTGTTAGCGCCATAAAAGAATGTTTGGCAAACAATAAATCAAAATCCAGTACTAGACCTGTTCAAGATAATTTGGCAAGCAATAAATTCGACCCCTGTGCTAGCTCTGTTCAAGAGAGTTTGGCAAGCAATAAATCTGACCTCAGTGTACGCGCCATAAAAGAATGTTTGACAAGCTATAAATTAAAATCCAGTAATAGATCTGTTCAAGATAGTTCGGCAAGCATTAAATCCAACCCTAGTGTGAGCGCTATTAAAGAATGTTTGGCAAGCAATAAATCCGACCCCTGTGCTAGCACTGTTCAAGATAGTTTGGCAAGCAATAAATCTGATCCCAGTGTAAGCGCCATAAAAGAATGTTTGACAAGCTATAAATCAAAATCCAGTACTAGATTTGTTCAAGATAGTTTGGCAAGCATTAAATCAAAGTCCAATGCTAGCTCTGTTCAAGAGAGTTTGGCAAGCAATAAATCCAACCCTAGTGTAAGCGCTATTAAAGAATGTTTGGTAAGCAATAAATCCGACTCCTGTGCTTACTCTGTTCAAGATAGTTTGGCAAGCAATAAATTTGATCCCAGTGTAAGCGCCATAAAATCAAAATCAAGTGCTAGCTCTGCCCAAGTCAGTTCGGCAAGCAATATATCCAATCGAAGTGTAAGCGTCATTAAAGAAAGCTTGGTTAGCCATAAATCAAAATCCAGTGCAAACACTGTTCAAGAGAGTTTGGCGAGCACAAAATCAAAGTCCAATGCTAGCTCTGTTGAAGCAAGTATTGCAAGCAATAGATCTGATCCCAGTGTAAGCACCATAAAAGAATGTTTGGCAAACAATAAATCAATGTCCAATGCTAGCTCTGTTCAAGAGAGTTTGGCAAGCAATAGATCTGATCCCAGTATAAGCGCTATCAAAGAATGTTTGGCAAGCAATAAATCCGACCCCTGTACTAGCTCTGTTCAAGAGAGTTTGGTAAGTAATAAATCTGACCCCAGCGTAAGCGCCATAAAAGAATGTTTGGCAAATAATAAATCAAAATCCAGTGCTAGATTTGTTCAAAAGAGTTTGACAAGCAATAAATCTGACCCTAGTATAAGCGCCAAAAAAGAATATTTGACAAGCTATAAATCAAAATCCAGTACTAGATCTGTTCAAGATAGTTTGGCAAGCATTAAATCAAAGCCCGATGCTAGCTCTGTTGAAGAACGTTTGGCAAGCACAAAACCAAAGTCCAATGATAGCTATGTTGCAGAGAGTTTGGCAAGCAATACATCTAACCCCAGTGTAAGCACCATAAAAGAATGTTTGGCAAACAATAAATCAAAATCCAGCGCTAGCTCTATTCAAGAGAGTTTGACAAGCAATCAATTCTACCCAAGTGTGAGCGCCATAAAAGAATATTTGGCAACAAATAAATCCGGTCCCTGTGCTAGCACTGTTCAAGAGAGTTTGGCAAGCAATAAATCCAACCCCAGTGTAAGCGCTATTAAAGAATGTTTGGCAAGCAATAAATCCAACCCCTGTGCTAGATCTGTTGAAGAGAGTTTGGCAAGCAATAAATCTGATCCCAGTGTAAGCGCCATTAAATCAAAATCAAGTGCTAGCTCTGTCCAAGACAGTTCGGCAAGCAATATATCCAATCGAAGTGTAAGCATCATTAAAGAAAACATGGTAAGCTATAAATCCAGTGCTAACACTGTTCAAGAGAGTTTGGCAAGCACAAAATCAAAGTCCAATGCTAGCTCTGTTGAAGAGAGTTTGGCAACCAATAGATCCAACCCCAGTGTAAGTGCTATTAAAGAATGTTTGGCAAGCAATAAATTCGACCCCAGTGCTACCTCTGTTCAAGAGAGTTTGGCAAGTAATAAATCTGACCCCAGCGTAAGCGCCATTAAAGAATGGTTGACAAACTATAAATCAAAATCCAGTACTAGATCTGTTCAAGATAGTTTGGAAAGCATTAAATCAGAGTCCTATACAAGCTCTGTTCAAGAGAGTCTAGCAAGCAATAGATCTGACCCCAATGTAAGCGCCATCAAAGAATGTTTGGCAAGCAATAAATCTGACCCCTGTGCTAGCTCTGTTCAAGATAGTTTGGTAAGCAATAGATCTGACCTCAGTTTAAGTGCCATAAAGGAATGTTTGGCAAGCTATAAATCAAAACTCAGTATAAGATCTTTACAAGATAGTTTGGCAAGCACTGAATCGAAGTCCAATGCTAGCTCTGTTCAAGAGAATCTGGCAAGCAATAGATCCGTCCCCAGTTTAAGCGCCATAAAAGAATGTTTGGCAAGCTATAAATTAAAATCCAGTACTAGACCCGTTCAAGATAGTTTGGAAAGCACTAAATCAGAGTCCTATATAAGCTCTGTTCAAGAGAGTCTAGCAAGCAATAGATCTGACCCCAGTGTAAGCGCCATCAAAGAATGTTTGGCAAGCAATAAATCTGACCCCTGTGCTAGCTCTGTTCAAAATAGTTTGGCAAGCAATAGATCTGACCTCAGTTTAAGTGCCATAAAGGAATGTTTGGCAAGCTATAAATCAAAACTCAGTACAAGATCTGTACAAGATAGTTTGGCAAACACTGAATTGAAGTCCAATGCTAGCTCTGTTCAAGAGAATCTGGCAAGCAATAGATCTGGCCCCAGTTTAAGCGCCATCAAAGAATGTTTGGCAGGCTATAAATCAAAACCTAATACAAGATCTGTACAAGATAGTTTGGCAAGCACTGAATCGAAGTCCAATACTAGCTCTGTTCAAGAGAATCTGGCAAGCAATAGATCTGGCCCCAGTTTGAGCGCCATAAAAGAATGTTTGGCAAGCTATAAATCAAAACTTAGTACAAGATCTATACAAGATAGTTTGGCAAGCACTGAATCGAAATCCAATGCTAGCTCTGTTCAAGAGAGTCTGGCAAGCAATAAATCTGACCCCAGTGTAAGCGCTATCAAAGAATGTTTGGCAAGCTATAAATTAAAATCCAGTACTAGATCTGTACAAGATAGTTTGGCAAGCACTGAATCGCAGTCCAATGCTAGTTCTGTTCAAGAGAATCTGGCAAGCAATAGATCTGGCCCCAGTTTAAGCGCCATCAAAGAATGTTCGACAAGCTATAAATCAAAACTTAGTACAAGATCTGTCCAAGATAGTTTGGCAAGCACTGAATCGAAATCCAATGCTAGCTCTGTTCAAGAGAGTCTGGCAAGCAATAGATCTGACCCCAGTGTAAGCGCTATCAAAGAATGTTTGGCAAGCAATAAATCAAAATCCAGTACTAGATCGGTTCAAGAGAGTTTGGCAAGCATTAAATCAAAGTCCAATACTAACTCTGTTCAAGAGAGTTCGGCAAGCAATAAATCTGATCTCAGTGTATGCGCCATAAAAGAATGTTTGACAAGCTATAAATCAAAATCCAGTACTAGATCTGTTCAAGATAGTTTGGCAAGCATTAAATCAAAGTCCAATACTAACTCTGTTCAAGAGAGTTCGGCAAACTATAAATCTGACCCCGGTGCTAGCTCTGTTCAAGAGAGTTTGGCAAGCAATAGATCTGACCCCAGTATAAGCGCTATCAAAGAATGTTTGGCAAGCAATAAATCCGACCCCTGTACTAGCTCTGTTCAAGAGAGTTCGGCAAGCAATAGATCTGACCCCAGTGTGAGCGCTATCAAAAAATGTTTGGCAAGCAATAAATCCGACCCCGGTGCTAGCTCTGTTCAAGAGAGTTTGGCAAGCAATAAATCTGATCCCAGTGTAAGCGCCATAAAATCAAAATCAAGTGCTAGCTCTACCCAAGACAGTTCGGCAAGCAATATATCCAATCGAAGTGTAAGCGTCATTAAAGAAAGCTTGGTAAGCCATAAATCAAAATCCAGTGCTAGCACTGTTCAAGAGAGTTTGGCAAGCAATAAATCCAACCGCAGTGTAAGCACCACCAGTAAAGAATGTTCGGCAAGTAAAACAAACTCCAGTGTAAGCTCGGTTTTAAACAGTTTGCCAAGCAATAAATCTGACCCTAGGGTTAGCTCTGCCAAAAGAAGTTTGTCAAACTCTAAATCAGACCCTAGTGCTAGCTCTGCTCAAAAAAGTTTGTCAAGCTCTCAATTTGACCCTAGTGTTAGCTCTGCCGAAAGAAGTTTGTCAAACTCTAAATCTGACCCTAGTGTTAACTCAGTCAAAAGAAGTTTGTCATGCTTTCAATCTGACCATAGTGTTAGCTCTGCCCAAAGAAGTTTGTCAAAGTCTAAATCTGACCCTAGCGTTAGCTCTGCCAAAAGAAGTTTGTTTAACCCTAAATCAGACCCTAAGGTTAGCTCGGCCCAAAACAGTTTGTCAAACTCTCAATCTGACCCTAGTCTTAGCTCTGCAAAAACAAGTTTGTTAAAACCTAAATCTGACCCTAGCGTTAGCTCTGCCAAAAGAAGTTTGTCAAACTCTAAATCAGACCCTAATGTTAACTCTGCCCAAAACAGTTTGTCAAGCTCTCAATCTGACCCTAGTGTTAACTCTGCAAAAATAAGTTTGTCAAAGCCTAAATCTGACCCTAGCGTAAGCTCTGCCAAAAGAAGTTTGTCAAACTCAAAATCCGACCCTAGTGTTAGCTCTGCCAAAAGAAGTTTGTCAAACTCTAAATCTGACCCTAGTGCTAGCTCTGCCAAAAGAAGTTTGTTAAACTCTAAATCAGACCCTAATGTTAGCTCGGCCCAAAACAGTTTGTCAAACTCTCAATCTGACCCTAGTCTTAGCTCTGCAAAAACAAGTTTGTCAAAGCCTAAATCTGACCCTAGCGTTAGCTCTGCCAAAAGAAGTTTGTCAAACTCTAAATCAGACCCTAATGTTAACTCTGCCCAAAACAGTTTGTCAAGCTCTCAATCTGACCCTAGTGTTAACTCTGCAAAAACAAGTTTGTCAAAGCCTAAATCTGACCCTAGCGTAAGCTCTGCCAAAAGAAGTTTGTCAAACTCTAAATCTGACCCTAGTGTTAGCTCTGCAAAAATAAATTTGTCAAAGCCTAAATCTGACCCTAGCGTAAGCTCTGCCAAAAGAAGTTTGTCAAACTCTAAATCTGACCCTAGTGTTAGCTCTGCCAAAAAAAGATCGTCGAGCTCTCAATCTGACCCTAGCGTTAGCTCTGCCAAAAGAAGTTCGTCAAACTCTAAATCTGACCCCAGTGCTAGCTCTGCCAAAATAAGTTTGTCTAACTCTCAATCTGACTCTAGTGTTAACTCTGCCAAAATAAGTTTGGCATGCAATAAATCTACCCTCAGTGCAGACGTAGCTCAAGGAAGTTTGATAAATAAATCGAACCCTTTTGTAAGATGTATTCGAGAGAGTTTGTCAAACTCTAAATCTGACCCTAGCGTTAGCTCTGCCAAAAGAAGTTTGTCAAACTCTAAATCAGACCCTAATGTTAACTCTGCCCAAAACAGTTTGTCAAGCTCTCAATCTGACCCTAGTGTTAACTCTGCAAAAACAAGTTTGTCAAAGCCTAAATCTGACCCTAGCGTAAGCTCTGCCAAAAGAAGTTTGTCAAACTCTAAATCTGACCCTAGTGTTAGCTCTGCAAAAATAAATTTGTCAAAGCCTAAATCTGACCCTAGCGTAAGCTCTGCCAAAAGAAGTTTGTCAAACTCTAAATCTGACCCTAGTGTTAGCTCTGCCAAAAAAAGATCGTCGAGCTCTCAATCTGACCCTAGCGTTAGCTCTGCCAAAAGAAGTTCGTCAAACTCTAAATCTGACCCCAGTGCTAGCTCTGCCAAAATAAGTTTGTCTAACTCTCAATCTGACTCTAGTGTTAACTCTGCCAAAATAAGTTTGGCATGCAATAAATCTACCCTCAGTGCAGACGTAGCTCAAGGAAGTTTGATAAATAAATCGAACCCTTTTGTAAGATGTATTCGAGAGAGTTTGTCAAACTCTAAATCTGACCCTAGTGTTATCTCTACCAAAAAAAGTGGATCATGTTCTCAATCTGGACTTAATACTAGCTCTGCCAAAATAAATTTGTTAAGCTCTCAATCTGACCCTAGCGCTAGCTCTGACAAAATAAGTTTGGCATGCGATAAATCAACCCTCAATGTAAGCATAGCTCAAGGAAGTTTAAAAAGCAATAAATTGAGACCTATTGTAAGATGTATTAGGGAGAGTTTGGCAAACAATAAATTAAACCTTAGTGTAAATTCTGTTGAAAAAAGTTTGGCAAAGAATAAATTAAATCCCAGTGTATACTCCCTTAAAAGAAGTTTGACAAGCAATAAATCAAGTGCTAGCTCCATTAAAAATAAAGCCGATTCTGACAGCATAATTCCTGTAAAAGATTTGGGATCCAATCAATCAGATACCAACGTCGTCTCTGATAAACCCATAGAAATAATATCTTCTAATAGTAAAACCGACTCCGAGTCTGGTGAACCTAAAGAAAAAAAATTAAAAAAGCAAAAGACACCTGTTAAAGAAAGTACCTCAGTAGACCCTCTTTCTCAAGACGAGCTAATCGAGGCATGGGAAGACTATAACTGGTTATGTACTTACTGCGAGACTGTATTCCCAAATATAGATGAACTGCATTCACATTCTATGGAGGTCCATACCACTTGCAACCCTTACCGATGTAAACATTGCAAGCTTCGTAAACAATGCTTAACCCCTTTCCTGGAACATATAGAATCCCACATGAAAAAACTAAAATTAACGTGTTTTAAATGTCAGGAAAAGTTTGAAAATGTGCTTAAAGCCAAGAAGCATATTAAAGTTCATTTTAAGAAGCTCAGCTGTCCAGGCTGCTTCACAACGTTCAAAAACAAGAAGGAGCTTAAAACACACCAGGATGTCTATTTAAAATTGAAATATATGAAAAAAGTAAAAAATCCACTAGTCAGTGACGACAAGTTAACTTGTACAGTTTGTTCAAAAACTTATATTAATGAATCTAGTCTTCGGAAACATCTTTTATTGCACACAGGAAGAAAACGTGATTATGTATGTGAAATTTGTGGTAAGGAATACTTTGAGAAAAACCATTTGACATATCATATGGGAACTCATGGTGATGACCGGCCTTTCAAATGTAAAGTCTGTAATCGGGGCTTCCAAAGGTTAAATGTTCTAAAAGAGCATGTTTGGTCACACAATGAAAAATTATTTTCCTGTGATAAGTGTGACAAGTCATTCAGTCTAGAGAAGAATCTTAAATTACATATGGTCGTTCATTCTAATAAACTACCGTTCAAGTGTTCGGAGTGTGAAAAATGTTTTAGGATCAGAGGAACATTGAAAAGACATCTACGTATTCATACTGGAGAGAAACCATTTTTGTGCAAGCTGTGTAGACTAACATTCAGGTTTAGATCTAATCTGAAGGATCATATGTTGTCTCAGCATGACATGAATATTTCTAAGAAGAAACGTAAGACAGTTGATACCGAAGAAACCCAAGAGTGGAAAAAGGAAATGCTGCAAAGAAAGCATGTCAAAAGAGTAGCAGGAAAAGTAATAGAACCAACTTGA
- Protein Sequence
- MSSGIVICRLCAESKHNSKQVDLECDMIKRDEVIEHLAKLDTVLDFNDEKLPSTVCLDCICTLDKSFDFVVNLESAQKVLHDLFYKRTSTRIDLSDDVIFFERSGLDDCGFLTESESDFSIESQDAATPSTDDDLISLTDLIIYDPNSSIGFDETSAVIRTYNSNTRSNIQSTEFSLDLGVIHPDSSVGSNEAIKVKTPSIEVESDDIIQLQDSRLNHSNATVDSDESDAITQLKCLINNKSNSSVPSVQESLASNRSDPSVTTVQECLASNRYDPSVTTIQECLASNSYDPSVTAVDKCLANNKSDSSVSAIKECLVSNKSNSSASSVQDSLESNRSDPSVATIQEYLANNRYDPSVTTVQEYLASNRYDPSVTTLQEYLASNRCDPSVTTIQECLASNRYDPSVTTVQEYLPSNRCDPSVTTIQECLASNRYDPSVTTVQECLASNSYDPSVTAVDKCLANNKSDSSVSAIKECLASNKSNSSASSVQDSLESNRSDPSVATIQEYLANNRYDPSVTTVQEYLASNRYDPSVTTLQEYLASNRYDPSVTTIQECLASNRYDSSVNTVQEYLPSNRCDPSVTTVQGYLASNRYNPSVTTVQEYLESNSYDPSVTTVQEYLASNRCDPSVTTVQECLASNRSDPSVTTIQEYLASNRCDPSVTTVQECLASNRCDPSVTTVQEYLASNRYDPRVTTIQEYLASNRYDPSITTVQEYLASNRCDPSVTIQECLASNRCDPSVTTIQECLASNRYDPSVTTIQECLAINRYDPSVTTVQEYLASNRCDPSVTTVQEYLASNRCDPSVTTVQEYLAINRYDPSVTTVQEYLASNRCDPSVTTIQECLASNRCDPSVTTIQECLASNRCDPSVTTIQECLASNRYDPSVPTVQEYLASNRYDPSVTTLQEYLVSNRCDPSVTIQECLASNRYDPSVTTVQEYLASNRCDLSVTAIEKCLASNKSNSSVSTIKECLASTKFKSNVNLVQDSLASNKSGPCASTVQESLASTKSKSNASSVQDSLASKSDPSVSAITEFLTNNKSKFSASPIPERLSSTKSKYNASSVQKSSASNRPNPSVSAIKKCVASNKSVQDSLASNKSKSNASSIQESLTSNKSKSSASLIQERLSSTKSKSNASSVQECLANNRSDPSVSAIKECLASNKSDPCASSVQESLASNKSGAIKECLTSYKSKSSTRSVQNSLASIKSKSNASSAQENSASNKSNPSVSAIKECLASNKSKANTKSVQDSLASNKPKFSASPIQERFSNTKSKSNASSVQESLSNNRSDPSVSAIKECLASNKSDPCASSVQESLASNQSDPSISAIKECFTSYKSKSSTRSVQDSLASIKSKSNASSAQENSASNKSNPSVSAFKECLASNKSKANTKSVQDSLANNKSKSSARSVQESLASNKSNPRESAIKECLASNKSGPCASTVQESLASTKSKSNASSVQDSLASKSDPSVSAITECLTNNKSKFSASPIQERLSNTKSKSNTSSVRESSSSNRPKPSVSAIKECLASYKSNANTKSVQDTLANTKSKSNGSCFQDSVTSSKSKSSASPIQERLSSTKSESNASSVEDSLASNRSDLSVSAIKECLANNKSKSSTRPVQDNLASNKFDPCASSVQESLASNKSDLSVRAIKECLTSYKLKSSNRSVQDSSASIKSNPSVSAIKECLASNKSDPCASTVQDSLASNKSDPSVSAIKECLTSYKSKSSTRFVQDSLASIKSKSNASSVQESLASNKSNPSVSAIKECLVSNKSDSCAYSVQDSLASNKFDPSVSAIKSKSSASSAQVSSASNISNRSVSVIKESLVSHKSKSSANTVQESLASTKSKSNASSVEASIASNRSDPSVSTIKECLANNKSMSNASSVQESLASNRSDPSISAIKECLASNKSDPCTSSVQESLVSNKSDPSVSAIKECLANNKSKSSARFVQKSLTSNKSDPSISAKKEYLTSYKSKSSTRSVQDSLASIKSKPDASSVEERLASTKPKSNDSYVAESLASNTSNPSVSTIKECLANNKSKSSASSIQESLTSNQFYPSVSAIKEYLATNKSGPCASTVQESLASNKSNPSVSAIKECLASNKSNPCARSVEESLASNKSDPSVSAIKSKSSASSVQDSSASNISNRSVSIIKENMVSYKSSANTVQESLASTKSKSNASSVEESLATNRSNPSVSAIKECLASNKFDPSATSVQESLASNKSDPSVSAIKEWLTNYKSKSSTRSVQDSLESIKSESYTSSVQESLASNRSDPNVSAIKECLASNKSDPCASSVQDSLVSNRSDLSLSAIKECLASYKSKLSIRSLQDSLASTESKSNASSVQENLASNRSVPSLSAIKECLASYKLKSSTRPVQDSLESTKSESYISSVQESLASNRSDPSVSAIKECLASNKSDPCASSVQNSLASNRSDLSLSAIKECLASYKSKLSTRSVQDSLANTELKSNASSVQENLASNRSGPSLSAIKECLAGYKSKPNTRSVQDSLASTESKSNTSSVQENLASNRSGPSLSAIKECLASYKSKLSTRSIQDSLASTESKSNASSVQESLASNKSDPSVSAIKECLASYKLKSSTRSVQDSLASTESQSNASSVQENLASNRSGPSLSAIKECSTSYKSKLSTRSVQDSLASTESKSNASSVQESLASNRSDPSVSAIKECLASNKSKSSTRSVQESLASIKSKSNTNSVQESSASNKSDLSVCAIKECLTSYKSKSSTRSVQDSLASIKSKSNTNSVQESSANYKSDPGASSVQESLASNRSDPSISAIKECLASNKSDPCTSSVQESSASNRSDPSVSAIKKCLASNKSDPGASSVQESLASNKSDPSVSAIKSKSSASSTQDSSASNISNRSVSVIKESLVSHKSKSSASTVQESLASNKSNRSVSTTSKECSASKTNSSVSSVLNSLPSNKSDPRVSSAKRSLSNSKSDPSASSAQKSLSSSQFDPSVSSAERSLSNSKSDPSVNSVKRSLSCFQSDHSVSSAQRSLSKSKSDPSVSSAKRSLFNPKSDPKVSSAQNSLSNSQSDPSLSSAKTSLLKPKSDPSVSSAKRSLSNSKSDPNVNSAQNSLSSSQSDPSVNSAKISLSKPKSDPSVSSAKRSLSNSKSDPSVSSAKRSLSNSKSDPSASSAKRSLLNSKSDPNVSSAQNSLSNSQSDPSLSSAKTSLSKPKSDPSVSSAKRSLSNSKSDPNVNSAQNSLSSSQSDPSVNSAKTSLSKPKSDPSVSSAKRSLSNSKSDPSVSSAKINLSKPKSDPSVSSAKRSLSNSKSDPSVSSAKKRSSSSQSDPSVSSAKRSSSNSKSDPSASSAKISLSNSQSDSSVNSAKISLACNKSTLSADVAQGSLINKSNPFVRCIRESLSNSKSDPSVSSAKRSLSNSKSDPNVNSAQNSLSSSQSDPSVNSAKTSLSKPKSDPSVSSAKRSLSNSKSDPSVSSAKINLSKPKSDPSVSSAKRSLSNSKSDPSVSSAKKRSSSSQSDPSVSSAKRSSSNSKSDPSASSAKISLSNSQSDSSVNSAKISLACNKSTLSADVAQGSLINKSNPFVRCIRESLSNSKSDPSVISTKKSGSCSQSGLNTSSAKINLLSSQSDPSASSDKISLACDKSTLNVSIAQGSLKSNKLRPIVRCIRESLANNKLNLSVNSVEKSLAKNKLNPSVYSLKRSLTSNKSSASSIKNKADSDSIIPVKDLGSNQSDTNVVSDKPIEIISSNSKTDSESGEPKEKKLKKQKTPVKESTSVDPLSQDELIEAWEDYNWLCTYCETVFPNIDELHSHSMEVHTTCNPYRCKHCKLRKQCLTPFLEHIESHMKKLKLTCFKCQEKFENVLKAKKHIKVHFKKLSCPGCFTTFKNKKELKTHQDVYLKLKYMKKVKNPLVSDDKLTCTVCSKTYINESSLRKHLLLHTGRKRDYVCEICGKEYFEKNHLTYHMGTHGDDRPFKCKVCNRGFQRLNVLKEHVWSHNEKLFSCDKCDKSFSLEKNLKLHMVVHSNKLPFKCSECEKCFRIRGTLKRHLRIHTGEKPFLCKLCRLTFRFRSNLKDHMLSQHDMNISKKKRKTVDTEETQEWKKEMLQRKHVKRVAGKVIEPT
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -