Oiva019664.1
Basic Information
- Insect
- Oeneis ivallda
- Gene Symbol
- -
- Assembly
- GCA_029955525.1
- Location
- JARPMR010000013.1:8477034-8504226[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 24 0.00014 0.014 17.3 1.5 1 23 388 410 388 410 0.96 2 24 0.011 1.1 11.3 0.6 1 20 416 435 416 438 0.94 3 24 3.5e-05 0.0035 19.2 5.4 1 23 457 479 457 479 0.98 4 24 2.5e-05 0.0026 19.6 5.6 1 23 485 507 485 507 0.99 5 24 5.5e-05 0.0056 18.5 5.3 1 23 513 535 513 535 0.97 6 24 5.5e-05 0.0056 18.5 5.3 1 23 547 569 547 569 0.97 7 24 0.00075 0.075 15.0 7.5 1 23 581 603 581 603 0.95 8 24 1.8e-05 0.0018 20.1 7.3 1 23 609 631 609 631 0.97 9 24 1.1e-06 0.00011 23.8 2.9 1 23 637 659 637 659 0.97 10 24 0.00016 0.016 17.1 6.7 1 23 665 687 665 687 0.97 11 24 0.00016 0.016 17.1 6.5 1 23 693 715 693 715 0.97 12 24 7.6e-07 7.7e-05 24.4 1.9 1 23 721 743 721 743 0.98 13 24 1.5e-05 0.0015 20.3 7.7 1 23 749 771 749 771 0.98 14 24 5.4e-06 0.00055 21.7 7.6 1 23 777 799 777 799 0.98 15 24 2.6e-06 0.00026 22.7 5.3 1 23 805 827 805 827 0.98 16 24 0.043 4.3 9.4 9.2 1 18 833 850 833 858 0.80 17 24 1.2e-05 0.0012 20.6 4.0 1 22 864 885 864 888 0.91 18 24 1.4e-05 0.0014 20.4 6.5 1 23 894 916 894 916 0.97 19 24 0.00018 0.018 16.9 7.3 1 23 922 944 922 944 0.97 20 24 7.2e-06 0.00073 21.3 5.9 1 23 950 972 950 972 0.98 21 24 8.9e-05 0.0089 17.9 7.8 1 23 978 1000 978 1000 0.97 22 24 5.8e-06 0.00058 21.6 4.0 1 23 1006 1028 1006 1028 0.96 23 24 0.0011 0.12 14.4 0.3 1 23 1035 1057 1035 1057 0.96 24 24 0.023 2.3 10.3 5.3 1 23 1065 1087 1065 1087 0.99
Sequence Information
- Coding Sequence
- ATGAATTCGGATCACCACAGTATCAATACGGGTGGTGGCCAACCTCCAAGCAATTCGGAGTCTCAGAATGCGAGGGTGCAATCAGCGCAGCAACATCAGCAGCAGCAAGCTAACTTAACGCCTACAACTTCGGCTACCGACTTGCGAGTGAACTCTGCAGCTGTGAACGTTGCTTTGTCGAGCGTGGCTAAGTACTGGGTGTTTACAAACCTTTTCCCGGGCCCATTGCCGCAGGTGTCCGTCTACGGTTTACCCAGTACGAGGATTGAAAATGGGAAACCAGTTCAGGACCTTGGTCAGGCGCATGCCAGTATATTGAACGGGGATCCCAATATCATACTGGGGCATGCTGGGCAACCTCAAGTCACAGTGTCGGCAGCTGGACAGCAGATTCCTGTGTCACAGATCATTGCAACACAGGCTGCACAAGGACATGATCTGGTGGGACACAGCCAGCAGCAAGAGGCGTCGGCCAGTGCTGCCCAGCTGGCAGTGGCGAGCCAGGCTTCCCAGCAGGTACCCAATAATCGGGTCGACTTTGTACAACACCATAACATTGATATGATCCAACTGACCGTGAGCGAGGATGGCATAGTGACAGTGGTGGAGCCGGGTGGCGGGAAACTGGTGGACAAGGAAGAGCTGCACGAGGCAATCAAGATGCCGTCAGATCATACGCTGACTGTGCACCAGTTGCAGCAGATTGTCGGACATCATCAGGTCATAGACAGTGTAGTGCGCATCGAGCAAGCCACAGGCGAGCCGGCTAACATCCTGGTAACACAGAACCCAGACGGCACCACCTCCATAGAGACCAGCGCCGTCGACCCGCTGCTCGTCAAAGACGAGAAGAACGTCGCCAAAATAGAGTCCGCGCAGTTCGCTATACCCGCCGAAATTAAGGACATCAAGGGAATAGACTTAAAGAGTGCAATGGGTATGGAAGGGGCAGTGGTCAAGATATCAACAGGCAGTGAGCATGACCTTCACACCATGTACAAAGTTAATGTGGAGGACCTGTCACAACTGCTGGCGTATCATGAAGTTTTTGGGAAACTAAGCACTGAAGGACAGCAGCAAGCCAAGGTAATAAACGACGTGGACGTCGAAGTGGCAGGCACAAGCGCGGCGATGTCGGAAGCTGAGACCTCCCCCGGACACCACGCCTGCGATATCTGTGGGAAGATATTCCAGTTCCGCTACCAGCTTATTGTTCACAGACGGTACCACGGCGAGAGTAAGCCATACACATGTCAAGTTTGTGGATCAGCTTTTGCCAATGCAGTTGAATTGTCGAAGCATGGAAAATGTCACCTTGCGGGTGACCCAGCCGAGCGTCAAGCCAAGAGACTAGCTCAAGACAAACCGTACGCGTGTTCCACGTGCCACAAGACGTTCTCGCGCAAGGAGCACCTCGACAACCACGTCCGCAGTCACACAGGAGAGACGCCTTATAGGTGTCAGTACTGCTCGAAGACGTTCACTCGCAAGGAGCACATGGTGAACCACGTGCGCAAACACACGGGCGAGACGCCGCACCGCTGCGAGATCTGCAAGAAGAGCTTCACGCGCAAGGAGCACTTCATGAACCACGTCATGTGGCACACGGAGTGTAACAAACACACGGGCGAGACGCCGCACCGCTGCGAGATCTGCAAGAAGAGCTTCACGCGCAAGGAGCACTTCATGAACCACGTCATGTGGCACACGGAGTGTAACAAACACACGGGCGAGACGCCGCACTGCTGCGAGATCTGCAAGAAGAGCTTCACGCGCAAGGAGCACTTCATGAACCACGTCATGTGGCACACGGGTGAAACGCCGCACCATTGTCAAATATGCGGGAAGAAGTATACTAGGAAGGAGCATTTAGTGAACCATATGCGATCCCATACAAACGATACACCGTTCAGATGCGAACTGTGCGGCAAGTCCTTTACGAGGAAGGAACACTTCACCAATCACATATTGTGGCATACGGGTGAAACCCCACACCGCTGCGACTTCTGTTCAAAGACATTCACTCGGAAAGAACATTTACTCAACCATGTGCGTCAACATACCGGCGAATCGCCGCATCGGTGCAACTTCTGCTCCAAATCGTTCACGCGCCGCGAACACCTCGTGAACCATGTACGACAACACACCGGGGAGACGCCTTTCCAGTGTGGATACTGCCCTAAAGCTTTCACCAGAAAGGACCATCTCGTAAACCACGTCCGCCAACACACGGGGGAGTCCCCACATAAGTGTTCATATTGTACGAAGTCATTCACGCGCAAGGAGCATCTTACCAATCACGTGCGCCAACACACCGGGGAATCCCCGCATCGATGTACCTACTGCTCCAAATCGTTCACAAGAAAGGAACATCTTACTAATCATATAAGACAGCATACGGGGGAGACTCCCCACAAGTGCACGTATTGCCCGCGAGCCTTCTCCCGCAAGGAGCATCTGAACCAACACATCCGGCAACACACCGGGGACACACCGCACACCTGCACGTACTGCAACAAGAGCTTCACCAGGAAGGAGCACCTTGAGCATCTGAACCAACACATCCGGCAGCACACCGGGGACACGCCGCACACCTGCACGTACTGCAACAAGAGCTTCACCAGGAAGGAGCACCTTGTGACCCATGTTCGGTATGTAGCACACACCGGGGACACGCCGCACACCTGCACGTACTGCAACAAGAGCTTCACCAGGAAGGAGCACCTTGTGACCCATGTTCGGCAACACACTGGGGAGACGCCATTCAAATGCACGTTCTGTTCAAAGTCCTTCTCAAGAAAGGAGCACCTAACAAACCACGTCCACCTCCACACGGGGGAAACTCCGCACAAATGTCCCTTCTGCACCAAGACATTCTCCAGAAGAGAACATTTGACTAACCATGTCCGGATACACACTGGCGAGTCGCCGCATCGGTGCGAGTTCTGTCAGAAGACGTTCACCCGCAAAGAGCACCTCACCAACCACTTGAAGCAGCACACGGGCAACACGCAGCATGCCTGTAAAGTGTGCTCCAAACCTTTCACTAGGAAGGAGCATCTTGTTCAACATATGAGATCCCACAGTTGCGGCGACCGACCGTTCAGTTGCGGCGAATGCGGGAAATCGTTCCCCCTGAAAGGCAACCTGCTATTTCACGAGCGGTCGCACAAAAGCGGCAACCCTAAAACATTCCGCTGCGAAATCTGCTCAAAGGAGTTCATGTGCAAGGGTCACTTGGTGTCGCATCGGCGTACCCATACAGAAGCAGGCGAATCTGCGACACCAGCAGAGCAGGGGGACGAGAATGAGAATTGCGGCGATTACGCCAAGTGCGAGAAGGATAACACCGAAATACCAGAGAGGAAACACGATATCAGGAGATTCAATTGCAGGCATAATAACATCCTCAGGTTCAGGCAATTACGTTACAAAGGCTAG
- Protein Sequence
- MNSDHHSINTGGGQPPSNSESQNARVQSAQQHQQQQANLTPTTSATDLRVNSAAVNVALSSVAKYWVFTNLFPGPLPQVSVYGLPSTRIENGKPVQDLGQAHASILNGDPNIILGHAGQPQVTVSAAGQQIPVSQIIATQAAQGHDLVGHSQQQEASASAAQLAVASQASQQVPNNRVDFVQHHNIDMIQLTVSEDGIVTVVEPGGGKLVDKEELHEAIKMPSDHTLTVHQLQQIVGHHQVIDSVVRIEQATGEPANILVTQNPDGTTSIETSAVDPLLVKDEKNVAKIESAQFAIPAEIKDIKGIDLKSAMGMEGAVVKISTGSEHDLHTMYKVNVEDLSQLLAYHEVFGKLSTEGQQQAKVINDVDVEVAGTSAAMSEAETSPGHHACDICGKIFQFRYQLIVHRRYHGESKPYTCQVCGSAFANAVELSKHGKCHLAGDPAERQAKRLAQDKPYACSTCHKTFSRKEHLDNHVRSHTGETPYRCQYCSKTFTRKEHMVNHVRKHTGETPHRCEICKKSFTRKEHFMNHVMWHTECNKHTGETPHRCEICKKSFTRKEHFMNHVMWHTECNKHTGETPHCCEICKKSFTRKEHFMNHVMWHTGETPHHCQICGKKYTRKEHLVNHMRSHTNDTPFRCELCGKSFTRKEHFTNHILWHTGETPHRCDFCSKTFTRKEHLLNHVRQHTGESPHRCNFCSKSFTRREHLVNHVRQHTGETPFQCGYCPKAFTRKDHLVNHVRQHTGESPHKCSYCTKSFTRKEHLTNHVRQHTGESPHRCTYCSKSFTRKEHLTNHIRQHTGETPHKCTYCPRAFSRKEHLNQHIRQHTGDTPHTCTYCNKSFTRKEHLEHLNQHIRQHTGDTPHTCTYCNKSFTRKEHLVTHVRYVAHTGDTPHTCTYCNKSFTRKEHLVTHVRQHTGETPFKCTFCSKSFSRKEHLTNHVHLHTGETPHKCPFCTKTFSRREHLTNHVRIHTGESPHRCEFCQKTFTRKEHLTNHLKQHTGNTQHACKVCSKPFTRKEHLVQHMRSHSCGDRPFSCGECGKSFPLKGNLLFHERSHKSGNPKTFRCEICSKEFMCKGHLVSHRRTHTEAGESATPAEQGDENENCGDYAKCEKDNTEIPERKHDIRRFNCRHNNILRFRQLRYKG
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00802108;
- 90% Identity
- -
- 80% Identity
- -