Enis020374.1
Basic Information
- Insect
- Epinotia nisella
- Gene Symbol
- -
- Assembly
- GCA_932294385.1
- Location
- CAKOAM010000119.1:439458-465762[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 49 0.0075 0.51 11.4 2.0 1 23 7 30 7 30 0.96 2 49 0.44 30 5.8 0.6 1 23 35 58 35 58 0.95 3 49 0.11 7.4 7.7 0.0 1 20 66 85 66 86 0.94 4 49 0.071 4.8 8.3 0.0 3 23 98 118 97 118 0.96 5 49 0.077 5.3 8.2 0.8 6 23 122 140 120 140 0.94 6 49 0.095 6.5 7.9 1.6 5 23 161 180 160 180 0.94 7 49 0.095 6.5 7.9 1.6 5 23 201 220 200 220 0.94 8 49 0.095 6.5 7.9 1.6 5 23 241 260 240 260 0.94 9 49 0.095 6.5 7.9 1.6 5 23 281 300 280 300 0.94 10 49 0.095 6.5 7.9 1.6 5 23 321 340 320 340 0.94 11 49 0.095 6.5 7.9 1.6 5 23 361 380 360 380 0.94 12 49 0.095 6.5 7.9 1.6 5 23 401 420 400 420 0.94 13 49 0.095 6.5 7.9 1.6 5 23 441 460 440 460 0.94 14 49 0.095 6.5 7.9 1.6 5 23 481 500 480 500 0.94 15 49 0.095 6.5 7.9 1.6 5 23 521 540 520 540 0.94 16 49 0.095 6.5 7.9 1.6 5 23 561 580 560 580 0.94 17 49 0.095 6.5 7.9 1.6 5 23 601 620 600 620 0.94 18 49 0.095 6.5 7.9 1.6 5 23 641 660 640 660 0.94 19 49 0.095 6.5 7.9 1.6 5 23 681 700 680 700 0.94 20 49 0.095 6.5 7.9 1.6 5 23 721 740 720 740 0.94 21 49 0.095 6.5 7.9 1.6 5 23 761 780 760 780 0.94 22 49 0.095 6.5 7.9 1.6 5 23 801 820 800 820 0.94 23 49 0.095 6.5 7.9 1.6 5 23 841 860 840 860 0.94 24 49 0.095 6.5 7.9 1.6 5 23 881 900 880 900 0.94 25 49 0.095 6.5 7.9 1.6 5 23 921 940 920 940 0.94 26 49 0.095 6.5 7.9 1.6 5 23 961 980 960 980 0.94 27 49 0.095 6.5 7.9 1.6 5 23 1001 1020 1000 1020 0.94 28 49 0.095 6.5 7.9 1.6 5 23 1041 1060 1040 1060 0.94 29 49 0.095 6.5 7.9 1.6 5 23 1081 1100 1080 1100 0.94 30 49 0.095 6.5 7.9 1.6 5 23 1121 1140 1120 1140 0.94 31 49 0.095 6.5 7.9 1.6 5 23 1161 1180 1160 1180 0.94 32 49 0.095 6.5 7.9 1.6 5 23 1201 1220 1200 1220 0.94 33 49 0.095 6.5 7.9 1.6 5 23 1241 1260 1240 1260 0.94 34 49 0.095 6.5 7.9 1.6 5 23 1281 1300 1280 1300 0.94 35 49 0.095 6.5 7.9 1.6 5 23 1321 1340 1320 1340 0.94 36 49 0.095 6.5 7.9 1.6 5 23 1361 1380 1360 1380 0.94 37 49 0.095 6.5 7.9 1.6 5 23 1401 1420 1400 1420 0.94 38 49 0.095 6.5 7.9 1.6 5 23 1441 1460 1440 1460 0.94 39 49 0.095 6.5 7.9 1.6 5 23 1481 1500 1480 1500 0.94 40 49 0.095 6.5 7.9 1.6 5 23 1521 1540 1520 1540 0.94 41 49 0.095 6.5 7.9 1.6 5 23 1561 1580 1560 1580 0.94 42 49 0.049 3.4 8.8 0.0 2 20 1687 1705 1686 1707 0.94 43 49 0.0038 0.26 12.3 1.0 1 23 1718 1740 1718 1740 0.96 44 49 0.0064 0.44 11.6 0.2 1 21 1746 1766 1746 1767 0.94 45 49 0.0015 0.1 13.6 0.3 1 21 1781 1801 1781 1802 0.94 46 49 0.0064 0.44 11.6 0.2 1 21 1853 1873 1853 1874 0.94 47 49 0.0015 0.1 13.6 0.3 1 21 1925 1945 1925 1946 0.94 48 49 0.0064 0.44 11.6 0.2 1 21 1997 2017 1997 2018 0.94 49 49 0.0057 0.39 11.7 0.1 2 23 2056 2078 2055 2078 0.95
Sequence Information
- Coding Sequence
- ATGCACACAGGAAAGGTGTTCAAATGCAAGCAGTGTGACGCTACCTTCACGAAAAGCACTTCGTACTACGGCCACGTGCGGCTGAACCACCCGTCCAAGGACCACGTGTGCGGGATCTGTGGGGACTCCTTCATCGGAGCCCTGGGGCTGCGGCACCATCACGCCAAGGCGCACAAGGAGTTGTCGAAAGCCAAGTTCGAATGCAAGTCGTGCGGTGTGGCGTTCGTTAGTTCGGAAGCTATGAGCGCACATACGGACGGGCGTACGCAATGCGACGCCAATTTGAAAGCTTGTGCGGAGTGCGGTGCGGTGTTCGGTACGGAGGAGGCGCTGAGCTCGCACGCGGACACACACGTGTTCTGGTGCAAGGAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGTGTTCCCAGTCGTTCGTGAGCCAGAGCTCCCTCGACACGCACCACCAGCGGATCCACCTCCGCATCAAGTGGCCCGCGTACGGGCGCACGCGCGAGATGAACAAGGAGTACTACAAGAAGGTACCCGCTGTGCTTACAGCAGCCATGTCCAGATTGTCCAGATCCGTGGGGACTAGAGGGCTCAAGCCTCCTCCCAAATGCTGTATTGTAATATCAAAACGCAGCCTTGTCAAGGAAGGGAACTGTCACTTTGACAGCGCTCTCTTATATCCTACCGACTGCGCCATGTGCGCAAAAGCTGTGTGCGGCAGGATACCGGGCAAAGCACCTGGGAACACCGGGATAAAAGAGAGGACGCGGCTGCGAGCGCAGGTGGTGTGTGAAATATGTGGAGCGAAGCTCGTGTCTAACGCGGCGCTAGAGAGGCACCAACTGGGTCAGTGCAAGCGGGAAAAAAACACCAAATCGTACAAGTGTCCGTTCTGCGAAAAAGCTTGCGCCACTAGACAGGGCTTGCAGCTTCACGTGACATTACACACGGGAGAGAAGCCGCACAAGTGCGCGCAGTGCTCGCTCGAGTTCCGGCAGGCGGGCGCGTTGCGGCGCCACACGCAGGCGGACCCTGGGGCGATGACGTTACACACGGGAGAGAAGCCGCACAAGTGCGCGCAGTGCTCGCGCGAGTTCCGGCAGGCGGGCGCGCTGCGGCGCCACACGCAGGCGGTGAGTGCGCGGGAGTGGCGAGCTGCTCCCGGGAGAGGACCCTGGGGCGATGACGTTACACACGGGAGAGAAGCCGCACAAGTGCGCGCAGTGCTCGCGCGAGATCCGGCAGGCGGGCGCGCTGCGGCGCCACACGCAGGCGAGAAGCCGCACAAGTGCGCGCAGTGCTCGCTCGAGTTCCGGCAGGCGGGCGCGCTGCGGCGCCACACGCAGGCGGTGAGTGCGCGGGAGTGGCGAGCTGCTCCCGGGAGAGGACCCTGGGGCGATGACGTTACACACGGGAGAGAAGCCGCACAAGTGCGCGCAGTGCTCGCGCGAGATCCGGCAGGCGGGCGCGCTGCGGCGCCACACGCAGGCGAGAAGCCGCACAAGTGCGCGCAGTGCTCGCGCGAGTTCCGGCAAGCGGGCGCGCTGCGGCGCCACACGCAGGCGGTGAGTGCGCGGGAGTGGCGAGCTGCTCCCGGGAGAGGACCCTGGGGCGATGACGTTACACACGGGAGAGAAGCCGCACAAGTGCGCGCAGTGCTCGCGCGAGTTCCGGCAGGCGGGCGCGCTGCGGCGCCACACGCAGGCGAGAAGCCGCACAAGTGCGCGCAGTGCTCGCTCGAGTTCCGGCAGGCGGGCGCGCTGCGGCGCCACACGCAGGCGGTGAGTGCGCGGGAGTGGCGAGCTGCTCCCGGGAGAGGACCCTGGGGCGATGACGTTACGGAGCCGATTAGTGTGCAGTATGGAGTGCACCTGGGCATCGACACGCGCGTGCAGTGCGCGACGTGCGGCAAGTACTTCTCCACGCCGAGCACGCGCGCGCTGCACGTCAACAGCGTGCACCTGCGGCTGCCCGCGCGCCGCCGGAAGGACCCTCAGGCGGACCCTACACGTCAATAA
- Protein Sequence
- MHTGKVFKCKQCDATFTKSTSYYGHVRLNHPSKDHVCGICGDSFIGALGLRHHHAKAHKELSKAKFECKSCGVAFVSSEAMSAHTDGRTQCDANLKACAECGAVFGTEEALSSHADTHVFWCKESFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKCSQSFVSQSSLDTHHQRIHLRIKWPAYGRTREMNKEYYKKVPAVLTAAMSRLSRSVGTRGLKPPPKCCIVISKRSLVKEGNCHFDSALLYPTDCAMCAKAVCGRIPGKAPGNTGIKERTRLRAQVVCEICGAKLVSNAALERHQLGQCKREKNTKSYKCPFCEKACATRQGLQLHVTLHTGEKPHKCAQCSLEFRQAGALRRHTQADPGAMTLHTGEKPHKCAQCSREFRQAGALRRHTQAVSAREWRAAPGRGPWGDDVTHGREAAQVRAVLARDPAGGRAAAPHAGEKPHKCAQCSLEFRQAGALRRHTQAVSAREWRAAPGRGPWGDDVTHGREAAQVRAVLARDPAGGRAAAPHAGEKPHKCAQCSREFRQAGALRRHTQAVSAREWRAAPGRGPWGDDVTHGREAAQVRAVLARVPAGGRAAAPHAGEKPHKCAQCSLEFRQAGALRRHTQAVSAREWRAAPGRGPWGDDVTEPISVQYGVHLGIDTRVQCATCGKYFSTPSTRALHVNSVHLRLPARRRKDPQADPTRQ
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -