Ccol001168.1
Basic Information
- Insect
- Columbicola columbae
- Gene Symbol
- -
- Assembly
- GCA_016920875.1
- Location
- JADMLL010000009.1:4002965-4009503[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 49 0.00012 0.0096 16.3 4.4 2 23 19 41 18 41 0.97 2 49 0.0011 0.085 13.3 1.6 2 23 49 71 48 71 0.96 3 49 0.01 0.77 10.3 3.1 1 23 77 99 77 99 0.99 4 49 8.6e-05 0.0066 16.8 1.0 2 23 106 127 105 127 0.97 5 49 0.00022 0.017 15.5 2.7 2 23 132 154 131 154 0.96 6 49 6.3e-05 0.0048 17.2 0.1 2 23 163 184 162 184 0.96 7 49 0.0012 0.092 13.2 3.4 2 23 190 211 190 211 0.98 8 49 3.4e-05 0.0026 18.0 0.3 1 23 217 240 217 240 0.97 9 49 3.4e-06 0.00026 21.2 0.7 2 23 248 269 247 269 0.96 10 49 1.4e-05 0.0011 19.3 1.5 1 23 278 300 278 300 0.98 11 49 0.55 42 4.8 2.6 1 23 306 328 306 328 0.97 12 49 9.2e-05 0.0071 16.7 0.4 1 23 334 356 334 356 0.98 13 49 1.2e-06 8.9e-05 22.6 0.5 1 23 362 384 362 384 0.98 14 49 0.49 37 4.9 0.1 5 23 448 466 445 466 0.92 15 49 0.93 71 4.1 0.2 2 23 470 492 469 492 0.92 16 49 0.0028 0.21 12.0 0.6 2 23 500 522 499 522 0.96 17 49 0.00086 0.066 13.6 0.4 1 23 557 580 557 580 0.95 18 49 0.0002 0.015 15.6 3.3 1 23 583 606 583 606 0.96 19 49 6.1e-06 0.00047 20.4 0.3 3 23 616 636 614 636 0.96 20 49 8.2e-05 0.0063 16.8 0.7 1 23 641 663 641 663 0.98 21 49 0.018 1.4 9.5 4.5 2 23 670 692 669 692 0.92 22 49 0.0039 0.3 11.5 3.2 2 20 698 716 697 719 0.94 23 49 0.00012 0.0089 16.3 1.6 1 23 728 750 728 750 0.97 24 49 1.6e-06 0.00012 22.2 0.7 1 23 757 779 757 779 0.99 25 49 1.9e-07 1.5e-05 25.1 0.3 1 23 785 807 785 807 0.98 26 49 0.028 2.2 8.8 0.2 1 17 813 829 813 830 0.93 27 49 0.24 18 5.9 1.0 2 23 918 938 917 938 0.92 28 49 0.1 8 7.1 0.1 6 23 949 966 949 966 0.98 29 49 0.073 5.6 7.5 5.6 1 23 969 992 969 992 0.96 30 49 0.00035 0.027 14.8 4.5 1 23 1002 1024 1002 1024 0.98 31 49 0.014 1.1 9.8 3.6 1 23 1030 1052 1030 1052 0.97 32 49 0.0019 0.15 12.5 1.3 1 23 1058 1081 1058 1081 0.96 33 49 2.5e-06 0.00019 21.6 0.4 1 23 1088 1110 1088 1110 0.98 34 49 0.0054 0.42 11.1 5.2 1 23 1120 1142 1120 1142 0.97 35 49 0.19 14 6.3 1.7 3 23 1150 1169 1148 1169 0.95 36 49 3e-06 0.00023 21.3 1.8 1 23 1175 1197 1175 1197 0.96 37 49 4.7e-05 0.0036 17.6 1.6 1 23 1206 1228 1206 1228 0.98 38 49 4.4e-06 0.00034 20.8 0.3 2 23 1237 1258 1236 1258 0.96 39 49 0.0025 0.19 12.2 0.9 1 23 1264 1287 1264 1287 0.95 40 49 0.00094 0.072 13.5 1.4 1 23 1342 1365 1342 1365 0.96 41 49 0.0012 0.089 13.2 3.5 1 23 1370 1392 1370 1392 0.94 42 49 0.0018 0.14 12.6 4.1 1 23 1400 1422 1400 1422 0.98 43 49 4.4 3.4e+02 1.9 4.0 1 23 1428 1451 1428 1451 0.93 44 49 0.0013 0.1 13.0 1.2 1 23 1456 1478 1456 1478 0.98 45 49 1.7e-06 0.00013 22.1 0.4 3 23 1490 1510 1488 1510 0.97 46 49 0.00026 0.02 15.2 0.5 1 23 1516 1538 1516 1538 0.98 47 49 0.00035 0.027 14.9 1.1 1 23 1544 1566 1544 1566 0.97 48 49 6.8e-06 0.00052 20.2 2.5 1 23 1572 1594 1572 1594 0.98 49 49 0.00017 0.013 15.8 0.3 3 23 1602 1622 1600 1622 0.98
Sequence Information
- Coding Sequence
- ATGGGATTCGAGACGTCGAAAGAATTGAACGAGCACAACGACGAGTGTCACGTCGAATGCGATCTCTGCGAACAGATATTCAAGAGGAGATTCCATCTCAAGGCCCACATGAAAACTTGTCACTCGGACATAAACGGAAAAGTCAGCTGCGATTATTGTCCCAAGACATTCTCGACTCACATGTCGAGATGGGTTCACACTAAAACGATTCACGTGAACAACTGGAAATACAAGTGCAATGATTGCGAGTACGGAACGAATCATGAGACTCTCTTTCAGAACCACGTGAAGAAACACGTGGAACCGTTAAACCTGAGATGCAGTCAGTGCGATAAAGAATTCAAGAGTCAGTATCGATATGAAGTTCACATCAAGATCCACGAAAAAGGTTTGACGTGTTCCGTTTGTAATAAGAAATTCGACAAACGGAAGAACATGCTGAGGCACATGAGCCGGAATCATGGACAGGAGAAGAAGAGAGACGTTCCATGCACGCTATGTAATAAAATGTTTTTCGACAAATGGGCACTTCGAGTCCACATGGAAGTCCACAACAACAAAAAGGAAAAGTGTCCTTTTTGCGACTACGCCACCTGTCATCCGAAAAACCTGAAAGCTCACGTCAAGAACCACATAAAAGAGTTCGACTTTGTCTGTGAACTTTGCGGACGAGGTTTCGTGTACAACTTCATGTTGAGAGATCACATGATGAAGGAACATGGAGACGGTTTACCGAGGATTCCGTGCGAGATATGCGGGAAGACCTTCGCCACTCAACATTACCTCAACATTCACAAACAATCTCACGAGCCCGGCTACGAGCAGAGGAATCACCAATGCGAGATCTGTGGAAAGAGATTCCTCACTCGTAGCATGCTTCTACGACACATACGCGGTCACAGCAAAGTCGTTCAATACGTTTGTAGATACTGCAACAAGTTCTTGTCCTGTTTGGCCACTCTCAAAGACCACGAAAAGATACACACCGGCGAGAAACCATTCATATGCGAAGTTTGCGGAAAGTCTTTCGGTGCTAAAAAGTACTTGGTCACGCACTCGAGGACTCACACCAATGAGAAACCTTACTCGTGTAAAGAGTGTGGCGCTGCTTTCTCTCAAAAGACGACACTCTCGGCTCATCAAAGATCTCATTTAAAGAGGGATTCTCCCAAGATGGACTCTTCTCAGCCTGAGCAGAAAATCCTCGCTTTGGCGTCGTCAGAATTGTTTGGAGACGAAGACGGAAATAAGATTTCCGAACCAAAGACGAAAGAGACGGAAGAAAGAGAAAGTCAGAGGCGACTAGTCGAACTAAGCTTCAAAGGAAAGAACTACTGTATAATATGCGAGGTTCAGTATTCGGAGCAATCCGAGTTCGAAGTCCACGCAAGAAGTCATAAAAGGGTTCCGTGTATGATTTGTGGAAAGATATGTATAACTGGTCTCAGGTTGTTGAGTCACGTGAGAAACGTTCACTCGAACAGACCGAAGAGGAAATCGTGCGATCTTTGTAGTAAATTGTTCGCGACGAACAACTCGCTTTGGGTTCATAAGAAAACAGTTCACAAGAAGAAATGGGATCTCACGTGTTCTGTTTGCGGCTACGGTACCAACTGTAAGGAAGTCTTGGAGGAACACTTACTGAAAGCCGGTGAAGATCATTCGACGTTCAAGTGCGAAGCCTGTGATAAGGAATACGCGAGCCAACAGAACCTAGACAAACACGTGCAGTTGTATCACGTGAATCACAATTGTAAATATTGTAATAAAGAGTACGGGAGTCGGAACAATTTGTTGAGACACATTCAGCGATCTCATGGAAAGCAAAAACCTCGGAACATAGCCTGCGAACAATGCGACAAAATGTTCTACACAAAATGGCAGCTCAAAGTTCACATGAACCTTCACATAAACGAGAAACATCAGTGCCCCTACTGCGACTACAGCACCGTCTACCCTCAAGCTCTGAAGAACCACGTTAAAATCCACACGAAACAGTATAACTGCATATGCGATCTCTGCGGAAGAGGGTTCATGACCAAATGCACTCTAAGAAATCACATAGCCCACGAGCACTTGGGCATCGTCGTCAAGTGTTCCATCTGCGGGAAAACCTTCCACGCTCAAAAGTATTTGAATCTCCACATGGAGTGTCACCAGCCGGGCTACGAGATGAGAAATCATCAATGCGACATTTGCGGAAGGCGGTTTTTGAGAAGACCGATGCTCCGACGACACCTGAGAGCTCATAAGGGGCTCACCAAGACTTATCAATGTCAGTTCTGCGACAAGGTGGTCACGTCATCCTCGTCTCTCAAAGATCACATGAACATTCACACAGGGCTGAGACCGTACATCTGCGAGATTTGCGGATACGATTTCGCGAGAAAGAATTATTTGACGGCTCACATGAGGACTCACACGAAGGAGAAACCTTTTTCCTGCGAAGTGTGCGGATCTTCCTTTACCCAGAGAGGAACTCTAACGGTAAACGATTGGATTGATTCTAACAGTGTTGACACCCCAAGTTGGATAAATTTGGGTAAATTAGAAGAAACCGTTGAGGAACCGGTAATTGACGAAACCGTAGTGATAGAAAATCTCGAAATCGGAAATAATGAAAGTAATTCGAACGAAACGACGGAAGAATCTCAAAGAGAAGGTGATTCCAATATGGCGGCGGATACAGTCAAAAAGAAGTCGAAGACGGGGAAGAGGATAAACCGACCGGCTGAATATAACTATCCTCAGAAGTGCGAGTTCTGTGGTCAGGTATTTTTTGAGAAGGATATGGTTCGACATAAGAAAGCACATCCGGAGTCGTTCACCCTCGATTGCATTCCGTGTGGGAAATTCTTCAACAGCAAAGCCGCCTACGACGTTCACTACAACATCCACAAGATATACCATTGCGTGACGTGTAAGAAGGATTTCAACGCGAGGAAACACTATCGGAGTCACATGGAACGGATTCACAAGTCCGAGGGCGAGAGACCTCGCATCCATGAGTGTGATATATGTCGGAAGACGTTCAAAGACAAATTCGTTTTGAACACTCACAAGAAGACTCACTTCAACATAAAAGCTCACAAGTGTCCCATGTGTGACTACGCCTGTAATAATTTGACTTACATGAGACAGCACGTCATGAATCACACGAACCAACTTCAGTTTAAGTGTCACATATGCTATCGCGGTTTCATGACCGATCAGGACCTTCAGGTTCACATCGATAAACTCCATGCGAACGAGCCTCAGGTATATAGCTGCGATGTCTGCGGGAAGATATCGACGAGCAAAGGGAATCTGAGACAGCATCAGAAGATTCACAATCCCGAACATAATCAAAAGAAGGATCATCAGTGTTTCGAATGCGGGAAGACTTTCGCGTTAAAATGCCGGCTAAAAAAACACGCCCTCTCGCACAAAGGAGAGGACAAGGTTCACTGCACCGTTTGCGGGAAGTTGTTATCGAAAGACTCTTACAAGAAACACTTGAAAGCTCACAGCGGAGAAAAACTATTCCTCTGCGATTATTGTGGCGAATCTTTCGCGACGAACAGTTGCCTAATGGTTCACAGACGGACTCACACGGGAGAGAAACCGTATAGACCCTACTCTTGTGATCAGTGTGGAAAGAACTTTGCTCAGCATTCGGTACTTCTCGCTCATAAGAAAACCCATTCGGGTGCACCTGTGAGGCCTTTATACTGCGAGAACTGCGGGAAAATGTTCACGCAACCTTCGCTTCTCGCGTCGCACAAGAAAATTCACGTCACCGGTGATAATTATATTTGTGACCTTTGCGGGAAAGCTTTTCCCGAACATTGTCTTCTAATCGCTCATAGGGATTCCTATCACGTGCCTGGCAGACCCTTTAGTTTTGTACATTTCGCAAACCAACTCGGATTACAAGATGCCAAAGAGACTAAAATGTTTGGACAACTGGTGGTTGAACATGAACACAAAGAGGAAGTTGAAGTGAGTGAAATGAAAGATGGACAGGAAAGAGATGAAAGAAAACACGAATGCGACACTTGCGGGAAGAAATTCGGCGGAAAACAAAGTGTCAACCGTCACGTTAAGCGAATCCATAGAGTTCGATCGCACGCTTGCCAACACTGCGAAAAAAAATTTTCCTACGCTTATCAACTAAAGCTCCATTCCATTGTTCACGACCTAAGCAACGAAAAAGAGTTCACATGTTCCTATTGTGGCTATCGAACCAACTACAAACAGTACTTCAAAGATCACGTTCGCAGACACACTTCCGACTACAAGCATTTCTGTAAGCAGTGTAACTCGGGTTTCCACGAGAAAAGAGGTTTGGAATCGCATAAGATCAAAGCTCACGGGGAACCTTTGTTCGTGTGCGAAATTTGTAAAAGATTCTACTACAGTTCCGAACGTTTGAAAGTGCATCTCTCAAGTCATGAACCGGAAAAGTGTAAAAGAGAAAAAGTTGCTTGTGAAATTTGCGGGAAGGCATTTCACCGGAAATACGATTTAAAAGTTCATATAATGAGTCACACGGGAGAACGCCGTTTCGTTTGTTCCGTCTGTCAGAAGGCGGTGAGTTCGAGATCTCGACTGAATCTTCATCTGAAATCTCATTCCGGGGTAAAGGATTTCGTTTGTCATTTGTGCGGGAAACCGTTCGCTTCCGGTCAGTATTTGAAACTTCATTTGTTCAGTCACAAGGGAATCAAACCGTACAAATGTGACTCGTGCGATAAGTCTTTCACTCAAAGAACGACTCTGAGGATTCATAAAAGATGGCACACGGGTGAAAGACCCTTCGGCTGTGATGTTTGCTCGAGGAAATTCGTCACGAAAACGGCACTGAAGGCACATGTTAAGATACATTTTTGA
- Protein Sequence
- MGFETSKELNEHNDECHVECDLCEQIFKRRFHLKAHMKTCHSDINGKVSCDYCPKTFSTHMSRWVHTKTIHVNNWKYKCNDCEYGTNHETLFQNHVKKHVEPLNLRCSQCDKEFKSQYRYEVHIKIHEKGLTCSVCNKKFDKRKNMLRHMSRNHGQEKKRDVPCTLCNKMFFDKWALRVHMEVHNNKKEKCPFCDYATCHPKNLKAHVKNHIKEFDFVCELCGRGFVYNFMLRDHMMKEHGDGLPRIPCEICGKTFATQHYLNIHKQSHEPGYEQRNHQCEICGKRFLTRSMLLRHIRGHSKVVQYVCRYCNKFLSCLATLKDHEKIHTGEKPFICEVCGKSFGAKKYLVTHSRTHTNEKPYSCKECGAAFSQKTTLSAHQRSHLKRDSPKMDSSQPEQKILALASSELFGDEDGNKISEPKTKETEERESQRRLVELSFKGKNYCIICEVQYSEQSEFEVHARSHKRVPCMICGKICITGLRLLSHVRNVHSNRPKRKSCDLCSKLFATNNSLWVHKKTVHKKKWDLTCSVCGYGTNCKEVLEEHLLKAGEDHSTFKCEACDKEYASQQNLDKHVQLYHVNHNCKYCNKEYGSRNNLLRHIQRSHGKQKPRNIACEQCDKMFYTKWQLKVHMNLHINEKHQCPYCDYSTVYPQALKNHVKIHTKQYNCICDLCGRGFMTKCTLRNHIAHEHLGIVVKCSICGKTFHAQKYLNLHMECHQPGYEMRNHQCDICGRRFLRRPMLRRHLRAHKGLTKTYQCQFCDKVVTSSSSLKDHMNIHTGLRPYICEICGYDFARKNYLTAHMRTHTKEKPFSCEVCGSSFTQRGTLTVNDWIDSNSVDTPSWINLGKLEETVEEPVIDETVVIENLEIGNNESNSNETTEESQREGDSNMAADTVKKKSKTGKRINRPAEYNYPQKCEFCGQVFFEKDMVRHKKAHPESFTLDCIPCGKFFNSKAAYDVHYNIHKIYHCVTCKKDFNARKHYRSHMERIHKSEGERPRIHECDICRKTFKDKFVLNTHKKTHFNIKAHKCPMCDYACNNLTYMRQHVMNHTNQLQFKCHICYRGFMTDQDLQVHIDKLHANEPQVYSCDVCGKISTSKGNLRQHQKIHNPEHNQKKDHQCFECGKTFALKCRLKKHALSHKGEDKVHCTVCGKLLSKDSYKKHLKAHSGEKLFLCDYCGESFATNSCLMVHRRTHTGEKPYRPYSCDQCGKNFAQHSVLLAHKKTHSGAPVRPLYCENCGKMFTQPSLLASHKKIHVTGDNYICDLCGKAFPEHCLLIAHRDSYHVPGRPFSFVHFANQLGLQDAKETKMFGQLVVEHEHKEEVEVSEMKDGQERDERKHECDTCGKKFGGKQSVNRHVKRIHRVRSHACQHCEKKFSYAYQLKLHSIVHDLSNEKEFTCSYCGYRTNYKQYFKDHVRRHTSDYKHFCKQCNSGFHEKRGLESHKIKAHGEPLFVCEICKRFYYSSERLKVHLSSHEPEKCKREKVACEICGKAFHRKYDLKVHIMSHTGERRFVCSVCQKAVSSRSRLNLHLKSHSGVKDFVCHLCGKPFASGQYLKLHLFSHKGIKPYKCDSCDKSFTQRTTLRIHKRWHTGERPFGCDVCSRKFVTKTALKAHVKIHF
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -