Vata020282.1
Basic Information
- Insect
- Vanessa atalanta
- Gene Symbol
- -
- Assembly
- GCA_905147765.1
- Location
- LR990578.1:4452690-4460270[+]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 51 0.0027 0.21 12.5 0.8 1 23 14 37 14 37 0.90 2 51 0.35 27 5.8 0.2 2 23 109 132 108 132 0.94 3 51 0.0013 0.098 13.5 1.5 2 23 138 160 137 160 0.95 4 51 0.057 4.4 8.3 0.6 2 23 166 188 165 188 0.92 5 51 0.00081 0.063 14.1 1.4 1 23 193 216 193 216 0.95 6 51 9.1e-07 7e-05 23.4 5.7 1 23 220 243 220 243 0.97 7 51 0.0011 0.088 13.6 0.2 2 23 250 271 249 271 0.96 8 51 0.11 8.3 7.4 1.0 2 23 310 331 309 331 0.95 9 51 0.021 1.6 9.7 0.1 2 23 359 381 358 381 0.96 10 51 0.0073 0.56 11.1 0.2 2 23 402 424 401 424 0.95 11 51 0.00046 0.035 14.9 1.3 1 23 429 452 429 452 0.97 12 51 0.0053 0.41 11.5 2.2 1 23 456 479 456 479 0.96 13 51 0.016 1.3 10.0 2.3 2 23 482 504 482 504 0.96 14 51 0.0049 0.38 11.6 2.5 1 23 581 604 581 604 0.93 15 51 0.55 42 5.2 0.1 2 23 632 654 631 654 0.94 16 51 2.7e-05 0.0021 18.7 0.5 1 23 676 699 676 699 0.93 17 51 5.9e-05 0.0046 17.7 2.8 3 23 706 726 705 727 0.94 18 51 0.0012 0.095 13.5 0.9 1 23 733 756 733 756 0.95 19 51 1.5e-05 0.0012 19.5 0.4 1 23 761 784 761 784 0.96 20 51 0.16 13 6.8 7.5 1 23 788 811 788 811 0.97 21 51 0.093 7.2 7.6 0.9 2 23 818 839 817 839 0.96 22 51 0.00042 0.032 15.0 1.1 1 23 887 909 887 909 0.95 23 51 0.4 31 5.6 0.2 2 21 937 956 936 959 0.87 24 51 0.011 0.83 10.6 0.3 2 23 980 1002 979 1002 0.95 25 51 0.013 1 10.3 1.1 2 23 1008 1030 1007 1030 0.96 26 51 0.0028 0.21 12.4 3.0 1 23 1033 1056 1033 1056 0.96 27 51 2.6 2e+02 3.0 3.3 1 23 1058 1081 1058 1081 0.95 28 51 0.0016 0.12 13.2 2.0 1 23 1155 1178 1155 1178 0.96 29 51 1.5 1.2e+02 3.8 0.6 2 23 1206 1228 1205 1228 0.93 30 51 0.05 3.8 8.5 0.4 1 23 1250 1273 1250 1273 0.93 31 51 0.00014 0.011 16.5 1.5 3 23 1280 1300 1279 1301 0.94 32 51 0.00016 0.013 16.3 1.3 1 23 1306 1329 1306 1329 0.95 33 51 0.045 3.5 8.6 0.8 2 23 1335 1357 1334 1357 0.90 34 51 0.0034 0.26 12.1 3.8 1 23 1361 1384 1361 1384 0.97 35 51 0.015 1.1 10.1 0.5 3 23 1392 1412 1391 1412 0.97 36 51 0.018 1.4 9.9 2.7 1 23 1418 1440 1418 1440 0.97 37 51 0.0044 0.34 11.8 2.7 1 23 1475 1497 1475 1497 0.96 38 51 7.3 5.7e+02 1.6 0.0 3 23 1526 1547 1524 1547 0.90 39 51 0.0099 0.77 10.7 0.7 2 23 1568 1590 1567 1590 0.95 40 51 0.00021 0.016 15.9 1.7 1 23 1595 1618 1595 1618 0.96 41 51 0.0096 0.74 10.7 1.4 1 23 1623 1646 1623 1646 0.96 42 51 0.066 5.1 8.1 2.9 2 21 1649 1668 1649 1671 0.94 43 51 0.058 4.5 8.3 3.2 3 22 1680 1699 1679 1699 0.95 44 51 0.041 3.2 8.7 0.5 1 21 1752 1772 1752 1775 0.90 45 51 0.0015 0.11 13.3 0.1 2 23 1848 1870 1847 1870 0.96 46 51 0.0001 0.0078 16.9 0.2 3 23 1877 1898 1876 1898 0.94 47 51 0.024 1.9 9.5 2.7 2 23 1903 1925 1902 1925 0.94 48 51 0.0038 0.3 12.0 4.0 1 23 1930 1953 1930 1953 0.97 49 51 9.9e-06 0.00077 20.1 5.2 1 23 1957 1980 1957 1980 0.97 50 51 0.00064 0.05 14.4 0.3 2 23 1987 2008 1986 2008 0.96 51 51 0.00016 0.013 16.3 0.6 1 23 2014 2036 2014 2036 0.97
Sequence Information
- Coding Sequence
- ATGTCGACGGCAGTGCCATTTAAATTTTTCAAAAACAAATTCCGTTGTTTCTTTTGCTCTAAGGACTTTAGCGATTTCGATAATTTACGGGATCACGCGATAGCCGATCATCCTCATTTCAACCAGGAATCTAAAGCCATGAAAATGATTAAAGGTAGAGATATAAATGTTAAAATTGATATTTCTAATCTAACGTGTAAGCTTTGTTTTGAAAACAGCTTAGCTTTAAATGATTTGATTAATCATCTAATATCGAAACACGGCGTCAATTATGACAAGTCAATAGAATGCTTACAACCTTTTAGAGTCGTTAAAGATAATATGCCTTGTCCAATTTGTCCGAATATGGTCTTTAGGTATTTTAAAAAACTTCTCGAACATATGAATGAATTTCATTCCGACAATAATATAGTCTGCGCACATTGCGGTCTAACATTTAGGGGACATCCGAACTACAGGGCTCACATGTCTCGCTATCATAGGAGCAAGGCGTGTAAATGTCCCGATTGCAATATGGAATTTTGGAATTTGGAGAAATTAGCACGTCACAGAGCAAACGTACACGGCACGAAAAAATATAAATGTCCTCAATGCGTAGAGAAATTTATAACTCAGCATTTGATGCGTAAACATCAAATACTCGTACACGGTTTCGGGCACAAGTGTCCGTATTGTGGTAAAATGTTCACGCGTAACTCTCATATGAAGAATCACGTAAGGCGTCTCCATTTGAAGGAGAAAAATGTAGAATGCACAGTTTGCAAGGAAAAGTTTTTTGATAGAGCTCTTTTGAATGTGCATATGGTCAAACATGTAGGTGTAGACGGACCTGCTCAGTCACTTAGTCCTAATACATTAAGGCGGAGGAATCTTCTAATACTATTCAACAATACATCAATTATACCATTCAAATGCCGTGGCAAATGTCGATGTTTTTATTGCGGCGAAGAGTTGCCTATATATGACGATTTGAGGAAGCATACGAAAGCTCACGGTCCATGCTCAGAAAGGGATAGAGCTATTAAATTAGTAAAAACCGATGATGCGGAAGTGAAAATTGATGTGTCTGATATAACGTGCGAATTATGCAACGAATCATTTGGAAACTTAGATGAAATTATATCACATTTAACAACAGAACATATGTTTCCTTATGAAAGAGACGTTAAATTGATTATTATGTCATATCGACTTATTGATAAACAGTGTCTTATATGTGATGAAAGATTTAAACAAGTAAGAGAATTAGTAATACACGTTAACAACGAGCATCCCGTGCAGTGCTTCGATTGTGACGTCTGTCAACAGAAATTCTTTAGAAAGCAGTATTTAGATGCCCATATGCGAGTTAAACATTCAAATATTTATAAATGTTTAAAGTGTCCTCAGACATTTTCATCTCACTTGGCCCTTCAAGAGCACAAAATCAAATCACATGTCGCTGTCTGTAACATATGTTTTACTAAATTCTCTACACAAATGAAAAGATTGAAGCACATGAAATCCGAACACGCCAATGAACCCTTAAAATGTGGATTCTGTTTAAAATTTATGAGTACAAAATTGGGTTTTCTCCGGCACGCTGCTAAATGTACAGAAAAAAATGAAAATGTCAATGAAACATTTGTGGTAGACGACGACGATGAAGACAAGAAACCCGCAGTTATACAGATAAGGAAAAACATAGCTTGTATATTCAATATGTCAACGGTGATACCATTCAAGTACTTTATGAGTAGGTTTAGGTGTTTTTATTGTCCACACGATTTTACTGATTGTGAAGAATTAAGAGCTCATACCGTCATTGAACATCCGATTTGTGATGTTAATTTCAAATGTATGAGGCTGCGTAACAGACAAGAAGGTTGTGTCAAAGTTGACACGTCTGTGCTTTCTTGTAAAATGTGCTTTGAAAATCTACCAAATTTAGAATCTTTAATAGAACATCTAATATCTGAACATAAGGCTTTATATGATAAGTCTGTTGATAACAATATTCAACCGTATAATTTAATTAAAGATAATTATCCCTGTCCAGTTTGTGGGGAATCCTATACACATTTTAGCACGTTGTTAAAACATATGGGACAACTTCACACTGACAATAAAAATATATGCATGCACTGTGGCAAATCATTCCGTAACTTGCCCAATTTGCGGGTGCATATATCAAATCATCACAAAACGACCGGTAGCTATAAATGTGTTCGATGCGAACAAGAATTTTCTACAAATAAATATTTACAAACTCACTTAGGTCGCGCACACGGTATTAAAGTTTACGAATGTCCAGAGTGTTCAGAAAAGTTTACATCAAATTACGCAATGCAACGCCATATGATTAACACGCACAGCTCTGGACACAAATGTCTACACTGTGGTAAATTATTTACGTCAAACTGTTTCATGATCGATCATATAAAACGCACACATTTAAAAGAGAAAAATGTTGAATGTCAAGTGTGTTACGAAAGATTTTTCGATACCCAACGCCTAAAGACGCATATGGTAAAACATAACGGCGAACGAAACTTCCATTGCGACGTTTGCGATGTTAACGGTCCTGTCAAGAGAATTAGTGCAAATAAATTAAGAAGGTTAAATCTTCAAATTTTATTCCACAATACATCCGTAATACCATTCAAGTGGCGCGGTAAATATTTATGTTTTTACTGCGGTGAGGATTTCAAATCTTATGAAAATTTAAAGGAGCACACAAAAGATCATGGTGTGTGCTCGGATAAAGATCGAGCTTTGCGATTGGTTAAATCGGCCGATGTCGAGGTCAAAATTGATGTTTCACAAATTTCTTGTAACATATGCCATGAAAATTTTATGCAATTAGAGGAAATCATTTCTCATTTAGTCGACACGCACAATTTGCCTTATGATAGGAGAGTCAATATGTCGATAGCGCCATATCGGTTGTCAGATTTAAGCTGCCTTTTGTGTGATGAAAAATTTAATTACCTAAAGAAATTAATAGTTCATGTAAATACTGATCATCCCAGTCAAGATTTGAATTGTGTTCAATGTCAGCAGAAGTTTAACAAGACACGGGATTTAGACGCGCATATTCGCACCAAACATAGAAATCATATGTGTTCCAAATGTTTTCTTAATTTTCGCACGCGTTCAGAACTTTTAAGCCATAAGAGGGTTGCGCATAGTTTTAAATGTAATGTCTGCATTAGAAGCTTTTCCTCGATCGGCAAATGTTTCAAACACATAAAAAATGAGCACATGGGAGCTGTAATGAAATGTGGATTTTGTGTGAATGCTTCGACTTCAAAGCAAGGTTTTCACAGACATGCCATTCAATGTACGGATAATTGTAAAAAATCTGTGGAAGCTGTGACACTGGGCGTTGATAAGAAACCATGCGTTACACAAATAAGAGATAACATTGCATGTATCTTTAACATGTCTACAGCCATACCCTTTAAATATTTTATGAGCAAATTTAGATGCTTCTATTGCCCGAAAGACTTCAATGAGTGCGATGACCTAAAACAGCATACTATAATTGAACATCCTCTTTGCGATACTAAGCTGAAATCTATGAAATTACGCCATAGACACGACGGTGTGATTAAAGTTGATACGTCATCCTTGTCTTGCAAAATTTGTTTTGAAAATATACAAGATTTAGAGTCTTTGATAAAACATTTACATAATGAGCATAAAATATATTTTGATAAATCATTGTCCATTAACTTACAATCGTATAAACTCATAAAGGATAACTTTCCTTGTCCCTTTTGCGGGGATGTCTTTAGATATTTTCGGACGTTGTTAAATCATGTAGTAAAAATGCATTCGGATAATAAAAATATATGCATGCATTGTGGAATGGCATTTCGAAATGCTCCAAATTTACGAACACATATTGCACGTCACCATAAGGCTGCTAATTTCAAATGCTCTGAATGTGACTTAAGCTTCTTTTCGAATTACTATTTGCAAACTCATTTAGGTCGAGTGCACGGTACGAAAGTCGTGGAATGTCTTCAGTGTCATGAGAAATTTACATCCGTGTATGAAATGCAAAGGCATAAGATAGACGTGCATGGTACCGGTCACGAGTGTTCCTACTGTCATAAGTTGTTTACAGGAAAATCTTCTGTCGTCGACCACATCAGACGAACTCATTTAAAAGAGAAGAATGTGGCGTGCACGGTTTGTCTCGAGAAATTTTTCGATAGACAAAGTTTGAAAGTGCACATGGTAAAACATTACGGGGAAAGGAATTTTCATTGCGATATCTGTGGGAAGAAATTTCTCTGGAAGAAAAATCTTAGAGAGCACATGACTTCACATAATGAAGGGCCGTACGACCACTGTACCTCTGAGCGGAGAAGGAAAAACTTGCAAATACTGTTCAACAATACATCAATAATTCCTTTTAAATGGCGGGGCAGGTATCTCTGTTTTTATTGTGGCCAGAACTTCACAGAATATGCGACATTCAGAAAGCATACAAAGTCTCATGGCCTGTGCAGTACAAGCGATTACGCCCTAAAAATTATCAAAGGAAATCATGTAGAAATAAAAATTGACATATCTGAAATTATTTGTGAAATATGTTACGAGCCATTGACAAGTTTTAGCGAAATAATTGATCATTTGATAGGGAAACACGAATTGGAATACGATAAAACTATTTTTACGCCATACCAGGAATATAGACTAGTCGATTGTCGATGTCAGTTATGTGGCCAAGAATTCGCGTATTTCGGTTATCTTATTAGTCACACAAATATTGCTCATCCTCAGAACAGTTTCATATGTGATAATTGCGGTGAAAGATTTAATAAGAAGAGAGATTTAGCAATGCATATAAGACATTATCATAGAGACGGTGGCTATCCGTGTGATGAGTGTTCGGAAATATTCGAATCGCATCTTCTTTTACGGAAACACAGGAATAATTTTCATTTTAGGCGATGTAAGAGTTGCGATTTACGTTTTCCGTCGTACACACTTTTACAGAAGCATATTAAGTGTGACCATTCCGATGATGGTAATTATAAAAATTGTCCACATTGTTCCAAACAATGCCAAACTTCTCAAGGCTTAAACCAGCATATTCGAAAATGTAAGCGAAAAAAAGGAACTAAAATGCCGACACCCCTTTTTCCGGATGACAATTTGCTACAGCCGAATAAGAAACAAAACGTTCAGCAAATTCGAAAAAATATTCAATGTGTCCTTAACATGTCAACCGCTGTGCCATTTAAATTTTTCGCTAAATATTCTTGCTTCTATTGCTCCAAATATTTCGTTGAATTTGATGAGCTTAAAGAACATGTAGTCGCTGAACATCCTATCTGTGATTTGAAAGAAAAATGCATGAAGAATTGTAAAGGTGAAAGGATAACGGTTAAAATTGATATATCTTCATTATCTTGTCGAGTCTGCTTTCAACCAATGGAAAATGTCGACACACTTATCGACCACCTCATAACGAAACACACGGCAGATTACGATAAATCAATGACGGGATGCTTTGAACCATTTAAAATAATCAAAGACAACATCCCTTGTCCCATATGTTCAAATGTATTCAGATATTTTGGTACACTTTTACGGCATATTAACTCTGAACACAGTAATAACAATAAAATATGCGATTTCTGCGGGCGCAGTTTTAAAAATGTTGCAAATTTAAAAGTTCATATCACTTACGCGCATACTGGCTCCTGCGAGTGTGAAATTTGTGGTGCGAAATACAAGAATCAGTGGTGTCTGGGGCGCCATAGAGCGAAATCACATAATGCTAAAGATTTCCAATGCACTAAATGTCCTGAACGATTTCAGTCCCAGTACCATAAGCAGAAACATTTGATAAAAGTACACGATATAGGTCACAAATGTACGTATTGTGGTAAAATGTTCACAAGGAACTCATTTATGAAGGATCATATACGTAGAACTCATTTGAAGGAAAAGAACGTGCCGTGTACGGTTTGCAACGAGAAATTCTTCGATAATTATCTATTGAGGATGCACATGGTGAAACATAAAGGTGAAAGGAAGTTTAGTTGTGAGGTTTGTGGTAAAGCGTTCCTTCGACGAAGCAACTTGAGTTCCCATAAAGAAATGCACAAAAAATATGGACATGTATGA
- Protein Sequence
- MSTAVPFKFFKNKFRCFFCSKDFSDFDNLRDHAIADHPHFNQESKAMKMIKGRDINVKIDISNLTCKLCFENSLALNDLINHLISKHGVNYDKSIECLQPFRVVKDNMPCPICPNMVFRYFKKLLEHMNEFHSDNNIVCAHCGLTFRGHPNYRAHMSRYHRSKACKCPDCNMEFWNLEKLARHRANVHGTKKYKCPQCVEKFITQHLMRKHQILVHGFGHKCPYCGKMFTRNSHMKNHVRRLHLKEKNVECTVCKEKFFDRALLNVHMVKHVGVDGPAQSLSPNTLRRRNLLILFNNTSIIPFKCRGKCRCFYCGEELPIYDDLRKHTKAHGPCSERDRAIKLVKTDDAEVKIDVSDITCELCNESFGNLDEIISHLTTEHMFPYERDVKLIIMSYRLIDKQCLICDERFKQVRELVIHVNNEHPVQCFDCDVCQQKFFRKQYLDAHMRVKHSNIYKCLKCPQTFSSHLALQEHKIKSHVAVCNICFTKFSTQMKRLKHMKSEHANEPLKCGFCLKFMSTKLGFLRHAAKCTEKNENVNETFVVDDDDEDKKPAVIQIRKNIACIFNMSTVIPFKYFMSRFRCFYCPHDFTDCEELRAHTVIEHPICDVNFKCMRLRNRQEGCVKVDTSVLSCKMCFENLPNLESLIEHLISEHKALYDKSVDNNIQPYNLIKDNYPCPVCGESYTHFSTLLKHMGQLHTDNKNICMHCGKSFRNLPNLRVHISNHHKTTGSYKCVRCEQEFSTNKYLQTHLGRAHGIKVYECPECSEKFTSNYAMQRHMINTHSSGHKCLHCGKLFTSNCFMIDHIKRTHLKEKNVECQVCYERFFDTQRLKTHMVKHNGERNFHCDVCDVNGPVKRISANKLRRLNLQILFHNTSVIPFKWRGKYLCFYCGEDFKSYENLKEHTKDHGVCSDKDRALRLVKSADVEVKIDVSQISCNICHENFMQLEEIISHLVDTHNLPYDRRVNMSIAPYRLSDLSCLLCDEKFNYLKKLIVHVNTDHPSQDLNCVQCQQKFNKTRDLDAHIRTKHRNHMCSKCFLNFRTRSELLSHKRVAHSFKCNVCIRSFSSIGKCFKHIKNEHMGAVMKCGFCVNASTSKQGFHRHAIQCTDNCKKSVEAVTLGVDKKPCVTQIRDNIACIFNMSTAIPFKYFMSKFRCFYCPKDFNECDDLKQHTIIEHPLCDTKLKSMKLRHRHDGVIKVDTSSLSCKICFENIQDLESLIKHLHNEHKIYFDKSLSINLQSYKLIKDNFPCPFCGDVFRYFRTLLNHVVKMHSDNKNICMHCGMAFRNAPNLRTHIARHHKAANFKCSECDLSFFSNYYLQTHLGRVHGTKVVECLQCHEKFTSVYEMQRHKIDVHGTGHECSYCHKLFTGKSSVVDHIRRTHLKEKNVACTVCLEKFFDRQSLKVHMVKHYGERNFHCDICGKKFLWKKNLREHMTSHNEGPYDHCTSERRRKNLQILFNNTSIIPFKWRGRYLCFYCGQNFTEYATFRKHTKSHGLCSTSDYALKIIKGNHVEIKIDISEIICEICYEPLTSFSEIIDHLIGKHELEYDKTIFTPYQEYRLVDCRCQLCGQEFAYFGYLISHTNIAHPQNSFICDNCGERFNKKRDLAMHIRHYHRDGGYPCDECSEIFESHLLLRKHRNNFHFRRCKSCDLRFPSYTLLQKHIKCDHSDDGNYKNCPHCSKQCQTSQGLNQHIRKCKRKKGTKMPTPLFPDDNLLQPNKKQNVQQIRKNIQCVLNMSTAVPFKFFAKYSCFYCSKYFVEFDELKEHVVAEHPICDLKEKCMKNCKGERITVKIDISSLSCRVCFQPMENVDTLIDHLITKHTADYDKSMTGCFEPFKIIKDNIPCPICSNVFRYFGTLLRHINSEHSNNNKICDFCGRSFKNVANLKVHITYAHTGSCECEICGAKYKNQWCLGRHRAKSHNAKDFQCTKCPERFQSQYHKQKHLIKVHDIGHKCTYCGKMFTRNSFMKDHIRRTHLKEKNVPCTVCNEKFFDNYLLRMHMVKHKGERKFSCEVCGKAFLRRSNLSSHKEMHKKYGHV
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01080895;
- 90% Identity
- iTF_01080895;
- 80% Identity
- -