Basic Information

Gene Symbol
-
Assembly
GCA_018150935.1
Location
JAECWR010000023.1:17117070-17128922[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 28 1.3e-15 2.4e-12 47.5 3.8 1 86 116 188 116 189 0.85
2 28 2.6e-15 4.8e-12 46.5 4.6 1 87 216 285 216 285 0.83
3 28 5.7e-16 1e-12 48.6 0.4 1 87 307 379 307 379 0.84
4 28 7e-16 1.3e-12 48.3 4.8 1 86 397 466 397 467 0.82
5 28 5.2e-15 9.5e-12 45.6 3.4 1 86 491 562 491 563 0.81
6 28 1.4e-12 2.5e-09 37.8 0.9 1 87 598 666 598 666 0.81
7 28 8.9e-11 1.6e-07 32.0 1.5 1 86 708 777 708 778 0.76
8 28 7.4e-17 1.4e-13 51.5 0.1 1 86 805 874 805 875 0.83
9 28 2.2e-13 4.2e-10 40.3 1.7 1 86 896 965 896 966 0.80
10 28 1.9e-14 3.6e-11 43.7 1.4 1 86 993 1064 993 1065 0.85
11 28 5.4e-13 9.9e-10 39.1 1.7 1 85 1128 1196 1128 1198 0.83
12 28 1.5e-11 2.8e-08 34.4 0.0 1 86 1221 1289 1221 1290 0.81
13 28 1e-13 1.9e-10 41.4 1.2 1 87 1413 1482 1413 1482 0.80
14 28 1.3e-10 2.3e-07 31.5 0.8 1 62 1555 1617 1555 1633 0.77
15 28 0.0013 2.4 9.0 0.1 1 58 1636 1686 1636 1701 0.81
16 28 4.9e-12 9e-09 36.0 0.7 1 86 1725 1794 1725 1795 0.83
17 28 5.6e-14 1e-10 42.2 0.6 1 86 1867 1937 1867 1938 0.80
18 28 2.7e-12 5e-09 36.8 0.8 1 86 1973 2044 1973 2045 0.81
19 28 5.5e-13 1e-09 39.1 0.7 1 86 2057 2128 2057 2129 0.80
20 28 3e-15 5.6e-12 46.3 0.1 1 87 2156 2229 2156 2229 0.81
21 28 0.0003 0.56 11.0 0.0 1 58 2262 2312 2262 2335 0.77
22 28 5.1e-14 9.3e-11 42.4 0.0 1 86 2350 2422 2350 2423 0.79
23 28 1.1e-15 2e-12 47.7 0.4 1 86 2561 2633 2561 2634 0.83
24 28 1.3e-13 2.4e-10 41.1 2.0 1 86 2689 2759 2689 2760 0.82
25 28 2.1e-13 4e-10 40.4 4.8 1 86 2868 2938 2868 2939 0.85
26 28 4.6e-13 8.6e-10 39.3 0.0 1 87 3018 3088 3018 3088 0.85
27 28 2.6e-09 4.8e-06 27.3 0.7 1 58 3107 3156 3107 3172 0.84
28 28 7.8e-10 1.4e-06 29.0 0.7 18 87 3173 3231 3159 3231 0.75

Sequence Information

Coding Sequence
ATGGGAACAATGGAAATGCCAACCTCGCAGCATAAACCGAATGTGACAGGGAACGCGTTGCCGGCAACGGCTGCACTTAATTCATTGTTGCAACAGCGTCAGGCAAATGCCGATGGCGCCGCATTGTTTGCCTCAACGCTAAAGAACGAATCGAATGTGAAACTGGAGCGCAGCTATAGCAATTCCACCAGCGAGTCTGGCTATAGTCTGCACGACAGCAGCTATAACAATGCCTATGGCAGCGACACCTCATTGCATGCCTCTGCTGGTGGCGGTGGCGGAGGTGGCACAATTGGTGGACCACAGGCGCACTCCTCGACACTAGATGATTCAGAGGATGCACTGTGCTGTGTGCCGTTGTGCGGTGTACGCAAGAGCACCAGCCCAACGCTGCAGTTCTTTACGTTCCCCAAGGATGAAAAGTATTTACATCAATGGCTGCACAATCTGAAGATGTTTCACATTCCGGCATCCAGTTATGCAAGCTATCGCATCTGCAGCATGCACTTCCCGAAGCGTTGTATAAATCGCTACTCGCTGTGCTATTGGGCGGTGCCCACATTCAATCTGGGCCACGATGATGTTGCCAATTTGTATCAGAATCGTGAGCTAACCAATACCTTTACCACCGGCGAGGTGGCACGCTGTAGCATGCCGAACTGCACCAGCCAGCGGGGCGAGAGCAATCTGAAGTTTTACAATTTTCCCAAGGATATTAAGAGTCTGATTAAATGGTGTCAGAATGCCCGTCTGCCCGTCCAAGCCAAGGAGCCGCGTCACTTTTGTAGTCGCCACTTTGAGGAGCGCTGTATTGGCAAGTTTCGACTGAAACCGTGGGCAGTACCCACATTACATTTGGGTGCCCAGTACGGCAAGATCCATGACAATCCCAAGAACTTATACGTGGAGGAGAAACGTTGCTGTCTTAATTTCTGTCGTCGCAGTCGCTCATCCGACTTTAATATGTCACTGTATCGCTTCCCCCGGGACGAGGTGCTCCTTCGACGTTGGTGCTACAATCTACGACTGGATCCGTCGGTTTATCGTGGCAAGAATCACAAAATATGCAGTGCTCACTTTATCAAAGAGGCCTTGGGATTGCGCAAGCTATCGCCGGGTGCTGTTCCCACGCTGCATTTGGCAGGTGGCTCCTCGTCCGCCCTCAATGTGAGCGACAGCATGGACGTGTGCTGTGTGCCGAGCTGCGAGAGCAAGCGCCACAATAGCGAGAACATTACATTCCATACGATACCACGACGACCCGAGCAGATGCGCAAATGGTGTCACAATCTCAAAATAGCCGAGGACAAGATGCACAAGGGTATGCGCATCTGTAGTCTGCACTTTGAGCCCTATTGCATTGGTGGCTGCATGCGTCCCTTTGCCGTGCCCACACTGCAGTTGGGCCATGACGACGAGGACATTCATCGCAATCCGGATGTGATCAAGAAGCTCAACATACGCGAAACCTGCTGTGTAGCCGTCTGTAAACGCAATCGGGATCGAGATCATGCCAATCTGCATCGGTTCCCCAGCAATGTGACGCTCCTGACCAAATGGTGTGCGAATCTGCAGCGTCCAGTGCCCGATGGCAGTAAGCTGTTCAACGATGCCATCTGTGAGGTGCATTTCGAGGATCGCTGTTTGCGCAACAAGCGACTGGAGAAGTGGGCAGTGCCCACCTTGATGCTGGGCCATGAGAACATTGCGTATCCGCTACCCACGGCGGAACAGGTGGCCGAGTTCTATGCAAGACCCAGTGCGCCCAACAATGGCGAAGAGCAGGGCGAGTGCTGTGTGGAGACCTGCAAGCGTAATCCCAGCGTGGATGACATCAAATTGTATCGTCCGCCCGAGGAGTCATCGGTGCTGGCCAAATGGGTGCACAATCTGCAAGTGGATGCAGCGCAGCTGTCCACCCTAAGGATCTGCAATTTGCACTTTGAATCCCATTGCATTGGCAAGCGCATGCGTGCCTGGGCCATACCCACGCTCAATCTGGCCACGCCCAGTGAGCATCTCTACGAGAATCCCGAGCAGCAGCTGCTCTACAAGCGGCGCAGTCATCACAATGTGAATTCGAGCCGAGCAACCGCAGTGAAGCCCACGTGGGTGCCACGTTGTTGTCTGCCACATTGTCGCAAGGTGCGTGCGTTGCACAATGTGCAATTGTATCGCTTCCCCAAGCTGAATCGTTCCACGCTGGCCAAGTGGGCGCACAATCTGCAGGTGCCACTGGTGGGCAGCGCCCAACGACGCCTCTGCTCTGCCCACTTTGAGCCGCATGTGCTGAGCAAGAAGTGCCCCGTGCCCTTGGCAGTGCCCACGCTGGAGCTGAACACACCAGCCGGCTATAAGATCTATCAGAATCCCGCCAAGCTCAAGGCCAACAAGCAGTGCCTGCAACGTGTCTGCATCGTGGAGAGTTGTCGCCGTCAACGTGGCCAGGGCGTACAGCTGTTCCGGCTGCCCCACAATCCGACGCAGTTGCGCAAGTGGATGCACAACATACGCATGCGACCCAGAGGCGCCATGCGACAGCAATACCGCATCTGCTCCATGCACTTTGAGACGCACTCCTTCAATGGCAAGCGCCTAAGTGCGGGCGCCATTCCTACACTGGAGCTGGGACATGCGGATGAGGACATCTATCCGAACGAGGCGCAATCGTTTGTCGAGGAGCACTGCACTGTGGAGGGCTGTGGCGCCACCAAAGAGCAGCCTGAGGTGCGTCTGTTTCGCTTCCCCAATGACGATGAGGATCTGCTCTGGAAGTGGTGCAACAATCTGAAAATGAATCCAGTCGATTGCCATGGCGTGCGCATCTGCAACAAGCACTTTGAGTCCGACTGCATTGGACCCAAGCATCTCTACAAGTGGGCCATACCCACATTGCAGCTGGGCCATGACGACGATCAGATTGAGCTCATCGACAATCCAAAGCCGGAGGAACGTTATGTGGATCCCGTGTTCAAGTGCTGTGTGCCCAGCTGCGGCAAGACGCGCAAGTTTGACGAGGTGCAGATGAACAGCTTCCCCAAGGATGCCATTATGTTCCAGCGCTGGCGTCACAATCTGCGCCTGGAGCATCTTAATTTTAAGGAGCGCGAGCGCTACAAGATTTGCAATGCCCATTTTGAGGACATTTGCATTGGCAAGACGCGCCTGAATCTGGGCTCGATACCCACGCTTGAGCTGGGCCATGAGGCGACCGAGGATCTGTATCAAGTCAATCCCGAGGAGCTGCAGAGCAATCTATTTGGGCGGCAGCGACGTGTGAATAGCGTCAAGCAGGAGGAGCTCTCCGAACCGGAGGAGGAGGAACTCATCAAGCCGGACATAACACAGGTTAAGATCAGGCGCTCCTTGGCGGATTACAAGTGCTGTGTGGCTATCTGTGGACGCACACGTCTGGAGCATGGCGCCCGGCTCTTCCCCTTCCCCACGGGCAAGCAACAGCAGAGCAAGTGGCGCCACAATCTGCGCCTGCCACCGGATGCAGTGGAGCGCAGCACACGCATCTGCAGCATGCACTTCAATCGTCGCTGCATTGATGGCAAGCAGCTAAGGAGCTGGGCCATGCCCACCCAGCAGCTGGGACATCAGGAGCTGCCCATCTATGAGAATCCCAAGAATATTCCGGGCTTTTTTACGCCCACCTGTGCCCTGGCCCATTGTCGCAAGCGGCGCAGCATTGACAACGATCTGCGCACCTATCGCTATCCGCGCGGCGAGGAGCTGCTCGAGAAGTGGCGCATCAATCTAAGATTGGCGCCAGATCAGTGCCGTGGACGCATCTGTGCCGATCACTTTGAGCCGCTGGTGCGGGGCAAGCTGAAGCTAAAGACGGGTGCAGTGCCTACGCTCAAGTTGGGCCATGAGGAGGGCGTGGTCTTTGACAATGAGGCCATCAAGGCGGGACTGCAGCAGGAGGAGGAGGAGGCGGCAGAGGAGGAGGAGGAAGAGTCGATGGTGCAGGTAAAGCAGGAGCAGCTTGAGCCAGAGGAGGAGGAAGCCGAGCACGAGGAGCAGGACGACGACGAAGAACATGATTACTTTGATCCCTTGCAGCTGGTGGAAACCTTTGCGGCCGGCGACGAACACGATGCCGACGATGACATTGAAGATGAACCCGGCGACGATGATGCGTTGCTCCTGCTGCCGGATACGCCACCAATTGCCGCCATCTCTTTGCCGCCGCGACGCGAAAAAGCCGTCAATAATGTGACGCCCATCTGCTGCCTGCGACACTGTCGCAAGGAGCGCACGCCCAGCCATCAGCTGAGCACCTTTGGCTTCCCCAAGGATCGCCAGCAGCTGCTCAAATGGAGCGCCAATCTGCAGTTGCCGTTGGACGATTGCGTGGGACGCGTCTGCATTGAGCATTTCGAGTCCGAGATGCTGGGCACACGCAAGCTGAAGCAGCATGCGGTGCCCACCTTGAATCTGGGACATAACCATGTGCGCTACAGCTGCAATGGGCAGCCAGTGCGTGGCATTCATGATGCACAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACATTGTCGCAAACGGAAGCTGGAGGAGAGGGAGGAGGAGGAGGAACAGGAACAGGAGAAGCAGCAGCAGCAGACGACACCGGAACCAGCACAGACAGCTCCGATTAAGCGACGCTGCTGCCTGCCCAGCTGTCGCAAGCAGCCGGAAGTGCATGGCGTGCAGCTGCAACGTCTGCCCAGCAATCGCATACAGCTGCGCAAATGGCTGCACAATCTCAAGCTGGACGGCTGCCGTCTGGGCAGCCAGGCGCAGCTGTGCAGCGAGCACTTTGAGCAGCAGGACGACGCACTGCCCACGCTGCATCTGGGACATAACGACTCGCACATCTATCGCAATCGTAGCAGCGACAGCAGCACCTGCATGGTGCCCAGCTGTCCTTGTGCCCGTCTCAATCTCTATCGTTGCTACGATCTGCCCAAGCACAGGCGGGTGCAACAGGTCTGGCTGCAGTGGCTGCAGCTGCCCCAGCCCAAGTTGGCCAGCGATGGCAAGCTCTGTGTGATGCACTACATGCAGCTGTTTGAACAGGTGCCATTGCCCCCGGAGCTACCTGGTGCGGTGCTGCGTCAGTTGCAAGACACATATGAGCAGATTGCCGGTTCCACAATGGCCATGAAGTTGCGCTGTGCCGTGCCCGGCTGCTACTCCAAGTACACGGACAACATACGGCTGACCAAGCTCCCCATGTGCCCGGACATGTGCGCCAAGTGGGTGCACAACACCAAGATTAGCTATGAGCCAGCACGCCATTATGTCTATCGCATCTGCATGCTGCACTTTGAGACGCGCTGCCTGGGACCGGTGCGTCCCAAGCTGTGGGCGGTGCCCACATTGCGATTGCAGCATCACGACAAGGATATCTATCAGAATCCCAAGCAACATTGGCAATCATCGCCAGCGCTACCGACGCCAGTGTCGACACCAGTGCCGCCGCCAGTGCCGTCACCAGTGCTGCCACCTGTGCCCGTCGAGCTGCCGCTGCGCATCAAGACGGAACTGGCCAGTCCCAGTGCCAGCGCCAGTCCCAGTCCACGGGGCAAGCTGCGCATATGCTGCATTGCCAGCTGCGGCCAACAAGCCACATCCCAGACGCGTCTCTTTCGCTTTCCGAACGCCGAGACAACGGCGCTGCTCAAGTGGCTGGTCAACACCCAACAGCAGCCCCGTTTGGTGGACACCTCGCAGCTGTACATTTGCCAGATGCATTTCGAGCCGGAGGCCATCTGCAAGCAGCAATTGCGCAGTTGGGCAGTGCCCACATTGCTGCTGGGCCACTCGGGCCATATCATACCGAATGCTCGGCACAATGGCAACATTGCCGACAGTCAGGAGAATCGACAGACGCTGCAGTACATCTGGGAGAACTATTGCTCCGTGCTCAACTGCTTTCAGCCGCGCAGCGATCAACTGCGTCTCTATGCGTATCCCACGGATCGACCCACCATACGCAAGTGGGCGGCCAACTGCAAGCATCGCTCCATGCAGGCCAGCAGCGATGGTTTCCAGGTGTGCCAGTCTCACTTTGCCCCACATTGCTTTGACCCGGAGACGGGCGAGCTGAGCGAGGATGCGGTGCCCACATTGGAGCTAAGTCGCAGCACCGTCAGCGACGAGCTGCGCTGCATTGTGGCCGGCTGTGGGAAGACAAAGGATGTACTGCGTCAACGCTTCTTCAAGATGCCCAAGCGCAACTCGCTGCTGGTCAGCTGGTGTCACAATCTGCGCCTGGATGTGTCCGCCATGAGCTTTGGCGAGCATTATGTGTGCGATCGCCACTTTGAGACGCAGTGCTTCAATCAGCAGAAACTGCTGCGTTCTGGTGCGCGTCCCACCTTGCAGCTGGGTCACGATGATGGCATTGAATTAATGCCCAATCCGGCGGAATGGGAGCTGGTAGAGGCCGCACCCGAAGCTGTCTGCTGTGTGCCCAACTGCGGCCTGGCCAAGGATGAGCAGGATGTGCAGCTGTTCGCCTTCCCCAAGCTGCGTATGCTCGCCGAGAAGTGGCTGCAGAACATACGGCTGGAGCATACGGCCACCAGGGAGCAGCTGATGCGTCTGAAGATCTGTAGTGTTCACTTTGAGGCCGACTGCCTGGACAATGGACGACCCCAGCTGGGCGCCATGCCCACGTTGCATCTGGGACACCAGCAGCAAAGTGGCATCCATCGCAGCACAGATGCGGCAACGGTGAAGGCCAAGAAGTTTTGCAATCGCAGCGGCTCCAGCTATGACTGCTGCTATCCACAGTGTGTGGAGCTGCAGAAGAGTTACCTGAGGATTAGCTATGATCTGCCCCAGGGCGAGGCACTGCGTCAGCGTTGGCTCGACTATATGGAGCTGGAGGAGACGGACGAGAAGCTGCTCAAGCTATGCCCACTGCACTTGGTGCTGCTCTATGATCATAGCCTTGAGCACTTTGAGGAGCATACGCCGGAGCAGCAGTTGGATGCCAGCTACTTGGACTGCCGCAATAGTGTGCGACTGCGCATCATCAGCTGTGCCGTGCCCGGCTGCAGGACACTGAAGCCACGAGATGGCGGCATCCTGTACGGATTGCCGCAGCGTCGGGATGTGCTTGAGATGTGGCTGCACAACATGCAGCTGGTATTCTATGAGCAGCAGCGTTACATGTACAAGATCTGCAGCCGGCACTTTGAGACCAGCTGCTTCATGGACACCACGCGACGGCTGAAGCCCTGGACGATGCCAACGCTGGAGCTGCCACGGCAGGAGGCGCCCATCTATCCCAATCCCAGCGAGGCGGAGTGGCAGCGCATGAACGAGTTGCTGGCGACAGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGAACTGGAACTGGAGCCGATTGTGAAGATCGAGAGAGATGACGAGCAGGAGGACGACCAGCAGCTGCTGTTGCCTGACCCCGAGCCAGATGACTATGACAACTCACAGCAGCCGCTGGCGCTGGAGGTGCTGCTTGAAGTGGGCCACGTGGAGAAGTGCACCACCTACGAGCAAATGGACAACGAGGCCAACGAGTGCAGCCAGCTTGTCCAGCAGCCAGGAGCTGGCCACCACCAACAACACACCCACAATGGTGTTCCCAAGTACAATGCGCGTCGTTGCAGCGTCCAAAATTGCGATGTGACGTCCAATGATGTGACGGGCAGCATTAAGCTGCACAAGTTTCCCACATCGCTGGATGCGCTGCAGAAATGGATGCATAACACGCAGGTGGAGGTGGACACCAACTATGCGTGGCGCTTTCGCATCTGCAGCTATCACTTTACCGACGATTGCTTCCATGGGGCACGCATCAAGCGTGGGGCCATGCCCACATTGAGTTTGGGCCCCCGGCGTCCTGCCAAGATCTATAACAATGAGTTCAACATGACGCAGGAGCAGCCAGTGGAGCAATCGCTCGAAACAGTGCACACACCGCCGCCGCCTGGCGAGATAAATTTGCGTCTGCCGGTACCAGCTCCTCCACGCAAATCCAGCAAATTCTGTCAGATTGATGGCTGCTCCAATCACTTGACCAGCGACAATCTGACGCTGCACAAGTTCCCGCACTCGGTGGACATGTGCGCCAAGTGGCAGCACAACACACAGGTGCCCTTCGATCCCGACTTTCGTTGGCGCTATCGCATCTGCAGCGCCCACTTTGAGCCCATCTGTCTGATGAATATGCGTCTAATGCATGGCAGTGTGCCCACCTTGAAGCTGGGAGCGCGTGCCCCGCGCCAGCTATTCGACAATGACTTTGAGGCGATTAGCATGCGGCTGGACAAGCAGAAGAGCGGCATGGAGCAGCATTTGGATATTAAGTATGAGGAGGCGCAGAAGCAGGAGGAGGAGGAGGAGCAGCAGCAGGACGAGGAGGCTGAGCTAAGCTATCTGGTGCCAGAGATGCAGCTGCATGAGGAGACAGAGGAAGCGGAGCAGGAGCAGTCGAACAGCTGGCAAACAGAGCTACACCTGCCGAGCATCAAGCTGGAACCAACCACAGCCAGCTACAATCCGGTGAAGTCCGGCTATGACAAGTGCTCGCTGGTGCATTGCCAACGCCAGCGTTCCCAGCATGGCGTCCATATCTACAAGTTTCCACGCGCCCGCCAACTGCAACAGCGCTGGATGCACAATCTGCGCATTCAGTACGATGAGCGTCGTCCATGGAAGACCATGATCTGCAGCGTGCACTTCGAACCCAACTGCATACGATTGCGCAAGCTCTGCTCCTGGGCGGTGCCCACGTTGCAGCTGGGCCAGAATGTGCCAACCGAGATCTATACGAATGAGCAGAGTCGCCAGCAGCTGGAGACGGGCAGCGATTACGAGGGCACATCCCTGGACGATGTTGATGCCTATGAAGAGGATGAATACGATGAGGAAATGGCCGAGCAATTGGCCACCGAGCCATTGGTGAAGCGGGAGCGTCGTTCACGCTTTGATCCTTTGCCGGCGGATCAGTTGCCGTATTGGCGCATCAAGACCTGCTCGTTGCCATATTGTCGCAGTCCACGTGGCGATGGCATCAAGCTCTTCCGGCTGCCCAACAACATCAGCGCCATACGTCAGTGGGAGCAGGCGACGGGCATTCACTTTACCGAATCTCAGCGGAACACGAAGCTTATCTGCAGTCGCCACTTTGAGCCGCAACTGATTGGCGTGCGTCGTCTCATGTACAATGCGGTGCCCACACTTAATCTGGCACCAAGTAGTATAACCAGCGAAGCTGTGCCGGCTCCAACTGGCCCACGTTGCTGTATGCCCGATTGCCAGGCGACTGGCAACGATTCCAAGCTGCACAAGTTTCCCAGTGATCCCATGCTACTGCATCAGTGGTGCCATGAGCTAAATCTCACAGATATCCAGCGTTATCGTGGCATGCACATCTGTGCCCAGCATCTGCCCGCCCAAACGCCCAGCTGCATTGTATGCGGCATGGAGCAATTGCAGTTGCCACTTCTCGACTTTCCCGAGCATCGCAATCAGCGCGCCAAGTGGTGCTATAATCTCAAGATCGAGCCCATTGCCAAGTGGGACAACTCGAAACAGATCTGCAGCAAGCACTTTGAGAGCTACTGCTTCATTGAGTCTGGTCAACTGGCACCCGAGGCAGCGCCAACGTTGCATTTGCTGCACAGCGATGGCAACATATTCCTAAATGATTATGCCATTGATGACAGTAACGGCAAGTTGCTGCGCATCAAGGACGAACCGCTGGACAGCGAGGACCTGATGCTATAA
Protein Sequence
MGTMEMPTSQHKPNVTGNALPATAALNSLLQQRQANADGAALFASTLKNESNVKLERSYSNSTSESGYSLHDSSYNNAYGSDTSLHASAGGGGGGGTIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYASYRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPSVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLAGGSSSALNVSDSMDVCCVPSCESKRHNSENITFHTIPRRPEQMRKWCHNLKIAEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLQLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVTLLTKWCANLQRPVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLMLGHENIAYPLPTAEQVAEFYARPSAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESSVLAKWVHNLQVDAAQLSTLRICNLHFESHCIGKRMRAWAIPTLNLATPSEHLYENPEQQLLYKRRSHHNVNSSRATAVKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLELNTPAGYKIYQNPAKLKANKQCLQRVCIVESCRRQRGQGVQLFRLPHNPTQLRKWMHNIRMRPRGAMRQQYRICSMHFETHSFNGKRLSAGAIPTLELGHADEDIYPNEAQSFVEEHCTVEGCGATKEQPEVRLFRFPNDDEDLLWKWCNNLKMNPVDCHGVRICNKHFESDCIGPKHLYKWAIPTLQLGHDDDQIELIDNPKPEERYVDPVFKCCVPSCGKTRKFDEVQMNSFPKDAIMFQRWRHNLRLEHLNFKERERYKICNAHFEDICIGKTRLNLGSIPTLELGHEATEDLYQVNPEELQSNLFGRQRRVNSVKQEELSEPEEEELIKPDITQVKIRRSLADYKCCVAICGRTRLEHGARLFPFPTGKQQQSKWRHNLRLPPDAVERSTRICSMHFNRRCIDGKQLRSWAMPTQQLGHQELPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRGEELLEKWRINLRLAPDQCRGRICADHFEPLVRGKLKLKTGAVPTLKLGHEEGVVFDNEAIKAGLQQEEEEAAEEEEEESMVQVKQEQLEPEEEEAEHEEQDDDEEHDYFDPLQLVETFAAGDEHDADDDIEDEPGDDDALLLLPDTPPIAAISLPPRREKAVNNVTPICCLRHCRKERTPSHQLSTFGFPKDRQQLLKWSANLQLPLDDCVGRVCIEHFESEMLGTRKLKQHAVPTLNLGHNHVRYSCNGQPVRGIHDAQPQHSVFRLWSLKHCRKRKLEEREEEEEQEQEKQQQQTTPEPAQTAPIKRRCCLPSCRKQPEVHGVQLQRLPSNRIQLRKWLHNLKLDGCRLGSQAQLCSEHFEQQDDALPTLHLGHNDSHIYRNRSSDSSTCMVPSCPCARLNLYRCYDLPKHRRVQQVWLQWLQLPQPKLASDGKLCVMHYMQLFEQVPLPPELPGAVLRQLQDTYEQIAGSTMAMKLRCAVPGCYSKYTDNIRLTKLPMCPDMCAKWVHNTKISYEPARHYVYRICMLHFETRCLGPVRPKLWAVPTLRLQHHDKDIYQNPKQHWQSSPALPTPVSTPVPPPVPSPVLPPVPVELPLRIKTELASPSASASPSPRGKLRICCIASCGQQATSQTRLFRFPNAETTALLKWLVNTQQQPRLVDTSQLYICQMHFEPEAICKQQLRSWAVPTLLLGHSGHIIPNARHNGNIADSQENRQTLQYIWENYCSVLNCFQPRSDQLRLYAYPTDRPTIRKWAANCKHRSMQASSDGFQVCQSHFAPHCFDPETGELSEDAVPTLELSRSTVSDELRCIVAGCGKTKDVLRQRFFKMPKRNSLLVSWCHNLRLDVSAMSFGEHYVCDRHFETQCFNQQKLLRSGARPTLQLGHDDGIELMPNPAEWELVEAAPEAVCCVPNCGLAKDEQDVQLFAFPKLRMLAEKWLQNIRLEHTATREQLMRLKICSVHFEADCLDNGRPQLGAMPTLHLGHQQQSGIHRSTDAATVKAKKFCNRSGSSYDCCYPQCVELQKSYLRISYDLPQGEALRQRWLDYMELEETDEKLLKLCPLHLVLLYDHSLEHFEEHTPEQQLDASYLDCRNSVRLRIISCAVPGCRTLKPRDGGILYGLPQRRDVLEMWLHNMQLVFYEQQRYMYKICSRHFETSCFMDTTRRLKPWTMPTLELPRQEAPIYPNPSEAEWQRMNELLATXXXXXXXXXXXXXXXXXXELELEPIVKIERDDEQEDDQQLLLPDPEPDDYDNSQQPLALEVLLEVGHVEKCTTYEQMDNEANECSQLVQQPGAGHHQQHTHNGVPKYNARRCSVQNCDVTSNDVTGSIKLHKFPTSLDALQKWMHNTQVEVDTNYAWRFRICSYHFTDDCFHGARIKRGAMPTLSLGPRRPAKIYNNEFNMTQEQPVEQSLETVHTPPPPGEINLRLPVPAPPRKSSKFCQIDGCSNHLTSDNLTLHKFPHSVDMCAKWQHNTQVPFDPDFRWRYRICSAHFEPICLMNMRLMHGSVPTLKLGARAPRQLFDNDFEAISMRLDKQKSGMEQHLDIKYEEAQKQEEEEEQQQDEEAELSYLVPEMQLHEETEEAEQEQSNSWQTELHLPSIKLEPTTASYNPVKSGYDKCSLVHCQRQRSQHGVHIYKFPRARQLQQRWMHNLRIQYDERRPWKTMICSVHFEPNCIRLRKLCSWAVPTLQLGQNVPTEIYTNEQSRQQLETGSDYEGTSLDDVDAYEEDEYDEEMAEQLATEPLVKRERRSRFDPLPADQLPYWRIKTCSLPYCRSPRGDGIKLFRLPNNISAIRQWEQATGIHFTESQRNTKLICSRHFEPQLIGVRRLMYNAVPTLNLAPSSITSEAVPAPTGPRCCMPDCQATGNDSKLHKFPSDPMLLHQWCHELNLTDIQRYRGMHICAQHLPAQTPSCIVCGMEQLQLPLLDFPEHRNQRAKWCYNLKIEPIAKWDNSKQICSKHFESYCFIESGQLAPEAAPTLHLLHSDGNIFLNDYAIDDSNGKLLRIKDEPLDSEDLML

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-