Basic Information

Gene Symbol
-
Assembly
GCA_018904225.1
Location
JAEIFT010000079.1:687983-701652[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 25 7.3e-16 1.1e-12 48.5 5.9 1 87 23 93 23 93 0.81
2 25 3.2e-15 4.9e-12 46.5 3.4 1 86 117 188 117 189 0.82
3 25 2.4e-13 3.6e-10 40.5 1.0 1 87 224 292 224 292 0.81
4 25 2.5e-11 3.9e-08 34.0 2.5 1 86 333 402 333 403 0.78
5 25 2.8e-16 4.3e-13 49.9 0.4 1 86 430 499 430 500 0.82
6 25 7e-14 1.1e-10 42.2 1.0 1 86 521 590 521 591 0.80
7 25 3.6e-15 5.6e-12 46.3 1.5 1 86 618 689 618 690 0.85
8 25 5.2e-12 7.9e-09 36.2 1.9 1 85 760 828 760 830 0.81
9 25 2.8e-12 4.3e-09 37.0 0.1 1 86 854 922 854 923 0.82
10 25 3.2e-14 4.9e-11 43.3 0.6 1 87 1031 1100 1031 1100 0.79
11 25 4e-09 6.1e-06 26.9 0.0 1 86 1167 1233 1167 1234 0.82
12 25 0.00047 0.72 10.7 0.0 1 59 1254 1302 1254 1321 0.74
13 25 2.5e-13 3.9e-10 40.4 0.1 1 86 1335 1404 1335 1405 0.85
14 25 1.2e-14 1.8e-11 44.6 0.3 1 87 1464 1534 1464 1534 0.83
15 25 6.7e-12 1e-08 35.8 0.3 1 86 1569 1640 1569 1641 0.81
16 25 5.2e-13 7.9e-10 39.4 2.4 1 87 1651 1723 1651 1723 0.83
17 25 9.8e-13 1.5e-09 38.5 1.2 1 86 1749 1819 1749 1820 0.79
18 25 9.2e-07 0.0014 19.4 0.2 1 58 1853 1904 1853 1923 0.83
19 25 5.4e-12 8.2e-09 36.1 0.1 1 86 1942 2013 1942 2014 0.84
20 25 1e-15 1.5e-12 48.1 0.7 1 87 2161 2234 2161 2234 0.83
21 25 2.6e-12 4e-09 37.1 3.3 1 86 2299 2369 2299 2370 0.80
22 25 3.2e-14 4.9e-11 43.3 4.5 1 86 2465 2535 2465 2536 0.84
23 25 7.4e-12 1.1e-08 35.7 0.3 1 86 2610 2679 2610 2680 0.84
24 25 1.1e-10 1.7e-07 32.0 0.8 1 58 2702 2750 2702 2766 0.85
25 25 1.4e-08 2.1e-05 25.2 1.3 18 87 2767 2825 2756 2825 0.75

Sequence Information

Coding Sequence
ATGAGTGGATCGTATCTTGGTATGGGCACTTCAAGTGGCATGAATGTCACCGATAGCATGGACATTTGCTGTGTGCCCAGCTGTGAGAGCAAGCGACATAATAATGAGAATATCACATTCCACACGATACCCCGAAGGCCGGAGCAGATGCGGAAATGGTGTCATAATCTCAAAATACCCGAGGACAAAATGCACAAAGGCATGCGTATATGCAGTTTGCATTTCGAATCCTATTGCATTGGCGGTTGTATGCGTCCGTTTGCGGTGCCCACTTTGCATTTGGGACACGACGATGAGGATATACATCGTAATCCGGATGTGATCAAGAAATTGAATATACGCGAAACCTGCTGTGTGGCCGTGTGTAAAAGGAATCGTGATCGTGATCATGCTAATTTGCATAGATTCCCCAGTAATGCACCCCTGCTGACCAAATGGTGTGCCAATCTGCAAAGATCAGTGCCGGATGGCACTAAACTATTTAATGATGCCATTTGCGAGGTGCACTTTGAGGATCGTTGTTTGAGAAATAAACGACTGGAGAAATGGGCGGTACCTACTTTGATATTGGGTCATGAGGATATAGCCTATCAGTTGCCCACACCCGAGGAAGTGGCCGAGTTCTTTGCCCGTCCCAGTGCACCCAATAATGGTGAAGAGCAGGGCGAATGCTGTGTGGAGAGTTGTAAACGTAATCCCAGTGTGGATGACATCAAACTATATCGTCCACCCGAGGAAGCGGAGGTATTAAGCAAATGGGCGCACAATCTTCAACTGGAAGTCAGCGAGTTGACCAATCTACGTATATGTAATTTACACTTTGAATCCCATTGCATAGGCAAACGCATGAGACCCTGGGCCATACCCACTTTAAATCTATCGGGTAATATAGAAAATCTCTATGAGAATCCCGAGCATACCATGTTATATAAACGCAGAACCAGAGATCCAAATAGAGATTTAAGTGGTGGACACAAACCCACTTGGGTGCCCAGATGTTGTCTACCCCATTGCCGTAAGGTGAGAGCGGTGCATAATGTGCAACTCTATCGATTCCCCAAACTGAATCGTTCCACCCTGTCCAAATGGGCGCACAATTTACAAGTACCCATGGTGGGCAGTGCACAGAGACGCGTTTGTTCGGCTCACTTTGAACCGCATGTGTTGAGTAAGAAGTGTCCGGTGCCTATGGCGGTGCCCACTTTGGAATTGAATTCTCCTGCGGGTTATAAAATCTATCAAAATCCAGCCAAACTGAAAGCCAGTAAATTGTGCCTGCAAAGGGTGTGCATAGTGGAGAGCTGTCGCAAGACCCGAGCACAGGGTATACATCTCTTCCGTTTACCCCATAGTCCCACCCAGTTAAGGAAATGGATGCATAATATACGCACACGTCCTCGTACTGCCATGAGAGCTCAATATCGCATTTGCTCTGTACACTTTGAAACGCATTCGTTTAATGGCAAAAGATTAAGTGCAGGAGCCATACCCACTTTGGAGCTGGGTCATCAGGATGATGACATATTTCCCAATGAGGCACAATCCTTTGTGGAGGAACACTGCGCCGTGGAGGGTTGTGACTCCAGCAAGGAACAGGCAGAAGTGCGCCTGTTTCGTTTTCCTACCGATGATGATGATTTGCTTTGGAAATGGTGTAATAATCTCAAAATGAATCCTGTGGATTGTGTGGGCGTGCGTATTTGTAATAAACATTTCGAGGCGGATTGTGTGGGACCCAAACATTTGTATAAATGGGCCATACCCACCCTGGAATTGGGTCATGATGATTCACAAATTGAATTGATACCCAATCCCAAGCCTGAGGAACGTTATGCCGAACCTTTGTTCAAATGTTGTGTACCCACTTGTGGCAAGACAAGGAAATTTGATGAGGTACAAATGAATAGTTTTCCCAAGGATCCCTCAATGTTTCAACGTTGGCGTCATAATCTACGTTTGGATCATTTGGATTTCAAGGAGAGAGAACGTTATAAAATATGCAATGATCATTTTGAGGATATATGTGTGGGCAAGACACGTTTAAATATGGGTTCCATACCCACTCTAGAGTTGGGACATGATGAAACTGAAGATTTATATCAAGTGAATCCTAAGGATTTGCAAAGTAATCTATTTGGACGTCAGCGTAGATTAATGGGAGGAGATGGCAAACAGGAGGAGGCGGAGCAGGAGGGGGATATGGAGCAGGTGATGTCAGAAGCCACCGTTTCGAATACCAAGCAGGTGAAAATCAAAAAATCCCTGGCCGAACTCAAGTGCTGTGTCAGCAGCTGTGGACGTAGTCGTTTGGAGCATGGCGCACGCCTATTTCCCTTGCCCACGGGCAAGCAACAGCAACTTAAATGGCGTCATAATCTACAACTGGAACCCCAACAGGTCAATAAAATGACCAGAGTCTGCAGTGCCCACTTCAATAGACGCTGCATAGATGGTAAACATTTGAGAGGATGGGCCATGCCCACTCAACAATTGGGACATAAGAAACATCATAATATCTATGAGAATCCCAAGAATATACCGGGTTTCTTTACGCCCACCTGTGCTTTGGGTCATTGTCGTAAAAGACGTAGCATAGATAATGATTTGCGTACCTATCGTTATCCCAGGACAGAGGATTTGTTAGAGAAATGGCGTGTTAATCTACGTTTGGCCCCAGATCAGTGTAGGGGACGCATATGTGCAGATCATTTTGAGCCGCAGGTGAGAGGAAAACTGAAGTTAAAAACTGGAGCGGTGCCTACCTTGAAATTGGGTCACGATGAAGATTTGGTATATGATAATGAAGCCCTTAAGGCGGGTCTAAATGAGGAAGATGAGCAGGCCAAAAAGGAGCAGCAGGCAGAAGAGGAAGATCAAGAGGAGCCCGAAGAACTGCAAGATCCCGCCGAGGGCGACGAGCAGGATGATGAGCACAGCGGCTATTTTGATCCCCTAGAACTGGTAGAAACCTTTGCCACTGATCAGGCCGATGAGCCACCCGATGAGAACTCACCTGGCGGCACTCCACCTCCTCCCTCGGCTGCTCCTCAATTAAGGCGTGATAAACCCGCCAATAATGTCACACCCATCTGCTGTCTACAACATTGCCGCAAGGAACGCACTCCACGTTATCTTTTAAGCACTTTTGGTTTTCCCAAAGATCAACAACTGCTGCTCAAATGGTGTGCCAATTTGCAATTGGAACTGGAAGACTGTATAGGACGTGTGTGCATGGAGCACTTCGAGGCCGAGGTCTTGGGTTCACGTAAACTTAAACAGAATGCAGTGCCCACTCTTAATGTGGGTCATGATTTGGATTTACCCTACACGTGTAATGGTCAGGAACGTGACTCAGAGGATCAGTCGGTTTTTCGGCTTTGGAGCCTGAAACATTGTCGTAAAAGGAAGCTAGAGGAACCGCCAGATATTCAGCGTGTAAAACAGGAGAAGAAGGAGGAGGCGGAGGAGCCGCCGCCGAGAGAGAGTAAACCAAAGCTGTGTTGCCTCTTACATTGTGGTAAAATGAAGGGTCTACAACGTTTACCCCAGGATCAGTTGTTGCTTAACAAATGGTTACGCAATCTCAAATTAGTTATGGCCAAACCCACCGATCCTCAGGAATTGCGTTTATGTTTAGAACACTTCGAACCGCAAGTGGTAAAAAATGGTGAAGTCTTGCCCTTGGCTGTGCCCACTCTTCAGCTGGGTCATCAGGATACCAAAATATATCGTAATAATGTGAGCTCCAATGAAAGTGGTTGCATAGTGGCCAGTTGTCCCTGTGCTCGCCTCAATCTCTATCGTAGCTATGATCTACCACGAAATTCGATTATAAAACAAGCCTGGTTAAGTCATCTCAAGCTCAAGCCAAATACCCAGGGACAATTGTGTGTCATGCACTTTATGCAGCTGTACGAGGAGGTCAAAATAGAGCCTTTGTCGGCCGAACTGAAGTCCGTTTACGACTCCATAGCCAACTCTGCACAGGCCTTGAAACTGCGCTGTGCCGTACCCGGTTGTCGTTCCAAATACACCGATAATATACACTTGACCCGCTTGCCCTACGATCAGCACATCTTTAGCAAATGGTTACACAATACCAAAATTGAATATGATGCCTCCCGGCATTATATCTATCGCATTTGCCTGCTACACTTTGAGCCCGCTGCCTTGGGCGCCGTACGTCCCAAATCCTGGGCCGTTCCCACTCTCAGACTTAATCATCAGGATGAGATTTATCTGAATTCCACTAAAGAGGAACCGCAACTTCATACACCGCTGCACATTAAAACTGAACCCTCGGTACCGCTGCGTGTCAAAACCGAACTGGCTGCTTTGACCAGTCCCAGCGTTAGCTCCACGCCCAGTCCACGTGGCAAAATTCGTATTTGCTGTATACCCACTTGTCTGCAACAGTCCAGCTCTCAAATACGTCTCTATCGTTTTCCCACCGCTGAGCCGGCTCTTCTCAAATGGCTGGTCAATACGCAACAGCAACCACGTCTCGTGGATGCCCTTACGTTGTATGTGTGCCAATCCCATTTCGAACCCGAAGCCATTGTCAAAAAGCAGCTGAGAAGTTGGGCTGTGCCCACCTTGAATTTGGGTCATGATGGTCATATTATACCCAATGCCAAGCATAATGGTAACATTGCCGATAGTCAGGAGACGGAACAGGCCATGGAATTTATCCGAGAGAATTATTGTTCCGTACTCAGTTGCTTTCAGCCACGAAGTGAGGTATTGCGTCTGTACGAGTATCCCAAGGATATGCCCACAATACGAAAATGGGCGGCAAATTGTAAACATCGCTCCATGCAGGCAAGCAGTCATGGATTTCAGGTTTGCCAATTACACTTTGGTCCCGAATGCTTTGACACAGAAACTGGGGAATTGCGTGAGGGTTCAGTGCCCACTCTGGATATAAACAAAAGTCATACCGAATTGCGTTGCCTGGTATCAGGTTGTGTTAAGGATGAATCTACACCACGTCGTCGTTACTACAAGCTGCCCAAGCGATCCACCCAATTGGCGGAATGGTGTCACAATCTCTGCGTAGAACCGTCTGAACTTTCGCCCACGGATCAGCACATATGTGAGCGTCATTTTGAACCAAGCTGCCTGAATAGCTACAAACAGTTGCGTCCTGGAGCACGTCCCACCTTACATTTGGGTCACGAACAGCAAGTAGATTTGCTCAACAATCCGGATACCTTCTGTGAGCCGCCGGAAGAGGCTGTGGTCTGCTGTGTGCCACAATGTGAACGCTTGCAACAGCCGGATGAGCAGCAATTTAGTTACTTTCCCAAAATGCGTCTACTGGCCGACAAATGGCTGCACAATTTCCGACTGGAAGCAAGCAAGGAGCAGCTGGTTAAGCTTAGAGTGTGTCTAAGGCATTTCGAGTCCACATGTTTTGAAAATGAACGTGTACGTTTGGGTGCCATGCCCACTTTGGAGTTGGGTCATGACGATCGCGATATTTATCAGACCGATGAGCAGGCACTGGGTAAGGGTCGTAAACATCATAAGCGTAACAGTGGCCTGGGTATGGACTGTTGTTATGCTCAGTGTGCGGAATTGCAAAAGAATTACCAAAGAGTGGTCTATGATTTGCCCCTAGAGGAGCAACTACGTAATGATTGGCTGCAGCATTTAGGTGTGCAAGAGGAGGAGGAGCAACAGCCCTTAAAGCTGTGTCCCCTACATTTTATTATACTGTATGAACACAGCCAGGAACACTTTCCAGATCATGCACCAGAAGAGTTACTCGATCATAACTATGAGGATGCCAGGAATAGTGTGAGAATACGTTTGATTAGTTGTGCCGTACGTGATTGTAAAACCATACAGCCACGGGATGGTGGTCGTTTACATGGTCTACCACCTAGGAGAGATGTGGCCCAGATGTGGCTGGAGAATACTTTAATAGAGTACTTGGAGCCCCAGCGTTATATGCTTAAGGTGTGCAGCAAACACTTTGAGCCACATTGCTTTACGGATAGTCGCAAGCTAAAATACTGGAGTATGCCCACGCTATATCTGCCCGTGGAGGAGGCGGTACATAAAAATCTTACCGAAGAGGAATGGCAAAAAATGAATGAGCAACCACCAAGCAAAGCAGAGAAAAAGGAGAAGGAAAATGAGGTGGAGTTGGTTTATGAAGAGAACTCTTTGCTGGAGCCCATTGTCCTCATGGAGCATGCTGAATCCGAAAAAGAAGAGGAGGAGCAGGAGCCCGAAAAGGTAGAGGATTTACTTACACCCGAACTGGTCATCGATGAGCGCACTCAGGACACCCAGGAGGAGGAAATGCAAGAGCTGGAAGTTTTACTAGAGGTGGGTCATGTGGAAAAATGTGCCAGCTATGAGAAAATGGATCAGAAATCTCATTTACCCTACTCCCATGAAGCCTCAGCCACACCCTCACATTCAGCCGGCGGCGGTTCGGCTTTACGTAATCAATACAATGCCAGACACTGCAGTGTCCAGGGTTGTCAAACCAGTTCCCATGACATAGATGGACAAATTAAATTACACAAATTTCCCACCTCAATGGATGCTACCAAAAAATGGATGCACAACACCCAAGTGGATGTGGATGAAAGTGTTTCCTGGCGTTATCGCATATGCAGCTATCATTTTGAACAGGATTGCTTTAATGGTGCACGCATACGTAAGGGTTCTATGCCCACTTTACATTTGGGTCGTGCACGACCTGAAGAGATTTATGACAATGAATTTGCCCAAATGGAAATGGAGGAATCTTCTGTGGTAATTGAACCTGCTCCTGCCATAACCAACAGTAGTCGGGAAACCCAGGCCTCCAAGGGGGGTGAGGTTACACAATTGATTATACCCCGCCCTGCACCACCACGCAAATCCAGTAAATTCTGTCAAATAGAAAGCTGTCCCAATCATTTGACCAGTGAGAATATGACCCTGCATAAGTTTCCCCACTCGGTGGATATATGCGAACGCTGGCAACACAATACCCAGGTGCCCTTTGATCCGGATTATCGTTGGCGTTATCGCATTTGTAGTGCCCACTTCAAATCCGTGTGTTTGGTCAATATGAGATTGCTACATGGTAGTGTGCCCACTCTCAAGCTGGGTCCTCGAGCTCCTGCCCAATTATTTGATAATGATTTTGATGCCATGCACAGCAGATTGGATAAGAAGCAATCTCAAAAGCCTTATCCACAATATCAGCCGGAGCAAGAGTCATACTTGGAGCCTGAATGGGAGGAGGAAGAGCAGTTGTTCGATGAACCTGATGATGAAGAGGAGGAACCGCATAGCTCATTTGCCAGCTGGCGTCATCAATTGCGTCTGCCTGCCGTCAAACAGGAAAAGTCACCCTATAATCCCATCAAATCGGGCTATGATAAATGCTCTTTGGTTCACTGTCAACGTCAGAGATCACTACATGGTGTACATATCTTTAAATTTCCACGCTCTAGACGACATCAGCAACGTTGGATGCATAATTTACGCATACGCTATGACGAGCGGCGACCCTGGAAATATATGATTTGCAGTGTACACTTTGAACCGCGTTGCGTGCGCCTACGAAAACTGCGACCTTGGGCAGTGCCTACTCTGGAGTTGGGTGACAATGTGCCAGATCAAATTTATACCAATGAACAGTGTCAGGAAATGGCCACCGATATGAGTGAGGCTGAAACCGATGAGGAAGATGGCCTGCAAGAGGATGATGATGATGAGGAGGAGGAAGTGGATTTGGCTCCGTATATCAAAAAGGAGAAACGTAGTCGATCGGATCCCTGGTCAACGGGTCAATTAACCGCCTGGAAGGCCAAACAGTGTTGCCTGCCCTATTGCCGCAGACCAAGGGGTGATGGCATACGCTTGTTCCGTTTGCCCAGCAATTGTAACTCTATCAAATATTGGGAACAGGCCACCGGCATGGTATTCAAAGAGTCACAAAGGAATACACGCCTGATTTGTAGTCGTCACTTTGAGCCGGAATTAATGGGTGTAAGACGTTTAATGCGAAATGCCATACCCACCCGGAATTTGGGACCACAGAGTGAGAGCTTTTCAGCAGAAATGGTGGTGCCCACGCCGACTGGACCCATATGCTGCATGGCGGATTGCACAAGTGAGGGAAGTACCAAACTACATCAGTTTCCCAGAGATCCCACTCTCCTGCAACAATGGTGTCAATCCCTAAGACTGTCGGACTTCCAACGCTATCGCGGCAAACACATTTGCTCCGCCCATTTGCCCGCTCAGGCATTAACCTGCCTCATCTGTGGCGTGGACGATGTTCATCTGCCTTTGCTCGATTTTCCCGAAAATCGTAATCATCGCGCCAAATGGTGTTATAATCTCAAAATTGATACCATACCGAAATGGGATACATCCAAGCACATTTGCTGCAAGCATTTCGAATCATATTGCTTTGTCCAACCGGGTGAATTACTACCGGATGCCGCACCCACATTGCATTTGAATCACAATGATACGAATATATTCCACAATGATTATACCATAGATGATCACTCCATAATCCGCATTAAGGATGAACCTCTGGACAATGACGATTTCCTGTTGGTGTAG
Protein Sequence
MSGSYLGMGTSSGMNVTDSMDICCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFESYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNAPLLTKWCANLQRSVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHEDIAYQLPTPEEVAEFFARPSAPNNGEEQGECCVESCKRNPSVDDIKLYRPPEEAEVLSKWAHNLQLEVSELTNLRICNLHFESHCIGKRMRPWAIPTLNLSGNIENLYENPEHTMLYKRRTRDPNRDLSGGHKPTWVPRCCLPHCRKVRAVHNVQLYRFPKLNRSTLSKWAHNLQVPMVGSAQRRVCSAHFEPHVLSKKCPVPMAVPTLELNSPAGYKIYQNPAKLKASKLCLQRVCIVESCRKTRAQGIHLFRLPHSPTQLRKWMHNIRTRPRTAMRAQYRICSVHFETHSFNGKRLSAGAIPTLELGHQDDDIFPNEAQSFVEEHCAVEGCDSSKEQAEVRLFRFPTDDDDLLWKWCNNLKMNPVDCVGVRICNKHFEADCVGPKHLYKWAIPTLELGHDDSQIELIPNPKPEERYAEPLFKCCVPTCGKTRKFDEVQMNSFPKDPSMFQRWRHNLRLDHLDFKERERYKICNDHFEDICVGKTRLNMGSIPTLELGHDETEDLYQVNPKDLQSNLFGRQRRLMGGDGKQEEAEQEGDMEQVMSEATVSNTKQVKIKKSLAELKCCVSSCGRSRLEHGARLFPLPTGKQQQLKWRHNLQLEPQQVNKMTRVCSAHFNRRCIDGKHLRGWAMPTQQLGHKKHHNIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRYPRTEDLLEKWRVNLRLAPDQCRGRICADHFEPQVRGKLKLKTGAVPTLKLGHDEDLVYDNEALKAGLNEEDEQAKKEQQAEEEDQEEPEELQDPAEGDEQDDEHSGYFDPLELVETFATDQADEPPDENSPGGTPPPPSAAPQLRRDKPANNVTPICCLQHCRKERTPRYLLSTFGFPKDQQLLLKWCANLQLELEDCIGRVCMEHFEAEVLGSRKLKQNAVPTLNVGHDLDLPYTCNGQERDSEDQSVFRLWSLKHCRKRKLEEPPDIQRVKQEKKEEAEEPPPRESKPKLCCLLHCGKMKGLQRLPQDQLLLNKWLRNLKLVMAKPTDPQELRLCLEHFEPQVVKNGEVLPLAVPTLQLGHQDTKIYRNNVSSNESGCIVASCPCARLNLYRSYDLPRNSIIKQAWLSHLKLKPNTQGQLCVMHFMQLYEEVKIEPLSAELKSVYDSIANSAQALKLRCAVPGCRSKYTDNIHLTRLPYDQHIFSKWLHNTKIEYDASRHYIYRICLLHFEPAALGAVRPKSWAVPTLRLNHQDEIYLNSTKEEPQLHTPLHIKTEPSVPLRVKTELAALTSPSVSSTPSPRGKIRICCIPTCLQQSSSQIRLYRFPTAEPALLKWLVNTQQQPRLVDALTLYVCQSHFEPEAIVKKQLRSWAVPTLNLGHDGHIIPNAKHNGNIADSQETEQAMEFIRENYCSVLSCFQPRSEVLRLYEYPKDMPTIRKWAANCKHRSMQASSHGFQVCQLHFGPECFDTETGELREGSVPTLDINKSHTELRCLVSGCVKDESTPRRRYYKLPKRSTQLAEWCHNLCVEPSELSPTDQHICERHFEPSCLNSYKQLRPGARPTLHLGHEQQVDLLNNPDTFCEPPEEAVVCCVPQCERLQQPDEQQFSYFPKMRLLADKWLHNFRLEASKEQLVKLRVCLRHFESTCFENERVRLGAMPTLELGHDDRDIYQTDEQALGKGRKHHKRNSGLGMDCCYAQCAELQKNYQRVVYDLPLEEQLRNDWLQHLGVQEEEEQQPLKLCPLHFIILYEHSQEHFPDHAPEELLDHNYEDARNSVRIRLISCAVRDCKTIQPRDGGRLHGLPPRRDVAQMWLENTLIEYLEPQRYMLKVCSKHFEPHCFTDSRKLKYWSMPTLYLPVEEAVHKNLTEEEWQKMNEQPPSKAEKKEKENEVELVYEENSLLEPIVLMEHAESEKEEEEQEPEKVEDLLTPELVIDERTQDTQEEEMQELEVLLEVGHVEKCASYEKMDQKSHLPYSHEASATPSHSAGGGSALRNQYNARHCSVQGCQTSSHDIDGQIKLHKFPTSMDATKKWMHNTQVDVDESVSWRYRICSYHFEQDCFNGARIRKGSMPTLHLGRARPEEIYDNEFAQMEMEESSVVIEPAPAITNSSRETQASKGGEVTQLIIPRPAPPRKSSKFCQIESCPNHLTSENMTLHKFPHSVDICERWQHNTQVPFDPDYRWRYRICSAHFKSVCLVNMRLLHGSVPTLKLGPRAPAQLFDNDFDAMHSRLDKKQSQKPYPQYQPEQESYLEPEWEEEEQLFDEPDDEEEEPHSSFASWRHQLRLPAVKQEKSPYNPIKSGYDKCSLVHCQRQRSLHGVHIFKFPRSRRHQQRWMHNLRIRYDERRPWKYMICSVHFEPRCVRLRKLRPWAVPTLELGDNVPDQIYTNEQCQEMATDMSEAETDEEDGLQEDDDDEEEEVDLAPYIKKEKRSRSDPWSTGQLTAWKAKQCCLPYCRRPRGDGIRLFRLPSNCNSIKYWEQATGMVFKESQRNTRLICSRHFEPELMGVRRLMRNAIPTRNLGPQSESFSAEMVVPTPTGPICCMADCTSEGSTKLHQFPRDPTLLQQWCQSLRLSDFQRYRGKHICSAHLPAQALTCLICGVDDVHLPLLDFPENRNHRAKWCYNLKIDTIPKWDTSKHICCKHFESYCFVQPGELLPDAAPTLHLNHNDTNIFHNDYTIDDHSIIRIKDEPLDNDDFLLV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-