Basic Information

Gene Symbol
-
Assembly
GCA_018904105.1
Location
JAEIFR010000226.1:863133-874353[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 23 4.8e-15 7e-12 45.8 4.1 1 86 378 450 378 451 0.85
2 23 3e-15 4.3e-12 46.5 4.7 1 87 478 547 478 547 0.83
3 23 6.5e-16 9.4e-13 48.6 0.4 1 87 569 641 569 641 0.85
4 23 6.5e-16 9.5e-13 48.6 5.4 1 86 741 810 741 811 0.82
5 23 7.8e-15 1.1e-11 45.1 3.5 1 86 835 906 835 907 0.82
6 23 1.2e-12 1.8e-09 38.1 1.0 1 87 942 1010 942 1010 0.81
7 23 9.1e-11 1.3e-07 32.1 1.4 1 86 1057 1126 1057 1127 0.76
8 23 6.2e-16 9e-13 48.6 0.1 1 86 1154 1223 1154 1224 0.82
9 23 2.1e-12 3e-09 37.3 1.1 1 85 1245 1313 1245 1315 0.81
10 23 3.6e-15 5.3e-12 46.2 1.6 1 86 1342 1413 1342 1414 0.85
11 23 8.4e-14 1.2e-10 41.8 1.6 1 85 1493 1561 1493 1563 0.81
12 23 1.3e-12 1.9e-09 38.0 0.1 1 86 1586 1654 1586 1655 0.81
13 23 3.5e-13 5.2e-10 39.8 1.4 1 87 1792 1861 1792 1861 0.80
14 23 1.7e-11 2.4e-08 34.4 0.3 1 61 1933 1988 1933 2012 0.81
15 23 0.031 46 4.7 0.0 1 58 2020 2070 2020 2091 0.73
16 23 4.2e-12 6.2e-09 36.4 0.2 1 86 2109 2178 2109 2179 0.85
17 23 1.1e-14 1.6e-11 44.6 2.1 1 86 2250 2319 2250 2320 0.82
18 23 4e-13 5.8e-10 39.6 1.0 1 86 2355 2426 2355 2427 0.81
19 23 7.2e-13 1.1e-09 38.8 4.2 1 87 2437 2509 2437 2509 0.84
20 23 1.2e-14 1.8e-11 44.5 0.1 1 86 2539 2612 2539 2613 0.79
21 23 0.00028 0.41 11.3 0.1 1 57 2646 2695 2646 2706 0.80
22 23 2.1e-15 3e-12 47.0 0.1 1 86 2734 2806 2734 2807 0.81
23 23 3.3e-15 4.8e-12 46.3 1.9 1 87 2946 3019 2946 3019 0.82

Sequence Information

Coding Sequence
ATGGCACCGCCGCCGGCGCCCGCCAATCGTCATGCTCTCAATGCCAGCGGCGAGATGATAATAAAATCTGAACCTGTTGACGAACATGCGTACAAGTCCAACTATATTGATGATAATAGACCGTTTGCCGACTTTAGTAAATTTCCTGAATTCGGCGATGACATGCTGAGTCCCAAGATGGAGATGACCGTCAAGGATGAGGCGTATCAAAAGAACCCGCTCAGCTATCCACGCCGCAAGCTGCAATCGGAACGTTCGTCGGAGAGCTTACCCATTTGTCAACGTTGCAAGGAGGTGTTCTTTAAGAAACAAATCTACTTGCGGCATGTGGCACAGAGCAGTTGCACCATACAGGAGTATGACTTCAAGTGCAACATATGCCCCATGTCCTTCATGAGCACGGAGGAACTGCAAAAGCACAAGCATATGCATAGGGCCGATAAGTTCTTCTGTCACAAATACTGTGGCAAGTACTACGATACCATTGCAGAATGCGAATCACATGAGTACATGCAGCATGAATATGAAAGCTTTGTTTGCAATATGTGCTCCGTGACGTTTGCCACACGGGAGCAGCTCTATGAGCATTTGCCGCAGCACAAGTTCCAGCAACGTTACGATTGTCCCATTTGCCGGCTATGGTATCAAACGGAATTAGAGCTTCATGAGCATCGATTGGCGGCACCCTATTTCTGTGGCAAGTATTACGCAGGCGCACAGGCCACAGCGCACCAACAGCAGCAGCAGCATCAACAACACGCACAGCAACACCATCATCAGAGCGGCAACTACAAGTTGCAGGATTGTCACATGGGTACCATGGAAATGTCTCCGTCACATCATAAGACAGCGGGCAGCGCGTTGCCTGCTACTGCAGCGTTGAGCTCGTTGTTGCAACAACGACAGGCAAATGCTGATGGTGCCGCATTGTTTGCGTCGTCCCTCAAAAACGAGGTGAACGTAAAGTTGGAGCGAAGCTATAGCAATTCTACTACAAGTGAATCTGGCTACAGTCTGCATGACAGCAGTAATTATAATAATGCATATGGAAGCGATGCATCGCTGCAGGGCGCAGCTATTGGAGGTCCACAGGCGCATTCTTCAACGCTGGACGATTCAGAGGATGCGCTATGCTGTGTGCCATTATGCGGTGTGCGGAAGAGTACCAGCCCCACGCTGCAGTTCTTTACGTTTCCCAAAGATGAGAAGTACTTGCATCAATGGCTGCATAATCTCAAGATGTTTCGTGTTCCGGCCTCGAGCTATGCAACTTTTCGAATTTGCAGCATGCATTTTCCGAAGCGTTGTATCAACCGTTATTCATTGTGCTATTGGGCTGTGCCCACGTTCAATCTGGGACACGATGATGTTGCCAATCTTTATCAGAATCGTGAGCTAACGAACACCTTCACAACCGGCGAGATTGCGCGTTGCAGTATGCCTAACTGCACCAGTCAGCGGGGTGAAAGTAATCTCAAGTTTTACAACTTCCCCAAGGACATCAAGAGCCTGATCAAGTGGTGCCAGAACGCACGTCTGCCCGTCCAAGCCAAGGAGCCGCGTCACTTCTGCAGTAAACACTTTGAGGAGCGTTGCATTGGCAAGTTTCGCCTAAAGCCATGGGCGGTGCCGACGCTACATTTGGGTGCCCAATATGGCAAGATTCATGACAATCCTAAGAACTTGTATGTGGAAGAAAAGCGTTGTTGCCTTAATTTTTGTCGTCGCAGTCGCTCATCGGACTTTAACATGTCACTCTATCGCTTTCCCAGAGATGAGGTGCTTTTGCGACGTTGGTGCTACAATTTACGACTTGATCCTGCCGTTTATCGTGGCAAGAATCACAAAATATGCAGTGCACACTTTATCAAGGAAGCTCTGGGGTTACGCAAACTATCGCCGGGTGCTGTTCCTACGCTTCATTTGGGACACAACGATACGTTTAATATCTACGAGAATGAACTCTGGCCTCCACCGACAGCCTCTACGCCCACCANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTCGGCATCGCATTATGTCGATCCAGAAGTGAGTGCCTCGTATATGGGCATTGGCGGCTCGGGTGGATCGGCATCGGGTATGAATGTGAGCGACAGCATGGATGTGTGCTGTGTGCCAAGTTGCGAGAGCAAGCGGCACAATAACGAGAACATTACATTCCATACGATACCCCGACGACCTGAGCAGATGCGGAAATGGTGTCACAATCTCAAGATACCCGAGGATAAGATGCACAAGGGCATGCGTATTTGTAGTCTGCACTTTGAGCCCTATTGCATTGGTGGCTGCATGCGTCCGTTTGCGGTGCCTACATTGAAACTAGGCCACGACGACGATGATATTCATCGCAATCCGGATGTGATTAAAAAGCTGAATATACGTGAAACCTGCTGTGTGGCTGTGTGCAAGCGTAATCGGGATCGAGATCATGCGAATCTGCATCGTTTCCCTAGCAATGTATCGCTGCTGACCAAATGGTGCGCTAATCTGCAACGGCCTGTGCCGGATGGTACTAAACTCTTTAACGATGCCATATGTGAGGTACACTTTGAGGATCGTTGCCTGCGCAACAAGCGATTGGAGAAATGGGCAGTGCCTACGCTGGTGCTGGGCCATGAGAATATTGCATATCCATTGCCCACGGCAGAGCAAGTAGCTGAGTTCTATGCCCGTCCCAGTGCACCCAATAATGGCGAGGAGCAAGGCGAATGCTGTGTGGAGACCTGTAAGCGTAATCCCAGTGTCGACGATATCAAACTCTATCGCCCGCCCGAGGAATCGTCCGTGCTGGCGAAATGGGCGCATAATCTTGAAGTGGATGCCGCCAAGTTGCCAAGCATGCGTATATGTAATCTACACTTTGAATCCCATTGCATTGGCAAGCGAATGCGACCCTGGGCCATTCCTACACTCAATCTGTCCAACAATATTGAGAATATGTATGAAAATCCTGAGCATCATATGATATATAAGCGTCGCACACAAATCAAAACCGAGAGGGGATCCTCTTCGGGTGCTGGTGCGAGCATGTCGAAACCGACTTGGGTACCACGTTGTTGTTTGCCACATTGTCGCAAGGTGCGTGCTCTCCACAATGTGCAGCTATATCGCTTCCCGAAACTCAATCGTTCTACGCTGGCAAAGTGGGCGCATAATCTTCAAGTGCCATTGGTGGGCAGTGCCCAGAGGAGACTGTGCTCCGCTCACTTTGAGCCGCATGTGCTGAGCAAAAAGTGTCCGGTACCGCTGGCGGTACCCACACTGGATCTCAATGCGCCACCCGGCTATAAAATCTATCAGAATCCCGCCAAATTAAAGGCCAACAAATTGTGTCTGCAGCGCGTCTGCATTGTTGAGAGTTGCCGACGTCAGCGTGGTCAGGGCGTGCAGCTCTTCCGTCTTCCTCACAGTCCCACGCAGCTGCGAAAGTGGATGTACAACATACGGATGCGACCAAGAGGCGCCATGCGGCAACAGTATCGCATCTGTTCCACACACTTTGAGACGCACTCGTTCAATGGAAAACGATTGAGTGCTGGAGCAATTCCAACGCTTGAGCTGGGCCATGACGATGAGGATATCTATCCGAATGAGGCGCAATCATTTGTGGATGAACACTGTGCCATCGATGGTTGTGATGCATCTAAAGAGCAGCCAGATGTGCGTCTCTTCCGATTTCCCACTGAAGATGAGGATTTGCTATGGAAGTGGTGTAATAATCTGAAAATGAATCCTGTCGATTGTATTGGTGTGCGCATATGCAACAAACATTTTGAGGCGGACTGCATTGGACCAAAACATCTGTATAAGTGGGCCATACCCACCCAGGAATTGGGCCACGATGATTCTCAGATTGAGCTGATAACTAATCCTAAGCCAGAAGAACGCTATGTTGATCCAGTATTCAAATGTTGTGTTCCCACATGCGGCAAGACGCGTAGATTCGATGAGGTACAAATGAATAGCTTTCCCAAAGATCCCACACTGTTCCAGCGCTGGCGCCACAATCTGCGACTCGAGCATCTCAATTTCAAGGAGCGTGAACGCTATAAGATCTGTAATGCACACTTTGAAGATATATGCATTGGTAAGACACGTCTTAACATTGGCTCCATACCCACACTGGAGTTGGGTCACGACGAAACGGAAGATCTTTTCCAAGTGAATCCTGCAGAGTTACAAAGCAATCTGTTTGGTCGCCAGCGACGGGTACAAGAATCCATGGGTATCAGAATCAAGCAGGAAGAGCACTCTGAGCAGGAGGAGGACAACAACAGGATCACGGGTGATGTCACCATGTCAGAGGCTACGGATTTGAATACGAAGCAGGTTAAAATAAAGAAAAAGCTGTCTGAGATGAAGTGCTGCGTGCCAAGCTGTGGTCGTAGTCGTCTGGAGCACGGGGCACGTTTGTTTCCCTTCCCAACTGGCAAGCAACAGCAAAGCAAGTGGCGTCACAATTTACGCCTGGAGGCCAGTGACGTGGATAAATCGTCACAAGTTTGCAGTGCACACTTCAATCGACGTTGTATTGATGGAAAGCAATTACGTGGATGGGCAATACCCACACAACAGCTGGGCCATCAGGAGCAGCCAATCTATGAGAATCCAAAGAATATACCGGGCTTCTTTACGCCCACCTGTGCGCTGGCGCACTGCCGCAAGCGGCGAAGTATTGACAATGATTTACGCACTTATCGCTATCCGCGAAGTGAGGAATTGTTGGAGAAATGGCGGGTGAATCTACGCTTGTCGCCGGATCAGTGTCGTGGACGCATCTGTGCAGATCACTTTGAGCCAATGGTGCGTGGCAAGCTGAAGCTAAAGACTGGTGCGGTTCCAACGCTTAAATTGGGACATGATGAGGGCGTGGTGTTTGACAATGAGGCTATTAAGACAGGACTGCAGTTAGATGGTGCTGAGCCAGAGCCGGAAGAGGGTGACGCCAGCAGAGAATCGCTGATTAAGATTAAAAAGGAGAAAGTAGAGCCGGATGAGGAGTTGGAGCAGACGCAACTCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTATTTTGATCCATTGGAATTGGTTGAAACTTTTGCCGAGGACAATGCAGTCGATGATGATGATGATGATATTCCTGGTGATGATGATCAGCTGTTACTGCCGGAGACGCAGCAAGTGAAGCTAGAACATCCATTCCCGCCAATGCGCCGCGAAAAGGCGGTGAATAATGTTACGCCTATTTGTTGTCTAAAACATTGCCGTAAGGAACGGACTGCCACACATCAGCTCAGCACCTTTGGCTTTCCAAAGGATCGTCTGCAGCTGCGAATGTGGAGCACCAATTTGCAAATCCCTGAAAGCGACTGCGTGGGTCGCGTTTGCATTGAGCATTTTGAGTCAGAGGTGCTGGGAACGCGGAAGCTGAAACAGCATGCGGTGCCCACATTGAATCTGGGTCACAACGCACCTCTAATTTATAAATGCAATGGTCAGACTCAACCTTTGGGTATCTTCGATGAGCAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGTCGCAAAAGGAAGCTGGAAGTAATGGAACCGCCCGATCATCATCATCATCGTCATCCACAAGCAGTGATTAAGCAGGAGGAGGAAACGGATGTACACAGCTGTTGCCTTCCCCACTGTGGAAAGCGGTCAGATCTTCATGATGTACAGCTGCATGGGCTGCCCAGTAATAGATTCCGACTGCGAAAATGGTTACACAATCTAAATCTTCCGCAGAGCATTCCAATCGGTAGTCATACTCGGGTCTGCAACGAACACTTTGACCTACAGCCTGGTGAGCCAATAGAAGATTGTATACCCACATTGAATCTTGGACACAGCGACACAAATTTATATCGCAATCAGACTGCGCCCACCTCAGCGTGCTTGGTACCAAGCTGCCCGTGTGCTCGACTCAATCTCTATCGCGGCTATGATTTGCCCGAACATCATTTGGTGCAACAGGCTTGGCTGGAGTTTCTTCGGCAGCCTATACCGCAATCGCCTGGCGATGGTCAACTCTGTGTGATGCACTATATGCAGCTGTACGAGCAGGTCTCGTTGCCCCGAGAGGTGCCAGCAGCAGTTTTGCGTCAACTTCAGGATACCTACGAGCAGATCTCAAACTCTACCATGGCCATGAAGCTACGCTGCGCTGTTCCGGGTTGCTATTCAAAGTACACGGATAACATTAGGTTAACCAAACTGCCCACCTGTGCAGACATGTTTGCCAAATGGCTGCACAATACTAAGATCAAATATGATCCAAACCGGCATTATATATATCGCATTTGTATGCTGCATTTTGAGCCACGTTGTCTGGGTCCAGTGCGTCCCAAACTTTGGGCGGTGCCGACATTGCTATTGAACCACAACGATGCGGATATTTATAGAAATCCTAAGCTAGAAGTCAGCGTAGGCAGTTCTCCAGCCCAGTTGCAAGACCAAGTTGAGCGCCAGCCTGTGCCTGTGCCCGTTCCCGTGGAGCTACCATTACGCATCAAGACTGAGCTAGCTTACACAGGCAGTCCCAGTGCCAGCGCCAGTCCTAGTCCACGTGGCAAGTTGCGTATTTGCTGCATTCCCAGTTGCGCTCAGCAGGCCAACTCGTTGATGCGTCTCTTTAGGTTTCCAACCACCGAGACAATGCTCCTCAAGTGGTTGGTTAATACACAGCAAAAGCCACGTCTGGCGGATCCACAGCATCTGTTTGTCTGCCAAGATCATTTTGAAAGCGAAGCTATTTGCAAGAAGCAGCTAAGCAGTTGGGCTGTGCCAACCCTGAAGCTAGGACACGATGGCCATATAATTCCGAATGCCAAGCATAATGGGAATATTGCCGACACAAATGAGAACAGGCATACACTGCAATTCATTTGGGCCAACTACTGCTCAGTACTTGACTGCTTTGAGCCACGAAGTGAACAGCTGCGTCTCTTTGCCTATCCCAAAGATCGTCCTACCATACGCAAGTGGGCTGCTAATTGCAAACATCGTTCAATGCAAGCCAGCAGCGATGGCTTCCAGGTCTGTCAATCGCATTTCGAGCCACATTGTTTCGATCCGGAAACGGGAGGGTTGCGGGAGAATGCAGTGCCCACGCTGGAGTTAAGTCGTGGCATTAACAAAATGGGCTGCGTGGTGTCTGGTTGTTTCAAGGAAGAGAATGCACCACGTCGTCGCTTTTTCAAGATGCCAAAACGAATGACACAGATAATGGATTGGTGTCATAATTTTCAACTGGATCCATCAGTTATGACCAACGAGAAAAGTGTTGTGTGTGATCGGCACTTTGAGCCGCATTGTTTGAACGTGCACAAGCAGCTGCGTCGTGGAGCACGACCCACGCTACACTTGGGTCACACAGCAGACATTGAGATATTGCAGAATCCGGAGAGTTGGACACGTTGCAGTGAAGTGCCTGAATCTTCAGCTGATGTTTGCTGTGTGCCCAATTGTGGACGTTACAAGGATGCAGAAGAGGAAGATGATATACAATTGTTTAACTTCCCCAAAGTGCGCACCGTAGCGGAGAAATGGTTAAGAAACATTCGCCTGGAAGCAAATAAAATGAAGCTGGCCGAATTGAAAATTTGCAATGTACACTTTGAGATCTATTGTCTGGAAAACGGAAGAATTCAATTGGGTGCCATGCCCACGCTACAGTTGGGACACGATGATCACCATGATGTATATCGCTCCACTTGTCTGTCGTCAGCAAAAATAAAACGTTACTGCAATAGGAACGGATGCAGCTATGACTGTTGTTTTCCACAATGCGTGGAGTTGCAGAAGAGTTATCTGAGAATAAGCTATGACTTGCCAGAGTCGGAAGCCATGCGTAATGCTTGGTTGGCATATATGGAATTGGAGGAGACACAAGAACAAAAGCTCAAGCTTTGTCCTCTGCACTTGATCATACTCTATGAGCACAGCATCGACAATATTCCAGAGCACATACCGGAGGAATTGTTGGAGGATAACTATGCAGCTGCAAGGAATAGTGTGCGCATTCGTATCATAAGCTGTGCCGTGCGTGGCTGCAAGACGCTCAAGACACGCGACGGCGGCATTCTTCATGGTCTGCCCACACGACCGGATATACTGCAAATGTGGTTGCACAACATGCAGCTGGTGTTCTATGAGCAGCAGCGATACATGTATAAGATCTGTAGCAAACACTTTGAGTCGAATTGCTTTACGGATACAACACGCCGCCTGAAGCCATGGAGTATGCCCACATTGGAGTTACCGGAACCAGAGCCGGGCGAGCCAGATGTGTATCAGAATCCCACAGAAGCCGAATGGTTGCAGTTGAATGAACAGTGGGCAGCTGAGCAATTGCAGCTTGATCAGCCAATGATGGAGGATGAGGAAGTACAAGTGAAATTGGAACCAGAACTGGAGCTGGAACACGAAAATCTAATGGAATACGATGAGGATTACTCCCAGCCTGCACTGGAGGATGCCAGCTCACAGCAACCGCTGGAGATGCAACCCCTGGAGGTGCTGCTCGAGGTGGGTCATGTCGAGAAATGCAGAACGTACGAGCAAATGGACACCGAAGCAAATTTCAGTTATGCTGAACAACAGGCGGCGGCTGGACCGCCACCTAGTAATGGAATCCTTCCCAGCAATGAAGAGAAATATAGTGCGGGAAAGTGTAGTGTTCGCGGCTGCGGTGTGACCTTCAAAGATCTCTCTGGCAGTGTGAAGTTGCACAAGTTTCCTACATCGTGGGATGCCATGCAAAAGTGGATGCACAATACTCAGGTCAAGGTGTGGCAAAGGGTCGTATGGCGTTTCCGCATTTGCAGCTATCATTTTAATGACGATTGTTTTCAAAACTCACGCTTGAGACGCGGTGCCATGCCCACATTGCATTTGGGCCCAAAGCGACCCTCACACATCTATCAGAGTGAATTCAATGGGAATGCAGAGCTACCCATTGCCGATGAGGAGCAGCCCCATGGATGA
Protein Sequence
MAPPPAPANRHALNASGEMIIKSEPVDEHAYKSNYIDDNRPFADFSKFPEFGDDMLSPKMEMTVKDEAYQKNPLSYPRRKLQSERSSESLPICQRCKEVFFKKQIYLRHVAQSSCTIQEYDFKCNICPMSFMSTEELQKHKHMHRADKFFCHKYCGKYYDTIAECESHEYMQHEYESFVCNMCSVTFATREQLYEHLPQHKFQQRYDCPICRLWYQTELELHEHRLAAPYFCGKYYAGAQATAHQQQQQHQQHAQQHHHQSGNYKLQDCHMGTMEMSPSHHKTAGSALPATAALSSLLQQRQANADGAALFASSLKNEVNVKLERSYSNSTTSESGYSLHDSSNYNNAYGSDASLQGAAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFRVPASSYATFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEIARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSKHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTASTPTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSASHYVDPEVSASYMGIGGSGGSASGMNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLKLGHDDDDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVSLLTKWCANLQRPVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLVLGHENIAYPLPTAEQVAEFYARPSAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESSVLAKWAHNLEVDAAKLPSMRICNLHFESHCIGKRMRPWAIPTLNLSNNIENMYENPEHHMIYKRRTQIKTERGSSSGAGASMSKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNAPPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRGQGVQLFRLPHSPTQLRKWMYNIRMRPRGAMRQQYRICSTHFETHSFNGKRLSAGAIPTLELGHDDEDIYPNEAQSFVDEHCAIDGCDASKEQPDVRLFRFPTEDEDLLWKWCNNLKMNPVDCIGVRICNKHFEADCIGPKHLYKWAIPTQELGHDDSQIELITNPKPEERYVDPVFKCCVPTCGKTRRFDEVQMNSFPKDPTLFQRWRHNLRLEHLNFKERERYKICNAHFEDICIGKTRLNIGSIPTLELGHDETEDLFQVNPAELQSNLFGRQRRVQESMGIRIKQEEHSEQEEDNNRITGDVTMSEATDLNTKQVKIKKKLSEMKCCVPSCGRSRLEHGARLFPFPTGKQQQSKWRHNLRLEASDVDKSSQVCSAHFNRRCIDGKQLRGWAIPTQQLGHQEQPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEELLEKWRVNLRLSPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEGVVFDNEAIKTGLQLDGAEPEPEEGDASRESLIKIKKEKVEPDEELEQTQLXXXXXXXXXXXXXXXXXXXXYFDPLELVETFAEDNAVDDDDDDIPGDDDQLLLPETQQVKLEHPFPPMRREKAVNNVTPICCLKHCRKERTATHQLSTFGFPKDRLQLRMWSTNLQIPESDCVGRVCIEHFESEVLGTRKLKQHAVPTLNLGHNAPLIYKCNGQTQPLGIFDEQPQHSVFRLWSLKHCRKRKLEVMEPPDHHHHRHPQAVIKQEEETDVHSCCLPHCGKRSDLHDVQLHGLPSNRFRLRKWLHNLNLPQSIPIGSHTRVCNEHFDLQPGEPIEDCIPTLNLGHSDTNLYRNQTAPTSACLVPSCPCARLNLYRGYDLPEHHLVQQAWLEFLRQPIPQSPGDGQLCVMHYMQLYEQVSLPREVPAAVLRQLQDTYEQISNSTMAMKLRCAVPGCYSKYTDNIRLTKLPTCADMFAKWLHNTKIKYDPNRHYIYRICMLHFEPRCLGPVRPKLWAVPTLLLNHNDADIYRNPKLEVSVGSSPAQLQDQVERQPVPVPVPVELPLRIKTELAYTGSPSASASPSPRGKLRICCIPSCAQQANSLMRLFRFPTTETMLLKWLVNTQQKPRLADPQHLFVCQDHFESEAICKKQLSSWAVPTLKLGHDGHIIPNAKHNGNIADTNENRHTLQFIWANYCSVLDCFEPRSEQLRLFAYPKDRPTIRKWAANCKHRSMQASSDGFQVCQSHFEPHCFDPETGGLRENAVPTLELSRGINKMGCVVSGCFKEENAPRRRFFKMPKRMTQIMDWCHNFQLDPSVMTNEKSVVCDRHFEPHCLNVHKQLRRGARPTLHLGHTADIEILQNPESWTRCSEVPESSADVCCVPNCGRYKDAEEEDDIQLFNFPKVRTVAEKWLRNIRLEANKMKLAELKICNVHFEIYCLENGRIQLGAMPTLQLGHDDHHDVYRSTCLSSAKIKRYCNRNGCSYDCCFPQCVELQKSYLRISYDLPESEAMRNAWLAYMELEETQEQKLKLCPLHLIILYEHSIDNIPEHIPEELLEDNYAAARNSVRIRIISCAVRGCKTLKTRDGGILHGLPTRPDILQMWLHNMQLVFYEQQRYMYKICSKHFESNCFTDTTRRLKPWSMPTLELPEPEPGEPDVYQNPTEAEWLQLNEQWAAEQLQLDQPMMEDEEVQVKLEPELELEHENLMEYDEDYSQPALEDASSQQPLEMQPLEVLLEVGHVEKCRTYEQMDTEANFSYAEQQAAAGPPPSNGILPSNEEKYSAGKCSVRGCGVTFKDLSGSVKLHKFPTSWDAMQKWMHNTQVKVWQRVVWRFRICSYHFNDDCFQNSRLRRGAMPTLHLGPKRPSHIYQSEFNGNAELPIADEEQPHG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-