Basic Information

Gene Symbol
-
Assembly
GCA_017165715.1
Location
Scf:18070-31100[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 22 3.1e-15 4.7e-12 46.1 4.3 1 86 340 412 340 413 0.85
2 22 1.9e-15 2.9e-12 46.8 4.6 1 87 440 509 440 509 0.83
3 22 4.2e-16 6.3e-13 48.9 0.4 1 87 531 603 531 603 0.84
4 22 4.2e-16 6.3e-13 48.9 5.2 1 86 701 770 701 771 0.82
5 22 3.9e-15 5.9e-12 45.8 3.1 1 86 795 866 795 867 0.82
6 22 7.3e-13 1.1e-09 38.5 0.9 1 87 902 970 902 970 0.82
7 22 6.7e-11 1e-07 32.2 1.4 1 86 1015 1084 1015 1085 0.76
8 22 4.1e-16 6.2e-13 48.9 0.1 1 86 1112 1181 1112 1182 0.83
9 22 4.7e-13 7e-10 39.1 1.4 1 87 1203 1273 1203 1273 0.80
10 22 3.6e-15 5.4e-12 45.9 1.9 1 86 1300 1371 1300 1372 0.85
11 22 2.3e-14 3.4e-11 43.4 2.6 1 85 1448 1516 1448 1518 0.82
12 22 2.2e-12 3.3e-09 37.0 0.1 1 86 1541 1609 1541 1610 0.81
13 22 5.2e-14 7.8e-11 42.2 1.2 1 87 1752 1821 1752 1821 0.80
14 22 3.1e-12 4.7e-09 36.5 0.1 1 86 1889 1955 1889 1962 0.80
15 22 0.0034 5.2 7.5 0.1 1 58 1980 2030 1980 2044 0.75
16 22 4.9e-12 7.4e-09 35.9 0.7 1 87 2069 2139 2069 2139 0.83
17 22 3.7e-15 5.6e-12 45.9 1.9 1 86 2215 2284 2215 2285 0.83
18 22 1.3e-12 2e-09 37.7 0.8 1 86 2320 2391 2320 2392 0.81
19 22 5.2e-13 7.8e-10 39.0 0.2 1 87 2402 2474 2402 2474 0.80
20 22 2.7e-16 4e-13 49.5 0.0 1 86 2503 2576 2503 2577 0.79
21 22 9.7e-05 0.15 12.5 0.0 1 58 2610 2660 2610 2694 0.77
22 22 2.2e-09 3.4e-06 27.3 0.0 1 58 2699 2751 2699 2751 0.85

Sequence Information

Coding Sequence
ATGCTGAGTCCGAAGGTGGAATTAACCGTCAAGGACGAGGGCTATGGCAGTCAAAAAGTTCCCAACCCGCTTAGCTATCCGCGGCGCAAGCTGCAAACGGATCGCTCATCGGAAAGTCTTCCCATATGCCAGCGTTGTAAGGAGGTGTTCTTTAAAAGATCGATCTACTTGCGACATGTGGCCGAGAGCAGTTGCAACATACAGGAGTATGACTTCAAGTGCAACCTCTGCACCATGTCCTTCATGACCAGCGATGAGCTGCAGAAACACAAGCATCTGCACAGAATGGACAAGTTCTTTTGCCACAAATACTGTGGCAAGTACTTTGATACAATTGCCGAATGCGAATCGCATGAGTACATGCAGCACGAATATGAAAACTTTGTGTGCAACATGTGCTCCATGACGTTTGTCACGCGGGAACAACTCTATGGTCATTTGCCGCAGCACAAATTCCAGCAGCGTTACGATTGTCCCATTTGTCGCCTGTGGTATCAAACCGCTTTGGAGTTGCACGAGCATCGTTTAGCGGCGCCGTACTTCTGCGGCAAGTACTACGCACCCGCTCAATCGGCAGTTCATCAGCAACAACCACATCAACAACATTCACATCAACAGCATCAGCAACAGGCCAACTACAAACTTCAGGATTGTCACATGGGCACCATTGAAATGCCTTCGCCGCAACACAAAACTAATACATCATCAGCAAACGCATTGCCGGCAACAGCAGCGCTCAATTCGTTGTTGCAACAGCGTCAAGCCAATGCCGATAATGCCGCATTGTTTGCTTCCACATTGAAGAACGAGGCGAATGTGAAGCTGGAGCGAAGCTACAGCAATTCAACGAGCGAATCTGGTTACAGTCTACACGATAACAGTAGCTACAACAATGCCTATGGCAGCGACAACTCGATTCATGCCGCAGCCGGAGGCGGTGGTGGCGGCGGCTCTGGTGGTGCCATTGGAGGTCCGCAGGCGCACTCCTCCACACTGGACGACTCGGAGGATGCTCTTTGCTGTGTGCCTTTGTGTGGAGTGCGCAAGAGCACCAGTCCAACGCTGCAGTTCTTCACGTTCCCCAAAGATGAGAAGTATCTGCATCAGTGGCTGCATAATCTCAAGATGTTTCACATTCCGGCATCGAGCTATGCGAGCTTTCGCATTTGCAGCATGCACTTTCCGAAGCGTTGCATCAATCGCTATTCGTTGTGCTATTGGGCGGTGCCCACATTCAATCTGGGCCACGATGATGTTGCCAATCTGTATCAGAATCGCGAGCTGACCAACACATTTACCACCGGCGAGGTGGCACGCTGCAGCATGCCCAACTGCACCAGCCAGCGGGGCGAGAGCAATCTCAAGTTCTACAATTTTCCCAAGGATATCAAGAGTCTGATTAAATGGTGCCAGAATGCCCGTCTGCCCGTCCAGGCCAAGGAGCCGCGTCACTTTTGCAGTCGCCACTTTGAGGAGCGTTGCATTGGAAAGTTCCGGCTGAAACCGTGGGCAGTGCCCACCTTACATTTGGGCGCTCAGTACGGCAAGATCCATGACAATCCCAAGAATCTGTATGTGGAGGAGAAGCGTTGCTGCCTTAACTTTTGTCGTCGCAGTCGCTCGTCCGACTTCAATATGTCGCTTTATCGCTTTCCGCGCGATGAGGTGCTGCTTCGACGCTGGTGCTATAATTTACGACTCGATCCTTCCGTCTATCGCGGCAAGAATCACAAAATATGCAGCGCTCACTTTATCAAAGAGGCTTTGGGACTGCGCAAACTATCGCCGGGTGCTGTTCCCACGCTGCATTTGGGGCACAACGACACGTTCAACATCTACGAGAATGAGCTGTGGCCACCACCGACGCCTTCCACGCCCACCAACCACCATCAGCAGCAACTGCAGCAGCATCAGTTGCAGCAACACCAGCAGCAACAACAACAGCATCACAAATATCAACGTCAATCGGCAGCCTCAACATCTTCTTCAGCCAGCTCATCGCACTATGTGGATGCTGGAGACATGGGTGGATCGTACATGGGCATGGGCAACTCAGGAGGCTCTTCGTCCGGCCTGAATGTGAGCGACAGTATGGACGTGTGCTGTGTGCCGAGCTGCGAGAGCAAGCGGCACAATAATGAGAACATCACATTCCACACGATACCGAGAAGGCCCGAGCAGATGCGCAAATGGTGTCACAATCTAAAGATACCCGAGGATAAGATGCATAAGGGAATGCGCATCTGCAGTCTGCACTTTGAACCCTATTGCATTGGCGGTTGCATGCGTCCGTTTGCCGTGCCCACGTTGCAATTGGGCCACGACGACGAGGACATTCATCGCAATCCGGATGTGATCAAGAAGCTGAACATAAGGGAAACCTGTTGTGTGGCTGTCTGCAAGCGTAATCGCGATCGTGATCATGCCAATCTGCATCGCTTCCCCAGCAATGTGGCGTTGCTGACCAAGTGGTGTGCGAACCTCCAGCGCCCGGTACCGGATGGCACTAAGCTCTTCAACGATGCCATCTGCGAGGTGCACTTCGAGGATCGCTGTCTACGCAACAAGCGGCTGGAGAAGTGGGCAGTGCCAACGTTGATACTCGGCCACGAGAACATCGCCTATCCGCTGCCCACGGCAGAGCAGGTGGCCGAGTTCTACGCTCGACCCAGTGCACCCAACAATGGCGAGGAGCAGGGCGAGTGCTGTGTGGACACGTGCAAGCGTAATCCCAGCGTTGATGACATCAAACTCTATCGCCCGCCTGAAGAGTCGCAAGTGCTGGCCAAATGGTCGCACAATCTGCAGCTGGACGCTGCGAAGTTATCCAGCTTGAGGATCTGCAATCTGCACTTTGAGGCGCACTGCATTGGGAAGCGCATGCGTCCCTGGGCGATACCCACGCTCAATTTGGCGACAAACATTGAGAATCTCTACGAGAATCCCGAGCACCAAATGCTCTATAAGCGACGCACGCATCTCAAAACCAAACGTGCCGCTAATCACGAGGCGGGTGGCGTGAAACCGACCTGGGTGCCACGCTGTTGTCTGCCACATTGCCGCAAGGTGCGTGCACTGCATAATGTGCAACTGTATCGCTTTCCTAAGCTCAATCGCTCCACGCTGGCCAAGTGGGCGCACAATCTGCAAGTGCCGCTGGTGGGCAGCGCCCAAAGACGCCTCTGCTCGGCGCACTTTGAGCCGCACGTGCTCAGCAAGAAATGTCCGGTACCACTGGCTGTGCCTACGCTTGATCTGAACTCACCACCTGGCTACAAGATCTATCAGAATCCCGCCAAGCTCAAGGCCAACAAGTTGTGCCTGCAACGCGTCTGCATTGTAGAGAGTTGTCGCCGGCAGCGTGGTCAGGGCGTGCAGCTCTTCCGGCTGCCACACAATCCTACGCAGCTGCGCAAATGGATGCACAACATACGCATGCGACCCAGAGGCGCCATGCGGCAACAGTATCGTATGTGCTCCATTCACTTTGAGACGCACTCCTTCAATGGCAAGCGGTTGAGTGCGGGAGCCATCCCAACGCTCGAGTTGGGTCACAATGACGACGATATCTATCCGAATGAAGCGCAATCGTTTGTTGAGGAACACTGCACTGTTGAGGGCTGTGAGGCTTCCAAGGAGCAGCCCGATGTACGTCTCTTCCGCTTCCCCACCGACGACGAAGATCTGCTGTGGAAGTGGTGCAACAATCTCAAGATGAATCCCGTGGATTGCATTGGTGTGCGCATCTGCAACAAACACTTTGAGCTGGACTGCATCGGACCCAAGCATCTGTACAAATGGGCGATACCGACACTGCATCTGGGTCATGACGATGAGCAGATCGAGCTGATTGACAACCCCAAGCCCGAAGAACGCTATGTGGATCCCGTGTTCAAGTGCTGCGTACCGACATGCGGCAAGACGCGCAAGTTTGATGAGGTGCAGATGAATAGCTTTCCCAAGGATCCGAACATGTTTCAGCGCTGGCGACACAATTTGCGACTGGACCATCTCAATTTCAAGGAGCGCGAACGCTACAAGATTTGCAATGCGCACTTTGAGGACATTTGCATAGGGAAGACGCGGCTTAACATTGGCTCGATACCCACACTCGAGTTGGGGCACGAGGAGACGGAGGATCTATTCCAAGTGAATCCCGAGGAGCTGCAAAGCAATCTCTTTGGACGTCAGCGTCGAGTGAATTCTTCGATGGGCATGAGCATCAAGCAGGAGGACAACTCGGAGGTGGACGAGGACATTAAGCCTGACTTGAATATGTTGGAGGCAAAAGACAGAAATATCGCACAGGTAAAGATCAAGCGTTCTCTAGCGGATTACAAGTGTTGTGTGCCAGACTGTGGACGCAGTCGCTTGGAGCATGGCGCTCGCCTGTTTCCCTTCCCCAATGGCAAGCAGCAGCAGAGCAAGTGGCGCCATAATCTTCGCCTGCAGCCCGATGAAGTGGATCGCAGCACACGAGTCTGCAGTGCGCATTTCAATCGTCGTTGCATTGATGGCAAGCAGCTAAGGAGTTGGGCCATGCCCACGCAGCAGCTGGGTCATCAGGAATTGCCCATCTATGAGAATCCAAAGAATATACCGGGCTTCTTTACGCCCACCTGTGCGCTGGCTCATTGTCGCAAGCGTCGCAGCATTGACAACGATCTGCGTACTTATCGCTATCCGCGTAGCGAAGATCTGCTCGAGAAGTGGCGCGTTAATCTGCGCTTGGCGCCGGATCAGTGTCGTGGACGCATTTGTGCGGATCACTTTGAGCCTATGGTGCGCGGCAAGCTGAAGCTGAAGACGGGCGCAGTGCCCACGCTGAAGTTGGGTCACAACGAAGGCGTTGTCTTTGACAATGAGGCTATTAAGGCGGGACTGCAGCAAGAGGCGGAGGAGGGTGGCGATCAGGAGACCAGTATGGAATCGCTAGTCAAAGTGAAGCAGGAGAGACTCGATCCGGAAGAGGCGCCAGCTGATGATGTGGACCACGAGCAGCAGCACGACGACGACGATGAGCAGGCAGATCATGGCTACTTTGATCCTCTAGAGCTGGTCGAGACGTTTGCTGAGCAGCACAGCGCCGAGGATGAGAACGAAGTCAATGACGACGATGACGAATATGAAGATGAGGATGAGCCGGGCGACGATGATGAGCTGCTACTACCAGACACACCCCCTGTGAAGCGACTGCCGCCTTTGGTGCTGCCGCCGCGGCGCGAGAAAGCCGTGAACAATGTGACCCCCATCTGCTGCCTGAAACACTGTCGCAAGGAGCGCACGGCGAGCCATCAACTGAGCACCTTTGGTTTCCCAAAGGATCGGCAACAGCTGCTCAAATGGAGCGCCAATCTACAGCTATCGCTTGACGATTGTGTGGGACGCGTGTGCATCGAACACTTTGAGTCGGAGATGCTAGGCACACGCAAGCTGAAGCAGCATGCGGTGCCCACCTTGAATCTGGGTCACGCAACGCCCCTCAGCTACAGTTGCAATGGTCAGGCATTGAGCATCTACGATGCACAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACATTGCCGCAAACGGAAGCATCCAACGGAACCGCCGGATCAGCAGCAGAAGCACCGGCAGCTGGATCAGAACCCCACAACGATGATGACTAAGCGACGCTGTTGCTTGCCCAGCTGTGGCAAGCAGCCGGAAGTGCATGGCGTACAATTGCAGCGACTGCCCAGCAATCGCATTCAGCTTCGCAAGTGGCTGCACAATCTCAAGCTATCCCCAATGCTAGACAGCAGTCAGGCGCGTCTCTGCAGCGAACACTTTGAGCCGGAGCTGCTGGACCATGTGGAGGATGCGGTGCCCACACTGCGACTGGGACACGATGACACGCACATCTATCGCAATCGTGGCAACATCGCGGCAGCCTCCACGTCGAGTGCCTGCTTGGTGGCCAGTTGCCCGTGTGCTCGTCTCAATCTCTATCGCTGTTACGATCTGCCCGAACATCGTCTGGTGCAGCAAGCTTGGCTGCAGTGGTTAGAGCTGCCGCTGCCCCAACAAGCCAGCGATGGCAAGCTCTGTGTCATGCACTTCATGCAGCTCTTCGAGCAGGTGCCGCTGCCCGCGGAGTTGCCAGGCTCGGTGCTCCGTCAACTGCAGGAGACTTATGATCTCATTGGAGGCTCCACGATGGCCATGAAGTTGCGCTGCGCTGTGCCCGGCTGTTACTCGAAGTACACGGACAACATCAGGCTGACCAAGCTTCCCATGTGTCCCGGCATGTGCTCCAAATGGGTGCACAACACCAAGATCAACTACGATGCAACGCGTCACTATGTCTATCGCATCTGCATGCTGCACTTTGAGTCTCGCTGCTTGGGCCCTGTGCGTCCCAAGCTGTGGGCGGTGCCAACGCTGCACTTGAACCACAACGATGCGAATATCTATCAGAACCCGAAGTTGGATGGGCAATTCCCGTCAGCTCCAGTGCCGCCTCCTGTGCCCGTTGCCATGACGGCGTCGGTACCCGTTGCTTTGACAGCGTCGGTGCCAGTTGAGTTGCCGCTGCGCATCAAGACGGAGCTGGCCTTTAGTGGCAGTCCCAGCGCCAGTGCAAGTCCCAGTCCGCGTGGCAAGCTGCGCTTCTGCTGCATCCCCAGCTGCTTGCAACAGGCTACGTCGCAGACGCGACTCTTTCGTTTCCCCACCGCTGAAACGGCGCTGCTCAAGTGGCTGGTGAATACGCAGCAACAGCCGCGTTTGGTGGATACCCAGCAGCTGTTCATTTGCCAGGATCATTTCGAGGAGGAAGCCATCTGCAAGAAGCAGCTGCGCAGTTGGGCGGTGCCCACATTAAAGCTGGGTCACGATGGCCATGTCATACCGAATGCCAGGCACAATGGCAACATTGCCGACAGCCAGGAGAACAAGCAGACGTTACAGTACATCTGGGAGAACTATTGTTCCGTCTTGAGCTGCTTCCAGCCGCGTAGCGAGGAATTGCGTCTCTACGCTTATCCCACGGATCGACCCACCATTCGCAAGTGGGCGACCAACTGCAAGCATCGCTCCATGCAGGCCAGCAGCGATGGTTTCCAGGTCTGCCAGCTGCACTTTGCACCACATTGCTTTGACCGGGAGACGGGCGAGTTGAGAGAGGATGCGGTGCCCACGCTGGAACTGAGTCGATGCCTAAACGATGTACATTGCATCGTCGCTGGCTGTGTGAAAGACGAGGATGGACCGCGTCAACGCTTCTACAAGATGCCCAAGCGCAGTGCTCAACTGCTTAGCTGGTGTCACAATCTGCGTTTGGATGCGGCAACCATGGGCGGTGGGGAGCATCACGTCTGCGATCGCCACTTCGAAACGCAGTGCATCAATCAACAAAAACTGCTACGACCCGGCGCACGTCCTACTCTCCACCTGGGCCACGATGAGCCCATTGAATTGATGCCCAATCCAGCGGAATGGGATGCAACGGATGCTGCGGCTGCTGTAGACAATGTATGCTGTGTGCCCAACTGCGGTCTGGCCAAGGATGAGGAGGAGGATGTGCAGCTGTTTGCCTTTCCNAAGCTGCGGTCGCTCGCCGAAAAGTGGCTACAGAATATACGCCTCGAAAACATAAGTCGCGAGCAGCTGATGCGACTGAGGATCTGCAGCGCACACTTCGATGATAGCTGCCTGGAGAGCAACGGACGTCCACAGCTGGGCGCCATGCCCACGCTGCAACTGGGTCACGAGGACAACGGGAACATTCATCGCAGCACCGATGCTGCTGCCGTTAAGGCTAAGAAGTTCTGTAATCGAAGTGGCTCCAGCTATGACTGCTGCTATCCGCAATGTGTGGAGCTGCAGAAGAGTTACCTGAGGATTAGCTACGATCTACCACAATCGGAGGCACTGCGTCTCAAGTGGCTGGAGTACATGGGTCTGGAAAAGACGGAAGAGAAGCTCTTGAAGCTGTGCCCGCTGCACTTGGTGCTGCTCTACGATCACAGCGTCGAACATTTTGCAGAGGAACACACGCCCGAGCAGCAGTTGGACGCCAACTACGAGGACAGTCGGAACAGTGTGCGATTGCGAGTCATCAGCTGCGCGGTGCCCGGCTGCCGTACGCTAAAGCCACGCGATGGTGGCATACTTTATGGATTGCCGCAGCGTCGCGATGTGCTTGAGATGTGGCTACACAACATGCAGCTGGTGTTCTATGAGCAGCAGCGTTACATGTACAAGATATGCAGCAAACACTTTGA
Protein Sequence
MLSPKVELTVKDEGYGSQKVPNPLSYPRRKLQTDRSSESLPICQRCKEVFFKRSIYLRHVAESSCNIQEYDFKCNLCTMSFMTSDELQKHKHLHRMDKFFCHKYCGKYFDTIAECESHEYMQHEYENFVCNMCSMTFVTREQLYGHLPQHKFQQRYDCPICRLWYQTALELHEHRLAAPYFCGKYYAPAQSAVHQQQPHQQHSHQQHQQQANYKLQDCHMGTIEMPSPQHKTNTSSANALPATAALNSLLQQRQANADNAALFASTLKNEANVKLERSYSNSTSESGYSLHDNSSYNNAYGSDNSIHAAAGGGGGGGSGGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYASFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPSVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPSTPTNHHQQQLQQHQLQQHQQQQQQHHKYQRQSAASTSSSASSSHYVDAGDMGGSYMGMGNSGGSSSGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLQLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRPVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHENIAYPLPTAEQVAEFYARPSAPNNGEEQGECCVDTCKRNPSVDDIKLYRPPEESQVLAKWSHNLQLDAAKLSSLRICNLHFEAHCIGKRMRPWAIPTLNLATNIENLYENPEHQMLYKRRTHLKTKRAANHEAGGVKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRGQGVQLFRLPHNPTQLRKWMHNIRMRPRGAMRQQYRMCSIHFETHSFNGKRLSAGAIPTLELGHNDDDIYPNEAQSFVEEHCTVEGCEASKEQPDVRLFRFPTDDEDLLWKWCNNLKMNPVDCIGVRICNKHFELDCIGPKHLYKWAIPTLHLGHDDEQIELIDNPKPEERYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPNMFQRWRHNLRLDHLNFKERERYKICNAHFEDICIGKTRLNIGSIPTLELGHEETEDLFQVNPEELQSNLFGRQRRVNSSMGMSIKQEDNSEVDEDIKPDLNMLEAKDRNIAQVKIKRSLADYKCCVPDCGRSRLEHGARLFPFPNGKQQQSKWRHNLRLQPDEVDRSTRVCSAHFNRRCIDGKQLRSWAMPTQQLGHQELPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEDLLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHNEGVVFDNEAIKAGLQQEAEEGGDQETSMESLVKVKQERLDPEEAPADDVDHEQQHDDDDEQADHGYFDPLELVETFAEQHSAEDENEVNDDDDEYEDEDEPGDDDELLLPDTPPVKRLPPLVLPPRREKAVNNVTPICCLKHCRKERTASHQLSTFGFPKDRQQLLKWSANLQLSLDDCVGRVCIEHFESEMLGTRKLKQHAVPTLNLGHATPLSYSCNGQALSIYDAQPQHSVFRLWSLKHCRKRKHPTEPPDQQQKHRQLDQNPTTMMTKRRCCLPSCGKQPEVHGVQLQRLPSNRIQLRKWLHNLKLSPMLDSSQARLCSEHFEPELLDHVEDAVPTLRLGHDDTHIYRNRGNIAAASTSSACLVASCPCARLNLYRCYDLPEHRLVQQAWLQWLELPLPQQASDGKLCVMHFMQLFEQVPLPAELPGSVLRQLQETYDLIGGSTMAMKLRCAVPGCYSKYTDNIRLTKLPMCPGMCSKWVHNTKINYDATRHYVYRICMLHFESRCLGPVRPKLWAVPTLHLNHNDANIYQNPKLDGQFPSAPVPPPVPVAMTASVPVALTASVPVELPLRIKTELAFSGSPSASASPSPRGKLRFCCIPSCLQQATSQTRLFRFPTAETALLKWLVNTQQQPRLVDTQQLFICQDHFEEEAICKKQLRSWAVPTLKLGHDGHVIPNARHNGNIADSQENKQTLQYIWENYCSVLSCFQPRSEELRLYAYPTDRPTIRKWATNCKHRSMQASSDGFQVCQLHFAPHCFDRETGELREDAVPTLELSRCLNDVHCIVAGCVKDEDGPRQRFYKMPKRSAQLLSWCHNLRLDAATMGGGEHHVCDRHFETQCINQQKLLRPGARPTLHLGHDEPIELMPNPAEWDATDAAAAVDNVCCVPNCGLAKDEEEDVQLFAFPKLRSLAEKWLQNIRLENISREQLMRLRICSAHFDDSCLESNGRPQLGAMPTLQLGHEDNGNIHRSTDAAAVKAKKFCNRSGSSYDCCYPQCVELQKSYLRISYDLPQSEALRLKWLEYMGLEKTEEKLLKLCPLHLVLLYDHSVEHFAEEHTPEQQLDANYEDSRNSVRLRVISCAVPGCRTLKPRDGGILYGLPQRRDVLEMWLHNMQLVFYEQQRYMYKICSKHF

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-