Basic Information

Gene Symbol
-
Assembly
GCA_016746235.1
Location
CM028595.1:4991042-5004177[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 22 4 7.3e+03 -2.3 2.4 38 62 311 338 303 353 0.61
2 22 7.7e-09 1.4e-05 25.6 0.2 1 40 528 565 528 575 0.86
3 22 0.0087 16 6.2 0.2 1 87 628 705 628 705 0.50
4 22 3.7e-16 6.7e-13 49.1 5.5 1 87 803 873 803 873 0.82
5 22 1.2e-14 2.1e-11 44.3 3.4 1 86 897 968 897 969 0.82
6 22 2.9e-13 5.1e-10 39.8 1.3 1 87 1004 1072 1004 1072 0.80
7 22 7.2e-11 1.3e-07 32.1 1.4 1 86 1112 1181 1112 1182 0.76
8 22 3.9e-17 7.1e-14 52.2 0.4 1 86 1209 1278 1209 1279 0.83
9 22 5.7e-13 1e-09 38.9 1.5 1 85 1300 1368 1300 1370 0.81
10 22 1.4e-13 2.5e-10 40.8 0.5 1 86 1397 1468 1397 1469 0.84
11 22 5.6e-13 1e-09 38.9 3.2 1 85 1545 1613 1545 1615 0.83
12 22 5.2e-13 9.4e-10 39.0 0.1 1 87 1638 1707 1638 1707 0.82
13 22 1.4e-11 2.4e-08 34.5 0.1 17 87 1833 1891 1825 1891 0.78
14 22 3.2e-10 5.7e-07 30.1 0.1 1 86 1993 2059 1993 2060 0.79
15 22 2.9e-06 0.0052 17.4 0.0 1 58 2075 2122 2075 2138 0.81
16 22 1e-13 1.8e-10 41.3 0.1 1 87 2151 2224 2151 2224 0.83
17 22 3.4e-16 6.2e-13 49.2 0.4 1 86 2340 2412 2340 2413 0.82
18 22 2.6e-07 0.00047 20.7 1.1 24 86 2463 2515 2445 2516 0.77
19 22 4.2e-14 7.6e-11 42.5 4.2 1 86 2613 2683 2613 2684 0.85
20 22 1.2e-11 2.1e-08 34.6 0.1 1 86 2763 2832 2763 2833 0.84
21 22 8e-11 1.4e-07 32.0 1.4 1 58 2856 2904 2856 2919 0.84
22 22 2.2e-10 4e-07 30.6 1.1 18 86 2922 2979 2911 2980 0.75

Sequence Information

Coding Sequence
ATGTCACAACAACACCCCCATCACGCCCACCCCCACCACTACGCACACCACTACCCGCCGCCCGCGACGCCCATGCCCCTGCAGCAAGAGCAGCAGCAGCAAGCGCAGCTGCTGCCTCAGCAGCAGCACGCCAATTGGTACTCACATGTTGCTTCCTACCCCACACCCCACTCGGCCTTCGGCCCCGCCCCCTCTTGCAAGGCCACCAACAACAGTAGCAGCGGCAACAACAACAACATCATGGACGGCGGGGGATACGGGCCAGGGGGTGGTGGCACGCAGGGCTATTATGGCGCCGCTGGCGGCGGCCTCAATGTCAGTGGGGCGGTTGTAAGTGGTGGTGGTCCGAGCTACGGCCTTGGGGCCAATACAGTGGCATATGCCCACAACCAACTGCTGCAGTATCAACAACAGCAACAGCAGCAGCACCAGCATCTTCCACAGCATATAGGCCAACAAAGGCCCTATATGGGCCACAATATCATGACCGGCAGCTATCCTTATATCAAAAGCGAACCCATGGAGGCCTACCAGCAGCCCCCGAATCCCATGGCCCCGCCACCGGCACCAGAAGTATTAATAAAATCGGAACCCATTGACGAGCATTCCTATAAGTCTAACTATATAGACGACAATACGCCATTTGCTGACTTTAGCAAGTTCAGCGAATTCAGCGAGGACATGCTGAGCCCCAAAGTGGAGCTGACCGTGAAGGATGAGTCCTACGGAAAGAGCACTAGCAGTTTCCTGCGGCGAAAACAACAATCGGATCGGGGCACCGAAAGTCTGCCCATCTGCCAGCGCTGCAAGGAGGTCTTTTTTAAGAAGCAGGTATATGTTCGCCATGTGGCCGAGAGCAACTGCGGAATCCAGGAGTACGACTTCAAGTGCAGCACCTGCCCCATGTCGTTCATGACCACCGAGGAGTTGCAGCGGCACAAGCTCCACCACCGGGCCGACAGGTTCTTCTGCCACAAGTACTGCGGCAAGCACTTTGACACGATCGCTGAGTGCGAGGCGCACGAGTACATGCAACACGAGTACGATGGCTTTGTCTGCAACATGTGCTCCGGACCTTCGCTACGCGGGAGCAACTGTACGCCCACCTGCCGCAGCATAAAGCTTCACGAACATCGTCTGGCTGCGCCCTATTTCTGTGGCAAGTACTACACGGGGGGACAGTCATCCTCTGCCTCCCTGTCGCAGTCGCAGGCGCAGCAGCACCAGACAAATTACAAGCTACAGGACTGCCATATGGCGACCATGGAAATGACCACAACCACGCACCACAAAACAACTCCATCCGGCTCGTCCTTGCCGGCGACGGCTGCGCTGAATTCTCTACTGCAACAGCGTCAGGCAAATGCCGATGGAGCTGCCATGTTCGCAGCATCCGCAATAAAGAACGAAGTGAATGTTAAGATGGAGCGGAGCTACAGCAACTCTACCAGCGAGTCATCCTACAGCGTTCAGGACAGTGCGTACAATAATACCTACGGAAGCGACAGCTCAATGCACGCAGGAGCGATCGCCGGACCACAAGCGCACTCCTCAACGCTGGACGACTCCGAAGACGCCTTGTGCTGTGTTCCTCTGTGTGGAGTGCGCAAGAGCACAAGCCCTACGCTTCAGTTTTTCACGTTTCCCAAGGACGAGAAGTACCTTAATCAGTGGCTGCACAACCTCAAGATGTTCCATATACCCGCCGCCAGCTACGCCAATTTCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGTGCTATTGGGCCGTTCCCACTTTTAATCTGGGTCATGACGACGTGGCCAATCTATACCAGAACAGGGAACTCACTAACACCTTCACAACCGGCGAGGTTGCGCGCTGCAGCATGCCTCACTGTACAAGTCAGCGGGGCAAAAGAGCCGCGTCATTTTTGCAGCCGCCACTTGAGGAGCGCTGTATCGGCAAGTTCCGACTGAAGCCGTGGGCGGTGCCTACTTTACATCTAGGTGCCCAGTATGGCAAGATCCACGACAACCCAAAGAATTTCGCCCACTTTATTAAAGAGGCGTTGGGTCTGCGGAAACTATCTCCGGGGGCAGTTCCCACACTTCATCTGGGTCACACTGACACCTTCAACATCTACGAGAACGAACTGTGGCCGCCTCCAACAACTCCAAACAGTCACAGCAGTGGCCTCCAGCACCACACACAACATCATCCATCGCAACACTCACTGCAGCAGCAGTTGCATAGCAAATCATACCAGCGCCAATCTGCGGCTTCCACGTCCTCCTCTGCCAGCTCGGCAAACTCTCATTACGTTGATCCTGATCTAAGCGCTTCTTACCTGGCCTTAAGTGCCAGTGGATCTTCTGCAATTAATGCCAGCGACAGCATGGACGTATGTTGTGTGCCCAGTTGCGAAAGCAAGCGCCACAATAATGAGAACATTACATTCCACACCATACCACGCCGACCGGAGCAGATGCGCAAGTGGTGCCATAATCTGAAAATACCCGAAGAAAAGATGCACAAGGGTATGAGGATTTGCAGCCTGCACTTCGAGCCCTATTGCATTGGCGGTTGCATGCGTCCATTTGCGGTGCCTACGCTTAATTTGGGTCACGATGACGACGATATTCATAAAAATCCGGATGTGATTAAAAAATTGAACATCCGGGAAACATGTTGCGTTGCCGTATGTAAACGAAATCGGGACAGAGACCATGCCAATCTGCACCGTTTCCCAAGCAATGTGTCTCTATTGACCAAGTGGTGTGGCAATCTTCAGCGTCCTGTTCCGGATGGGAGTAAACTTTTCAACGACGCTATTTGTGAAGTGCACTTTGAGGAACGATGTCTGCGCAACAAAAGGCTTGAGAAGTGGGCGGTACCCACACTATCGTTAGGCCATGAAAGCATCCCATACCCGCTACCAACGCCGGAACAGGTTACAGAGTTCTACGCTCGACCCACTGCACCCAATAATGGCGATGAACAGGGAGAGTGCTGTGTGGAGACGTGCAAGAGAAATCCCAGTGTGGACGACATCAAGCTTTATCGGCCTCCGGAAGAAGCTTCCGTGTTGGCCAAATGGGCGCACAACCTGCAGACGGAGAGTAGTCAACTGACTAGCATGAGGATCTGTAATCTTCACTTTGAGGCACACTGCATCGGCAAGCGGATGAGACCTTGGGCTATTCCAACACTTAATCTGGCTGGTAACATAGAAAATCTCTACGAAAATCCGGAGCATTCGATGCTGTACAAACGGCGAACTCACTTAAAAACCAAGCAGTCGGCTTCCGTGAAGCCCACTTGGGTGCCCAGGTGCTGTCTTCCGCATTGCCGCAAAGTTCGGGCTCTCCACAACGTTCAGCTTTATCGCTTCCCCAAGCTAAATCGGTCCACTCTGGCTAAGTGGGCGCACAATCTGCAGGTTCCTATGGTTGGCAGTGCCCAACGCCGTCTATGCTCTGCTCATTTTGAGCCGCACGTGCTGAGCAAGAAGTGTCCGGTGCCGCTGGCGGTACCCACATTGGACTTGAACGCGCCGCCCGGTTTGAAGATTTACCAGAATCCTGCCAAGCTCAAGGCAAGCAAGCTGTGTCTGCAGCGCGTGTGTATTGTCGAGAGCTGCCGCAAGACGCGGGCGCAGGGCGTCCAGCTCTTTCGGCTGCCGCATAGTCCAACGCAACTGCGCAAGTGGATGCATAACATCAAAACGCGTCCTAGAGCGGCGATGAGGGCCCAATACCGGGTGTGTTCCCGCCACTTTGAGACACACTCTTTCAATGGTAGGAGACTGAGCGCTGGCGCCATTCCCACCCTGGAATTGGGTCATGACGACGAGGACATCTATCCTAACGAAGCGCAGGCCTTTGCCGATGAACACTGCGTGGTGGAGGGATGCGAGGCCTCCAAAGAACAGCCGGACGTACGCCTGTTCCGTTTTCCCACAGAGGATGACGATATGCTATGGAAATGGTGCAACAATCTCAAAATGAACCCGGTGGATTGCACCGGCGTGCGCATATGCAATAAACACTTCGAGGCGGATTGCATCGGGCCCAAGCACTTGTACAAGTGGGCTATTCCCACCGAGGAACTGGGCCACGACGATGCCGAAATCGAGCTAATACTAAATCCCAAGCCGGAGGACAGGTATGTGGATCCGGTGTTCAAGTGCATCGTACCGACCTGCGGAAAGACGCGTCGCTTCGACGAGGTACAGATGAACAGCTTTCCCAAGGATGCGAATCTCTTTGAGCGGTGGAAACACAACTTGCGGCTGGAACACCTCAGCTTCCAGGAACGCGATCGGTACAAGATATGCAACTCTCACTTTGAGGATATATGTATCGGAAAGACTCGGCTAAATATAGGTTCGATCCCGACTCTAGAATTGGGTCACGACGACACAGACGACCTGTTCCAGGTAAATCCAGCGGAATTGCAAAGCAACCTTTTCGGACGACAACGGCGAATACAGGACGAGTTGGGCGGAACAATTATCAAGCAGGAGTTTTCCGAGTCGGAAGACGTCAAAACGGATGTGACAACCATGTCTGATGCCAAAGATTTGAATACGAGACAGGTTAAGCTCAAAAGGACTATGTCCGATCTGAAGTGTTGTGTGCGCAGTTGTGGGCGCAGTCGACTGGAGCACGGAGCACGCCTCTTTCCATTTCCCAATGGTAAGCAACAGCACCTAAAGTGGCGCCATAACCTGCGTCTGGAACCCGACGAGGTGGAGCGTTCGACACGGATTTGCAGTGCGCACTTTAACCGCCGCTGCATTGATGGCAAGCATTTGAGAAGCTGGGCAATGCCCACGCAGCAACTGGGCCACCAGGAGCAGCCGATCTACGAGAATCCGAAAAACATACCAGGATTCTTTACGCCCACCTGTGCTTTGAGTCATTGCCGCAAGCGTAGGAGCATTGACAACGATCTCCGCACTTATCGATATCCAAGGAGTGAGGATCTTCTGGAAAAATGGCGGGCGAATCTGCGTCTGGCACCGGATCAGTGTCGCGGCAGGATATGTGCGGACCATTTTGAGGCACAGGTGCGTGGAAAGTTGAAGCTGAAAACGGGAGCGGTGCCTACTTTAAATCTGGGCCATGATGAGGGCTTAATATACGACAATGAGGCTATAAAGGTGGGAATGGCCGAAGAAGAGGAAGGTAGCTCGGAGTTGCCACGACTGAAAACCAAAAGGGAGCTGATTGATGAAGAGGAAGAAGAACTAGAGGCTGAAGAGGAGCACCATGAGCAAGATATTTACGATGAAGATGAGAAAGACGGCCACTATTTCGATCCCCTCGAACTGGTCGAGACGTTTGCTGAACATCCAAGTGATGATGAAGGCGAGTATTGTGGAGATGAGGAGGATGAACGCGTCGAAGAAGAGGACGTGGAGGAAGCGGAGCACTTTCATCCGGACGCCCCACCCACTCCTCCTATTATCTCCTCTGCAAAAGAACGCACGGCTTTTCATCTACTGAGCACCTTTGGCTTTCCCAAGGATCGACAGCTGCTGCTCAAGTGGTGCTCCAATCTACACCTGAATCCTGATGACTGCATCGGCCGCGTTTGCATAGAGCACTTTCAGCCGGAGGTACTGGGAACTCGAAAGCTTAAGCAAAATGCAGTGCCCACTCTCAATGTGGGCCACGATGATCCACTTAGATACTCTTGTAATGGAGTAGATCTTGACCAAGAGCAATTACAGCCACAGCACTCGGTTTTTCGGCTTTGGAGCCTAAAACACTGCCGCAAGAGGAAGCTGACAGAGCCGCCGGACATCCGCCAAGACAAGTGGAGTGCGCTGGAAGTGCGGAAGATGCAGACGTTGAGGATGGAGATGAAGGTAGGGAGGAAGATCAAGCTGGAGGTGCAGACGGAGAGGGAAATGATGATGAAGGAAAAGACTCCAACGAAGAATCACAGAGAGAGTAATTTGGAAAGATGTTGCATCAGCTACTGTGCGAATGCGGACGTTAACCAGCTACTTCCGCTGCCTGAGGAGCAAAATCTTTTAAAAAAGTGGCAGCTTAACCTAAAGCTTTCCCAAGACACGGACTTCAAGGAAATTCGAGTGTGTCTTAAGCACTTTGAGGCACAAGTGGTCGAAAACGGAAAGCCCTTGGAGCAGGCAGTACCCACCTTAATGCTAGATCAAAAAAGTTGGAACATCTACAGAAACCATGGCAGTTGTCTCTTTCCCGAGTGCGGAAATTCTTCATCGGATCACCTAAGCTTTGTTAATTTACCCGAAAATGTGGTCATAAAAGATGCCTGGATAAGACACCTTAATTTGCCACCCAGCAGTGAGGGTCTTCTTTGTGGTGACCATTTTATGCATCTGTTCAAACGGGTGGAGCTTCCCAAAGTATTGGCCGCACAAGATTTGGAAGATTTGAAACGCGTCAAGATCGTTAGCTGCGCGGTAAAAGGTTGTGAAATGATCAGGCCCCGGGATAAGGTGCTCCTTCATGGGTTGCCGCAAAGGGAAGACATACTGAGGATGTGGGTTGAGAATGGCCAACTAGAGATTACGGAGCCGCAGCAGCAATATATGCTTAGAGTATGCCGCAATCACTTTGAGTCCCGATGCTCGTTTGACGACAGGCGGCTGCATCCCTGGAGTGTTCCCACTCTGAATTTACCTGGAAATCCGGTGCACCAAATTCCCACTAAGGAAGAGTGGCAGGAAATGACAGTAAAATTTACCCAGGAAGCCGATACAATTAAATTGGAGAGAGAGGAGGAGCTGGAAGAGGAACAGCTTCGCGAGGAGGAAGAGGGTGACAGTTTCCTTCTAGAGCCCATTGTAAGAATGGAGCACATCGAGTCTGATGAGGAAGACTCGGAGATGCAGGCCCTGGAGGTCCTTCTGGAGGTTGGTCACGTTGAGCGAATGGACAGCTACGAAAGAGTGGACAAGTCCTATACCGAACATGCCGTCTGCCACTCTAATAGCATTCGAAACCAGTACAATGCCAACCACTGCGCCGTTGAAGGGTGCGAAGTGACAGTAGAAGACGTGGACGGGACCATCAAGCTGCACAAGTTCCCCGCATCGTCCGAGGCCGCTCGGAAGTGGATGCACAACACCCAAGTTGACATGGATGAGAAGTTCTGGTGGCGGTATCGAATTTGCAGTTATCATTTCGAACAGGAGTGCTTTCAGAGCGCTAGAATAAAGAAGGGAGCGATGCCCACTCTACTCCTAGGACCTAAGAGACCAGAAGAGGTATATGAGAATGAATTCGCATTGCAGGAGACTGAGGAGCTTATTTTGCCCGAAGAACTTCAGTTTGAGGAGCACAAAATGGTTAAAAGTGAGGTAATAAAAATGTGCCTGCCAACGCCGGCTCCGCCCCGAAAGTCCAAGGACATGTGCCTTAAGTGGCAGCACAACACACAGGTGCCTTTCGATCCCTTGCACCGCTGGCGATACCGTATCTGTAGTGCCCATTTTCATCCCGTGTGCCTGCTTAACATGAGACTAGTCCACGGCAGTGTACCCACCTTAAAACTAAGTAGCAAGGCTCCCACCGACCTATTTAATAACGACTTCGAAGCCATTAACCTCAGGCTGGACAAGAAGTTGGGAGCGGACACAAACACCGTGCAGATAAAGGAGGAGGATGAAGATTCAATGCCGTCCCTGGAACCAGAACTTCAGCTTCATGAGGACCAAGAGGCGGAGGATTCAGCAGCTACGCAAATACCCTTTACCCAGACAAACTGGAAGGGTCAACTGAGCTTGCCAGTGAAGCAGGAAACGGTGACCTACAACCAGGTGAAGTCTGGATACGATAAGTGCTCATTGGCTCATTGCCAGCGGCAAAGATCGAAGCATGGCGTCCACATCTACAAGTTTCCCAAATCGAGGATCCAACAGGAGCGTTGGATGCACAACCTCCGCATCCGCTACGATGAGCGAAGGCCATGGAAGTTTATGATCTGCAGCGTGCACTTCGAGCCACACTGCATCAGCCTGAGGAAGCTGCGCCCCTGGGCGGTTCCCACCCTGGAGTTGGGCGACAATGTGCCCGAGAAGATCTTTACGAACGAACAGTGCCATGAACTCTTCACCGATCGCAGTGAGGTGGAAAGCAATGCCGATGAGGACGATGGACTGCAGGAGGATGAGGACGAGGAGGATGAGTACGAGGAGGATATGGGGACGGAGGTACGCATCAAGCGGGAGCGGCGTTCCAAGCTGGATCCTTGGCTGCCTGGGCAGGTTCCGCCGTGGAAAGTTAAGCAGTGTTGTCTTCCTTACTGCCGGGCCTTTCGTGGCGATGGCATCAAATTATTCCGGCTGCCCAATAACCGCACCTCCATACGAAACTGGGAGCTGGCGACGGGCATGGTGTTCAAGGAGTCTCAACGAAACACTCGCCTGATTTGCAGCCGGCACTTTGAGCCCGAGTTGATTGGGGTAAGACGTCTCATGCGTAACGCCATTCCCACGAGGCACTTAAATCCCCAGGCATCCTCAGACAAAGTTAAGAGAAAGCCGGATCCACCCCTCCCAATTGCAACCTGTTGCATGGCTGACTGTCATCATAATGGAAATGTTAAACTGCACAAGTTTCCCAGTGACCCGGCGCTGCTTAGACAGTGGTGCCAGGCACTTAGGCTCACGGATACCCAGCGATATCGGGGCAAACATATTTGCTCGGTCCACCTGCCCTCTGACAGGACGGTTAGCTGTGTCATCTGCGGAGTGGACAATGTCCAGTTGCCGATGCTGGACTTTCCAGAGCAGCGCAATCAGCGCGCCAAGTGGTGCTACAATCTCAAGATCGAGGCCATATCCAAGTGGGACCACTCCAAGCACATTTGCTGTCGTCACTTCGAGTCGCAATGCTTCCTCCAGCCAGGTGAACTGCGTCCAGGTGCGATTCCCACGCTGCAGCTGAACCACAACGACACGAATATATTCCTCAGCGACTTCGCCACTGGTTCGACCGGCAATCGGATCAAGGATGAGCCGCAGGACAACGACGACATGCTGCTGGTTTAG
Protein Sequence
MSQQHPHHAHPHHYAHHYPPPATPMPLQQEQQQQAQLLPQQQHANWYSHVASYPTPHSAFGPAPSCKATNNSSSGNNNNIMDGGGYGPGGGGTQGYYGAAGGGLNVSGAVVSGGGPSYGLGANTVAYAHNQLLQYQQQQQQQHQHLPQHIGQQRPYMGHNIMTGSYPYIKSEPMEAYQQPPNPMAPPPAPEVLIKSEPIDEHSYKSNYIDDNTPFADFSKFSEFSEDMLSPKVELTVKDESYGKSTSSFLRRKQQSDRGTESLPICQRCKEVFFKKQVYVRHVAESNCGIQEYDFKCSTCPMSFMTTEELQRHKLHHRADRFFCHKYCGKHFDTIAECEAHEYMQHEYDGFVCNMCSGPSLRGSNCTPTCRSIKLHEHRLAAPYFCGKYYTGGQSSSASLSQSQAQQHQTNYKLQDCHMATMEMTTTTHHKTTPSGSSLPATAALNSLLQQRQANADGAAMFAASAIKNEVNVKMERSYSNSTSESSYSVQDSAYNNTYGSDSSMHAGAIAGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLNQWLHNLKMFHIPAASYANFXXXXXXXXXXXXXXXXXCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCTSQRGKRAASFLQPPLEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNFAHFIKEALGLRKLSPGAVPTLHLGHTDTFNIYENELWPPPTTPNSHSSGLQHHTQHHPSQHSLQQQLHSKSYQRQSAASTSSSASSANSHYVDPDLSASYLALSASGSSAINASDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEEKMHKGMRICSLHFEPYCIGGCMRPFAVPTLNLGHDDDDIHKNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVSLLTKWCGNLQRPVPDGSKLFNDAICEVHFEERCLRNKRLEKWAVPTLSLGHESIPYPLPTPEQVTEFYARPTAPNNGDEQGECCVETCKRNPSVDDIKLYRPPEEASVLAKWAHNLQTESSQLTSMRICNLHFEAHCIGKRMRPWAIPTLNLAGNIENLYENPEHSMLYKRRTHLKTKQSASVKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNAPPGLKIYQNPAKLKASKLCLQRVCIVESCRKTRAQGVQLFRLPHSPTQLRKWMHNIKTRPRAAMRAQYRVCSRHFETHSFNGRRLSAGAIPTLELGHDDEDIYPNEAQAFADEHCVVEGCEASKEQPDVRLFRFPTEDDDMLWKWCNNLKMNPVDCTGVRICNKHFEADCIGPKHLYKWAIPTEELGHDDAEIELILNPKPEDRYVDPVFKCIVPTCGKTRRFDEVQMNSFPKDANLFERWKHNLRLEHLSFQERDRYKICNSHFEDICIGKTRLNIGSIPTLELGHDDTDDLFQVNPAELQSNLFGRQRRIQDELGGTIIKQEFSESEDVKTDVTTMSDAKDLNTRQVKLKRTMSDLKCCVRSCGRSRLEHGARLFPFPNGKQQHLKWRHNLRLEPDEVERSTRICSAHFNRRCIDGKHLRSWAMPTQQLGHQEQPIYENPKNIPGFFTPTCALSHCRKRRSIDNDLRTYRYPRSEDLLEKWRANLRLAPDQCRGRICADHFEAQVRGKLKLKTGAVPTLNLGHDEGLIYDNEAIKVGMAEEEEGSSELPRLKTKRELIDEEEEELEAEEEHHEQDIYDEDEKDGHYFDPLELVETFAEHPSDDEGEYCGDEEDERVEEEDVEEAEHFHPDAPPTPPIISSAKERTAFHLLSTFGFPKDRQLLLKWCSNLHLNPDDCIGRVCIEHFQPEVLGTRKLKQNAVPTLNVGHDDPLRYSCNGVDLDQEQLQPQHSVFRLWSLKHCRKRKLTEPPDIRQDKWSALEVRKMQTLRMEMKVGRKIKLEVQTEREMMMKEKTPTKNHRESNLERCCISYCANADVNQLLPLPEEQNLLKKWQLNLKLSQDTDFKEIRVCLKHFEAQVVENGKPLEQAVPTLMLDQKSWNIYRNHGSCLFPECGNSSSDHLSFVNLPENVVIKDAWIRHLNLPPSSEGLLCGDHFMHLFKRVELPKVLAAQDLEDLKRVKIVSCAVKGCEMIRPRDKVLLHGLPQREDILRMWVENGQLEITEPQQQYMLRVCRNHFESRCSFDDRRLHPWSVPTLNLPGNPVHQIPTKEEWQEMTVKFTQEADTIKLEREEELEEEQLREEEEGDSFLLEPIVRMEHIESDEEDSEMQALEVLLEVGHVERMDSYERVDKSYTEHAVCHSNSIRNQYNANHCAVEGCEVTVEDVDGTIKLHKFPASSEAARKWMHNTQVDMDEKFWWRYRICSYHFEQECFQSARIKKGAMPTLLLGPKRPEEVYENEFALQETEELILPEELQFEEHKMVKSEVIKMCLPTPAPPRKSKDMCLKWQHNTQVPFDPLHRWRYRICSAHFHPVCLLNMRLVHGSVPTLKLSSKAPTDLFNNDFEAINLRLDKKLGADTNTVQIKEEDEDSMPSLEPELQLHEDQEAEDSAATQIPFTQTNWKGQLSLPVKQETVTYNQVKSGYDKCSLAHCQRQRSKHGVHIYKFPKSRIQQERWMHNLRIRYDERRPWKFMICSVHFEPHCISLRKLRPWAVPTLELGDNVPEKIFTNEQCHELFTDRSEVESNADEDDGLQEDEDEEDEYEEDMGTEVRIKRERRSKLDPWLPGQVPPWKVKQCCLPYCRAFRGDGIKLFRLPNNRTSIRNWELATGMVFKESQRNTRLICSRHFEPELIGVRRLMRNAIPTRHLNPQASSDKVKRKPDPPLPIATCCMADCHHNGNVKLHKFPSDPALLRQWCQALRLTDTQRYRGKHICSVHLPSDRTVSCVICGVDNVQLPMLDFPEQRNQRAKWCYNLKIEAISKWDHSKHICCRHFESQCFLQPGELRPGAIPTLQLNHNDTNIFLSDFATGSTGNRIKDEPQDNDDMLLV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-