Dnas008194.1
Basic Information
- Insect
- Drosophila nasuta
- Gene Symbol
- -
- Assembly
- GCA_017165715.1
- Location
- Scf:18070-31100[+]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 22 3.1e-15 4.7e-12 46.1 4.3 1 86 340 412 340 413 0.85 2 22 1.9e-15 2.9e-12 46.8 4.6 1 87 440 509 440 509 0.83 3 22 4.2e-16 6.3e-13 48.9 0.4 1 87 531 603 531 603 0.84 4 22 4.2e-16 6.3e-13 48.9 5.2 1 86 701 770 701 771 0.82 5 22 3.9e-15 5.9e-12 45.8 3.1 1 86 795 866 795 867 0.82 6 22 7.3e-13 1.1e-09 38.5 0.9 1 87 902 970 902 970 0.82 7 22 6.7e-11 1e-07 32.2 1.4 1 86 1015 1084 1015 1085 0.76 8 22 4.1e-16 6.2e-13 48.9 0.1 1 86 1112 1181 1112 1182 0.83 9 22 4.7e-13 7e-10 39.1 1.4 1 87 1203 1273 1203 1273 0.80 10 22 3.6e-15 5.4e-12 45.9 1.9 1 86 1300 1371 1300 1372 0.85 11 22 2.3e-14 3.4e-11 43.4 2.6 1 85 1448 1516 1448 1518 0.82 12 22 2.2e-12 3.3e-09 37.0 0.1 1 86 1541 1609 1541 1610 0.81 13 22 5.2e-14 7.8e-11 42.2 1.2 1 87 1752 1821 1752 1821 0.80 14 22 3.1e-12 4.7e-09 36.5 0.1 1 86 1889 1955 1889 1962 0.80 15 22 0.0034 5.2 7.5 0.1 1 58 1980 2030 1980 2044 0.75 16 22 4.9e-12 7.4e-09 35.9 0.7 1 87 2069 2139 2069 2139 0.83 17 22 3.7e-15 5.6e-12 45.9 1.9 1 86 2215 2284 2215 2285 0.83 18 22 1.3e-12 2e-09 37.7 0.8 1 86 2320 2391 2320 2392 0.81 19 22 5.2e-13 7.8e-10 39.0 0.2 1 87 2402 2474 2402 2474 0.80 20 22 2.7e-16 4e-13 49.5 0.0 1 86 2503 2576 2503 2577 0.79 21 22 9.7e-05 0.15 12.5 0.0 1 58 2610 2660 2610 2694 0.77 22 22 2.2e-09 3.4e-06 27.3 0.0 1 58 2699 2751 2699 2751 0.85
Sequence Information
- Coding Sequence
- ATGCTGAGTCCGAAGGTGGAATTAACCGTCAAGGACGAGGGCTATGGCAGTCAAAAAGTTCCCAACCCGCTTAGCTATCCGCGGCGCAAGCTGCAAACGGATCGCTCATCGGAAAGTCTTCCCATATGCCAGCGTTGTAAGGAGGTGTTCTTTAAAAGATCGATCTACTTGCGACATGTGGCCGAGAGCAGTTGCAACATACAGGAGTATGACTTCAAGTGCAACCTCTGCACCATGTCCTTCATGACCAGCGATGAGCTGCAGAAACACAAGCATCTGCACAGAATGGACAAGTTCTTTTGCCACAAATACTGTGGCAAGTACTTTGATACAATTGCCGAATGCGAATCGCATGAGTACATGCAGCACGAATATGAAAACTTTGTGTGCAACATGTGCTCCATGACGTTTGTCACGCGGGAACAACTCTATGGTCATTTGCCGCAGCACAAATTCCAGCAGCGTTACGATTGTCCCATTTGTCGCCTGTGGTATCAAACCGCTTTGGAGTTGCACGAGCATCGTTTAGCGGCGCCGTACTTCTGCGGCAAGTACTACGCACCCGCTCAATCGGCAGTTCATCAGCAACAACCACATCAACAACATTCACATCAACAGCATCAGCAACAGGCCAACTACAAACTTCAGGATTGTCACATGGGCACCATTGAAATGCCTTCGCCGCAACACAAAACTAATACATCATCAGCAAACGCATTGCCGGCAACAGCAGCGCTCAATTCGTTGTTGCAACAGCGTCAAGCCAATGCCGATAATGCCGCATTGTTTGCTTCCACATTGAAGAACGAGGCGAATGTGAAGCTGGAGCGAAGCTACAGCAATTCAACGAGCGAATCTGGTTACAGTCTACACGATAACAGTAGCTACAACAATGCCTATGGCAGCGACAACTCGATTCATGCCGCAGCCGGAGGCGGTGGTGGCGGCGGCTCTGGTGGTGCCATTGGAGGTCCGCAGGCGCACTCCTCCACACTGGACGACTCGGAGGATGCTCTTTGCTGTGTGCCTTTGTGTGGAGTGCGCAAGAGCACCAGTCCAACGCTGCAGTTCTTCACGTTCCCCAAAGATGAGAAGTATCTGCATCAGTGGCTGCATAATCTCAAGATGTTTCACATTCCGGCATCGAGCTATGCGAGCTTTCGCATTTGCAGCATGCACTTTCCGAAGCGTTGCATCAATCGCTATTCGTTGTGCTATTGGGCGGTGCCCACATTCAATCTGGGCCACGATGATGTTGCCAATCTGTATCAGAATCGCGAGCTGACCAACACATTTACCACCGGCGAGGTGGCACGCTGCAGCATGCCCAACTGCACCAGCCAGCGGGGCGAGAGCAATCTCAAGTTCTACAATTTTCCCAAGGATATCAAGAGTCTGATTAAATGGTGCCAGAATGCCCGTCTGCCCGTCCAGGCCAAGGAGCCGCGTCACTTTTGCAGTCGCCACTTTGAGGAGCGTTGCATTGGAAAGTTCCGGCTGAAACCGTGGGCAGTGCCCACCTTACATTTGGGCGCTCAGTACGGCAAGATCCATGACAATCCCAAGAATCTGTATGTGGAGGAGAAGCGTTGCTGCCTTAACTTTTGTCGTCGCAGTCGCTCGTCCGACTTCAATATGTCGCTTTATCGCTTTCCGCGCGATGAGGTGCTGCTTCGACGCTGGTGCTATAATTTACGACTCGATCCTTCCGTCTATCGCGGCAAGAATCACAAAATATGCAGCGCTCACTTTATCAAAGAGGCTTTGGGACTGCGCAAACTATCGCCGGGTGCTGTTCCCACGCTGCATTTGGGGCACAACGACACGTTCAACATCTACGAGAATGAGCTGTGGCCACCACCGACGCCTTCCACGCCCACCAACCACCATCAGCAGCAACTGCAGCAGCATCAGTTGCAGCAACACCAGCAGCAACAACAACAGCATCACAAATATCAACGTCAATCGGCAGCCTCAACATCTTCTTCAGCCAGCTCATCGCACTATGTGGATGCTGGAGACATGGGTGGATCGTACATGGGCATGGGCAACTCAGGAGGCTCTTCGTCCGGCCTGAATGTGAGCGACAGTATGGACGTGTGCTGTGTGCCGAGCTGCGAGAGCAAGCGGCACAATAATGAGAACATCACATTCCACACGATACCGAGAAGGCCCGAGCAGATGCGCAAATGGTGTCACAATCTAAAGATACCCGAGGATAAGATGCATAAGGGAATGCGCATCTGCAGTCTGCACTTTGAACCCTATTGCATTGGCGGTTGCATGCGTCCGTTTGCCGTGCCCACGTTGCAATTGGGCCACGACGACGAGGACATTCATCGCAATCCGGATGTGATCAAGAAGCTGAACATAAGGGAAACCTGTTGTGTGGCTGTCTGCAAGCGTAATCGCGATCGTGATCATGCCAATCTGCATCGCTTCCCCAGCAATGTGGCGTTGCTGACCAAGTGGTGTGCGAACCTCCAGCGCCCGGTACCGGATGGCACTAAGCTCTTCAACGATGCCATCTGCGAGGTGCACTTCGAGGATCGCTGTCTACGCAACAAGCGGCTGGAGAAGTGGGCAGTGCCAACGTTGATACTCGGCCACGAGAACATCGCCTATCCGCTGCCCACGGCAGAGCAGGTGGCCGAGTTCTACGCTCGACCCAGTGCACCCAACAATGGCGAGGAGCAGGGCGAGTGCTGTGTGGACACGTGCAAGCGTAATCCCAGCGTTGATGACATCAAACTCTATCGCCCGCCTGAAGAGTCGCAAGTGCTGGCCAAATGGTCGCACAATCTGCAGCTGGACGCTGCGAAGTTATCCAGCTTGAGGATCTGCAATCTGCACTTTGAGGCGCACTGCATTGGGAAGCGCATGCGTCCCTGGGCGATACCCACGCTCAATTTGGCGACAAACATTGAGAATCTCTACGAGAATCCCGAGCACCAAATGCTCTATAAGCGACGCACGCATCTCAAAACCAAACGTGCCGCTAATCACGAGGCGGGTGGCGTGAAACCGACCTGGGTGCCACGCTGTTGTCTGCCACATTGCCGCAAGGTGCGTGCACTGCATAATGTGCAACTGTATCGCTTTCCTAAGCTCAATCGCTCCACGCTGGCCAAGTGGGCGCACAATCTGCAAGTGCCGCTGGTGGGCAGCGCCCAAAGACGCCTCTGCTCGGCGCACTTTGAGCCGCACGTGCTCAGCAAGAAATGTCCGGTACCACTGGCTGTGCCTACGCTTGATCTGAACTCACCACCTGGCTACAAGATCTATCAGAATCCCGCCAAGCTCAAGGCCAACAAGTTGTGCCTGCAACGCGTCTGCATTGTAGAGAGTTGTCGCCGGCAGCGTGGTCAGGGCGTGCAGCTCTTCCGGCTGCCACACAATCCTACGCAGCTGCGCAAATGGATGCACAACATACGCATGCGACCCAGAGGCGCCATGCGGCAACAGTATCGTATGTGCTCCATTCACTTTGAGACGCACTCCTTCAATGGCAAGCGGTTGAGTGCGGGAGCCATCCCAACGCTCGAGTTGGGTCACAATGACGACGATATCTATCCGAATGAAGCGCAATCGTTTGTTGAGGAACACTGCACTGTTGAGGGCTGTGAGGCTTCCAAGGAGCAGCCCGATGTACGTCTCTTCCGCTTCCCCACCGACGACGAAGATCTGCTGTGGAAGTGGTGCAACAATCTCAAGATGAATCCCGTGGATTGCATTGGTGTGCGCATCTGCAACAAACACTTTGAGCTGGACTGCATCGGACCCAAGCATCTGTACAAATGGGCGATACCGACACTGCATCTGGGTCATGACGATGAGCAGATCGAGCTGATTGACAACCCCAAGCCCGAAGAACGCTATGTGGATCCCGTGTTCAAGTGCTGCGTACCGACATGCGGCAAGACGCGCAAGTTTGATGAGGTGCAGATGAATAGCTTTCCCAAGGATCCGAACATGTTTCAGCGCTGGCGACACAATTTGCGACTGGACCATCTCAATTTCAAGGAGCGCGAACGCTACAAGATTTGCAATGCGCACTTTGAGGACATTTGCATAGGGAAGACGCGGCTTAACATTGGCTCGATACCCACACTCGAGTTGGGGCACGAGGAGACGGAGGATCTATTCCAAGTGAATCCCGAGGAGCTGCAAAGCAATCTCTTTGGACGTCAGCGTCGAGTGAATTCTTCGATGGGCATGAGCATCAAGCAGGAGGACAACTCGGAGGTGGACGAGGACATTAAGCCTGACTTGAATATGTTGGAGGCAAAAGACAGAAATATCGCACAGGTAAAGATCAAGCGTTCTCTAGCGGATTACAAGTGTTGTGTGCCAGACTGTGGACGCAGTCGCTTGGAGCATGGCGCTCGCCTGTTTCCCTTCCCCAATGGCAAGCAGCAGCAGAGCAAGTGGCGCCATAATCTTCGCCTGCAGCCCGATGAAGTGGATCGCAGCACACGAGTCTGCAGTGCGCATTTCAATCGTCGTTGCATTGATGGCAAGCAGCTAAGGAGTTGGGCCATGCCCACGCAGCAGCTGGGTCATCAGGAATTGCCCATCTATGAGAATCCAAAGAATATACCGGGCTTCTTTACGCCCACCTGTGCGCTGGCTCATTGTCGCAAGCGTCGCAGCATTGACAACGATCTGCGTACTTATCGCTATCCGCGTAGCGAAGATCTGCTCGAGAAGTGGCGCGTTAATCTGCGCTTGGCGCCGGATCAGTGTCGTGGACGCATTTGTGCGGATCACTTTGAGCCTATGGTGCGCGGCAAGCTGAAGCTGAAGACGGGCGCAGTGCCCACGCTGAAGTTGGGTCACAACGAAGGCGTTGTCTTTGACAATGAGGCTATTAAGGCGGGACTGCAGCAAGAGGCGGAGGAGGGTGGCGATCAGGAGACCAGTATGGAATCGCTAGTCAAAGTGAAGCAGGAGAGACTCGATCCGGAAGAGGCGCCAGCTGATGATGTGGACCACGAGCAGCAGCACGACGACGACGATGAGCAGGCAGATCATGGCTACTTTGATCCTCTAGAGCTGGTCGAGACGTTTGCTGAGCAGCACAGCGCCGAGGATGAGAACGAAGTCAATGACGACGATGACGAATATGAAGATGAGGATGAGCCGGGCGACGATGATGAGCTGCTACTACCAGACACACCCCCTGTGAAGCGACTGCCGCCTTTGGTGCTGCCGCCGCGGCGCGAGAAAGCCGTGAACAATGTGACCCCCATCTGCTGCCTGAAACACTGTCGCAAGGAGCGCACGGCGAGCCATCAACTGAGCACCTTTGGTTTCCCAAAGGATCGGCAACAGCTGCTCAAATGGAGCGCCAATCTACAGCTATCGCTTGACGATTGTGTGGGACGCGTGTGCATCGAACACTTTGAGTCGGAGATGCTAGGCACACGCAAGCTGAAGCAGCATGCGGTGCCCACCTTGAATCTGGGTCACGCAACGCCCCTCAGCTACAGTTGCAATGGTCAGGCATTGAGCATCTACGATGCACAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACATTGCCGCAAACGGAAGCATCCAACGGAACCGCCGGATCAGCAGCAGAAGCACCGGCAGCTGGATCAGAACCCCACAACGATGATGACTAAGCGACGCTGTTGCTTGCCCAGCTGTGGCAAGCAGCCGGAAGTGCATGGCGTACAATTGCAGCGACTGCCCAGCAATCGCATTCAGCTTCGCAAGTGGCTGCACAATCTCAAGCTATCCCCAATGCTAGACAGCAGTCAGGCGCGTCTCTGCAGCGAACACTTTGAGCCGGAGCTGCTGGACCATGTGGAGGATGCGGTGCCCACACTGCGACTGGGACACGATGACACGCACATCTATCGCAATCGTGGCAACATCGCGGCAGCCTCCACGTCGAGTGCCTGCTTGGTGGCCAGTTGCCCGTGTGCTCGTCTCAATCTCTATCGCTGTTACGATCTGCCCGAACATCGTCTGGTGCAGCAAGCTTGGCTGCAGTGGTTAGAGCTGCCGCTGCCCCAACAAGCCAGCGATGGCAAGCTCTGTGTCATGCACTTCATGCAGCTCTTCGAGCAGGTGCCGCTGCCCGCGGAGTTGCCAGGCTCGGTGCTCCGTCAACTGCAGGAGACTTATGATCTCATTGGAGGCTCCACGATGGCCATGAAGTTGCGCTGCGCTGTGCCCGGCTGTTACTCGAAGTACACGGACAACATCAGGCTGACCAAGCTTCCCATGTGTCCCGGCATGTGCTCCAAATGGGTGCACAACACCAAGATCAACTACGATGCAACGCGTCACTATGTCTATCGCATCTGCATGCTGCACTTTGAGTCTCGCTGCTTGGGCCCTGTGCGTCCCAAGCTGTGGGCGGTGCCAACGCTGCACTTGAACCACAACGATGCGAATATCTATCAGAACCCGAAGTTGGATGGGCAATTCCCGTCAGCTCCAGTGCCGCCTCCTGTGCCCGTTGCCATGACGGCGTCGGTACCCGTTGCTTTGACAGCGTCGGTGCCAGTTGAGTTGCCGCTGCGCATCAAGACGGAGCTGGCCTTTAGTGGCAGTCCCAGCGCCAGTGCAAGTCCCAGTCCGCGTGGCAAGCTGCGCTTCTGCTGCATCCCCAGCTGCTTGCAACAGGCTACGTCGCAGACGCGACTCTTTCGTTTCCCCACCGCTGAAACGGCGCTGCTCAAGTGGCTGGTGAATACGCAGCAACAGCCGCGTTTGGTGGATACCCAGCAGCTGTTCATTTGCCAGGATCATTTCGAGGAGGAAGCCATCTGCAAGAAGCAGCTGCGCAGTTGGGCGGTGCCCACATTAAAGCTGGGTCACGATGGCCATGTCATACCGAATGCCAGGCACAATGGCAACATTGCCGACAGCCAGGAGAACAAGCAGACGTTACAGTACATCTGGGAGAACTATTGTTCCGTCTTGAGCTGCTTCCAGCCGCGTAGCGAGGAATTGCGTCTCTACGCTTATCCCACGGATCGACCCACCATTCGCAAGTGGGCGACCAACTGCAAGCATCGCTCCATGCAGGCCAGCAGCGATGGTTTCCAGGTCTGCCAGCTGCACTTTGCACCACATTGCTTTGACCGGGAGACGGGCGAGTTGAGAGAGGATGCGGTGCCCACGCTGGAACTGAGTCGATGCCTAAACGATGTACATTGCATCGTCGCTGGCTGTGTGAAAGACGAGGATGGACCGCGTCAACGCTTCTACAAGATGCCCAAGCGCAGTGCTCAACTGCTTAGCTGGTGTCACAATCTGCGTTTGGATGCGGCAACCATGGGCGGTGGGGAGCATCACGTCTGCGATCGCCACTTCGAAACGCAGTGCATCAATCAACAAAAACTGCTACGACCCGGCGCACGTCCTACTCTCCACCTGGGCCACGATGAGCCCATTGAATTGATGCCCAATCCAGCGGAATGGGATGCAACGGATGCTGCGGCTGCTGTAGACAATGTATGCTGTGTGCCCAACTGCGGTCTGGCCAAGGATGAGGAGGAGGATGTGCAGCTGTTTGCCTTTCCNAAGCTGCGGTCGCTCGCCGAAAAGTGGCTACAGAATATACGCCTCGAAAACATAAGTCGCGAGCAGCTGATGCGACTGAGGATCTGCAGCGCACACTTCGATGATAGCTGCCTGGAGAGCAACGGACGTCCACAGCTGGGCGCCATGCCCACGCTGCAACTGGGTCACGAGGACAACGGGAACATTCATCGCAGCACCGATGCTGCTGCCGTTAAGGCTAAGAAGTTCTGTAATCGAAGTGGCTCCAGCTATGACTGCTGCTATCCGCAATGTGTGGAGCTGCAGAAGAGTTACCTGAGGATTAGCTACGATCTACCACAATCGGAGGCACTGCGTCTCAAGTGGCTGGAGTACATGGGTCTGGAAAAGACGGAAGAGAAGCTCTTGAAGCTGTGCCCGCTGCACTTGGTGCTGCTCTACGATCACAGCGTCGAACATTTTGCAGAGGAACACACGCCCGAGCAGCAGTTGGACGCCAACTACGAGGACAGTCGGAACAGTGTGCGATTGCGAGTCATCAGCTGCGCGGTGCCCGGCTGCCGTACGCTAAAGCCACGCGATGGTGGCATACTTTATGGATTGCCGCAGCGTCGCGATGTGCTTGAGATGTGGCTACACAACATGCAGCTGGTGTTCTATGAGCAGCAGCGTTACATGTACAAGATATGCAGCAAACACTTTGA
- Protein Sequence
- MLSPKVELTVKDEGYGSQKVPNPLSYPRRKLQTDRSSESLPICQRCKEVFFKRSIYLRHVAESSCNIQEYDFKCNLCTMSFMTSDELQKHKHLHRMDKFFCHKYCGKYFDTIAECESHEYMQHEYENFVCNMCSMTFVTREQLYGHLPQHKFQQRYDCPICRLWYQTALELHEHRLAAPYFCGKYYAPAQSAVHQQQPHQQHSHQQHQQQANYKLQDCHMGTIEMPSPQHKTNTSSANALPATAALNSLLQQRQANADNAALFASTLKNEANVKLERSYSNSTSESGYSLHDNSSYNNAYGSDNSIHAAAGGGGGGGSGGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYASFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPSVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPSTPTNHHQQQLQQHQLQQHQQQQQQHHKYQRQSAASTSSSASSSHYVDAGDMGGSYMGMGNSGGSSSGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLQLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRPVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHENIAYPLPTAEQVAEFYARPSAPNNGEEQGECCVDTCKRNPSVDDIKLYRPPEESQVLAKWSHNLQLDAAKLSSLRICNLHFEAHCIGKRMRPWAIPTLNLATNIENLYENPEHQMLYKRRTHLKTKRAANHEAGGVKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRGQGVQLFRLPHNPTQLRKWMHNIRMRPRGAMRQQYRMCSIHFETHSFNGKRLSAGAIPTLELGHNDDDIYPNEAQSFVEEHCTVEGCEASKEQPDVRLFRFPTDDEDLLWKWCNNLKMNPVDCIGVRICNKHFELDCIGPKHLYKWAIPTLHLGHDDEQIELIDNPKPEERYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPNMFQRWRHNLRLDHLNFKERERYKICNAHFEDICIGKTRLNIGSIPTLELGHEETEDLFQVNPEELQSNLFGRQRRVNSSMGMSIKQEDNSEVDEDIKPDLNMLEAKDRNIAQVKIKRSLADYKCCVPDCGRSRLEHGARLFPFPNGKQQQSKWRHNLRLQPDEVDRSTRVCSAHFNRRCIDGKQLRSWAMPTQQLGHQELPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEDLLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHNEGVVFDNEAIKAGLQQEAEEGGDQETSMESLVKVKQERLDPEEAPADDVDHEQQHDDDDEQADHGYFDPLELVETFAEQHSAEDENEVNDDDDEYEDEDEPGDDDELLLPDTPPVKRLPPLVLPPRREKAVNNVTPICCLKHCRKERTASHQLSTFGFPKDRQQLLKWSANLQLSLDDCVGRVCIEHFESEMLGTRKLKQHAVPTLNLGHATPLSYSCNGQALSIYDAQPQHSVFRLWSLKHCRKRKHPTEPPDQQQKHRQLDQNPTTMMTKRRCCLPSCGKQPEVHGVQLQRLPSNRIQLRKWLHNLKLSPMLDSSQARLCSEHFEPELLDHVEDAVPTLRLGHDDTHIYRNRGNIAAASTSSACLVASCPCARLNLYRCYDLPEHRLVQQAWLQWLELPLPQQASDGKLCVMHFMQLFEQVPLPAELPGSVLRQLQETYDLIGGSTMAMKLRCAVPGCYSKYTDNIRLTKLPMCPGMCSKWVHNTKINYDATRHYVYRICMLHFESRCLGPVRPKLWAVPTLHLNHNDANIYQNPKLDGQFPSAPVPPPVPVAMTASVPVALTASVPVELPLRIKTELAFSGSPSASASPSPRGKLRFCCIPSCLQQATSQTRLFRFPTAETALLKWLVNTQQQPRLVDTQQLFICQDHFEEEAICKKQLRSWAVPTLKLGHDGHVIPNARHNGNIADSQENKQTLQYIWENYCSVLSCFQPRSEELRLYAYPTDRPTIRKWATNCKHRSMQASSDGFQVCQLHFAPHCFDRETGELREDAVPTLELSRCLNDVHCIVAGCVKDEDGPRQRFYKMPKRSAQLLSWCHNLRLDAATMGGGEHHVCDRHFETQCINQQKLLRPGARPTLHLGHDEPIELMPNPAEWDATDAAAAVDNVCCVPNCGLAKDEEEDVQLFAFPKLRSLAEKWLQNIRLENISREQLMRLRICSAHFDDSCLESNGRPQLGAMPTLQLGHEDNGNIHRSTDAAAVKAKKFCNRSGSSYDCCYPQCVELQKSYLRISYDLPQSEALRLKWLEYMGLEKTEEKLLKLCPLHLVLLYDHSVEHFAEEHTPEQQLDANYEDSRNSVRLRVISCAVPGCRTLKPRDGGILYGLPQRRDVLEMWLHNMQLVFYEQQRYMYKICSKHF
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -