Basic Information

Gene Symbol
-
Assembly
GCA_018903425.1
Location
JAEIGD010000032.1:305456-317175[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 24 9.2e-16 2.8e-12 47.5 3.8 1 86 364 436 364 437 0.85
2 24 2.6e-15 7.9e-12 46.0 5.0 1 87 464 533 464 533 0.83
3 24 4.1e-16 1.2e-12 48.6 0.4 1 87 555 627 555 627 0.85
4 24 3.1e-16 9.5e-13 49.0 5.5 1 87 657 727 657 727 0.82
5 24 4.9e-15 1.5e-11 45.1 3.5 1 86 751 822 751 823 0.82
6 24 6.6e-13 2e-09 38.3 1.0 1 87 858 926 858 926 0.81
7 24 5.8e-11 1.8e-07 32.1 1.4 1 86 972 1041 972 1042 0.76
8 24 3.9e-16 1.2e-12 48.6 0.1 1 86 1069 1138 1069 1139 0.82
9 24 2.2e-12 6.8e-09 36.6 0.7 1 85 1160 1228 1160 1230 0.79
10 24 7.6e-15 2.3e-11 44.5 1.6 1 86 1257 1328 1257 1329 0.84
11 24 5.8e-14 1.8e-10 41.7 1.5 1 85 1403 1471 1403 1473 0.81
12 24 1.6e-12 4.9e-09 37.1 0.1 1 86 1496 1564 1496 1565 0.81
13 24 1e-13 3.1e-10 40.9 1.6 1 87 1712 1781 1712 1781 0.80
14 24 1.1e-12 3.5e-09 37.5 0.4 1 61 1857 1910 1857 1928 0.78
15 24 0.029 86 4.2 0.0 1 58 1943 1993 1943 2014 0.74
16 24 3.7e-12 1.1e-08 35.9 0.3 1 86 2032 2101 2032 2102 0.84
17 24 1.4e-15 4.3e-12 46.9 1.4 1 86 2166 2235 2166 2236 0.84
18 24 6.4e-13 1.9e-09 38.3 1.5 1 86 2271 2342 2271 2343 0.81
19 24 2.4e-12 7.2e-09 36.5 2.3 1 87 2353 2425 2353 2425 0.79
20 24 6.6e-15 2e-11 44.7 0.1 1 86 2455 2529 2455 2530 0.75
21 24 1.6e-05 0.048 14.6 0.1 1 58 2566 2620 2566 2640 0.84
22 24 6.1e-15 1.8e-11 44.8 0.5 1 86 2658 2730 2658 2731 0.81
23 24 5.8e-14 1.8e-10 41.7 3.0 1 86 2876 2948 2876 2949 0.82
24 24 0.024 73 4.5 0.1 1 24 3011 3031 3011 3043 0.77

Sequence Information

Coding Sequence
ATGATAATAAAATCGGAACCCATTGACGAACATGCGTATAAGTCCAACTATATTGATGATAATACGCCGTTTGCCGATTTTAGTAAATTTCCAGAATTTGGCAACGACATGCTGAGTCCCAAGGTAGAGCTGTCCGTCAAGGATGAGGCGTATCAAAAGAACCCGCTTAGTTATCCGCGGCGCAAGCTGCAATCGGAACGTTCGGCGGAGAGCTTGCCGATTTGTCAGCGCTGCAAGGAGGTGTTCTTCAAGAAGCAAATCTACTTGCGGCACGTGGCAGAGAGTAGTTGCACCATACAGGAGTATGACTTCAAGTGCAACATATGCCCCATGTCCTTTATGAACACCGAAGAACTGCAAAAGCACAAGCATCTGCACAGGGCCGATAAGTTCTTCTGCCACAAATATTGTGGCAAGTACTTTGACACCATTGTAGAGTGCGAGTCACATGAGTACATGCAGCACGAGTATGAGAGCTTCGTTTGCAACATGTGCTCGGTTACATTTGCCACACGGGAGCAGCTCTATGCGCACTTGCCACAGCACAAATTCCAGCAACGTTTCGATTGTCCCATTTGCCGGCTATGGTATCAAACGGCATTAGAGCTGCATGAGCATCGCCTGGCGGCGCCATATTTCTGTGGCAAGTATTACGCAGGCGCACAGACCACACAGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCATGGCCACCATGGAAATGCCGACGACACATCAAAAGGCAACTGCTGGCAGCGCATTGCCGGCAACGGCAGCGTTGAACTCTCTGTTGCAACAACGTCAGGCAAATGCCGATGGTGCCGCATTGTTTGCATCCTCTCTGAAGAACGAAGTGAACGTTAAACTGGAGCGAAGCTATAGCAACTCTACAACAAGCGAATCCGGCTACAGTCTGCATGACAGCAGCAACTATAACAATGCATATGGAAGCGATGCCTCGTTGCATGGCGGTGCAGGTGCCGGAGCTATTGGTGGTCCACAGGCGCATTCCTCAACGCTGGACGATTCAGAGGATGCGCTTTGCTGCGTGCCCTTATGTGGTGTGCGCAAGAGCACCAGCCCCACGCTGCAGTTCTTTACATTCCCCAAAGACGAGAAGTACTTGCATCAATGGCTACATAATCTCAAGATGTTTCATATCCCGGCCTCGAGCTATGCCACTTATCGCATCTGTAGCATGCACTTTCCGAAGCGCTGCATCAATCGTTATTCATTGTGCTATTGGGCTGTGCCCACGTTCAATTTGGGGCACGATGATGTTGCCAATCTATATCAGAATCGCGAGCTAACGAACACCTTTACCACCGGCGAGGTGGCACGTTGCAGCATGCCCAACTGTACCAGTCATCGTGGGGAAAGCAATCTCAAGTTTTACAACTTCCCGAAAGACATTAAGAGTCTTATCAAGTGGTGCCAGAACGCACGTTTGCCCGTCCAGGCCAAGGAGCCGCGTCACTTCTGCAGTCGACACTTTGAGGAGCGATGCATTGGCAAGTTCCGGCTAAAGCCATGGGCGGTGCCAACGCTGCATTTGGGTGCCCAATACGGCAAGATTCACGACAATCCCAAGAATTTGTATGTGGAAGAGAAGCGCTGTTGCCTTAATTTCTGCCGTCGCAGTCGCTCCTCGGACTTTAACATGTCGCTGTATCGCTTCCCCAGAGATGAAGTGCTCTTGCGACGTTGGTGCTACAATTTACGCCTTGATCCTGCCGTCTATCGTGGCAAGAATCACAAGATATGTAGTGCTCACTTCATCAAGGAAGCACTGGGATTGCGAAAACTATCGCCGGGTGCTGTTCCTACGCTTCATTTGGAGTTGAGTGCATCGTATATGGGCATGGGAAGCGGATCGGGAGGATCCGCATCGGGCCTGAATGTGAGCGACAGCATGGATGTGTGCTGTGTGCCAAGTTGTGAGAGCAAGCGACACAAAAACGAGAACATCACATTCCATACTATACCGAGGCGACCCGAACAGATGCGCAAATGGTGTCACAATCTCAAGATACCCGAGGATAAGATGCATAAGGGAATGCGAATTTGTAGTCTACACTTTGAGCCCTATTGCATTGGCGGCTGTATGCGTCCGTTTGCGGTGCCCACACTGAATCTCGGGCACGACGACGATGATATACATCGCAATCCGGATGTAATTAAGAAACTGAATATACGCGAAACCTGCTGTGTGGCTGTGTGCAAGCGTAATCGGGATCGCGATCATGCCAATCTGCATCGTTTCCCCAGCAATGTCTCCCTGCTGACGAAATGGTGCGCCAATCTGCAGCGACCTGTGCCCGATGGCACTAAACTCTTCAACGATGCCATTTGTGAGGTGCACTTTGAGGATCGTTGCCTGCGCAACAAGAGATTGGAGAAATGGGCAGTGCCTACGCTGGTGCTGGGACATGACAACATTGCATATCCACTGCCAACGGCAGAACAGGTGGCCGAGTTCTATGCCCGTCCCAGTGCACCCAACAATGGTGAGGAGCAGGGCGAGTGCTGTGTGGAGACCTGTAAGCGTAATCCCAGTGTCGATGACATCAAACTCTATCGCCCGCCCGAGGAGTCGTCTGTGCTGGCGAAATGGGCACATAATCTTGAAGTGGATGCTGCCAAGTTGTCCAGCATGCGTATATGTAATCTGCACTTTGAATCCCATTGCATTGGCAAGCGTATGCGTCCCTGGGCCATACCCACACTCAATCTGTCCAACAATATTGAGAACATGTACGAGAATCCCGAGCATCATATGATCTACAAGCGTCGCACACAAATTAAAACCGAGAGGGGTGCATCGGGAGCGGGGGCAAGCATGTCGAAACCCACTTGGGTGCCACGTTGCTGCTTGCCACATTGTCGCAAGGTGCGTGCTCTGCACAATGTGCAGCTCTATCGCTTCCCCAAACTCAATCGTTCCACGTTGGCCAAGTGGGCGCATAATCTGCAAGTGCCACTGGTGGGCAGTGCCCAGAGGCGACTCTGCTCTGCCCACTTTGAGCCGCATGTGCTGAGCAAAAAGTGTCCGGTACCGCTGGCGGTGCCCACACTGGATCTCAATGCGCCACCCGGTTATAAAATCTATCAGAATCCCGCCAAGCTAAAGGCCAACAAGCTGTGTCTGCAGCGTGTCTGCATTGTGGAGAGCTGCCGTCGTCAACGTGGTCAGGGCGTGCAGCTCTTCCGTCTGCCGCACAGTCCCACACAGCTGCGCAAATGGATGTACAACATACGGATGCGCCCCAGAGGTGCCATGCGGCAACAGTATCGCATCTGCTCGACGCACTTTGAGACGCATTCGTTCAATGGAAAGCGACTGAGTGCGGGAGCGATACCGACACTGGAGTTGGGCCATGACGACGAGGACATCTATCCCAATGAGGCGCAATCATTTGTGGAGGAACACTGTGCCATCGATGGGTGCGATGCATCCAAAGAGCAGCCAGATGTGCGTCTCTTCCGCTTTCCCACCGACGATGAGGACTTGCTCTGGAAATGGTGTAACAATCTGAAAATGAATCCTGTCGATTGTATTGGCGTGCGCATCTGCAACAAACACTTCGAGGGAGACTGCATAGGACCGAAGCATCTCTACAAGTGGGCCATACCCACCCAGGAGTTGGGCCACGATGACTCTCAGATTGAGCTCATTACCAATCCCAAGCCAGAGGAGCGTTATGTTGATCCCGTGTTCAAGTGTTGTGTTCCCACCTGCGGCAAGACGCGTAAATTCGATGAAGTGCAAATGAATAGCTTCCCCAAAGATCCTACACTCTTCCAGCGCTGGCGACACAATCTGCGACTGGAGCATCTCCAGTTCAAGGAGCGCGAACGCTATAAGATCTGCAATGCACACTTTGAGGATATTTGCATTGGCAAGACACGTCTGAATATTGGCTCCTTACCCACACTGGAGTTGGGGCACGACGATACCGAAGATCTCTTCCAGGTGAATCCTGCAGAGTTGCAAAGCAATTTGTTTGGTCGTCAGCGACGTGTACAGGAATCAGGTCTGGGGGTCAGAATAAAGCAGGAGGAGCACTCCGAGCAAGAGGAGACGGATGTTACCATGTCGGAGGCTACGGATTTGAATACCAAGCAGGTAAAAATAAAGAAACGCCTTGCGGACTTGAAGTGCTGCGTGCCCAGCTGTGGTCGCAGTCGTCTGGAGCACGGCGCACGTTTGTTTCCCTTTCCAACTGGCAAACAACAGCAGAGCAAGTGGCGCCACAATTTGCGTCTGGATGCCAGCGATGTGGACAAGACGTCGCAAGTTTGCAGCGCACACTTCAATCGACGTTGCATCGATGGCAAGCAGCTGCGTGGCTGGGCAATCCCCACACAGCAGCTGGGACATCAGGAGCAACCGATCTATGAGAATCCAAAGAATATACCGGGCTTCTTTACACCCACCTGTGCGCTGGCGCACTGCCGCAAGCGGCGAAGTATTGACAATGATTTGAGGACTTATCGTTATCCCCGCAGCGAGAAGTTGTTGGAGAAATGGCGAGTGAATCTACGACTGGCTCCAGATCAGTGTCGCGGACGCATCTGTGCGGATCATTTTGAGCCCATGGTACGTGGCAAGCTGAAGCTGAAGACGGGTGCAGTGCCAACACTTAAACTGGGACATGATGAGGGCGTGGTCTTTGACAATGAGGCCATTAAGGCGGGACTGCAGCTGGATGGAGCCGAAGAGCCAGAGGCGGAGGAGGGTGAGGCGAGCAGAGAATCGCTGATTAAGATTAAAAAGGAAAAACTAGAACCAGATGAGGAGGAGGCGGAGGCGGAGGTAGAGCAGACACAGCTTGACAATGGAGACGATGATGATGAACAGGAAGAAGATGTGGATGCGGATGAGGACGATGATCATGGCTACTTCGATCCATTGGAACTGGTTGAAACCTTTGCTGACGACAATGCAGCCGATGATGATGAAGGTGATATTCCTGGTAATGATGATGAGCTTCTGCTGCTGCCTGATACGCCGCCAGTGAAGCTAGAACATCCGCTCCCGCCGCTGCTACGTCGCGAGAAGCCCGTGAATAATGTGACGCCAATTTGTTGTCTAAAGCATTGCCGCAAAGAACGCACTGCAACACATCAGCTGAGCACCTTTGGCTTTCCAAAGGATCGCCTGCAGCTACGCAAGTGGAGTAGCAATCTACAAATTGCCGAGAGCGATTGCGTGGGACGTGTTTGCATTGAGCATTTCGAGGCGGAGGTGCTGGGCACGCGTAAGCTCAAACAGCATGCGGTGCCCACATTGAATTTGGGCCATGACACGCCCCTAATCTACAGATGCAATGGGCAGGCGCAGCCCATGGGAGGCATCTTCGATGAGCAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGTCGCAAAAGGAAGGTGGAAGCAATGGAACCGCCCGATCATCATCATCATCAACGAGCAGTGATTAAGCAGGAGATAGAGGAGGAGGAGGAGCAGGAGGATGGGCTCAGCTGTTGCCTGCCCCACTGTGGCATGCGGGCGAATGACGTGCAACTGCATCGTCTGCCCAGCAATCGCATGCGACTGCGCAAATGGTTGCACAATCTCAATCTGCCCCAGAGCATTCCCATCGGCAGTCAGACACGTGTCTGCAGCGAACATTTTGATGTGCAACCGAGTGAGGCACTCGAGGATTGTATGCCCACCTTGAAACTGGGTCACAATGATATCAATATATACAGGAATCAAAGCTCTTCTGCCTCCTCATCCTGCTTAGTGCCAAGCTGCCCGTGTGCACGACTCAACCTCTATCGAGGCTATGATTTGCCTGAGCATCATCTGGTACAGCAGGCTTGGCTGCAATTCCTGCGACAACCCATGCCTCAGCTGCCCGCCGATGGTCAGCTCTGCGTGATGCACTACATGCAGCTGTACGAGCAGGTCTCGCTGCCCCAGGAGGTGCCAGCGACAGTGTTGCGTCAACTGCAGGAAACCTACGAACAGATTGCCAGCTCCACCATGGCCATGAAGTTGCGCTGCGCCGTGCCCGGCTGCTACTCCAAGTATACGGATAACATAAGGTTGACCAAGCTGCCCACCTGCTCGGACATGTTTGCCAAGTGGCTGCACAACACCAAGATCAAATACGATGCCAACAGGCATTATATCTATCGCATTTGTATGCTCCACTTTGAGGCCCGTTGTCTGGGTCCAGTGCGTCCCAAATTGTGGGCGGTGCCGACGCTGCAATTACATCACAACGATGCAAATATCTATAGGAATCCCAAACTAGATGGGAGTGGACAAGAAGTACAGCAAGTCGTTCCACAAACTGTACCCGTGCCCGTGGAGCTGCCGTTGCGCATCAAGACGGAGCTGGCTTACACTGGCAGTCCGAGTGCCAGCGCCAGTCCTAGTCCACGTGGCAAGTTGCGCATTTGCTGCATTCCCGGCTGTGCCCAACAGGCCAACTCTCAGGGGCGACTCTTTCGCTTCCCCACCTCCGAAACAATGCTGCTCAAGTGGCTGGTGAACACACAGCAGAAGCCCCGTCTGGTGGATCCACAGCATCTGTTTGTCTGCCAGGATCATTTCGAAACGGAAGCCATTTGCAAGAAGCAGCTAAGCAGTTGGGCTGTGCCCACATTGAAGCTGGGACACGATGGTCATATCATTCCCAATGCCAAGCACAATGGAAACATTGCGGACACACATGAGAATCGGCACACGCTGCAATTTATTTGGGCCAATTACTGCTCTGTGCTGGACTGTTTTGAGCCCCGCAGTGAGCAACTGCGTCTCTTCGCATATCCCACAGATCGACCTACCATACGCAAGTGGGCTGCCAATTGCAAGCATCGTTCCATGCAGGCCAGCAGCGATGGCTTTCAGGTTTGTCAATCGCACTTTGAGCCACATTGTTTTCATCCGGAAACGGGAGAGTTGCGAGAGGATGCAGTGCCCACGCTGGAGCTAAGTCGTGGCATCAACAAGACGGGTTGCATGGTGCCCGGCTGCGTCAAGGAGGAGAATGGACCGCGTCGTCGTCTGTTTAAGATGCCCAAGCGTTCATCGCAATTGTTAGATTGGTGTCACAATTTCCAACTGGATCCATCGAGCATCTCAATTGAGAAGAGTGTTGTGTGTGATCGTCACTTTGAGTCGCATTGCTTGAATGTGCATAAGCAGGTGCGTCGTGGAGCACGTCCCACACTTCACTTGGGTCACACTACGGATGTTGAGATATTGCAGAACCCTTCCAATTGGGCCCGCTGCACTGAGGTGCCTGAGACTCCTGCTGATGTCTGCTGTGTGCCCAATTGTGGACGGTACAAGGATGCCGAGGAAGAGGATACTGATATGCAGCTCTTTAGCTTTCCCAAGGTGCGAACGCTGGCAGAAAAGTGGCTGAGGAACATTCGACTCGAGGCAAACAAGGAACAGTTGGCTGAGCTGAAAGTCTGCAATGCGCACTTTGAGATCTACTGTCTGGAGAATGGACGCCCACAGTTGGGTGCCATGCCCACACTGCAGTTGGGACACGATGATCATCATGATATACATCGCTCCAGTTGTCTCACGTCCTTGTCGGCCTCCAAAATGAAGCGTTATTGCAATAGAAATGGATACAGCTATGATTGCTGCTTTCCTCAGTGTGTGGAGCTGCAGAAGAGTTATCTACGGATTAGCTACAATTTGCCACAGTCGCAAGCAATGCGTGATGCCTGGATAGCCTACATGGAGTTGGAGGAACCGAAGGAGCAAGAAAAGGAGCAGCAGCTGAAATTGTGCCCGCTGCACTTGATCATTCTTTACGAGCACAGCGTAGACAATTTCCCCGAGCACACAACAGAGGAACTGTTGGAGGATAACTATGCAGCTGCGAGAAATAGTGTGCGGATACGTATCATCAGTTGTGCAGTGCGTGGCTGCAAGACTCTGAAGACCCGCGATGGCGGTTGCCTTCACGGTCTGCCCCAGCGTCGGGATATACTGCAAATGTGGTTGCACAATATGCAGTTGGTGTTCTACGAGCAACAGCGGTACATGTACAAGATCTGTAGCAAACACTTTGAGCCCAATTGCTTTACGGACACAACACACCGCCTCAAGCCATGGAGTATGCCAACGTTGGAGTTGCCTGTGCCCGAACCGGGTGAACCAGAAGTCTATCAGAATCCCACAGAAGCTGAATGGCAGCAAATGAATAAACAATGGGCAGCAGAGCAGCTGCAAACAGCTCAGCCAATGATGGAGGATGAAGAGCATGAGGAAGTTGTGCAAGTGAAGCTTGAGCCGGAGACGGAGATGGAAGCGGATAATCTGATGGCATACGAGGAAGAGGATTACTCGCAGCCTCCCGTGGATGGAGAGGAGGATGCCAGTTCGCAGCAGCCGCTGGAGATGCAACCCTTGGAGGTGCTGCTTGAGGTGGGGCATGTGGAGAAGTGCAGAACATACGAGCAAATGGACACCAAGGCAAATCTAAGTTATGCCGAGCAACAGGCTGGACCGCTGCCTAGTAATGGATTCTTTGCCAGTAATGGCAACAAGTACACTGCACGCAATTGCAGTGTACAAGGCTGCCACGTGACCTTGAACGATATCTGTGGCAATGTGAAGTTGCACAAATTTCCCACGTCGTGGGAGGCAATGCAAAAGTGGATGCACAACACCCAAGTGAAGGTGTGCAGAAGTGTCTCGTGGCGTTTTCGCATTTGCAGTTATCATTTTATGGAAGATTGTTTTCAAGGCGCTCGCCTAAGGCGTGGTGCCATGCCCACATTGCAGTTGGGCCCCAAGCGACCTTCGCATATCTATGAGAGTGAGTTCAATGTTGGCGATGGCGACGAGGAGCAGCCCATTGAAGAGTCATCAAAACAAACACCAAACACCAAGGCAAGGGTGGTAGGTGGCGATAATATTAGTCTGTGCCTGCCTAGCCCAGGACCGCCTCGCAAGTCGAGCAAGTTCTGCCAAGTCGATGGTTGTCCTAATCATTTGACCAGCGAGAATCTGACGCTGCACAAGTTCCCCCCACTCGCCGGACATGTGTGCCAAGTGGCAGCACAATACACAGGTGCCATTTGA
Protein Sequence
MIIKSEPIDEHAYKSNYIDDNTPFADFSKFPEFGNDMLSPKVELSVKDEAYQKNPLSYPRRKLQSERSAESLPICQRCKEVFFKKQIYLRHVAESSCTIQEYDFKCNICPMSFMNTEELQKHKHLHRADKFFCHKYCGKYFDTIVECESHEYMQHEYESFVCNMCSVTFATREQLYAHLPQHKFQQRFDCPICRLWYQTALELHEHRLAAPYFCGKYYAGAQTTQXXXXXXXXXXXXXXXXXXXXXXXXXXMATMEMPTTHQKATAGSALPATAALNSLLQQRQANADGAALFASSLKNEVNVKLERSYSNSTTSESGYSLHDSSNYNNAYGSDASLHGGAGAGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYATYRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSHRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLELSASYMGMGSGSGGSASGLNVSDSMDVCCVPSCESKRHKNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLNLGHDDDDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVSLLTKWCANLQRPVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLVLGHDNIAYPLPTAEQVAEFYARPSAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESSVLAKWAHNLEVDAAKLSSMRICNLHFESHCIGKRMRPWAIPTLNLSNNIENMYENPEHHMIYKRRTQIKTERGASGAGASMSKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNAPPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRGQGVQLFRLPHSPTQLRKWMYNIRMRPRGAMRQQYRICSTHFETHSFNGKRLSAGAIPTLELGHDDEDIYPNEAQSFVEEHCAIDGCDASKEQPDVRLFRFPTDDEDLLWKWCNNLKMNPVDCIGVRICNKHFEGDCIGPKHLYKWAIPTQELGHDDSQIELITNPKPEERYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPTLFQRWRHNLRLEHLQFKERERYKICNAHFEDICIGKTRLNIGSLPTLELGHDDTEDLFQVNPAELQSNLFGRQRRVQESGLGVRIKQEEHSEQEETDVTMSEATDLNTKQVKIKKRLADLKCCVPSCGRSRLEHGARLFPFPTGKQQQSKWRHNLRLDASDVDKTSQVCSAHFNRRCIDGKQLRGWAIPTQQLGHQEQPIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRSEKLLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEGVVFDNEAIKAGLQLDGAEEPEAEEGEASRESLIKIKKEKLEPDEEEAEAEVEQTQLDNGDDDDEQEEDVDADEDDDHGYFDPLELVETFADDNAADDDEGDIPGNDDELLLLPDTPPVKLEHPLPPLLRREKPVNNVTPICCLKHCRKERTATHQLSTFGFPKDRLQLRKWSSNLQIAESDCVGRVCIEHFEAEVLGTRKLKQHAVPTLNLGHDTPLIYRCNGQAQPMGGIFDEQPQHSVFRLWSLKHCRKRKVEAMEPPDHHHHQRAVIKQEIEEEEEQEDGLSCCLPHCGMRANDVQLHRLPSNRMRLRKWLHNLNLPQSIPIGSQTRVCSEHFDVQPSEALEDCMPTLKLGHNDINIYRNQSSSASSSCLVPSCPCARLNLYRGYDLPEHHLVQQAWLQFLRQPMPQLPADGQLCVMHYMQLYEQVSLPQEVPATVLRQLQETYEQIASSTMAMKLRCAVPGCYSKYTDNIRLTKLPTCSDMFAKWLHNTKIKYDANRHYIYRICMLHFEARCLGPVRPKLWAVPTLQLHHNDANIYRNPKLDGSGQEVQQVVPQTVPVPVELPLRIKTELAYTGSPSASASPSPRGKLRICCIPGCAQQANSQGRLFRFPTSETMLLKWLVNTQQKPRLVDPQHLFVCQDHFETEAICKKQLSSWAVPTLKLGHDGHIIPNAKHNGNIADTHENRHTLQFIWANYCSVLDCFEPRSEQLRLFAYPTDRPTIRKWAANCKHRSMQASSDGFQVCQSHFEPHCFHPETGELREDAVPTLELSRGINKTGCMVPGCVKEENGPRRRLFKMPKRSSQLLDWCHNFQLDPSSISIEKSVVCDRHFESHCLNVHKQVRRGARPTLHLGHTTDVEILQNPSNWARCTEVPETPADVCCVPNCGRYKDAEEEDTDMQLFSFPKVRTLAEKWLRNIRLEANKEQLAELKVCNAHFEIYCLENGRPQLGAMPTLQLGHDDHHDIHRSSCLTSLSASKMKRYCNRNGYSYDCCFPQCVELQKSYLRISYNLPQSQAMRDAWIAYMELEEPKEQEKEQQLKLCPLHLIILYEHSVDNFPEHTTEELLEDNYAAARNSVRIRIISCAVRGCKTLKTRDGGCLHGLPQRRDILQMWLHNMQLVFYEQQRYMYKICSKHFEPNCFTDTTHRLKPWSMPTLELPVPEPGEPEVYQNPTEAEWQQMNKQWAAEQLQTAQPMMEDEEHEEVVQVKLEPETEMEADNLMAYEEEDYSQPPVDGEEDASSQQPLEMQPLEVLLEVGHVEKCRTYEQMDTKANLSYAEQQAGPLPSNGFFASNGNKYTARNCSVQGCHVTLNDICGNVKLHKFPTSWEAMQKWMHNTQVKVCRSVSWRFRICSYHFMEDCFQGARLRRGAMPTLQLGPKRPSHIYESEFNVGDGDEEQPIEESSKQTPNTKARVVGGDNISLCLPSPGPPRKSSKFCQVDGCPNHLTSENLTLHKFPPLAGHVCQVAAQYTGAI

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01548887;
90% Identity
iTF_01557594;
80% Identity
-