Basic Information

Gene Symbol
-
Assembly
GCA_018152115.1
Location
JAECXI010000002.1:20313023-20327691[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 29 6.4 1.2e+04 -3.0 3.5 38 62 10 37 4 52 0.58
2 29 2e-15 3.8e-12 46.7 4.1 1 86 246 318 246 319 0.86
3 29 1e-14 2e-11 44.4 5.0 1 87 346 415 346 415 0.83
4 29 9.5e-16 1.8e-12 47.8 0.2 1 87 437 509 437 509 0.85
5 29 5.9e-16 1.1e-12 48.4 5.4 1 86 605 674 605 675 0.82
6 29 1.2e-14 2.3e-11 44.2 2.8 1 86 699 770 699 771 0.82
7 29 5.2e-13 9.7e-10 39.0 0.7 1 87 806 874 806 874 0.81
8 29 1.5e-10 2.8e-07 31.1 1.6 1 86 914 983 914 984 0.76
9 29 4.6e-17 8.7e-14 52.0 0.4 1 86 1011 1080 1011 1081 0.83
10 29 4.6e-13 8.7e-10 39.2 1.4 1 85 1102 1170 1102 1172 0.80
11 29 6.1e-14 1.1e-10 42.0 0.8 1 86 1199 1270 1199 1271 0.85
12 29 7e-13 1.3e-09 38.6 3.0 1 85 1347 1415 1347 1417 0.82
13 29 1.4e-12 2.7e-09 37.6 0.1 1 86 1440 1508 1440 1509 0.81
14 29 2.6e-13 5e-10 39.9 2.2 1 87 1638 1707 1638 1707 0.79
15 29 6.1e-12 1.1e-08 35.6 0.7 1 86 1809 1875 1809 1876 0.80
16 29 6.5e-05 0.12 13.0 0.0 1 58 1891 1937 1891 1951 0.81
17 29 3.5e-10 6.5e-07 29.9 0.8 1 87 1967 2037 1967 2037 0.80
18 29 5.6e-13 1.1e-09 38.9 0.4 1 87 2093 2163 2093 2163 0.82
19 29 1.1e-10 2e-07 31.6 0.1 1 86 2198 2269 2198 2270 0.80
20 29 1.9e-11 3.5e-08 34.0 0.0 1 86 2280 2351 2280 2352 0.79
21 29 2e-10 3.7e-07 30.7 0.0 1 86 2376 2446 2376 2447 0.78
22 29 2.8e-05 0.053 14.2 0.0 1 58 2480 2527 2480 2549 0.81
23 29 2.6e-12 4.9e-09 36.7 0.1 1 86 2565 2637 2565 2638 0.82
24 29 1.5e-16 2.8e-13 50.4 0.5 1 86 2748 2820 2748 2821 0.83
25 29 3.8e-13 7.1e-10 39.4 3.2 1 86 2881 2951 2881 2952 0.80
26 29 1.9e-14 3.5e-11 43.6 4.8 1 86 3052 3122 3052 3123 0.85
27 29 6.6e-12 1.2e-08 35.5 0.2 1 86 3204 3273 3204 3274 0.84
28 29 8.7e-11 1.6e-07 31.9 1.3 1 58 3297 3345 3297 3359 0.84
29 29 2e-10 3.8e-07 30.7 2.8 17 87 3362 3421 3353 3421 0.74

Sequence Information

Coding Sequence
ATGTCCTTCATGACCACCGAGGAGCTGCAGAGGCACAAGCACCAGCACCGGGCAGACAGGTTCTTCTGCCACAAGTACTGCGGCAAGCACTTTGACACGATCGCAGAGTGCGAGGCGCACGAGTACATGCAGCACGAATACGAGAGCTTTGTGTGCAACATGTGCTCAGGAACCTTCGCCAATCGAGAGCAGCTATATGCACATCTTCCTCAACACAAGTTCCAGCAGCGATTCGACTGTCCCATCTGCCGTTTATGGTACCAGACAGCCCTCGAGCTTCACGAGCATCGCCTGGCAGCACCATACTTTTGTGGCAAATACTACACGGGCGGACAGTCCTCCTCAGCTTCACAGCCGCAGACACAGCAGCACCAAAACCAGACGAATTACAAGCTACAAGATTGCCATATGGCAACCATGGAAATGCCCACCGCCCCGCATCACAAATCTAACCCATCCAGCTCATCCTTGCCGGCTACAGCTGCACTAAATTCACTGCTGCAGCAGCGGCAGGCGAATGCCGATGGAGCGGCCATGTTTGCCGCATCCGCGCTGAAGAATGAGGTGAATGTTAAGCTAGAGCGTAGCTATAGCAACTCGACCAGTGAGTCATCCTACAGCGTTCAGGATAGCAGCTACAACAATGCCTATGGCAGCGATAGCTCCATCCACGGCGGAGCCATCGCGGGTCCGCAGGCCCATTCTTCCACGCTGGACGACTCCGAGGACGCCCTGTGCTGTGTACCGCTGTGCGGAGTGCGCAAAAGCACAAGTCCCACGCTGCAGTTTTTCACGTTCCCCAAGGACGAGAAGTACCTCAACCAGTGGCTGCACAATCTCAAGATGTTCCACATACCCGCCGCAAGCTACACCAATTTCCGCATCTGCAGCATGCACTTCCCGAAGCGCTGCATCAACCGCTACTCGCTGTGCTATTGGGCGGTGCCCACCTTCAACCTGGGCCACGACGACGTAGCCAATCTCTACCAGAACCGGGAACTTACCAACACCTTTACTACCGGCGAAGTAGCCCGCTGCAGCATGCCACACTGCACCAGCCAACGGGGTGAGAGCAACCTGAAGTTCTACAATTTTCCCAAGGACATCAAGAGCCTGATCAAGTGGTGCCAGAACGCACGACTTCCTGTGCAGGCGAAGGAGCCGCGACACTTCTGCAGCCGCCACTTTGAGGAGCGCTGCATCGGCAAGTTCCGACTGAAGCCCTGGGCGGTGCCCACTTTGCATCTGGGTGCCCAATATGGCAAGATACACGACAATCCAAAGAATCTCTACGTGGAGGAAAAACGCTGCTGCCTGAACTTCTGCCGCCGAAGCCGATCCTCTGACTTCAATATGTCGCTATATCGATTCCCCAGAGACGAGGTGCTCCTGCGGCGCTGGTGCTATAATCTCCGCCTGGATCCGGGAGTGTATCGCGGAAAGAACCATAAAATATGCAGCGCCCACTTTATCAAGGAGGCGTTGGGTCTGCGGAAACTATCACCGGGTGCCGTTCCAACGCTCCACCTGGGTCACACTGACACCTTTAACATCTACGAGAACGAGCTGTGGCCCCCGCCGACAGCCTCTGGCAGTGGCCTCCAGCAGCACCAGGCGCAACATCACACCTCGCAACACTCATTGCAGCAGCAGCTGCACAGCAAATCCTACCAGCGTCACTCGGCGGCCTCTACGTCGTCCTCGGCCAGTTCTGCCACCTCGCACTACGTGGATCCGGAAATGAACGCCTCCTATCTGGCCATGGGTGCCGGCGGATCCTCGGCGATTAACGCCAGCGACTGCATGGACATCTGCTGCGTGCCTAGTTGCGAGAGCAAGCGGCACAACAACGAGAACATTACATTCCACACCATCCCACGCCGGCCGGAGCAAATGCGCAAGTGGTGCCACAACCTGAAGATACCCGAGGAAAAGATGCACAAGGGCATGCGGATCTGCAGTCTGCACTTCGAGCCCTACTGCATCGGCGGCTGCATGCGTCCGTTTGCGGTGCCCACCCTTCAGTTGGGTCACGATGACGAGGATATTCATCGGAATCCGGACGTGATTAAGAAGCTCAACATCCGGGAAACGTGCTGCGTCGCCGTTTGCAAACGAAATCGGGACAGGGACCATGCCAACCTGCATCGGTTCCCTAGCAACGTGTCGCTACTGACAAAGTGGTGTGGCAATCTGCAGCGGTCTGTTCCGGATGGCAGTAAGCTTTTCAACGACGCCATCTGCGAGGTGCACTTCGAGGATCGATGCCTGCGCAACAAAAGGCTAGAAAAGTGGGCGGTGCCCACCCTGATTCTGGGTCACGAGAACATCCCCTATCCGCTGCCAACGCCGGAACAGGTGGCCGAGTTCTACGCTCGACCCACGGCTCCCAATAATGGCGAGGAGCAGGGCGAGTGCTGTGTAGAGAGCTGCAAGAGGAATCCCAGTGTGGACGACATCAAGCTATACCGGCCGCCTGAGGAAGCCTCTGTGCTGGCCAAGTGGGCGCACAACCTGCAAACGGAGGCCGGTCAGCTGACGAGTATGAGGATCTGTAATCTTCACTTCGAGGCACACTGCATTGGCAAGCGAATGCGGCCGTGGGCCATACCCACACTAAATCTGGCAGGTAACATAGAGAATCTCTACGAGAATCCGGAGCACTCAATGCTGTACAAGCGACGAACTCATGCAAAGACTAAGCTTCCGGCCTCGGTGAAGCCCACTTGGGTGCCCAGGTGCTGTCTTCCACATTGCCGCAAAGTTCGAGCTCTTCACAACGTCCAACTGTATCGCTTCCCCAAGCTCAATCGCTCCACTCTGGCCAAGTGGTCGCATAATCTGCAGGTTCCCATGGTGGGTAGTGCCCAACGCCGGCTATGTTCGGCTCATTTTGAGCCGCATGTGCTGAGCAAGAAGTGTCCGGTGCCGCTGGCGGTGCCCACGCTAGATCTTAACTCTCCGCCCGGCCTGAAAATCTACCAGAATCCGGCCAAGCTGAAGGCAAGCAAGCTGTGCCTACAGCGGGTGTGCATAGTAGAAAGCTGTCGCAAGACGAGGGCGCAGGGCGTCCAGCTATTCCGACTTCCCCATAGTCCCACGCAGCTGCGCAAGTGGATGCACAACATCAAGACGCGTCCGAGGGCGGCGATGAGGGCTCAGTACCGGGTTTGTTCCCGCCATTTTGAGACGCACTCGTTCAATGGCCGGAGACTAAGTGCTGGCGCCATTCCCACCCTGGAACTGGGTCATGACGACGAGGACATCTATCCCAACGAAGCTCAGGCCTTTGCGGACGAGCACTGTGTTGTGGAGGGCTGCGAGGCATCCAAGGAACAGGCGGATGTGAGACTCTTCCGTTTTCCGACCGATGACGACGACATGCTGTGGAAGTGGTGCAACAACCTCAAGATGAATCCCGTGGATTGCACCGGAGTGCGCATATGTAACAAACACTTCGAAGCGGATTGTATCGGACCCAAGCACCTGTACAAGTGGGCTATTCCCACCCAAGAACTGGGCCACGACGACGCCCAGATCGAGCTGATACCGAATCCCAAGCCAGAGGACAGGTATGTGGACCCGGTGTTCAAGTGCATCGTACCCACTTGCGGAAAGACGCGTCGCTTCGACGAGGTGCAGATGAACAGCTTCCCCAAGGACCCGAATCACTTCCAACGCTGGCAGCACAACCTGCGGCTGGAGCACCTTAATTTCCTGGAACGCGAGCGCTATAAGATCTGCAATGCCCACTTTGAGGATATCTGTATTGGCAAGACACGGCTGAACATAGGTTCAATTCCCACTTTGGAGCTTGGTCACGACGAAACGGAGGACCTGTTCCAGGTAAATCCGGAGGAGTTGCAGAGTAACCTTTTCGGACGCCAGCGGCGGGTTCACGAGGAATCAGGCGGAATAAGCATCAAGCAGGAACTTTCGGAGCCGGAAGACATTAAGCCGGATGTGACGATCATGTCCGGTCTCAGAGATCCCATTACCAGACAGGCTAAGTTCAAGAAAACGATGTGCGAACTAAAGTGCTGTGTGCACAGCTGTGGACGCAGTCGCCTGGAGCACGGAGCTCGGCTCTTTCCGTTTCCCACCGGCAAGCAGCAGCATCTCAAGTGGCGCCACAATCTGCGCCTGGAGCCCGACGAGGTGGACCGTTCGACGAGGGTTTGCAGTGCCCACTTCAATCGGCGCTGCATCGACGGCAAGCAATTGAGGAGCTGGGCAATGCCCACCCAGCAATTGGGTCATCAGGAGCAGCCGATCTACGAGAACCCAAAGAACATTCCAGGATTCTTCACGCCCACCTGTGCCCTCGGTCACTGTCGCAAGCGGAGGAGCATCGACAACGACCTGCGCACGTATCGGTACCCAAGAAGTGAGGATCTGCTGGAGAAATGGAGGGCGAATCTGCGACTATCGCCGGATCAGTGCCGCGGTCGCATTTGTGCGGATCATTTTGAGGCCCAAGTGCGAGGCAAGCTGAAGCTCAAGACTGGGGCGGTGCCCACTTTAAAACTGGGCCATGATGAGGGCTTAATATTCGACAATGAGGCCATTAAGGTGGGCGTGCCAGAAGATGAGGAAGGCAGCTCGGAGTTGCCGCGACTGAAGCCCAAAAGGGAGCCAATCGATGAGGAGGAGGAAGACGGCGAGGTTGACGAGGAGCACCATGACCAGGATAACGAGGATGAAGATGAGAAAGACGACCACTATTTTGATCCCCTTGAACTGGTTGAGACCTTTGCCGAGCATGCATCCGATGAGGAAGCGGAATATCGTGGGGATGAGGAGGACGATCGAGACGAAGAGGACCTGGAGGAAGTCCTGCCGGACATGCCACCCACTCCTCCAATCGTGCCTCTGCGTCGCGAAAAGCCCGCCAACAATGTGACGCCCATTTGCTGCCTCAAGCACTGCAGAAAGGAACGCACTGCCTTTCACTTGCTCAGCACCTTTGGTTTCCCCAAGGATCGCCAGCTGCTGCTTAAGTGGTGCGCAAATCTCCACCTCAATCCGGATGACTGCATCGGCCGAGTTTGCATCGAGCACTTCCAGTCGGAGGTACTCGGAACCCGTAAGCTTAAGCAGAATGCAGTGCCCACTCTCAATGTAGGACATAATGAGCCACTCAAATACTCATACAACGGCGTGGACCAGGATCAGGAGCAATCGCCGGCACAGCAGTCGGTTTTTCGGCTTTGGAGCCTAAAACACTGCCGCAAGAGGAAACTAACAGAGCCGCCGGACATTCGCCCAAGCAAGTGGAATTCGATGGAGGTTCAGATGATGCGGAGATTGAAGATGGAAATGCAGATAGAGAGGGAGATCAAGATGGAGATGAAGATGGAGCGGGAGATAAAGACAGAGGCGAAGACTCCAACGAAGAATCTGAGGGAGAGTGAAGTGGAAAGATGTTGCATCAGCACCTGCGAAAACAAGGATGTTAGCCAGTTACTTCCAATGCCTGAGGACCCAACTCTTTTAAGAAAATGGCAGCATAACCTGAAGCTGTCAATGGACACGGTGCCAAGCAAGATTCGCGTATGCCTGGATCATTTTGAGCCGAAAGTTGTGCGAAATAGAAAGCCCACAGAGCAGGCAGTACCGACCTTAAAACTGGACCAAATAAGTTCGAACATCTACAGAAACAACGGGTCGTGTTTGGTGCCCGACTGCGCCAGTTCCCTTTCGGATCGCTTGAGCTTTGTGGATTTGCCGGAAAATATGGTCATAAGAGCGGAATGGATGTCCTACCTAAAACTGTCTCTCAGTGAGGGACTATTATGTGGTATCCACTTTATAGAACTGTATGAGAATGTGGTTTTGCCAAAGGTTTTGCCCGCACAAGATTTCGAAATGCTGCATTTGACTTTTGATGAATTTAAATGCGCTGTGCCGGACTGCTCCTCCAAAATTGACCATGATCTTCAGCTTGTCCCGCTGCCAGACAAGGAAGTGACTCTTTTTAAGTGGCTGCATAACACTAAGATTGCATACGATCAATCAAGGCATAAGAGCTATCGCATCTGTCTGCATCACTTTGAGCCAACGTGCCTAGAGTCGGATTTACCGAAGGTTTGGGCTTTACCCACTTTGCATCTGAACCATGAGGACAAGATTCATTTGAATCCTGGCCAGGAATCCCGCAGTGAAACCCCTAATAGCAACTCCAGACTGACTCCTCTAAGAATAAAGACAGATCTTGCCTCACTAGGCAGTCCCTGCGCGAGTGCAAGTCCCAGTCCTCGAGGCAGGATCAGGATATGCTGCATTCCTTCATGCGGACAGTTTGGGAATAGTCAAATACGTCTCTACCGCTTCCCCACCGAAGAGCAGGCCTTGCTACGTTGGCTGGTGAACACACAGCAACAGCCTCGCCTGGTGGATCCCCTCGAGCTTTACGTGTGCCAGTCCCACTTCGAACCCGATGCCATATGCAAGAAGCAGCTTCGCAGCTGGGCGGAGCCCACCTTGAACCTGGGCCACGACGGTCATGTCATCCCCAATGCCAGGCACAACGGGAACATCGCCGACAGCCAGGAGACTGAGCAAGCGATGACGTTCATCCGGGAACGCTATTGCTCCGTACTGACCTGCTTCCAGGCTGAAGGTAATGGAGTGAGGCTCTATGAGTATCCCAAGGATATGGCAACTATACGAAAGTGGGCAGCAGCATGTCGACATCGCTCCATGCAAGCCAGCAGCCATGGGTTCAAAGTGTGCCAGTCTCACTTTGCTCCGGAGTGCTTCGATCCTATCACTTTAGATCTGATTGACGGATCGGTTCCCACTCTGGAGTTGAGTAGAGATGACATTGAACGACACTGCCTAGTGTCGGGATGTGTGAAGGATGCGACTGGAGAACGTCTGCGCTACTACAAGGTGCCAAAGACTGCGGCTCAGCTGGATGCCTGGAGTAATAACCTGAAGATCAATTCTATGGACCTCCTTCAGGGAGATCAGCTCATCTGCGAGCGTCACTTTGAATCCTTTTGCTTTGGTGCCAGCAAGGGACTGCGGCCTGGAGCGCTTCCAACTCTCCAGCTAGGTCATGACGAAGAGGTGGAGATGTTACCCAATCCTGAAAGCCTCTGCCAGAGCAAGTCGCAGGTTTGCTGCGCCCCAGGATGCGGTAGTATTTGGCAGCTTGGAGAAACTCACTTCAGTGGATTTCCCAAAGTATGGGCGTTGGCCGACAAATGGATGTATAATCTCCGACTGAAAGCATCTAAGGATCAGATAGGCAAGCTGAAAGTCTGCGGTTCCCATTTCGAAGCCTCCCTATTCGATAAAAGCGGATTAATATTTGGTGCGATCCCCACTCTGGAGCTGGGTCATTCCTCTCCGGACATTTTTCAAACGGATAAGCAAAGCCTTGGCAAGAGCTTAAAGTTGTCCAAGCGTTCCCAGATAGCGGAGACCGATTGCTGTTATCCTGAGTGCAAGGAACTGTCTAAAAGCTTGTCATTCAATCTGCCCCAGGATGAGCACCTTAGAATAGCATGGTTGCGTCATTTGGGCATGGAAGAGCCGTCGAATGGTGTCGCACTGCTTTGCCCGCTGCACTACGTCATTCTATATGAGCTTAGTGTCAAAAGTTTTCCAGAGCATGTTCCAAATCGAATACTTGAGGATAATTACCATTCTGCTAGGAGCAACAGGCGTGTAAAGATCGTTAGCTGTGCGGTCAGAGGTTGTGATATGATCCGGCCCCGGGATAAGGTTCTCCTGCACGGATTGCCGCAAAGAAAAGACATCCTGATGATGTGGGTAGAAAATGGCCAGTTGGAGATCACCGAGCAGCAGCAACAGTACATGCTCAAGGTATGCCGCAATCACTTTGAGTCACGATGCTCGTTTGACGACAGGCGTTTGCATCCTTGGAGCGTTCCCACTCTACTTTTACCAGCTGATCCAGTGTATCAGATTCCCACCAAAGAGGAGTGGCTGGAGATGACGGCCAAAATTAGACAGGATCCAGGGGAAATCAATGAGGAAAGGGAGGAAGAGCTGCCGGTGCAGGAAGAGGATGAGGCGGATAACTCTCTGTTGGAGCCAATTGTGAGAATGGAGCACATCGAGTCGGAGGAGGAAGACTCGGAGATGCAGGCCCTGGAGGTGCTCTTGGAGGTCGGTCACGTTGAGCGGATGGACAGCTACGAACACGTGGACACCGAACATGCAACCTACCAATCTGGTGGCATTCGGAACCAGTATAATGCGAACCACTGCGCCGTTGAGGGATGTGAGGTGACCGCCGAGGACGTGGACGGAACCATCAAGCTGCACAAGTTTCCCGCCTCTTCGGAGGCCGCCCAAAAGTGGATGCACAACACCCAAGTTCACATGGACGAGAAGTTCTGGTGGCGCTATCGCATTTGCAGTTACCATTTCGACCAGGAGTGCTTCCAGAGCGCCAGGATTAAGAAGGGAGCGATGCCCACACTGCTTTTGGGACCTAATAGACCGGACAAGCTGTACGACAATGAATTCGCATTGCAGGAGGCAGAAGAGCCGCCTCCCTTGCCAGCAGAAGTTCTGTTAGAGGAGCAAGAGGAGCCGCTAATTGAAAAGGTGACCAATCTGTGCCTGCCGCCGCCAGCTCCGCCTCGAAAGTCTAGCAAGTTTTGCCAGGTAGACGGATGTACGAATCACCTGACCACCGAGAACATGACGCTACACAAGTTTCCCCACTCAGAGGACATGTGCCTCAAGTGGCAGCACAACACGCAGGTGCCATTCAATCCCTACTACCGGTGGCGGTATCGCATCTGCAGTGCCCACTTCCATCCCGTGTGCCTGGTGAACATGCGCCTAGTCCACGGCAGCGTGCCCACCTTGAAGCTGGGAACTAAGGCTCCCACCGAGCTCTTCGACAACGACTTCGAGGCCATCAACCTAAGGCTGGACAAGCGGTCGGGAGCGGAGCCAAGTTCCGTGCAGATCAAGGAGGAACGGGATGAGGATGAAGACTCAATGCTGTTCCTAGAGCCCGAACTGCAGCTGCACGAAGACAAAGAGGAGGAGGAAATGGCAGAGACGCAAATACCCTTCAATCACTCCAACTGGAAGAGCCAGCTCCGCCTGCCCGTAAAACAGGTGAAGGTGTACTACAACCAGGTGAAGTCAGGCTACGATAAGTGCTCGTTGAGTCATTGCCAGCGACAGAGATCTCAGCACGGCGTCCACATCTACAAGTTTCCGAAATCAAAGCAACAACAGGACCGTTGGATGCACAACCTCCGCATTCGTTACGATGAGCGGCGACCCTGGAAGTTCATGATCTGCAGTGTGCACTTTGAGCCACACTGCATCAGTCTGAGAAAGCTGCGTCCCTGGGCGGTGCCCACGCTGGAACTGGGCGACAATGTGCCGGAGAAGATCTTCACAAACGAGCAATGCCAGGAACTTTCCACTGATCGCAGCGAGGCGGACAGTGAAGGCGACGAGGAGGAGGATGGCCTGCAGGAGGACGAGGAAGAGGAGGAGGAAGAGTATGACGAGGACATGGAGCCGGAGGTTCGCGTCAAGCGGGAGCGACGTTCCAAGCTGGATCCTTGGCCTCCCGGGCAGGTTCCGCCCTGGAAGGTCAAGCAGTGCTGCCTTCCATATTGCCGTGCTTTCCGGGGTGATGGCATCAAGCTTTTCCGACTGCCTAACAATCGCAACTCCATCCGCAACTGGGAAGTGGCAACAGGAATGGTTTTCAAGGAGTCACAGCGAAACACTCGTCTGATCTGTAGCCGCCATTTCGAGCCCGAGCTGATTGGAGTAAGGCGTCTCATGCGAAACGCCATCCCCACAAGGCATTTGAATTCCCAAGGAGCACCTGACGATGTGAAGAGGAAGCCGGATCCTCCTGTTCCCATTGCAACTTGCTGCATGGCTGACTGCCATCACAATGGAAACGTCAAGCTGCACAAGTTTCCCAGTGATCCCGCGCTGCTTAGGCAGTGGTGCCAAGCACTCCGACTAACGGATACGCAACGATACCGGGGCAAACATATCTGCTCGGTCCACCTGCCCACCGACAAGACGATTAGCTGCGTTGTCTGCGGGGTGGACAACGTTCAGCTGCCGATGCTGGACTTTCCGCAGCAGCGCAATCAACGCGCAAAGTGGTGCTACAATCTCAAGATCGAGGCCATACCAAAGTGGGACCACTCCAAGCACATCTGCTGTCGTCACTTTGAGTCCCATTGCTTTGTCCAGCCGGATGAACTACGACCAGGAGCGATTCCCACGCTGCATCTGAATCACGATGATACAAACATATTCCTTCTCGACTATGCCGCTGGTCCGACGGGCAATCGGATCAAAGACGAGCCCTTGTTCAACGACGACATGCTGCTGGTTTAG
Protein Sequence
MSFMTTEELQRHKHQHRADRFFCHKYCGKHFDTIAECEAHEYMQHEYESFVCNMCSGTFANREQLYAHLPQHKFQQRFDCPICRLWYQTALELHEHRLAAPYFCGKYYTGGQSSSASQPQTQQHQNQTNYKLQDCHMATMEMPTAPHHKSNPSSSSLPATAALNSLLQQRQANADGAAMFAASALKNEVNVKLERSYSNSTSESSYSVQDSSYNNAYGSDSSIHGGAIAGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLNQWLHNLKMFHIPAASYTNFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHTDTFNIYENELWPPPTASGSGLQQHQAQHHTSQHSLQQQLHSKSYQRHSAASTSSSASSATSHYVDPEMNASYLAMGAGGSSAINASDCMDICCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEEKMHKGMRICSLHFEPYCIGGCMRPFAVPTLQLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVSLLTKWCGNLQRSVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHENIPYPLPTPEQVAEFYARPTAPNNGEEQGECCVESCKRNPSVDDIKLYRPPEEASVLAKWAHNLQTEAGQLTSMRICNLHFEAHCIGKRMRPWAIPTLNLAGNIENLYENPEHSMLYKRRTHAKTKLPASVKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWSHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGLKIYQNPAKLKASKLCLQRVCIVESCRKTRAQGVQLFRLPHSPTQLRKWMHNIKTRPRAAMRAQYRVCSRHFETHSFNGRRLSAGAIPTLELGHDDEDIYPNEAQAFADEHCVVEGCEASKEQADVRLFRFPTDDDDMLWKWCNNLKMNPVDCTGVRICNKHFEADCIGPKHLYKWAIPTQELGHDDAQIELIPNPKPEDRYVDPVFKCIVPTCGKTRRFDEVQMNSFPKDPNHFQRWQHNLRLEHLNFLERERYKICNAHFEDICIGKTRLNIGSIPTLELGHDETEDLFQVNPEELQSNLFGRQRRVHEESGGISIKQELSEPEDIKPDVTIMSGLRDPITRQAKFKKTMCELKCCVHSCGRSRLEHGARLFPFPTGKQQHLKWRHNLRLEPDEVDRSTRVCSAHFNRRCIDGKQLRSWAMPTQQLGHQEQPIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRYPRSEDLLEKWRANLRLSPDQCRGRICADHFEAQVRGKLKLKTGAVPTLKLGHDEGLIFDNEAIKVGVPEDEEGSSELPRLKPKREPIDEEEEDGEVDEEHHDQDNEDEDEKDDHYFDPLELVETFAEHASDEEAEYRGDEEDDRDEEDLEEVLPDMPPTPPIVPLRREKPANNVTPICCLKHCRKERTAFHLLSTFGFPKDRQLLLKWCANLHLNPDDCIGRVCIEHFQSEVLGTRKLKQNAVPTLNVGHNEPLKYSYNGVDQDQEQSPAQQSVFRLWSLKHCRKRKLTEPPDIRPSKWNSMEVQMMRRLKMEMQIEREIKMEMKMEREIKTEAKTPTKNLRESEVERCCISTCENKDVSQLLPMPEDPTLLRKWQHNLKLSMDTVPSKIRVCLDHFEPKVVRNRKPTEQAVPTLKLDQISSNIYRNNGSCLVPDCASSLSDRLSFVDLPENMVIRAEWMSYLKLSLSEGLLCGIHFIELYENVVLPKVLPAQDFEMLHLTFDEFKCAVPDCSSKIDHDLQLVPLPDKEVTLFKWLHNTKIAYDQSRHKSYRICLHHFEPTCLESDLPKVWALPTLHLNHEDKIHLNPGQESRSETPNSNSRLTPLRIKTDLASLGSPCASASPSPRGRIRICCIPSCGQFGNSQIRLYRFPTEEQALLRWLVNTQQQPRLVDPLELYVCQSHFEPDAICKKQLRSWAEPTLNLGHDGHVIPNARHNGNIADSQETEQAMTFIRERYCSVLTCFQAEGNGVRLYEYPKDMATIRKWAAACRHRSMQASSHGFKVCQSHFAPECFDPITLDLIDGSVPTLELSRDDIERHCLVSGCVKDATGERLRYYKVPKTAAQLDAWSNNLKINSMDLLQGDQLICERHFESFCFGASKGLRPGALPTLQLGHDEEVEMLPNPESLCQSKSQVCCAPGCGSIWQLGETHFSGFPKVWALADKWMYNLRLKASKDQIGKLKVCGSHFEASLFDKSGLIFGAIPTLELGHSSPDIFQTDKQSLGKSLKLSKRSQIAETDCCYPECKELSKSLSFNLPQDEHLRIAWLRHLGMEEPSNGVALLCPLHYVILYELSVKSFPEHVPNRILEDNYHSARSNRRVKIVSCAVRGCDMIRPRDKVLLHGLPQRKDILMMWVENGQLEITEQQQQYMLKVCRNHFESRCSFDDRRLHPWSVPTLLLPADPVYQIPTKEEWLEMTAKIRQDPGEINEEREEELPVQEEDEADNSLLEPIVRMEHIESEEEDSEMQALEVLLEVGHVERMDSYEHVDTEHATYQSGGIRNQYNANHCAVEGCEVTAEDVDGTIKLHKFPASSEAAQKWMHNTQVHMDEKFWWRYRICSYHFDQECFQSARIKKGAMPTLLLGPNRPDKLYDNEFALQEAEEPPPLPAEVLLEEQEEPLIEKVTNLCLPPPAPPRKSSKFCQVDGCTNHLTTENMTLHKFPHSEDMCLKWQHNTQVPFNPYYRWRYRICSAHFHPVCLVNMRLVHGSVPTLKLGTKAPTELFDNDFEAINLRLDKRSGAEPSSVQIKEERDEDEDSMLFLEPELQLHEDKEEEEMAETQIPFNHSNWKSQLRLPVKQVKVYYNQVKSGYDKCSLSHCQRQRSQHGVHIYKFPKSKQQQDRWMHNLRIRYDERRPWKFMICSVHFEPHCISLRKLRPWAVPTLELGDNVPEKIFTNEQCQELSTDRSEADSEGDEEEDGLQEDEEEEEEEYDEDMEPEVRVKRERRSKLDPWPPGQVPPWKVKQCCLPYCRAFRGDGIKLFRLPNNRNSIRNWEVATGMVFKESQRNTRLICSRHFEPELIGVRRLMRNAIPTRHLNSQGAPDDVKRKPDPPVPIATCCMADCHHNGNVKLHKFPSDPALLRQWCQALRLTDTQRYRGKHICSVHLPTDKTISCVVCGVDNVQLPMLDFPQQRNQRAKWCYNLKIEAIPKWDHSKHICCRHFESHCFVQPDELRPGAIPTLHLNHDDTNIFLLDYAAGPTGNRIKDEPLFNDDMLLV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00605601;
90% Identity
iTF_00529594;
80% Identity
-