Basic Information

Gene Symbol
-
Assembly
GCA_008042485.1
Location
VNJN01001702.1:55255-68540[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 29 4.2 9.7e+03 -2.7 1.9 49 60 197 213 181 231 0.56
2 29 2.3e-15 5.2e-12 46.2 4.0 1 86 426 498 426 499 0.85
3 29 8.6e-15 2e-11 44.3 5.0 1 87 526 595 526 595 0.83
4 29 7.9e-16 1.8e-12 47.7 0.2 1 87 617 689 617 689 0.85
5 29 5.4e-16 1.2e-12 48.2 5.3 1 87 787 857 787 857 0.82
6 29 1.9e-15 4.3e-12 46.5 3.6 1 86 881 952 881 953 0.82
7 29 9.9e-13 2.3e-09 37.7 1.5 1 87 988 1056 988 1056 0.81
8 29 7e-11 1.6e-07 31.8 1.4 1 86 1098 1167 1098 1168 0.76
9 29 4.3e-17 9.8e-14 51.7 0.4 1 86 1195 1264 1195 1265 0.82
10 29 1.2e-12 2.6e-09 37.5 1.6 1 85 1286 1354 1286 1356 0.79
11 29 1.2e-14 2.7e-11 43.9 0.6 1 86 1383 1454 1383 1455 0.85
12 29 1.5e-12 3.5e-09 37.1 3.7 1 85 1529 1597 1529 1599 0.82
13 29 1.5e-12 3.4e-09 37.2 0.1 1 86 1622 1690 1622 1691 0.83
14 29 1.8e-13 4.1e-10 40.1 2.2 1 87 1838 1907 1838 1907 0.80
15 29 1.7e-10 3.8e-07 30.6 0.2 1 87 2002 2076 2002 2076 0.79
16 29 0.014 32 5.2 1.3 1 61 2091 2143 2091 2162 0.69
17 29 1.1e-13 2.5e-10 40.8 0.1 1 87 2170 2241 2170 2241 0.79
18 29 7.2e-13 1.6e-09 38.2 0.2 1 87 2293 2363 2293 2363 0.81
19 29 3.5e-12 8e-09 36.0 0.1 1 86 2398 2472 2398 2473 0.80
20 29 9.2e-13 2.1e-09 37.8 0.0 1 86 2483 2556 2483 2557 0.80
21 29 5.4e-11 1.2e-07 32.2 0.0 1 61 2582 2637 2582 2654 0.76
22 29 1e-05 0.024 15.2 0.6 1 58 2683 2733 2683 2753 0.84
23 29 3.5e-11 7.9e-08 32.8 0.8 1 87 2773 2845 2773 2845 0.81
24 29 3.5e-16 7.9e-13 48.8 0.4 1 86 2957 3029 2957 3030 0.81
25 29 2.8e-12 6.3e-09 36.3 3.5 1 86 3090 3160 3090 3161 0.80
26 29 7.9e-14 1.8e-10 41.3 4.6 1 86 3253 3323 3253 3324 0.84
27 29 3.6e-12 8.3e-09 35.9 0.2 1 86 3405 3474 3405 3475 0.85
28 29 3.1e-10 7.1e-07 29.7 0.7 1 58 3501 3549 3501 3560 0.82
29 29 3.4e-10 7.8e-07 29.6 1.1 18 87 3567 3625 3556 3625 0.77

Sequence Information

Coding Sequence
ATGTCACAACACAACCACAATCACGCCCACCCACACTACCAATACCCCataagcaacaacaacagtatGGGCGCCTATGGGGGAGGAGTGGGAGGGGGTGGAGGCTCGCATGGATATTTCGGCGCCGCTGGCGGTGGCCTCAATAGCGAACCCTTGGAGGGGTTCCAGCAGCCGCCCAACCCAATGGCCCCACCCCCGGCCCCagaaatgataataaaatCGGAACCCATTGACGACCTGGCCTACAAGTCAAACTACATAGACGACAATACGCCATTTGCGGACTTCAGCAAGTTTAGCGATTTCAGCGAGGACATGCTGAGTCCCAAAGTGGAGCTGACAGTCAAGGATGAGTCCTTCGTTAGGAACCCCAATAGCTTTTTACGCCGTAAACAACAATCGGATCTGACGACAGCAGAGAGCCTGCCCGTCTGCCAACGATGCAAAGAGGTGTTCTTCAAGAAGCAGACATACCTGCGTCACGTTGCCGAGAGTAGCTGCGGCATCCAGGAGTATGACTTTAAGTGCACCATATGCCCCATGTCCTTCATGAACGCCGAGGAACTACACGAGCATAAGCAACAGCATCGAGCGGACAGATTCTTCTGCCACAAGTACTGCGGAAAACACTTTGGCACGATCACAGAGTGCGAGGCGCATGAGTACATGCAACATGAATACGAAAACATTGTGTGCAACATGTGCTCGGGATCCTTTGCCACGCGGGAACAACTGTATGCTCATTTGCCGCAGCACAAGTTCCAGCAGCGCTTTGACTGCCCCGTATGCCGCCTATGGTACCAAACGGCTCTGGAGCTGCACGAGCATCGGCTGGCTGCACCCTACTTCTGCGGTAAATACTACACGGGCGGACAGTCCCCGTCCCCGTCCTCCcaacagcatcagcaccaGAGTCAGACGAACTACAAGCTGCAGGACTGTCATATGGCCACCATGGAAATGCCAAACGCACCGCTCCTTAAGGCAAACTCGTCCAACTCGCCTGCCttgccagcaacagcagcgctTAACTCACTGTTGCAACAGCGCCAGGCCAATGCCGATGGAGCGGCTATTTTTGCCGCATCTTCGCTGAAGAACGAGGTCGCTGTGAAACTGGAGCGCAGCTACAGTAACTCGACCAACGAATCGTCTTATAGCGTCCAGGAGAGCGGCTACAATAATGTGTATGGCAGCAGTGACAGCTCAGGCCACGGTGCCATCGCCGGACCACAGGCACACTCTTCGACGCTTGACGATTCCGAGGATGCGCTGTGCTGTGTGCCGCTGTGCGGCGTGCGGAAGAGTACGAGTCCCACCTTGCAGTTCTTCACGTTCCCAAAGGATGAAAAATATCTCAACCAGTGGCTGCATAATCTCAAGATGTTCCACATACCTGCTTCCAGTTACGTAAGCTTCCGGATCTGCAGTATGCACTTCCCCAAGCGATGCATCAACCGCTATTCGCTGTGCTACTGGGCGGTGCCGACGTTTAACCTCGGCCACGATGACGTAGCCAATCTCTACCAGAACCGGGAGTTGACCAACACGTTTACCACTGGCGAAGTGGCgcgctgcagcatgcctcactGTACCAGTCAGCGGGGTGAGAGCAACCTCAAGTTTTACAACTTCCCAAAAGATATTAAAAGCCTGATTAAGTGGTGCCAAAACGCCCGTCTTCCGGTGCAAGCAAAGGAGCCGAGACATTTCTGCAGCCGCCACTTTGAGGAGCGGTGCATTGGCAAATTCCGACTGAAACCTTGGGCAGTGCCCACCTTGCACCTGGGCGCCCAGTATGGGAAGATCCACGACAATCCAAAGAATCTATACGTGGAAGAGAAACGCTGTTGCCTCAACTTTTGCCGCCGGAGCCGCTCTTCCGATTTCAATATGTCGCTATATCGATTTCCTAGAGACGAAGTCCTGCTACGTCGCTGGTGCTATAATCTTCGCCTCGATCCGGGAGTGTACCGCGGCAAGAATCACAAAATATGCAGCGCTCACTTTATAAAAGAGGCGTTGGGTCTTCGGAAACTATCTCCTGGTGCCGTGCCCACACTTCATCTGGGCCACAATGATACCTTCAATATCTACGAGAACGAACTGTGGCCGCCGCCAGCACCGACACCCTCCTCTTGTCATctccaacagcaacagcagtcaTCCCTTCATTCGCTTCAACAGCAAATGCACAGCAAGTCCTACCAGCGCCGTTCAGCAGCATCCACATCGTCATCGGCAAGCTCGGCAGCTTCGCATTACGTGGACCCAGAGATGAGTGCCTCTTAccatctagccatgtccgccTCCGCCGGCGGCTCTGCAATGATAAACGCCAGCGACAGCATGGATGTATGTTGCGTGCCCAGTTGCGAGAGCAAGCGACACAATAGCGAGAACATTACATTCCACACGATTCCGCGACGGCCCGAGCAGATGCGCAAATGGTGTCACAATCTTAAGATTGCCGAGGAcaagatgcacaagggcatgCGAATCTGTAGCCTTCACTTCGAGCCCTACTGCATCGGCGGCTGTATGCGACCGTTTGCTGTGCCCACTCTTCACTTGGGCCACGACGATGAGGATATCCACCGCAATCCGGACGTGATCAAGAAGCTGAACATCCGGGAGACATGCTGTGTGGCTGTGTGCAAGCGGAATAGGGACAGGGATCATGCGAACCTGCATCGTTTCCCTAGCAACGTGGCGTTACTGAAGAAATGGTGCGCCAATTTGCAGCGCAGCGTGCCCGATGGCAGTAAACTTTTCAATGATGCCATCTGTGAGGTGCACTTTGAGGATCGTTGTTTGCGCAACAAAAGGCTCGAGAAGTGGGCAGTGCCTACTCTGATCCTGGGACACGATGACATTGCCTATCCGCTGCCTACGCCAGAGCAAGTAACCGAGTTCTATGCCCGGCCGACGGCTCCCAACAATGGTGAGGAACAGGGCGAGTGCTGTGTGGAGACCTGCAAGAGGAATCCGAGCGTGGACGATATAAAGCTATACCGCCCACCGGAGGAAGCCGCCGTGCTGGCCAAGTGGGCTCACAACCTGCAAACGGAGGCCAACCAACTGACAAGCATGAGGATCTGCAATCTACACTTTGAGGCACATTGCATCGGCAAGAGGATGCGCCATTGGGCCATACCGACTTTAAATCTAGCCGGCAACATTGAGAATCTTTATGAGAATCCAGAGCAATCGCTGCTGTACAGGCGTCGCACTACTCACTTGAAGGCGAAGCTGCCGCAATCCTCCGTGAAACCCACCTGGGTGCCCAGGTGCTGTCTTCCGCACTGTCGCAAGGTCAGAGCTCTGCACAATGTCCAGCTTTATCGGTTCCCCAAACTCAATCGCTCCACATTGGCCAAGTGGGCGCATAATCTCCAGGTTCCAATGGTGGGCAGTGCCCAGCGCAGGCTATGCTCGGCCCATTTCGAGCCACATGTGCTGAGTAAAAAGTGTCCGGTGCCGCTGGCGGTGCCTACGCTTGACCTAAATTCACCACCCGGCTTGAAAATCTACCAGAATCCGGCCAAGCTAAAGGCTAACAAACTGTGCCTGCAGCGGGTTTGCATCGTCGAAAGCTGCCGCAAGACGCGGGCGCAGGGCGTTCAGCTTTTCCGGCTGCCGCACAGCCCCACACAGCTGCGAAAATGGATGCACAACATTAGGACGCGGCCACGAGCAGCTATGCGGGCTCAGTACCGGGTGTGTTCCCGTCACTTTGAGACGCACTCCTTCAATGGCCGAAGACTGAGTGCAGGTGCAATTCCGACTTTAGAACTGGGCCACGATGACGACGATATCTATCCCAATGAAGCGCAGGCATTTGTGGACGAGCATTGTGCTGTCGAAGGCTGCGAGGCATCCAAGGAGCAGCCGGAGGTGCGACTGTTCCGCTTCCCCACCGACGACGACGATATGTTGTGGAAGTGGTGCAACAACCTGAAAATGAATCCTGTGGACTGCATTGGGGTACGCATCTGCAACAAGCACTTCGAGACCGATTGCATCGGTCCCAAGCATCTGTATAAGTGGGCTATTCCCACGCAGGAGCTGGGCCACGACGACGCCCAGATCGAGCTAATTCCGAATCCCAAGCCAGAGGATAGGTATGTGGATCCAGTCTTCAAGTGCATCGTTCCCACCTGCGGCAAGACACGACGGTTTGACGAGGTGCAAATGAACAGCTTCCCCAAGGACCCGGATCTATTCCAGCGTTGGCGGCACAACCTCCGCCTCGATCATCTCAGTTTCCAGGAGCGTGAGCGCTACAAGATCTGCAACTCACACTTTGAGGAGATTTGCATTGGAAAGACACGGCTAAACATTGGATCCGTTCCAACCTTGGAACTTGGTCATGACGATGAGGAGGATATTTTCCAAGTAAATCCAGCGGAGCTGCAGAGCAATTTATTCGGACGGCAGCGTCGACTGCTGCTCGAGGGATCCGGCGAACAGAGTATCAAGCAAGAGCTGTCCGAGACGGAGGACAGCAACAAAGCGGATGTGACGGCCACAGGCTCTATTTCCAAACAGaTCAAGATCAAGAGATCTTCTTCGGATCTAAAGTGTTGCGTGCACAGTTGTGGAAGAAGTCGCTTGGAACACGGGGCACGCCTGTTTCCCTTTCCTACAGGCAAGCAGCAACACCTAAAGTGGCGTCACAATCTGCACCTGGAACCGGAGGAGGTGGACCGTTCGACGCGCGTTTGCAGTGCTCACTTTAATCGACGTTGCATCGAGGGCAAACAACTGAGGAGCTGGGCGATGCCCACCCAACAGTTGGGACACAACGACCAGCCAATCTACGAGAACCCAAAGAACATACCTGGATTCTTCACACCTACCTGTGCCCTGGGACACTGTCGGAAACGGAGGAGTATTGACAACGATCTGCGTACCTATCGATATCCCAGGAGCGAAGATCTTCTAGAAAAATGGCGAGCTAATCTGCGGCTGGCTCCCGATCAGTGTCGTGGTCGGATCTGTGCCAATCACTTTGAGCCACAGGTGCGCGGCAAGCTAAAGTTGAAGACGGGAGCCGTTCCCACACTACAACTGGGACACGATGAGGATTTAATCTATGACAATGAAGCTATAAAGGCGGGCATGACCGAAGAAGAGGAGGCCATTTCCACAGACTTCCCGcgattaaaaccaaaaaaagagttgttcgaagaggaggaggaggagtgcgAAGGGAATGATGGCGAGCAGCAGCATACAGATGACCTGGATGATAATGCAGATGAAGAAGACAAAGATGATCAGTACTTTGACCCACTTGAGCTAGTTGAGACTTTTGCTGAACATCGTAGTGATGACGAAGCCCAAGACTATGAGGATGAAGAAGACGAAGAACGAGTTGAAGACTCCCCCTCCGGTTATGATGTCAAGGAGGAGATTGAACCGCCTCCAAGCTCTTCACCTTCTCCGCTTCGCCGACGGCATCATGTTCCGCGTCGAGACAAGCCGGCCAACAATGTGACTCCCATTTGCTGCCTGAAGCACTGCAGGAAGGAACGCACTGCCTTCCACCTGCTGAGCACTTTCGGCTTCCCAAAGGATCGCCAGTTGCTGCTGAAATGGTGTGCCAATCTGCATTTAAACCCGGACGACTGCATCGGTAGGGTTTGCATTGAGCACTTTCAGCCGGAGGTACTCGGCACCCGCAAGCTCAAGCAGAATGCAGTGCCCACTCTTAATGTGGGACATGATGAACCGCTCAGGTACTCGTGCCATGGAGTGGACCAGAATCTCGAGGAGCGGGAGCCCCAGCCACAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGAAAAAGGAAGCTAACGGAGCCGCCGGATATTCCCCTAGCCAAGAGGAGAGTGCTGGAGATGCCGATGATGAAGCGGGAGTGGGAGATGGAGATGccgatgcagatgcagatggagCAGAAGAAGGAGGCAAAAAAGATGACTCAAACTGAAAGTAAATCACTTATATGCTGTATTAGCAGTTGCGGAAGGCAGGAACTTAACCAATTGGTGGCATTTCCTAAAGAGAAGTCCTTGTTAAGAAAGTGGATGCATAATTTAAGGCTGCCCACTGAGATTGAGTCCACTTGCCTAAGCCTGAAAAGAGCTTGTTTGGCGCATTTCGAAACGCAGCTCTTGGAGAATGGAAAGCTCATAAAGGAAGCAGAGGCAGTGGCTGTGCCGACTTTAAACCTGGGCCATAGCAGCTGGAACCTATACAGGAGCAATGGGATCTGCCTAGTGCCAAACTGCGCCTTTAATAACTTCAGAAACATTAGCTTTATTGACCTGCCGGATAACAGTATTATTAGGGACGCTTGCTTCTCCTGCCTGAACCTACCTGAATCCTGTGAGGAGCTGGCAAAGCTATGTTGTATCCACTTTATGGAGGCTTACAAAAAGTTTGATCTTCCTAATGTTCTGCACCCTGAAGTCATGACGATGCTACAAAGTGTTGTGGCCGAGCTGCAATGCGCGGTGCCAGGCTGCAATTTCGAAGATGCTGATCCGGACTTTCAACTAATACAGTTTCCCGATAACAAGGAGGCGCTGTCACAGTGGCTGCACAACACCAAGGTCCCGTATGATCCTTCTAGCCACCACAGTTATCGCATCTGCACGCGTCACTTTAAATCAGAGTATTTAGAGACGAATGGCCCGCTAGAAGGGGCTATACCGACGCTCCATCTAAACCATGAAGATGAGATTCACTTGAATACTAGCTCTTTGCCAGGGGATTCGAATTCTATATTAACTCCACTGCGTATAAAGACGGATCCGGCCTTTTTGGGCAGTCCCTGTGCAAGTGCAAGCCCCAGTCCCCGGGGCAAGATCCGGATTTGCTGTATTCCCTCATGCGGCCAGTTTGGCAGCAGTCAAGTGAGACTGTTTCGTTTTCCCACCGAGGAGCAGGCGTTGCTCCGGTGGCTGGTGAACACACAGCAGCAACCGCGGCTGGTTGATCCCATGGACTTGTATGTGTGCCAGTCGCATTTTGAGCCCGAGGCCATTTATATGAAGCATCTTCGAAACTGGGCTGAGCCCACCTTAAACTTGGGCCACGACGGCCATATAATCCCGAATGCCAAACACAATGGAAACATTTCCGACAGCCAAGATACTGAGCAAGCCATGAGGTTTATTCGCGAGCGATTCTGCTCTGTCCTTTCTTGCTTTCAGGCAGGCGGTCAGGAAGAGGAGGGAGTGAGGCTATTTGATTATCCCGAGGATATGGCGACTACTCGAAAGTGGGCAGCCGCATGCAGACATCGCTCCATGCAGGCCAGGAGCCATGGGTTCAAGGTGTGCCAGTTGCATTTCGCCAAGGAGTGCTTTGATCCAAATTCTGGAGCATTGATTGAGGGCGCTGTGCCCACTTTGGAGTTGAGCAGAGATGAAATGGAGAGGCAATGTCTGGTGGCTGGATGTGTAAAAAATGATGCCACTGGAACCCGCCTTCGCTACTTTAAGATACCAAAAGTGGCTGCTCAATTGGAAGCGTGGAGCAACAACCTTAAAGTCCATCCAACGGATCTCATGCAAGGAGAGCAGCAATACATCTGCGAGAAACATTTTGAGTCGTTCTGCTTTGGAGCCAACAAGGGACTGCGTTCTGGTGCTCTTCCAACCCTCTTGCTAGGCCATGATGAGGAGGTGGATATGCTTCCAAATCCGGAAAGCTTTATCTGCCAGAATAAGGCCGATAAATGCTGCGTACCTGGTTGCGGGCGTGTCTGGCAGGCTGGCGATCGTAAATTTCGTGGATTTCCCAAATTGCTGGCCATGGCCAATAAATGGAGGCATAATCTTCGCTTGGACGAGCCTGTGGAGCAACTCGGCAAGCTGAAGGTCTGCAGTGCACACTTTGAGGCCACCTCACCCAACCTGGGTACAAATGGTCTAAGTGTCTCGATACCAACTTTGGAATTGGGCCACTCTTCTCCGGATATTTTCCCAGCGGAAATTAGCTTAAAGTTCCAAAAGCGCTCTGGAATGCCGGCGAAAATTTATTGCTGTTATCCCAAATGCGAGGAAGTCTGTTTATCCAAGAATTTTTCTTACAGCCTTCCCCAGGAGGAGCATCTGAGAAATGCCTGGCTAAGCCATATGGACATAGAAGATCCAAAAGATGAAGAAATCGCACGGGTTTGCCCGCTGCACTATGTCATTCTCTACCAGCACAGTGCCGCACTCTATCCGGAGCTTCATGCTTCAAGACGACAGCTTCTTGACTTCAACTACAAGGAGGCGTGGAACAACAGGCGCGTAAAGATTGTGAGTTGCACGATTAAGGGCTGCGACATGGTTAGGCCTCGAGATGGCGTACCACTGCACGGGATGCCGCAAAGCGATGAAATCCTGCAGATGTGGATAGACAATGGCCAGTTTGAGTTTTTGGAGCAACAGCGGTATATGTTCAAGGTGTGTCACAATCATTTTGAGCCATGCTGTTTCTTCGACGACAGACGTTTGCATTCATGGAGCGTGCCCACTTTGCATCTACCTGGAGATGTAATTCACCAAAATCCCACTCCCGAGCAGTGGCAGAACATGATCAATaagcaagcagcagcaaaaacacaCGCTGAAGAGAACGAGGAGCCAGATCCATATGAGGATGTGGTTAAAACCGAACCCGTTGTAAAGATGGAGCATATCGAATCGGAATATGAAGATGAAAACCCTGAGATGCAGGCCCTAGAGGTCCTCCTAGAAGTTGGTCATGTCGAGCGAATGGAGAGCTATGAGGAAATGGATAAATCACCAGCGATATACGCCGATAGTGCGCCCTTTCGATCCTCACCCATACGCTGCCAATATAATGCTAATCATTGTGCCGTTGAAGGATGCCAAGTGACTGTCGAAGATGTGGACGGCACGATTAAGCTGCATAAATTCCCGGCGTCGCAGGAAGCCGCACAGAAGTGGATGCACAACACCCAAGTTGATATGGATGAAAAGTTCTGGTGGCGCTATCGCATATGCAGTTACCACTTCGATCAAGAATGCTTTCAGAGTGCTAGAATTCGTAAAGGCGCGATGCCCACGCTTTTGCTAGGGCCACGGCGACCGGACGAGGTGTACGATAATGAGTTTTCACTGCCAGAGGCGGAGGAACCTTTTCCAGAGACACCGGAAGAGGAAAGTTCGACTGTTGCGTCCAAAGTTCAAAAGGAGGTAACCAATTTATGCCTGCCGCCACGGGCGCCGCCTCGAAAGTCAAGCAAGTTTTGCCAAATTGATTCCTGCACAAACCACCTGACCACTGAGAACATGACACTTCACAAGTTTCCACACTCGGAGGACATGTGCCTCAAGTGGCAGCACAACACGCAAGTGCCATTTGATCCCTACTACCGCTGGCGTTACCGCATTTGCAGTGCACATTTTCATCCGGTGTGTTTGGTCAACATGCGTCTAGTCCATGGAAGCGTTCCCACTTTAAAGCTGGGTCCCAAGGCACCTTCCGAGCTGTTTGACAACGATTTCGAAGCCATCAACCTAAGATTGGATAAAAGGTTGACAGAGTCCAATGCTAACGTGTATATCAAGCATGAAAAAagggaggaggatgaggattCGATGATGTTCCTTGAGCCCGAGCTTCAGTTACATGAGGACCAAGATGATAAGGTATCAAGCTGGAACAGCAAACTGCAGTTGCCACCTGTGAAGCAAGAGAAAATGATATACAGCCAAATCAAGTCTGGCTACGACAAGTGTTCGCTGGCTCACTGCCAGCGCCAAAGGTCCCAGCATGGCGTCCACATTTATAAGTTTCCCAGATCGAGGCGTCAGCAGGAGCGGTGGATGCACAACCTACACATCCGCTATGATGATCGGACACCGTGGAAATTCATGATTTGCAGCGTTCACTTCGAGCCGCACTGCGTCAGCCTAAGGAAGCTGCGACCTTGGGCGGTGCCCACACTGGAACTGGGTGACAATGTACCAGAGACAATCTTTACGAACGAACAGTGCGAGAAGGAGCTGGTGATCGAGCGCAGTGATCCGGATAGCGACGCGGAAGAAGAAGACGGCTTGCAGgaggacgacgaggatgatgacgacgaAGACGATGTAAGGCCCGATGTTATTGGCATAAAAAGGAGGAAACGTTCCAAAATAGATTCCACCGGCCCTCCTAGCCAGATTCCACCCTGGAAAGTCAAGCAGTGCTGCTTACCCTATTGCCGGGCCTTTCGAGGCGATGGTATCAAGCTGTTTCGGCTTCCGAACAACCGAAACTCCATTAGCAACTGGGAACGAGCCACCGGAATGGTATTCAAGGAGTCGCAACGGAACACTCGCCTGATCTGCAGCCGTCACTTTGAGCCAGAGCTGATTGGAGTCAGGCGTCTAATGCGTAACGCCATTCCCACGAAACACTTGAGCCCTCAATCTGTGGACCAGATCCGTACTAAAAAGGAGAAGAATCCTCCTCCGGCCACTATTATACCCATCTGCTGCATGGCCGATTGCCACTACAACGGAAATGCGAAGCTGCACAAGTTTCCAAGTGATCCCACTCTTCTCAAACAGTGGTGCCAGGCTCTCCGGCTCACGGATACGCAGCGGTATTTGGGCAAGCACATTTGCTCCATGCACCTGCCAATGAACAAGACGCTGAGCTGTGTCATCTGCGGTGGAGACGACGTAGAGCTGCCGATGCTTGGGTTTCCGGAAAACCGCAATCAGCGCGCCAAATGGTGTTACAATCTCAAAATTGAGGCAATACCAAAGTGGGACCACTCAAAGCATATTTGCTGCCGGCACTTTGAGTCCCATTGCTTCGACAAGCCGGGTGAGCTACGTCCAGGAGCGGCTCCCACGCTCCATCTCAATCACGACGACACAAACATATTCTTCAGCGACTATGCCACTGGTCTTCCGTCCTCGCCACTAGGCAATCGAATTAAAGACGAGCCCCTGGAATCGGAGTCCGACGAGACACTGCTGGTGTAG
Protein Sequence
MSQHNHNHAHPHYQYPISNNNSMGAYGGGVGGGGGSHGYFGAAGGGLNSEPLEGFQQPPNPMAPPPAPEMIIKSEPIDDLAYKSNYIDDNTPFADFSKFSDFSEDMLSPKVELTVKDESFVRNPNSFLRRKQQSDLTTAESLPVCQRCKEVFFKKQTYLRHVAESSCGIQEYDFKCTICPMSFMNAEELHEHKQQHRADRFFCHKYCGKHFGTITECEAHEYMQHEYENIVCNMCSGSFATREQLYAHLPQHKFQQRFDCPVCRLWYQTALELHEHRLAAPYFCGKYYTGGQSPSPSSQQHQHQSQTNYKLQDCHMATMEMPNAPLLKANSSNSPALPATAALNSLLQQRQANADGAAIFAASSLKNEVAVKLERSYSNSTNESSYSVQESGYNNVYGSSDSSGHGAIAGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLNQWLHNLKMFHIPASSYVSFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPAPTPSSCHLQQQQQSSLHSLQQQMHSKSYQRRSAASTSSSASSAASHYVDPEMSASYHLAMSASAGGSAMINASDSMDVCCVPSCESKRHNSENITFHTIPRRPEQMRKWCHNLKIAEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLKKWCANLQRSVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHDDIAYPLPTPEQVTEFYARPTAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEEAAVLAKWAHNLQTEANQLTSMRICNLHFEAHCIGKRMRHWAIPTLNLAGNIENLYENPEQSLLYRRRTTHLKAKLPQSSVKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGLKIYQNPAKLKANKLCLQRVCIVESCRKTRAQGVQLFRLPHSPTQLRKWMHNIRTRPRAAMRAQYRVCSRHFETHSFNGRRLSAGAIPTLELGHDDDDIYPNEAQAFVDEHCAVEGCEASKEQPEVRLFRFPTDDDDMLWKWCNNLKMNPVDCIGVRICNKHFETDCIGPKHLYKWAIPTQELGHDDAQIELIPNPKPEDRYVDPVFKCIVPTCGKTRRFDEVQMNSFPKDPDLFQRWRHNLRLDHLSFQERERYKICNSHFEEICIGKTRLNIGSVPTLELGHDDEEDIFQVNPAELQSNLFGRQRRLLLEGSGEQSIKQELSETEDSNKADVTATGSISKQIKIKRSSSDLKCCVHSCGRSRLEHGARLFPFPTGKQQHLKWRHNLHLEPEEVDRSTRVCSAHFNRRCIEGKQLRSWAMPTQQLGHNDQPIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRYPRSEDLLEKWRANLRLAPDQCRGRICANHFEPQVRGKLKLKTGAVPTLQLGHDEDLIYDNEAIKAGMTEEEEAISTDFPRLKPKKELFEEEEEECEGNDGEQQHTDDLDDNADEEDKDDQYFDPLELVETFAEHRSDDEAQDYEDEEDEERVEDSPSGYDVKEEIEPPPSSSPSPLRRRHHVPRRDKPANNVTPICCLKHCRKERTAFHLLSTFGFPKDRQLLLKWCANLHLNPDDCIGRVCIEHFQPEVLGTRKLKQNAVPTLNVGHDEPLRYSCHGVDQNLEEREPQPQHSVFRLWSLKHCRKRKLTEPPDIPLAKRRVLEMPMMKREWEMEMPMQMQMEQKKEAKKMTQTESKSLICCISSCGRQELNQLVAFPKEKSLLRKWMHNLRLPTEIESTCLSLKRACLAHFETQLLENGKLIKEAEAVAVPTLNLGHSSWNLYRSNGICLVPNCAFNNFRNISFIDLPDNSIIRDACFSCLNLPESCEELAKLCCIHFMEAYKKFDLPNVLHPEVMTMLQSVVAELQCAVPGCNFEDADPDFQLIQFPDNKEALSQWLHNTKVPYDPSSHHSYRICTRHFKSEYLETNGPLEGAIPTLHLNHEDEIHLNTSSLPGDSNSILTPLRIKTDPAFLGSPCASASPSPRGKIRICCIPSCGQFGSSQVRLFRFPTEEQALLRWLVNTQQQPRLVDPMDLYVCQSHFEPEAIYMKHLRNWAEPTLNLGHDGHIIPNAKHNGNISDSQDTEQAMRFIRERFCSVLSCFQAGGQEEEGVRLFDYPEDMATTRKWAAACRHRSMQARSHGFKVCQLHFAKECFDPNSGALIEGAVPTLELSRDEMERQCLVAGCVKNDATGTRLRYFKIPKVAAQLEAWSNNLKVHPTDLMQGEQQYICEKHFESFCFGANKGLRSGALPTLLLGHDEEVDMLPNPESFICQNKADKCCVPGCGRVWQAGDRKFRGFPKLLAMANKWRHNLRLDEPVEQLGKLKVCSAHFEATSPNLGTNGLSVSIPTLELGHSSPDIFPAEISLKFQKRSGMPAKIYCCYPKCEEVCLSKNFSYSLPQEEHLRNAWLSHMDIEDPKDEEIARVCPLHYVILYQHSAALYPELHASRRQLLDFNYKEAWNNRRVKIVSCTIKGCDMVRPRDGVPLHGMPQSDEILQMWIDNGQFEFLEQQRYMFKVCHNHFEPCCFFDDRRLHSWSVPTLHLPGDVIHQNPTPEQWQNMINKQAAAKTHAEENEEPDPYEDVVKTEPVVKMEHIESEYEDENPEMQALEVLLEVGHVERMESYEEMDKSPAIYADSAPFRSSPIRCQYNANHCAVEGCQVTVEDVDGTIKLHKFPASQEAAQKWMHNTQVDMDEKFWWRYRICSYHFDQECFQSARIRKGAMPTLLLGPRRPDEVYDNEFSLPEAEEPFPETPEEESSTVASKVQKEVTNLCLPPRAPPRKSSKFCQIDSCTNHLTTENMTLHKFPHSEDMCLKWQHNTQVPFDPYYRWRYRICSAHFHPVCLVNMRLVHGSVPTLKLGPKAPSELFDNDFEAINLRLDKRLTESNANVYIKHEKREEDEDSMMFLEPELQLHEDQDDKVSSWNSKLQLPPVKQEKMIYSQIKSGYDKCSLAHCQRQRSQHGVHIYKFPRSRRQQERWMHNLHIRYDDRTPWKFMICSVHFEPHCVSLRKLRPWAVPTLELGDNVPETIFTNEQCEKELVIERSDPDSDAEEEDGLQEDDEDDDDEDDVRPDVIGIKRRKRSKIDSTGPPSQIPPWKVKQCCLPYCRAFRGDGIKLFRLPNNRNSISNWERATGMVFKESQRNTRLICSRHFEPELIGVRRLMRNAIPTKHLSPQSVDQIRTKKEKNPPPATIIPICCMADCHYNGNAKLHKFPSDPTLLKQWCQALRLTDTQRYLGKHICSMHLPMNKTLSCVICGGDDVELPMLGFPENRNQRAKWCYNLKIEAIPKWDHSKHICCRHFESHCFDKPGELRPGAAPTLHLNHDDTNIFFSDYATGLPSSPLGNRIKDEPLESESDETLLV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00525910;
90% Identity
iTF_00594581;
80% Identity
-