Basic Information

Gene Symbol
-
Assembly
GCA_037044495.1
Location
JBAMBQ010005379.1:1446185-1461614[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 32 2.5e-15 4.7e-12 46.6 3.8 1 86 533 605 533 606 0.86
2 32 7.6e-15 1.5e-11 45.0 3.8 1 87 633 702 633 702 0.83
3 32 1.2e-16 2.3e-13 50.8 0.3 1 87 724 796 724 796 0.85
4 32 4.1e-17 8e-14 52.3 3.4 1 87 878 948 878 948 0.82
5 32 1.1e-13 2e-10 41.4 4.9 1 87 972 1044 972 1044 0.81
6 32 4e-12 7.7e-09 36.3 0.6 1 87 1079 1147 1079 1147 0.80
7 32 6.3e-10 1.2e-06 29.3 2.3 1 86 1188 1257 1188 1258 0.74
8 32 4.9e-16 9.4e-13 48.8 1.1 1 87 1285 1355 1285 1355 0.81
9 32 2.6e-11 4.9e-08 33.7 3.0 1 86 1376 1445 1376 1446 0.80
10 32 1.9e-15 3.7e-12 46.9 2.4 1 87 1473 1545 1473 1545 0.84
11 32 5.8e-14 1.1e-10 42.2 3.1 1 87 1639 1709 1639 1709 0.83
12 32 5.1e-12 9.8e-09 36.0 0.4 1 86 1731 1798 1731 1799 0.79
13 32 2.6e-14 5e-11 43.3 1.4 1 87 1896 1965 1896 1965 0.82
14 32 5.8e-13 1.1e-09 39.0 0.6 1 86 2057 2127 2057 2128 0.80
15 32 3.4e-07 0.00066 20.5 0.0 1 60 2156 2208 2156 2229 0.80
16 32 4e-12 7.6e-09 36.3 5.2 1 86 2254 2320 2254 2321 0.79
17 32 6.7e-13 1.3e-09 38.8 1.1 1 86 2356 2426 2356 2427 0.82
18 32 1.7e-13 3.3e-10 40.7 2.5 1 87 2458 2530 2458 2530 0.83
19 32 1.3e-14 2.4e-11 44.3 0.2 1 87 2555 2626 2555 2626 0.80
20 32 2e-06 0.0038 18.0 0.7 1 58 2667 2716 2667 2740 0.80
21 32 9.9e-16 1.9e-12 47.9 0.3 1 86 2765 2837 2765 2838 0.83
22 32 2e-15 3.8e-12 46.9 0.4 1 86 2864 2934 2864 2935 0.81
23 32 2.5e-14 4.8e-11 43.4 1.8 1 86 3083 3153 3083 3154 0.81
24 32 1.2e-13 2.3e-10 41.2 2.9 1 87 3215 3286 3215 3286 0.82
25 32 3.8e-13 7.3e-10 39.6 2.7 1 86 3376 3446 3376 3447 0.78
26 32 6.2e-16 1.2e-12 48.5 0.0 1 87 3494 3571 3494 3571 0.82
27 32 4e-13 7.7e-10 39.5 7.2 1 87 3615 3686 3615 3686 0.83
28 32 2.1e-12 4.1e-09 37.2 0.3 1 86 3768 3837 3768 3838 0.83
29 32 8e-14 1.5e-10 41.7 0.0 1 87 3935 4005 3935 4005 0.86
30 32 1.6e-12 3.1e-09 37.5 0.6 1 86 4025 4099 4025 4100 0.78
31 32 7.4e-15 1.4e-11 45.1 1.3 1 87 4112 4181 4112 4181 0.80
32 32 4e-12 7.6e-09 36.3 1.5 1 87 4189 4257 4189 4257 0.88

Sequence Information

Coding Sequence
atgtcacaacaacaaccacagcagcagcagaaccaCTCGCACTACGCACATGTTGCTTCTTCCGCGTCTGGTAAAAgtagcaacggcaacaacgacaTGCACATGTACGGAAGCTTTTATGGAGGCAGTATGTCTGGAGGAATCGGTATAGGCATGGGCGGTAGCTGTGCTCTTAACAGTGGCAACGATTGCATTGGTGGCGCAGCAGCATTCGACCTTGAAACGAGTGGTTCCAACTCGGTTGCTTACGCTCATAATCAGCTGCTCcagtatcaacaacaacaaagcacactTATGCCGCAGCATAATCCACCAGGAGTAGCCGAAGCTACCATGCAGCGCCATGAATACATGAATCCAAAAGTTATGAACTTGCAGCATCACGGGAATCAATACCAAATGCATAATCCGATGTCTTGTTCACCTATTGAAGTCGAAGAATTGATAATCAAATCCGAACCACCAGACGAGAAtaactataaaacaaattttattgatgACAATGCACCTATCGTGgactttaaaaaatttcctGACTTCGGTGACGAAATGCTAAATCCTAAGGTGGAGCTAAATATTAAGGAGGAACATCCCAGCGACATGCAGCAAAAGTCGTTGAATTTTCCACGGCGTAAAGTACAAACGGAGCGTTCAGAGACTTTGCCAATATGTCAAAGATGCAAGCAGGTTTTCTTCAAAAAACATGTTTATATAAACCATGTAGCTGAAAGTTCCTGCGATATCGTCGAATATGACTTAAGATGTAGTATATGTCCAATGTCATTCATGTCTATGGAAGAGCTGGagaaacataaacatttacatcgtgaaaataaattcttttgcCATAAGTACTGTGGTAAATACTATGACACTATATCGGATATTGAAACACATGAGTACATGCATCATGAATATGATACATTCGTTTGCAATATATGTTCTCTTACATTCCCTACACGAGAGATGTTGTATGCTCATTTACCGCATCATAAGTTTCAACAGCGTTTTGATTGCCCTGTTTGCCGCTTGTGGTTTGCGACGCCACAAGAACTCCATGAACATCGCATTGGTGCTCCTTACTTTTGCGGCAAGTACTATATAATGGGTGGCACTCAAAATCATACACAACGTTCTAACTTTCAAAACTCGTCGGTTTCGAAGTTTTCTTCAGAGCCCCAAGAAAACTACAAACTACAAGACTGTCAAATGGGAGTCATGGAAATGGCGCCGCACCAATTTCAGTCTTCCACTGCTACATCCAACTCAACCATCACTAGTTCCTTTATGACACCAAGCCACACGTATCAACATGcacattacaaaaacaacaacttctTCCAAACGCCAGCTATAAAGACGGAGATAAAAGTGGAACCAGACTTTTATTCAAATATCTCCGATTACCCCCAGCATCATACGCAAATGACACCGGCAAACTTTAGCGATTATACAAATGATTCCTACACTAGTTCGCATAACTCATCTAATTTTACGAATGATTACAATGAAATAGGCAGTTCTATGCTtaatgcacaacaacagcaagctcAGTCAGCTAACGCATCTACACTTGACGATTCCGAAGACGCAGTTTGTTGTGTGCCCCGTTGCGGTGTACGTAAGAGTACCAGTCCGACTTTACAGTTTTTTACATTTCCTAAGGACGAAAAGTATTTACATCAGTGGTTAAGTAATCTGAAAATGCTGCATGTGTCCGTGACGACATATCGAAGCTATCGTATATGTAGCTTGCATTTTCCAAAACGCTGCATAAATCGTTACTCATTGTGCTATTGGGCTGTGCCTACCTTCAATTTGGGCCATGATGATGTCGCAAATTTGTATCAAAATCGAGAACTTACGAATACATTTACAACAGGCGAAGTAGCACGCTGCAGCATGCCCAATTGTAAGAGCCAACGTGGTGAGAGTAATgtgaaattttacaattttccaAAGGATATAAAAAGTTTGATTAAGTGGTGCCAAAATGCAAGATTGCCTGTGCAAGCTAAAGAGCCACGCCACTTTTGTGGACGGCATTTTGAAGAAAGGTGTATCGGTAAATTTCGCCTTAAACCGTGGGCGGTGCCTACGCTTAACTTGGGCACACCTTACGGCAAAATACATGACAATCCTCAGAATCTCTATGTAGAAGAAAAACGatgttgtttgccattttgccGTAGAAGTCGCTCGTCCGACTTCAATATGTCGTTATATCGTTTTCCACGTGATGAAGTGCTTTTACAGAGGTGGTGCTATAACTTGCGTTTGGATCCTTCGGTATATCGTgggaaaaatcataaaatttgcaGTGCCCATTTCATCAAAGAGGCTTTGGGTCTAAGAAAACTTTCACCaggTGCAGTACCGACACTAAATTTGGGTCATAATgacacttttaatatttatgaaaatgagTTGTACACTCCACctccaccaccaccgccacctcCTTCATCAACTACTCAGAAAGGATATTACCGTCAACCGAATGCAACTTCAACAGTACCTTCGCCTTCTAGCAGTTTGTCATCAATGTTCTATGCGTCGGATGCAGTGTCACCATTTTCAAGTATTACAAATGCAAGTTCAACGCCTAATGTCACCGAATCGATGGATGTGTGCTGCGTACCTGGTTGTGATAGTAAACGCCATAACTCTAATGATATCACTTTCCATACCATTCCACGTCGGCCagagcaaatgcaaaaatggtGTCACAATCTAAAAATGGATGAAGACAAAATGCACAAAGGCCTAAGGATCTGTAGCTTACATTTTGAGCCTTACTGCATCGGCGGTTGTATGCGCCCTTTTGCTGTGCCAACGCTTAACCTTGGCCACAAtgatgaaaatatttacaagaaTCCGGACGTTATTAAAAAGCTTAATATTCGTGAAACATGCTGTGTTCAAATTTGCAAACGAAACCGTGAAAGGGACCATGCGAATTTGCATCGCTTTCCAGCGAATCCAGTTATGATGGCGAAGTGGTGTGATAATCTGCATAAGCCAGTTCCGGATGGATCAAGGTTATTTAATGATGCCATATGCGAAGTGCACTTCGAAGACCGTTGTTTGCGAAATAAACGTTTGGAAAAATGGTCTGTACCAACTTTAAATTTAGGCTATGATGACTTGGTCCATAAACTACCTACTGAAGCCGAAGTTGCAGAGCTATTTGCAAAACCAAGTGCACCGAATAATGGCGATGAGGAAGGAGAATGTTGTGTAGATTCGTGCAAGCGAAATCCGCAAGTCGACGATATAAAACTTTATCGTGCACCTGAGGATCTAGAGATACTTGCTAAATGGGCGCACAATTTACAAATTGATGTCGACGACCTTTCCAATCTACGTATTTGTAATCTACATTTTGAATCGCATTGCATTGGAAAACGTATGCATTTTTGGGCAATTCCAACTCTTAATCTTGGAACAAACATTGAGAATATGTACGAGAATCCTGAGCAACCCGTTTTTATAAAAAGGGAGAAATCGCTTAAATTCTATGGCGATACTTCAACGAGCAAAGCTACTTGGGTGCCTCGGTGCTGCCTAGGGCATTGTCGCAAAATGCGTGCAGTCCACAATGTGCAGTTGTTCCGGTTTCCTAACTTGAATCGTCCTATGCTTGCAAAGTGGTGTCATAATCTCCAAATGCCTCTTGTAGGCAGCGCTCAGCGGCGAGTTTGTTCAGTACATTTTGACCCGGCGGTGTTAAGTAAAAAGTGTCCTATACCACACTCGGTGCCAACACTGGACCTAAATACTCCGCCAGGATATAAAATTTATCAGAACCCAGCGCGACTTAAAGCTCCTAAACAATGTTTGCAACGCGTTTGTATAATTGAGACTTGTCGTAGAAAGCGAACCGATGGAGTTCAACTATTTCGTTTTCCACACAACTCCTCTGTACTGCGTAAATGGttacataatataaaacaacgtCCAAAGGGATCTTCTCGTGCGCAATGGAGAGTTTGTACAATGCATTTCGAAACACATTCGTTTAATGGAAAACGTCTTAGCAATGGTGCCATACCAACACTTAATCTGGGCCACGACGACGAGGACATTTATCCTAATGAAGCACAGTCTTTTATGGATGAGCAGTGCGTAATCCAGGGTTGTGATTCGGTTAAGGATATGCCTGATGTGCGCCTCTTTAGATTTCCTACTGATGACGAAGATCATCTTTGGCAATGGTGCAACAATTTAAAGATGAATCCAATTGATTGTCGTGGCGTCAGGATATGTAATAAGCATTTTGATCCAGAGTGTATTGGGCCAAAACATCTCTTCAAATGGGCTATTCCAACTCACAATTTGGGACATTCTGACACAGAAATAGAATTTATTCCGAACCCTAAACCCGAAGATCGTTATTCGGAGTTAATATTCAAGTGTTGTGTGCCAACATGTGGCAAAACGCGCAAATATGATGAGGTACAGATGAATAGCTTCCCAAAGGATCCAAAACAATTCCGGCGATGGAAACATAACCTTAAATTAGACCATTTAGATTTCAAAGAACgtgataaatacaaaatatgcaattcaCATTTTGAGGATATTTGTATAGGCAAAACGCGACTTAATATTGGCGCAATACCAACTGTAAATCTTGGACATAATGACATAGACCACATATATGAAGTCAATCCGGAAAAACTGCAGAGTAATCTATTTGGCAAAAATCGGCCGATTTCCGATTATGCAACCAGTGAAGACAGCGCTGATGAAGAGTCTATTGAAAATGACTCGGAAGCTATTTTGCCAAATTCATATAGTGGCAATGAGTATTTTTCTGATAATCAACTGGATTCCGAGGATGTGATCAATGCTTCTTCAGACTTAAATATGACTCAAGTTAAAATGAAGACATCGCTTGATGTGCTTAAATGTTGCGTACAAAGTTGCCAAAAAAGTCGCTTACAGCATGGTGTCcacctttttaattttcctagTGGTCCCATACAATTGAAGAAGTGGCGCCACAATCTCCAACTACCAAATGAATATATGAAGAAACtcttaaaaatatgcaatctTCATTTCAACAAGCGTTGCATAGACGGCAAGCAATTGCGTTCGTGGGCCGTACCAACCATGAACTTGGGTCACTCTGATCCAATCTatgaaaatccaaaaaacATTCCAGGATTTTTCTTGCCCGTATGTGCATTAAATCATTGTCGCAAACGTCGCACAATTGACAATGACCTACGCACATATCAGTTTCCTAAAGGATATTTGCTCGAGAAATGGTGTGCCAATTTGAAGCTTCAACCAGAGAATTGTCGTGGTCGTATATGCGCTGACCATTTTGAAGCCGAAGTGCGTGGtaaattgaaactaaaaaCCGGCGCTGTGCCAACTCTGAAGTTAGGACACAATGATCCGTTAACTTTTAATAATGAAACCATTAAGTCATCATATGAGAAGCGTTACGATACCTACGAAAAACAATCTGAAATGTCAATCGGTGGAACTGAACCCGATGTGGAAGAATCTATTTCCGATACGGAAGAGGGAGAAGAGTTTTACGATCCACTAAAGTTTGTAGAAACTGTAGATGAACAACTTGAAGGAGATACTATTGAAGAGCCTATTTTCATGCCAATTGCCACATgcatcaaaaaagaaaaaccagtAAACAATGTTTCTcctatttgttgtttaaaacaTTGCCGTAAAGAAAAAACGGCTCAGCAACATCTAAGCACTTTTGGCTTTCCCAAAGATCGGCAGCTACTGTTAAAATGGTGTGCTAATCTACATCTTAATCCTGATGATTGTATTGGTCGTGTTTGCATTGAGCATTTCGATCCTGAAGTTCTGGGAAGTAGAAAACTTAGACAGGGTGCTGTACCCACGATTAATGTTGGCCATGACGAACCCCTGCGTTATTCAAACAATGGATGCGAACTACCCATTGAACTTTTTGATAATGATTCATTTGGAAGTGTCAATAGAAATGATCATACTAGTGATTCGCCGGGGCACTATGCTGAAACACTTGATCATTCGGTTTTTCGGCTTTTTAGCCTAAAACATTGCCGAAAAAGGGAAAATCTTGAAAAATATGATGAAACGAGTAATAGGAACATTTTCCTAAACCCATGTTCACTTGAGGTAGCAAAAGACCTTTCAAGCTGCTGTGCTTTAAACTGTGGAAAATCTCGTCTTAAAGACGGCGCACGTCTTAATCGTTTTCCAAAGAGTCGCATACTTTATCTTAAATGGACATATAACTTGAAACTAAAGTCCACAACGCAAATAATAAAGACGCACAAAGTTTGCAACGAACACTTTGAGTCGTACTGCTTGAAGAATGGTTTTCTTAAGGAAGGTGCTGTGCCCACTTTGAAATTGGGTTATACTGATTCCTTAATTTACCGTAACTCAAAGAAATTGCagagcaaaataaatgtatacaacagaaaaactaaatgcaTGGTACCGGGTTGTTTATCTGGAAATTTGCAAACTAACAAAATGTACCCAATACCTTCTTGGGCTCCCTTAAGTTCTGCTTGGCAACGCTTTTCCAACCTTAAAGACGATGAGAAGCAAAGTAAGCTACAAGTTTGTGGTTTGcattttcttgaatttttcgAGGATACCTCTTTGCCTCAAGAATATTCTACACAagtcaaaaatattatgaaaccaattgaaatcaaaataGAACCGAATTTGGATGGGAAACCAAAAAGTGAGTCACCAAAACGAGTTTCCAAAATCTGTTGCATTCCTCACTGTAACggtcaaaatatttttcgccAACTTTTCTCGTTTCCCACTGCTGAATGCCAGTTACAAAAATGGTGTGTTAACACGCAAAAACCTATTGCACAAGTCCGTAGTCTTTACATTTGTGCTAAACACTTTGAACCCGATGCCATTTGCAAACAGCAACTACGTCCTTGGGCTCTGCCTACTATTGATTTGGGTCACACCGACACCataatacaaaatgcaaaacaccTGGGAAATTTTCACAGCGAGGAGCAGGAAAGCAATGATATGAAATTTATACGCAGCAACTATTGTGCTGTCTTTAGCTgttttcaaaagaaaagcgatACCGTCAGGCTCTATGAGTACCCTACGGAGATGACGTTAATACGAAAGTGGGCAGCTAACTGTAAGCATCGTTCGTTTCAAGCTAGACAACACGGTCTACGCGTTTGTAAAGATCATTTCGACCGCGACTGTTTTTTGGAAAGCGACATATTACGCATCGGGGCTGTTCCTACGTTGAAGCTTGGTCTAGGGGACGGTCAAAATATTCAGCAAAGCGAGTGGATAGAAATGCGACGGAAGACACCGATACCGCATGGACAATTGATTCATTGTATAGTGCCAGATTGCAGTTCGGTCAGTGGCACAAGTAAaagattttataaatttccaaAGATTGGCGAATTGCTAGAAATGTGGtgtaaaaatttgcaaattgttggattaaatgacaatgacaataagGAGAAACATATATGCCAACTACATTTCGAGTCCCGTTGTTTCACTGAACACCAACGTCTGCACAAATGTGCTTTACCGACACTGAACTTAGGACACGAAGACATCAACAACGTTATTCCAAACCCCGACAGTTTTGAACGCATCGATGTTGTTCCGAAATGTTGCGTACCTGGGTGCAACAAGACAAAACAGAGTGATGGTGTTCAATTAAGTGCGTTTCCGCGTGTTCGAAATCTTTTTGAAAAATGGGCACTTAACCTTAAATTGCCCATTAGTAATCAACTTTGGCAGAATGGAAAAGTATGCAGTATTCATTTCGAAACTTATTGCTACGAATATGGTCGTCTTAAGACTGGTGCTATGCCCACTCTGAATTTGGGTCACGATGATGCGGATATAAATCTTACTAACGAATTGACACTAGGAAAAAAACGCAAGGCTCATTTTGCACACCGTACCAACACTCCAACACTGAGAAAGATTGAtgattttaaatgttgttttccaACTTGTAAAGAGCTTGACAACATAAAGATGAGGCAGGAGTTCAAGTTGCCCGCTTTGAAATCACTTAGAATAAAATGGTGTGAAATTATGGATTGTGATCCAGATGATACGGAGCTAAGGTTATGCCCTCTGCACTTCATTATTATCTATGAAGCTAATATTGACATGATTAAAAATCTTCAATCGAACGAATCCCAAAATGAAGAGACGATAGAATGTGTTTCACAAATCCAAACATCGTATGATACAGCAAAATGTAATTCACGAATACTCGGAATGCGTTGCTCTGTACCTGGTTGTTTGACACTCGCTTCTAGGGACAACAAAAAGCTATTCGTTTTGCCTTATAATCAGGACTTGCTCGAAAAGTGGctgcacaacacaaaaatcgAATTGGTGGAATCTCAAAGATACCTAACTAAAGTATGTTCTGACCATTTCGAGAAACGTTGTGTATCTGAAAGCTCAACACGATTAAAGATGTGGGCCATACCTACACTTAAAGTACACACACCCATTGATGAtgaaattttgtatgaaaaccCATCTGCGGCGGCGCTAGAGAGAAGTACCGCCAAATGCTGTATACATAATTGTGAAAATGCAAGTAAAAATGTTGATGTCATTACTGTGGCTCTCTACCGGTTCCCAAAGGAGAATGACGTTTTACAGAAATGGTTGTACAACACGCAATTAAAGGCACGCGATACTATTGGTGGCCGAGTCTGTGCGCTTCACTTTGAGAAATTTTGCATCGGCAAGCGATTGCGTAGCTGGGCTGTGCCCACATTATATCTTGGCCATGATTTACCGGATATTTACCAAAATTCAAATGATAAGCGAGTAGCTCAATTGAAGGTAGAAGCTGTAGATAGTGACGAAAGTCGCATGACCAGTTACAGTGACCAGAAAACGGAGACGGAAACTATGGATAGTTCAACTTTCGAACCAATCGTTGTAATGGAATCTTTAGAGACCGGAGAATTCCACGATCAATCTGTTAATTCCACAATTTCAAACATCTATGACAATTATAAGGGTAAACAAAAAGACTTGACTGCAGATACAATTGAAGATGACTTAAATTCAACAGATATGGCTCTTGAAGTTATGCTAGAGGTTGGTCACGTTGAGAAGTGTGCTACTTATGAGCAAACACATTCTTCTCAGCCACACACTCCTGTCAGTGAACAATCATTACCAAGTCCAGGCGAACGGATCGCAAATGCTAGACGATGTTGCATTGTGGGTTGCACAGTGCGCAACACAGATCCAGGTATGAAGCTTCATAAATTCCCTCAATCAAAGGAAGTGTTAGATAAGTGGATGCATAATGCGCAAATTGAAGTCGATCTACGTTGCCCGTGGCGTTATCGGATTTGCAGTAGACACTTCGATCCTGAGTGTTTTAATGGCTACCGGTTTCGACACGGAACGATGCCAACCTTATACCTTGGACCCAATCGCCCACGGAACATTTTTGAAAACGAATTTGAACAAGTAATTTGTGCgcaagacaaaagcaaatcacTTTGCCAAGACAGCGATATTGAAGACAATTCCGACGTAGATTTCGAACAATCTAATCAATGTCTCGACTTCGATAGAAGTATGAACAAATCGGGAAAGTTTTGTCAAATTAAAGGTTGCCGAAACCATTTAAAAACGGATGGCATTAGTCTGCACAAATTTCCACAATCTCCAAGGCTATGGAGGAAATGGCAACATAATACACAAATACCTATTAATATCAACTATCCGTGGCGTTTCCGCGTTTGCAGCGAACACTTTCATCCGCAATGTCTTTCAAATTCTCGACTTTTGTTCGGAAGTATTCCGACTTTGAATTTAGGTGCCAATGCGCCAGAAGAACTTTATGAAAATGAGTTCCAGATTGAATCGATACCGAAATTGGAAATGCCATTGTTTGAAGAAAAAACCTCTGCGGATGAATATGACGAAGAATACGATGAAGTATTGGCTATAGAACCGGAAATTGAGTTGCAATTAAAAGAAGATCGTTCAATGAAGTGTAGTAGTAGTAATGAATTCGAGTATGATAATGAAAACTCGGATTCTTTCGACGATTATTTCACAAAAGCCGCGATGGGCAAATGCTGCATAGCTGGTTGCAATATGAGCAAAGCTTCACCTGGCATTACGTTACACAAATACCCTAGCCCACCGGAACACCTTCGTAAATGGCTGCACAATACGCAAGTTAAAGTCGACCTGACCTGCAGGTGGCGTTATCGTGTATGTAGTCGCCACTTTGAACCAAGTTGCTTTCGCGGACCACGCATTATTTACGGATGTGTACCAACATTGAATTTAGGTCGTAACGCACCCAGTTTCGTTTATACAAACGAGGATGTTTTCAAACGTATGGAAGGAGAAACCCTGACAAACCATATTCCAGATGAAAAGCCTAATGTAACGTTTAGTGCTACATCCTTTACCAAATTGCACTGTTCAATATTGACTTGTGGTCGTGTTTCTGGGCAAGATGGAGTGTCCCTTCATCGTTTGCCACAAGATGAACTAACCAGGAGAAAGTGGCTAGTACATTGTAATTTCGGCAATGGCAGCTTTCAGAATAATGTGTATTCACTGCGCATTTGCAGTCGACATTTTGAGAAACATTGCTATATAGGCGCGCCAGGGAAGCGATACTTAAAACCAGGTGTCATTCCaacattaaatttgcaaagtACTGTCTGCGAAAAGTTTGCAATGCAACCGATTAAGCGGGAAATAACCTTAAAGcatagcaaaatgaaaacaatcgAAATCAATTTTGAGTATAATGAAATCAAATCGGGGTATGGAAAATGTTCGCTGATACactgtcaaaaacaaaaatcgcagCATtgcgttaaaatttataaatttccaaagtctcaacaacaacaggaaagATGGTCTCATAATCTGCGCATACAATACGATCCTGCGCGACCATGGAAATTCTTAATTTGCAGTGACCATTTCGAGGAGCAGTGCGTCAGTAAAAGGAAACTCTATAGATGGGCAGTGCCGACTCTTAACTTGGGTAGCAATGTTCCAGCTACACTCTTTACTAATGAGGAGTGTAAAGTTTTGTGTGCAGGTTTTGGCAGTTCTTCAGATGACAGTGAGAACGATGATCAGTCTGGTAATGAAATTAAGTTCAACCATTCCGTTTCTGTGTGTAAAACAGAGACGAAATCAACTTTTGATAAGACATCGACTTCACCAAATGTTTTTTACAATTCGCCACCCAAACATAATTTCgcttataaaactaaaatatgctGTTTGCCCCATTGCCGCAGACCACGCGGGGAGGGCGTTAAACTATTTCGCTTGCCAACTAGCGTTCACTTGATTCGTAAATGGGAATATAATACTGGCATTGCTTTTAAAGAATCTCAACGAAATACGAAGTTAATTTGCAGCGATCATTTTCCACCTTCCCTTATTGGCATTAGACGTTTATCCAAAAATGCAGTGCCAACGAGAAATCTAGGACCGAATGCACCACAGAATccccataaaaacaaaatagttgaAACTAATGAAGAGTCTGATGATATTGTCGATAGAGAAACCGGAACCAAAGAGCTGGAAATAGCCGAAGTAGACACATTGGAAAACTCTGAAGATttcgatgatgatgaaaaattgtttgatCCAGTGTCATTTCGTTCACATTGGCAAATGTCAAATTTTGTCAAACAAGAACATAACGATTGGCAGTCAAATACAGACTTGCGCCTACCTCCCAAAAATATGACCTATAAGCAACATCATTGTTGTGTTCCACATTGTGGCTTGGCCCGCAGAGGTAATCTAAAACTTTTTCGAGTGCCATTAGATTCAAATCTCCGGGcgaaatgggaaaaaaatttacgcaTGGTATTCGATATAAACGACCGCAACATCAACCTTGTGTGTAGTCAGCATTTTGAGCCAGAATTAATTGGTGCCACACGATTGGCGCGCGGTTCTGTGCCAACTTTGAATTTGAAGCCCATCGATGCTACTAGTTCATGTACTACTCACAGTTCTGTGCTTGTATTCTGCAATGTGCCAGGCTGTGATAACACAAACGCCCATCAGAACATTACTATTTTTGACACATATATGAGAGAACCAGATATTCTTAATAAGTGGTGCGAGAACTTGAATATTGACGCGAATAATCCAACAATTAAGGAAGGCGGCTACAACATTTGTAGTGCGCACTTCGAACCTCAATGCTTTACTCCACAAGGCAACCTTCACAAATGGGCCCTGCCTACATTAAAGTTAGGACATGAAGTCGAAGCAAAACACCAGCAATGTTGTGTACCCAAATGCCAAGGCAGTAAAGGCTTAAATACTACAATGTTTAGTTTCCCCACAGatgaGTTTCTTCGAAAGAAATGGTGCAAAGCTCTACAAATtgacgaaaaaaatatttttttaaatgaaaaggTGTGTAAAAGCCATTTCGCAGATGACAATTTAAACGGATATAATCTGAAACCAGGCGCTATACCCTGCCTAAATGTATGCACATCCCAAGGACACTGCATtgtaaaaaattgtgaaactGTCAttggtttaattaattttccgGAAAATCGTAACACTGCTGCCAAGTGGtgtcataatttaaaaattgaaccGCTGCCGAAACTTCATTTGCACCAAGACTACAAAGTATGTCGTAGACACTTTGAACCAGACTGCTTTCGGTTAGGTAATCTATATCCGGACGCAGTACCAAGTTTACATTTAGGCCACGGGGACccgaatatatttttaaatgaagatATGTCTTCAAACAACTATTTTGAATCTGGTAGTGTGCAAACCTATTCAAACAGCTGTGAAACAGAAACTATAATTAAGAAAGAACCCGTTGGCATTGACGAAGTCAGTATACATGCATGGCAAGCACACTCTTGGTATACACCTGAAGAAGATATGGTTCTTGGTGGGTCGTAA
Protein Sequence
MSQQQPQQQQNHSHYAHVASSASGKSSNGNNDMHMYGSFYGGSMSGGIGIGMGGSCALNSGNDCIGGAAAFDLETSGSNSVAYAHNQLLQYQQQQSTLMPQHNPPGVAEATMQRHEYMNPKVMNLQHHGNQYQMHNPMSCSPIEVEELIIKSEPPDENNYKTNFIDDNAPIVDFKKFPDFGDEMLNPKVELNIKEEHPSDMQQKSLNFPRRKVQTERSETLPICQRCKQVFFKKHVYINHVAESSCDIVEYDLRCSICPMSFMSMEELEKHKHLHRENKFFCHKYCGKYYDTISDIETHEYMHHEYDTFVCNICSLTFPTREMLYAHLPHHKFQQRFDCPVCRLWFATPQELHEHRIGAPYFCGKYYIMGGTQNHTQRSNFQNSSVSKFSSEPQENYKLQDCQMGVMEMAPHQFQSSTATSNSTITSSFMTPSHTYQHAHYKNNNFFQTPAIKTEIKVEPDFYSNISDYPQHHTQMTPANFSDYTNDSYTSSHNSSNFTNDYNEIGSSMLNAQQQQAQSANASTLDDSEDAVCCVPRCGVRKSTSPTLQFFTFPKDEKYLHQWLSNLKMLHVSVTTYRSYRICSLHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCKSQRGESNVKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCGRHFEERCIGKFRLKPWAVPTLNLGTPYGKIHDNPQNLYVEEKRCCLPFCRRSRSSDFNMSLYRFPRDEVLLQRWCYNLRLDPSVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLNLGHNDTFNIYENELYTPPPPPPPPPSSTTQKGYYRQPNATSTVPSPSSSLSSMFYASDAVSPFSSITNASSTPNVTESMDVCCVPGCDSKRHNSNDITFHTIPRRPEQMQKWCHNLKMDEDKMHKGLRICSLHFEPYCIGGCMRPFAVPTLNLGHNDENIYKNPDVIKKLNIRETCCVQICKRNRERDHANLHRFPANPVMMAKWCDNLHKPVPDGSRLFNDAICEVHFEDRCLRNKRLEKWSVPTLNLGYDDLVHKLPTEAEVAELFAKPSAPNNGDEEGECCVDSCKRNPQVDDIKLYRAPEDLEILAKWAHNLQIDVDDLSNLRICNLHFESHCIGKRMHFWAIPTLNLGTNIENMYENPEQPVFIKREKSLKFYGDTSTSKATWVPRCCLGHCRKMRAVHNVQLFRFPNLNRPMLAKWCHNLQMPLVGSAQRRVCSVHFDPAVLSKKCPIPHSVPTLDLNTPPGYKIYQNPARLKAPKQCLQRVCIIETCRRKRTDGVQLFRFPHNSSVLRKWLHNIKQRPKGSSRAQWRVCTMHFETHSFNGKRLSNGAIPTLNLGHDDEDIYPNEAQSFMDEQCVIQGCDSVKDMPDVRLFRFPTDDEDHLWQWCNNLKMNPIDCRGVRICNKHFDPECIGPKHLFKWAIPTHNLGHSDTEIEFIPNPKPEDRYSELIFKCCVPTCGKTRKYDEVQMNSFPKDPKQFRRWKHNLKLDHLDFKERDKYKICNSHFEDICIGKTRLNIGAIPTVNLGHNDIDHIYEVNPEKLQSNLFGKNRPISDYATSEDSADEESIENDSEAILPNSYSGNEYFSDNQLDSEDVINASSDLNMTQVKMKTSLDVLKCCVQSCQKSRLQHGVHLFNFPSGPIQLKKWRHNLQLPNEYMKKLLKICNLHFNKRCIDGKQLRSWAVPTMNLGHSDPIYENPKNIPGFFLPVCALNHCRKRRTIDNDLRTYQFPKGYLLEKWCANLKLQPENCRGRICADHFEAEVRGKLKLKTGAVPTLKLGHNDPLTFNNETIKSSYEKRYDTYEKQSEMSIGGTEPDVEESISDTEEGEEFYDPLKFVETVDEQLEGDTIEEPIFMPIATCIKKEKPVNNVSPICCLKHCRKEKTAQQHLSTFGFPKDRQLLLKWCANLHLNPDDCIGRVCIEHFDPEVLGSRKLRQGAVPTINVGHDEPLRYSNNGCELPIELFDNDSFGSVNRNDHTSDSPGHYAETLDHSVFRLFSLKHCRKRENLEKYDETSNRNIFLNPCSLEVAKDLSSCCALNCGKSRLKDGARLNRFPKSRILYLKWTYNLKLKSTTQIIKTHKVCNEHFESYCLKNGFLKEGAVPTLKLGYTDSLIYRNSKKLQSKINVYNRKTKCMVPGCLSGNLQTNKMYPIPSWAPLSSAWQRFSNLKDDEKQSKLQVCGLHFLEFFEDTSLPQEYSTQVKNIMKPIEIKIEPNLDGKPKSESPKRVSKICCIPHCNGQNIFRQLFSFPTAECQLQKWCVNTQKPIAQVRSLYICAKHFEPDAICKQQLRPWALPTIDLGHTDTIIQNAKHLGNFHSEEQESNDMKFIRSNYCAVFSCFQKKSDTVRLYEYPTEMTLIRKWAANCKHRSFQARQHGLRVCKDHFDRDCFLESDILRIGAVPTLKLGLGDGQNIQQSEWIEMRRKTPIPHGQLIHCIVPDCSSVSGTSKRFYKFPKIGELLEMWCKNLQIVGLNDNDNKEKHICQLHFESRCFTEHQRLHKCALPTLNLGHEDINNVIPNPDSFERIDVVPKCCVPGCNKTKQSDGVQLSAFPRVRNLFEKWALNLKLPISNQLWQNGKVCSIHFETYCYEYGRLKTGAMPTLNLGHDDADINLTNELTLGKKRKAHFAHRTNTPTLRKIDDFKCCFPTCKELDNIKMRQEFKLPALKSLRIKWCEIMDCDPDDTELRLCPLHFIIIYEANIDMIKNLQSNESQNEETIECVSQIQTSYDTAKCNSRILGMRCSVPGCLTLASRDNKKLFVLPYNQDLLEKWLHNTKIELVESQRYLTKVCSDHFEKRCVSESSTRLKMWAIPTLKVHTPIDDEILYENPSAAALERSTAKCCIHNCENASKNVDVITVALYRFPKENDVLQKWLYNTQLKARDTIGGRVCALHFEKFCIGKRLRSWAVPTLYLGHDLPDIYQNSNDKRVAQLKVEAVDSDESRMTSYSDQKTETETMDSSTFEPIVVMESLETGEFHDQSVNSTISNIYDNYKGKQKDLTADTIEDDLNSTDMALEVMLEVGHVEKCATYEQTHSSQPHTPVSEQSLPSPGERIANARRCCIVGCTVRNTDPGMKLHKFPQSKEVLDKWMHNAQIEVDLRCPWRYRICSRHFDPECFNGYRFRHGTMPTLYLGPNRPRNIFENEFEQVICAQDKSKSLCQDSDIEDNSDVDFEQSNQCLDFDRSMNKSGKFCQIKGCRNHLKTDGISLHKFPQSPRLWRKWQHNTQIPININYPWRFRVCSEHFHPQCLSNSRLLFGSIPTLNLGANAPEELYENEFQIESIPKLEMPLFEEKTSADEYDEEYDEVLAIEPEIELQLKEDRSMKCSSSNEFEYDNENSDSFDDYFTKAAMGKCCIAGCNMSKASPGITLHKYPSPPEHLRKWLHNTQVKVDLTCRWRYRVCSRHFEPSCFRGPRIIYGCVPTLNLGRNAPSFVYTNEDVFKRMEGETLTNHIPDEKPNVTFSATSFTKLHCSILTCGRVSGQDGVSLHRLPQDELTRRKWLVHCNFGNGSFQNNVYSLRICSRHFEKHCYIGAPGKRYLKPGVIPTLNLQSTVCEKFAMQPIKREITLKHSKMKTIEINFEYNEIKSGYGKCSLIHCQKQKSQHCVKIYKFPKSQQQQERWSHNLRIQYDPARPWKFLICSDHFEEQCVSKRKLYRWAVPTLNLGSNVPATLFTNEECKVLCAGFGSSSDDSENDDQSGNEIKFNHSVSVCKTETKSTFDKTSTSPNVFYNSPPKHNFAYKTKICCLPHCRRPRGEGVKLFRLPTSVHLIRKWEYNTGIAFKESQRNTKLICSDHFPPSLIGIRRLSKNAVPTRNLGPNAPQNPHKNKIVETNEESDDIVDRETGTKELEIAEVDTLENSEDFDDDEKLFDPVSFRSHWQMSNFVKQEHNDWQSNTDLRLPPKNMTYKQHHCCVPHCGLARRGNLKLFRVPLDSNLRAKWEKNLRMVFDINDRNINLVCSQHFEPELIGATRLARGSVPTLNLKPIDATSSCTTHSSVLVFCNVPGCDNTNAHQNITIFDTYMREPDILNKWCENLNIDANNPTIKEGGYNICSAHFEPQCFTPQGNLHKWALPTLKLGHEVEAKHQQCCVPKCQGSKGLNTTMFSFPTDEFLRKKWCKALQIDEKNIFLNEKVCKSHFADDNLNGYNLKPGAIPCLNVCTSQGHCIVKNCETVIGLINFPENRNTAAKWCHNLKIEPLPKLHLHQDYKVCRRHFEPDCFRLGNLYPDAVPSLHLGHGDPNIFLNEDMSSNNYFESGSVQTYSNSCETETIIKKEPVGIDEVSIHAWQAHSWYTPEEDMVLGGS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-