Basic Information

Gene Symbol
-
Assembly
GCA_035047445.1
Location
JAWNPP010000016.1:13633944-13649072[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 29 9 1.6e+04 -11.1 8.2 56 72 32 44 10 60 0.40
2 29 5.1e-15 8.9e-12 45.4 4.3 1 86 609 681 609 682 0.85
3 29 2.9e-15 5e-12 46.2 4.6 1 87 709 778 709 778 0.83
4 29 6.3e-16 1.1e-12 48.3 0.4 1 87 800 872 800 872 0.84
5 29 6.3e-16 1.1e-12 48.3 5.2 1 86 978 1047 978 1048 0.82
6 29 6.2e-15 1.1e-11 45.2 3.2 1 86 1072 1143 1072 1144 0.82
7 29 4.4e-13 7.7e-10 39.2 0.9 1 87 1179 1247 1179 1247 0.82
8 29 9.9e-11 1.7e-07 31.7 1.4 1 86 1300 1369 1300 1370 0.76
9 29 6.1e-16 1.1e-12 48.4 0.1 1 86 1397 1466 1397 1467 0.83
10 29 2e-13 3.5e-10 40.3 1.5 1 86 1488 1557 1488 1558 0.81
11 29 7.4e-15 1.3e-11 44.9 1.7 1 86 1585 1656 1585 1657 0.85
12 29 5.6e-14 9.8e-11 42.1 2.6 1 85 1728 1796 1728 1798 0.82
13 29 3.8e-12 6.7e-09 36.2 0.1 1 86 1821 1889 1821 1890 0.81
14 29 8.3e-14 1.4e-10 41.6 1.1 1 87 2041 2110 2041 2110 0.80
15 29 2.7e-12 4.7e-09 36.7 0.2 1 85 2193 2257 2193 2267 0.79
16 29 0.037 64 4.2 0.0 1 58 2294 2344 2294 2359 0.75
17 29 2.9e-12 5.1e-09 36.6 0.7 1 86 2383 2452 2383 2453 0.83
18 29 7.4e-15 1.3e-11 44.9 2.2 1 86 2518 2587 2518 2588 0.84
19 29 1.4e-12 2.5e-09 37.6 0.5 1 86 2623 2694 2623 2695 0.81
20 29 3.7e-13 6.4e-10 39.5 0.4 1 87 2705 2777 2705 2777 0.82
21 29 1.8e-13 3.1e-10 40.5 0.0 1 86 2806 2879 2806 2880 0.79
22 29 0.0001 0.18 12.4 0.0 1 57 2913 2962 2913 2974 0.81
23 29 1.1e-14 1.9e-11 44.3 0.1 1 86 3002 3074 3002 3075 0.80
24 29 1.9e-14 3.4e-11 43.6 0.4 1 86 3246 3318 3246 3319 0.85
25 29 2.1e-13 3.7e-10 40.2 1.6 1 86 3379 3449 3379 3450 0.81
26 29 7.7e-13 1.3e-09 38.4 5.1 1 86 3545 3615 3545 3616 0.85
27 29 4.7e-13 8.2e-10 39.1 0.0 1 87 3693 3763 3693 3763 0.85
28 29 2.6e-09 4.6e-06 27.1 0.4 1 58 3782 3831 3782 3848 0.81
29 29 3.1e-10 5.4e-07 30.1 0.6 18 87 3848 3906 3836 3906 0.75

Sequence Information

Coding Sequence
ATGTCACAACATAACAATCTCCCTCCGCATCATCATCACTactaccagcaacagcagcagcagcaacaactacagcaacagcaacaacatcaccaccaccatcaacaacaacaacaacaacatcaccagcagcagcaacagcagcagctacaacataaacaaatacagcagcaacacagtTGGTACTCACATGTTGCTTCCTACCCTCCCCACCATCCGCATGCCGCCGCAGCCTTTGCGACGCCCTgcaaagccagcaacaacaataacaacaacaacaacaacagcattatGAATGCATACGGCACAGGAGTTGTTGCAAGTGGCACGCAGGCAACATATTATGGGGCAGCAGCTGGTGGTGGGGTGGGATATAACCTTGAGGCCAATACTGTTGCCTATGCGCACAACCAGCTGCTGCagtaccagcaacaacaacaacaacagcagcagcatcagcagcagcagcaacaacaacaccagcttAACCAACGTTCGTATATGTCGCATGGAGTAATGCACAGCTCTTATCCCTACATCAAGAGCGAACCATTGGAGTTGCCGGATGATAGACAGCgtcagccacaacaacaacagcagcaacatcatcagcatcaccaacaacagcagcagcagcaacatttccAAACCCCTATGGCACCGCCGCCAGCTCCCGCCAATCGTCATAGTCTCGATGCCAGCGGcgaaatgataataaaatCGGAACCCATTGACGAACATGCCTACAAGTCTAACTATATCGATGATAATACGCCCTTCGTTGACTTTAGTAAATATCCCGAATTCGGTGACGACATGCTGAGTCCCAAGGTCGAGTTAACCGTCAAGGATGAGGCCTATGGCAGTCAGAAAAACCCGCTAAGCTATCCGCGGCGCAAGCTGCAATCGGATCGCTCAACGGAAAGTCTTCCCATTTGCCAGCGTTGCAAGGAGGTGTTCTTCAAAAGACCCATCTACTTGCGGCATGTGGCCGAGAGCAGTTGCAACATACAGGAGTATGACTTCAAGTGCAGCCTCTGCACCATGTCCTTCATGACCAACGATGAGCTGCAGAAACACAAGCATCTCCATCGAGCGGACAAGTTCTTCTGCCACAAATACTGTGGCAAGTACTTTGACACGATTGCCGAATGTGAATCGCATGAATACATGCAACACGAATATGAAAACTTTGTGTGCAACATGTGTTCCATGACGTTTGCCACACGGGAACAACTCTATGCCCATTTGCCGCAGCACAAATTCCAGCAGCGTTACGATTGTCCTATTTGCCGGTTGTGGTATCAAACCGCTTTGGAGTTGCACGAACATCGTCTAGCGGCGCCGTACTTCTGCGGCAAGTATTATCCACCCgctcatcagcaacaacagcagcagcagcagcagcagcaacacgcacagcatcagcaacagacCAACTATAAACTGCAGGACTGTCACATGGGCACCATGGAAATGCCAACGTCGCAGCATAaggcaaatgcagcagcagcagcagtcaacgCATtgccggcaacagcagcgctcagctcgttgttgcagcagcgTCAAGCGAATGCCGACGGGGCCGCATTGTTTGCCTCAACGCTGAAAAACGAGGCGAACGTGAAGCTGGAGCGGAGCTATAGCAATTCAACGAGCGAGTCCGGTTACAGTCTGCAcgacagcagctacaacaatgcCTACGGCAGCGACACCTCGATACATGCGTCAGCtggaggcggcggtggtggcggaaCTGGTGGCGCAATTGGCGGTCCACAGGCGCACTCCTCGACGCTGGACGATTCGGAGGACGCACTCTGCTGTGTGCCGCTGTGCGGTGTGCGCAAGAGCACCAGCCCAACGTTGCAGTTCTTCACGTTTCCCAAGGACGAGAAGTATCTGCATCAGTGGCTGCACAATCTCAAGATGTTCCACATTCCGGCATCGAGCTATGCGACCTTTCGCATCTGCAGCATGCACTTCCCGAAGCGTTGCATCAATCGCTACTCGTTGTGCTATTGGGCGGTGCCCACTTTCAATCTGGGCCACGACGATGTCGCCAATTTGTATCAGAATCGCGAGCTGACCAACACCTTTACCACCGGCGAAGTGGCACGCTGCAGCATGCCCAACTGCACCAGCCAGCGGGGCGAGAGCAATCTCAAGTTCTACAATTTTCCCAAGGATATCAAGAGTCTGATCAAATGGTGCCAGAATGCCCGTCTCCCCGTCCAGGCCAAGGAGCCGCGTCACTTCTGCAGTCGCCACTTTGAGGAGCGCTGCATTGGCAAGTTCCGACTGAAACCGTGGGCGGTGCCCACGCTACATTTGGGCGCCCAGTACGGCAAGATCCATGACAATCCCAAGAATCTGTACGTGGAGGAGAAGCGTTGCTGCCTTAACTTctgtcgtcgcagtcgctcgTCCGACTTCAACATGTCGCTGTATCGCTTTCCACGGGACGAGGTCCTGTTGCGACGCTGGTGCTATAATCTGCGGCTCGATCCGTCGGTCTACCGTGGCAAGAATCACAAAATATGCAGCGCTCACTTTATCAAAGAGGCTTTGGGACTGCGCAAACTATCGCCGGGTGCTGTTCCCACGCTGCACTTGGGACACAACGACACCTTCAACATCTACGAGAACGAGCTGTGGCCGCCACCGACGCCCTCCACGCCCACCAatcaccaccatcagcagcagctgcagcagcatcagctgcaacagcaccagcaacagcagcagcaacatccaCAGCACCACCACAGTCACCACAAATACCAGCGTCATTCGGCGGCTTCCACATCCTCGTCAGCCAGCTCCTCGCACTACGTGGATGCTGGCGACATGAGTGGCTCCTACTTGGGCATGGGCAACTCGGGCGGCTCCTCGTCCGGCCTGAATGTGAGCGACAGCATGGACGTGTGCTGTGTGCCCAGCTGCGAGAGCAAGCGgcacaacaacgagaacatcACATTCCACACGATACCGCGCAGGCCCGAGCAGATGCGCAAGTGGTGCCACAATCTCAAGATACCCGAGGACAAGATGCACAAGGGCATGCGCATCTGCAGCCTGCACTTTGAGCCCTACTGCATTGGCGGCTGCATGCGTCCGTTCGCGGTGCCCACGCTGCAGCTGGGCCACGACGACGAGGACATTCATCGCAACCCCGATGTGATCAAGAAGCTGAACATTCGGGAGACCTGCTGCGTGGCTGTCTGCAAGCGCAATCGCGACCGCGATCACGCCAATCTGCATCGCTTCCCCAGCAATGTGGCGCTGCTGACCAAGTGGTGTGCGAATCTGCAGCGGCCCGTGCCCGACGGCAGCAAGCTCTTCAACGATGCCATCTGCGAGGTGCACTTCGAGGATCGCTGTCTGCGCAACAAGCGGCTGGAGAAGTGGGCGGTGCCCACGCTGATACTGGGCCACGACAACATTGCGTATCCGCTGCCCACCGCCGAGCAGGTGGCCGAGTTCTATGCGCGTCCCAGTGCGCCCAACAATGGCGAGGAGCAGGGCGAGTGCTGTGTGGAGACGTGCAAGCGCAATCCCAGCGTGGACGACATCAAGCTCTATCGCCCGCCCGAGGAGTCGCAGGTGCTGGCCAAATGGGCGCACAATCTGCAGCTGGACGCCGCGAAGCTGACTAGCCTGCGCATCTGCAATCTGCACTTTGAGGCGCACTGCATTGGCAAGCGCATGCGACCCTGGGCCATACCGACGCTCAATCTGGCGACGAACGTTGAGAATCTCTACGAGAATCCGGAGCATCAGATGCTCTACAAGCGACGCACGCATCTCAATGCGAATCGTGGCGCTGGCCGCGAGgcgggcggcggtggcggtggcggcggcggcggcgtgaAGCCGACGTGGGTGCCgcgctgctgcctgccgcaCTGCCGCAAGGTGCGTGCGCTGCACAACGTGCAGCTGTATCGCTTCCCCAAGCTCAATCGCTCCACGCTGGCCAAGTGGGCGCACAATCTGCAGGTGCCGCTGGTGGGCAGCGCCCAGCGACGCCTCTGCTCGGCGCACTTTGAGCCCCACGTGCTCAGCAAGAAGTGCCCGGTGCCCTTGGCCGTGCCGACGCTGGATCTGAACTCGCCGCCCGGCTACAAGATCTACCAGAATCCCGCCAAGCTCAAGGCCAACAAGCTGTGCCTGCAGCGCGTCTGCATTGTGGAGAGCTGTCGCCGCCAGCGTGGCCAGGGCGTGCAGCTCTTCCGGCTGCCGCACAATCCCACGCAGCTGCGCAAGTGGATGCACAACATCAGGATGCGGCCCAGGGGCGCCATGCGGCAGCAGTATCGCATGTGCTCCATACACTTTGAGACGCACTCGTTCAACGGCAAGCGGCTGAGTGCGGGCGCGATTCCAACGCTGGAGCTGggccacgacgacgacgacatctATCCGAATGAGGCGCAGTCGTTTGTCGAGGAGCACTGCACCGTCGAGGGCTGCGAGGCGTCCAAGGAGCAGCCCGAGGTGCGTCTCTTCCGCTTCCCCACCGACGACGAGGATCTGCTGTGGAAGTGGTGCAACAATCTCAAGATGAACCCCGTCGACTGCATTGGTGTGCGCATCTGCAACAAGCACTTCGAGCCGGACTGCATCGGGCCCAAGCATCTCTACAAATGGGCGATTCCCACACTGCAGCTGGGCCATGACGACGAGCAGATCGAGCTGATCGACAATCCCAAGCCCGAGGAGCGCTACGTGGACCCCGTGTTCAAGTGCTGTGTGCCGACGTGCGGCAAGACGCGCAAGTTCGATGAGGTGCAGATGAACAGCTTCCCCAAGGATCCGAGCATGTTCCAGCGCTGGCGGCACAATCTGCGGCTGGAGCATCTGAGCTTCAAGGAGCGCGAACGCTACAAGATCTGCAATGCGCACTTCGAGGACATTTGCATTGGAAAGACGCGACTGAACATTGGCTCGATACCCACGCTGGAGCTGGGCCACGAGGAGACTGAGGATCTGTTCCAGGTGAATCCCGAGGAGCTGCAGAGCAATCTCTTTGGGCGCCAGCGACGCGTGAACTCAGCTCTCAGCGTCAGCATCAAGCAGGAGGAGAACTCCGAGCTGGACGAGGACATCAAGCCCGACTTGGCGGACAACAATGCCTCACAGGCGCGGACCAAGCGTTCCCTGGCGGATTCCAAGTGCTGTGTGCCCGACTGCGGGCGCAGTCGCCTCGAGCACGGCGCCCGCCTGTTTCCCTTCCCCaacggcaagcagcagcagagcaagtGGCGCCACAATCTGCGCCTGCAGCCGCACGAAGTGGACCGCAGCTCACGCGTCTGCAGCGCGCACTTCAATCGTCGCTGCATCGATGGCAAGCAGCTGCGCGGCTGGGCCATGCCCACCCAGCAGCTGGGCCACCAGGAGCTGCCCATCTATGAGAATCCAAAGAACATACCGGGCTTCTTCACGCCCACCTGTGCGCTGGCCCAttgtcgccgccgtcgcagcATTGACAACGATCTGCGCACCTATCGCTATCCGCGCAGCGAGGATCTGCTGGAGAAGTGGCGCGTTAATCTGCGACTGGCGCCGGATCAGTGTCGTGGTCGCATCTGCGCCGACCACTTCGAGCCCATGGTGCGTggcaagctgaagctgaagacGGGCGCTGTGCCCACGCTGAAGCTGGGCCACAGCGAGGGCGTCGTGTTCGACAATGAGGCCATCAAGGCGGGCCTGCAGCAGGAGGCAGAGGAGGGCGGCGATCACGAGACCAGCCTGGAATCGCTGGTGAAAGTGAAGCAGGAGAAACTGGATCCAGACGAAGAGCCAGCTGAGAATGCGCAACAGGAGcaggacgaggacgatgagcAGGCGGATCACGGCTACTTCGATCCGCTCGAGCTGGTCGAGACCTTTGCGGAGCAGCACAGCGCCGAAGACGATGAGGAGGGCAACGAACACGATctcgacaacgacgacgacgacgacgacgatgtggaCGAAGACGAGGACGAGcccggcgacgacgatgagctgctgctgccggacACGCCGCCGGTGAAGCAGCTGCCACGTCCGCCTGTCGTGTTGCCGCCGCGTCGCGAGAAAGCCGTGAACAATGTGACGCCCATCTGCTGTCTGAGGCACTGTCGCAAGGAGCGCACAGCCAGCCACCAGCTGAGCACCTTCGGCTTCCCCAAGGaccgccagcagctgctcaagtgGAGCGCCAATCTGCAGCTCTCGCTGGACGATTGCGTGGGACGCGTCTGCATCGAGCACTTTGAGTCAGAGATGCTGGGCACACGCAAGCTGAAGCAGCACGCGGTGCCCACGTTGAATCTGGGGCATGCAACGCCGCTGAGCTACAGCTGCAATGGGCAGGCGTTGAGCATCTACGATGCACAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACATTGTCGCAAACGGAAGCATCCAACGGAGCCGCCGGATcacaagcagcagaagaagcagcagcagcagctggatcAGACAgtaacagcgacaacgacagcgacaacgacagcgacagcgacagcgatgaCGAGTAAGCGACGCTGCTGCCTGCCCAGCTGCGgcaagcagccgcagctccACGGCGtccagctgcagcggctgcccAGCAATCGCATCCAGCTGCGCAAGTGGCTGCACAATCTCAAGCTCTCGCCCACGCTGGACACCAGCCAGGCACGTCTCTGCAGCGAACACTTTGAGCCCGAGCTGCAGCACGCCGAGGAGGCGGTGCCCACGCTGCGACTCGGCCACGACGACACGCACATCTATCGCAATCGTGGCACtgctgccaacagcagcagcagcagcctcgcCGCCGCCAGCTCCACGGCCACGTCGAGCGCCTGCCTGGTGGCCAGCTGCCCCTGTGCGCGTCTCAATCTCTATCGCTGCTACGATCTGCCCGAGCATCGGCTGGTGCAGCAGGCCTGGCTGCagtggctgcagctgccgctgccgcagcaggCCAGCGACGGCAAGCTGTGCGTCATGCACTACATGCAGCTCTACGAGCAGGTGCCGCTGCCCGACGAGCTGCCCGGCTCGgtgctgcggcagctgcaggagACCTACGATCTCATTGCCAGCTCCACGATGGCCATGAAGCTGCGCTGCGCCGTGCCCGGCTGCTACTCGAAGTACACGGACAACATACGACTGACCAAGCTGCCCATGTGCCCCGAGATGTGCGCCAAGTGGGTGCACAACACCAAGATCAGCTACGACGCGACGCGTCACTACGTCTATCGCATCTGCATGCTGCACTTTGAGTCGCGCTGTTTGGGCCCCGTGCGTCCAAAGCTGTGGGCGGTGCCCACGCTGCAGCTGAACCATCAGGACGCCAGCATCTATCAGAATCCGAAGCCGGATGCGCTGTCGACGTCTGCGCCGGTGCCCATTGCCATGTCCCTCGCCTCGGTGCCCGTGGAGCTGCCGCTGCGCATCAAGACGGAGCTGGCGTTCAGCGGCAGTCCCAGCGCCAGTGCGAGTCCCAGTCCGCGTGGCAAGCTGCGCTTCTGCTGCATACCCAGCTGCCTGCAGCAGGCCACAACGCAGACGCGCCTCTTTCGCTTCCCCAGCGCCGAGACGACGCTGCTCAAGTGGCTGGTGAAcacgcagcaacagccgcgTCTGGTGGACACGCAGCAGCTGTTCATTTGCCAGGATCACTTCGAGCCGGAGGCCATCTGCAAGAAGCAGCTGCGCAGCTGGGCGGTGCCCACGCTGAAGCTGGGCCACGACGGCCACGTGATACCGAATGCACGCCACAATGGCAACATTGCCGACAGCCAGGAGAACAAGCAGACGCTGCAGTACATCTGGGAGAACTACTGCTCGGTGCTGAGCTGCTTCCAGCCGCGCGGCGAGCAGCTGCGTCTCTACGCATACCCCACGGACCGACCCACCATACGCAAGTGGGCCGCCAACTGCAAGCATCGCTCCATGCAGGCCAGCAGCGACGGCTTCCAGgTCTGTCAGCTGCACTTTGCGCCGCACTGCTTTGACCGGGAGACGGGCGAGTTGAAGGAGGATGCGGTGCCCACGCTGGAGCTGAGTCGCTGCCTCAACGACGTGCGCTGTGTGGTCGTCGGCTGCGTGAAGGACGAGGACGGGCCGCGTCAGCGCTTCTACAAGATGCCCAAGCGCAGTGcgcagctgctcagctggTGCCACAATCTGCGCCTGGACGCCGCAGCCATGAGCAGCGGCGAGCATCACGTGTGCGATCGTCACTTCGAGGCGCAGTGCCTCAATCAGCAGAAGCTGCTGCGACCAGGCGCACGTCCCACGCTGCACCTGGGCCACGACGCTGCCATTGACCTCATGCCCAATCCAGCCGAATGGGAGGCAGCTGAAGCGACGCCCGCCGTCGATCTGGTGTGCTGTGTGCCCAACTGTGCGCTGAGCAGGGACGAGGACGCGGATGTGCAGCTGTTTGCCTTCCCCAAGCTGCGCCTGCTCGCCGAGAAGTGGCTGCAGAACATACGGCTCGAGCATCTAGGCCGCGAGCAGCTGATGCGTCTGCGCATCTGCGGCGCACACTTCGATCCCGGCTGCCTGGAGGGCAGCGGGCGGCCGCAGCTGGGCGCCATGCCCACGCTGCAGCTGGGCCACGAGGAGCGCGACAACATACATCGCAGCACCGATGCCGCCGCCGTGAAGGCGAAGAAGTTCTGCAATCGCAGTGGCTCCAGCTACGACTGCTGCTATCCGCAGTGCGTGGAGCTGCAGAAGAGTTATCTGCGCATCAGCTACGAGCTGCCGCAGTCAGAGGCGCTGCGTCTCAAGTGGCTGGAGTACATGGGCATCGAGGAGACGGACGAGAAGCCGCTCAAGCTCTGCCCCCTGCACTTGGTGCTGCTCTACGATCACAGCGTGCAGCACTTGGCCGCAGACCACGCgcccgagcagctgctggacgCCAACTACGAGGACGCCCGGCACAGCGTGCGCCTGCGTGTCATCAGCTGTGCCGTGCCAGGCTGCCGCACGCTGAAGCCGCGCGACGGCGGCATATTGCACGGGCTGCCGCAGCGTCGCGATGTGCTCGAGATGTGGCTGCACAACATGCAACTGGTGTTCTACGAGCAGCAGCGCTACATGTACAAGGTGTGCAGCCGGCACTTTGAGCCCAGCTGCTTCATGGACACCACGCGGCGCCTGAAGCCGTGGACAATGCCGACGCTGGAGCTGCCGCCGCGTGCTGACGACGAGGCGCCCATCTACGACAATCCCAGCGAGTTGGAGTGGCAGCGCATGAACGAGCTGCTGGcggtggagcagctgcagctgcagcagcagcagcagcaggaacagccAGAGGAGCTGTGCGATCTGCTGGAGCCCATTGTGAAGATGGAGCACATCGACAGGGACGACGAGGAGTACGaagagcaggagcaggagccaGAGCAGGAGCCAGAGCAGGAGCAAGAGCAGGAGCCAGAGCAAGAGTTGCAGCCCGACTTCGATGGCGACTACGACAACTCGCAGCAGCCGCTGGCGCTCGAGGTGCTGCTCGAGGTCGGCCACGTGGAGAAGTGCACCACGTACGAGCAGATGGACAACGAGGCCAACCTCGGCTatgccgagcagcagcaacagcagcagctgctcagcgaCCCCCCGCCCCGTGCTGGCTACGGAGCAGCCGCAAGTGGGCAGCTGGGCAGCAATGGTTTCAAGTACACGGCGCGGCATTGCAGCGTGCGTGGCTGCGATGTGACGGCCAACGATGTGAACGGCAGCATTAAGCTGCACAAGTTCCCCACCTCGCTGGACGCCATGGAGAAGTGGAAGCACAACACGCAGGTGGACGTCGATGTGAATTTCTCGTGGCGCTTTCGCATCTGCAGCTATCACTTCACCGACGAGTGCTTTCACGGTGCGCGCATCAAGCGCGGTGCGATGCCCACGCTGAGCTTGGGACCGCGTCGGCCGGCCAAGATCTATGACAACGAGTTCAACGCCACGCAGCCGCTGGAAGCGGAACAGGAGACGGGTGAGGAGCTGCTGCCCAAGCGCACAAAAGGCGGCGGTGACGGCGAGATTAGTCTGCGTTTGCCGGAGCCGGCGCCGCCGCGCAAGTCGAGCAAATTCTGCCAGATCGATGGCTGCCCCAATCATCTGACCAGCGAGAATCTGACGCTGCACAAGTTCCCGCACTCGGTGGACATGTGCGCCAAGTGGCAGCACAACACGCAGGTGCCCTTCGATCCCGACTACCGTTGGCGCTATCGCATCTGCAGCGCCCACTTTGAGCCCATCTGCCTGTTGAATATGCGGCTGATGCACGGCAGCGTGCCCACCTTGAAGCTGGGACCGCGTGCTCCGCGTCAGCTCTTTGACAGCGACTTCGAGGCCATCAACTTGCGGCTGGACAAGCAGAAGAACAGCTCCGAGCAGCAACTGCCCATCAAGCAGGAGCACGACGGAGACGTCGaagcggaggaggaggacgaggcgGAGCTGAGCTATCTGGTGCCCGAGATGCAGCTGCACGAGGACACCGATCAGCTGCAGGAGCTGCGCTTGCCCCACATCaagcaggaggagcaggagacCAGCTACAATCCGGTCAAGTCGGGCTACGACAAGTGCTCGCTGGTGCATTGCCAGCGCCAGCGTTCGCAGCACGGTGTGCACATCTACAAGTTTCCACGCTcccgccagctgcagcagcactgGATGCACAATCTGCGCATCAAGTACGACGAGCGCCGGCCCTGGAAGACGATGATCTGCAGCGTGCACTTCGAGCCCAACTGCATCAGGCTGCGCAAGCTGTGCGCGTGGGCGGTGCCCACGTTGGAGCTGGGCGACAATGTGCCGCTGGAGATCTACACGAATGAGCAGAGtcgccagcagctggaggcggGCAGCGATTGCGAGGACATGCCGCTGGAGGATGCCTACGAGGACGAGGATTACGACGACGACTTGGCCGAGCAGCTGGCCAATGAGCCGCTGGTGAAGCGCGAGCGTCGCTCACGTTTCGATCCCCTGCCGCCGGGTCAGCTGCCGCCCTGGAAGATCAAGGTGTGCTCCCTGCCCTACTGCCGCAGTCCGCGTGGCGATGGCATCAAGCTCTTCAGGCTGCCCAACAACGTCAGCTCCATACGCAAGTGGGAGCAGGCGACGGGCATGCGCTTCACTGAATCGCAGCGCAACACGAAGCTCATCTGCAGTCGTCACTTTGATCCGCAGCTGATTGGCGTGCGTCGGCTCATGTACAATGCGGTGCCCACCCTGAATCTGGGTCCCATGAGTGCCGAGAGCTCTGCTGTGCAGCCTGCGGCTGGACCTCGCTGCTGTATGCCCGACTGCCAGGCGGAGGGCAAGGCTGGCAAGCTGCACAAGTTTCCCAGTGATCCCATGCTGCTGCATCAGTGGTGTCATGCGCTGAATCTCGCGGACATTCAACGCTATCGCGGCAAGCACATCTGTGCCCAGCATCTGCCCGACAAGACGCCCAGCTGCATTGTGTGCGGCATggagcagttgcagttgccgatGCTCGACTTTCCGGAGAATCGCAATCTGCGCGCCAAGTGGTGCTATAATCTCAAAATCGAGCCCATTGCCAAGTGGGACAACTCAAAGCAGATCTGCAGCAAGCACTTTGAGAGCTATTGTTTCACGCAGCCGGGTGAACTGCAGCCGGAGGCGGCGCCCACGTTGCATTTGCGGCACAACGATAGCAATATATTCCTAAACGACTATGCCATAACAGATCAGAGCAAGATGCTGCGCATCAAGGATGAGCCGCTGGACAGCGATGATCTGATGCtgtaa
Protein Sequence
MSQHNNLPPHHHHYYQQQQQQQQLQQQQQHHHHHQQQQQQHHQQQQQQQLQHKQIQQQHSWYSHVASYPPHHPHAAAAFATPCKASNNNNNNNNNSIMNAYGTGVVASGTQATYYGAAAGGGVGYNLEANTVAYAHNQLLQYQQQQQQQQQHQQQQQQQHQLNQRSYMSHGVMHSSYPYIKSEPLELPDDRQRQPQQQQQQHHQHHQQQQQQQHFQTPMAPPPAPANRHSLDASGEMIIKSEPIDEHAYKSNYIDDNTPFVDFSKYPEFGDDMLSPKVELTVKDEAYGSQKNPLSYPRRKLQSDRSTESLPICQRCKEVFFKRPIYLRHVAESSCNIQEYDFKCSLCTMSFMTNDELQKHKHLHRADKFFCHKYCGKYFDTIAECESHEYMQHEYENFVCNMCSMTFATREQLYAHLPQHKFQQRYDCPICRLWYQTALELHEHRLAAPYFCGKYYPPAHQQQQQQQQQQQHAQHQQQTNYKLQDCHMGTMEMPTSQHKANAAAAAVNALPATAALSSLLQQRQANADGAALFASTLKNEANVKLERSYSNSTSESGYSLHDSSYNNAYGSDTSIHASAGGGGGGGTGGAIGGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYATFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPSVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPSTPTNHHHQQQLQQHQLQQHQQQQQQHPQHHHSHHKYQRHSAASTSSSASSSHYVDAGDMSGSYLGMGNSGGSSSGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLQLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRPVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHDNIAYPLPTAEQVAEFYARPSAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESQVLAKWAHNLQLDAAKLTSLRICNLHFEAHCIGKRMRPWAIPTLNLATNVENLYENPEHQMLYKRRTHLNANRGAGREAGGGGGGGGGGVKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPLVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGYKIYQNPAKLKANKLCLQRVCIVESCRRQRGQGVQLFRLPHNPTQLRKWMHNIRMRPRGAMRQQYRMCSIHFETHSFNGKRLSAGAIPTLELGHDDDDIYPNEAQSFVEEHCTVEGCEASKEQPEVRLFRFPTDDEDLLWKWCNNLKMNPVDCIGVRICNKHFEPDCIGPKHLYKWAIPTLQLGHDDEQIELIDNPKPEERYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPSMFQRWRHNLRLEHLSFKERERYKICNAHFEDICIGKTRLNIGSIPTLELGHEETEDLFQVNPEELQSNLFGRQRRVNSALSVSIKQEENSELDEDIKPDLADNNASQARTKRSLADSKCCVPDCGRSRLEHGARLFPFPNGKQQQSKWRHNLRLQPHEVDRSSRVCSAHFNRRCIDGKQLRGWAMPTQQLGHQELPIYENPKNIPGFFTPTCALAHCRRRRSIDNDLRTYRYPRSEDLLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHSEGVVFDNEAIKAGLQQEAEEGGDHETSLESLVKVKQEKLDPDEEPAENAQQEQDEDDEQADHGYFDPLELVETFAEQHSAEDDEEGNEHDLDNDDDDDDDVDEDEDEPGDDDELLLPDTPPVKQLPRPPVVLPPRREKAVNNVTPICCLRHCRKERTASHQLSTFGFPKDRQQLLKWSANLQLSLDDCVGRVCIEHFESEMLGTRKLKQHAVPTLNLGHATPLSYSCNGQALSIYDAQPQHSVFRLWSLKHCRKRKHPTEPPDHKQQKKQQQQLDQTVTATTTATTTATATAMTSKRRCCLPSCGKQPQLHGVQLQRLPSNRIQLRKWLHNLKLSPTLDTSQARLCSEHFEPELQHAEEAVPTLRLGHDDTHIYRNRGTAANSSSSSLAAASSTATSSACLVASCPCARLNLYRCYDLPEHRLVQQAWLQWLQLPLPQQASDGKLCVMHYMQLYEQVPLPDELPGSVLRQLQETYDLIASSTMAMKLRCAVPGCYSKYTDNIRLTKLPMCPEMCAKWVHNTKISYDATRHYVYRICMLHFESRCLGPVRPKLWAVPTLQLNHQDASIYQNPKPDALSTSAPVPIAMSLASVPVELPLRIKTELAFSGSPSASASPSPRGKLRFCCIPSCLQQATTQTRLFRFPSAETTLLKWLVNTQQQPRLVDTQQLFICQDHFEPEAICKKQLRSWAVPTLKLGHDGHVIPNARHNGNIADSQENKQTLQYIWENYCSVLSCFQPRGEQLRLYAYPTDRPTIRKWAANCKHRSMQASSDGFQVCQLHFAPHCFDRETGELKEDAVPTLELSRCLNDVRCVVVGCVKDEDGPRQRFYKMPKRSAQLLSWCHNLRLDAAAMSSGEHHVCDRHFEAQCLNQQKLLRPGARPTLHLGHDAAIDLMPNPAEWEAAEATPAVDLVCCVPNCALSRDEDADVQLFAFPKLRLLAEKWLQNIRLEHLGREQLMRLRICGAHFDPGCLEGSGRPQLGAMPTLQLGHEERDNIHRSTDAAAVKAKKFCNRSGSSYDCCYPQCVELQKSYLRISYELPQSEALRLKWLEYMGIEETDEKPLKLCPLHLVLLYDHSVQHLAADHAPEQLLDANYEDARHSVRLRVISCAVPGCRTLKPRDGGILHGLPQRRDVLEMWLHNMQLVFYEQQRYMYKVCSRHFEPSCFMDTTRRLKPWTMPTLELPPRADDEAPIYDNPSELEWQRMNELLAVEQLQLQQQQQQEQPEELCDLLEPIVKMEHIDRDDEEYEEQEQEPEQEPEQEQEQEPEQELQPDFDGDYDNSQQPLALEVLLEVGHVEKCTTYEQMDNEANLGYAEQQQQQQLLSDPPPRAGYGAAASGQLGSNGFKYTARHCSVRGCDVTANDVNGSIKLHKFPTSLDAMEKWKHNTQVDVDVNFSWRFRICSYHFTDECFHGARIKRGAMPTLSLGPRRPAKIYDNEFNATQPLEAEQETGEELLPKRTKGGGDGEISLRLPEPAPPRKSSKFCQIDGCPNHLTSENLTLHKFPHSVDMCAKWQHNTQVPFDPDYRWRYRICSAHFEPICLLNMRLMHGSVPTLKLGPRAPRQLFDSDFEAINLRLDKQKNSSEQQLPIKQEHDGDVEAEEEDEAELSYLVPEMQLHEDTDQLQELRLPHIKQEEQETSYNPVKSGYDKCSLVHCQRQRSQHGVHIYKFPRSRQLQQHWMHNLRIKYDERRPWKTMICSVHFEPNCIRLRKLCAWAVPTLELGDNVPLEIYTNEQSRQQLEAGSDCEDMPLEDAYEDEDYDDDLAEQLANEPLVKRERRSRFDPLPPGQLPPWKIKVCSLPYCRSPRGDGIKLFRLPNNVSSIRKWEQATGMRFTESQRNTKLICSRHFDPQLIGVRRLMYNAVPTLNLGPMSAESSAVQPAAGPRCCMPDCQAEGKAGKLHKFPSDPMLLHQWCHALNLADIQRYRGKHICAQHLPDKTPSCIVCGMEQLQLPMLDFPENRNLRAKWCYNLKIEPIAKWDNSKQICSKHFESYCFTQPGELQPEAAPTLHLRHNDSNIFLNDYAITDQSKMLRIKDEPLDSDDLML

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00601834;
90% Identity
-
80% Identity
-