Basic Information

Gene Symbol
-
Assembly
GCA_037075305.1
Location
JBAMCE010000577.1:7846087-7860149[+]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 29 4.1 5.1e+03 -1.6 2.4 49 64 341 360 320 375 0.57
2 29 7.5e-15 9.4e-12 45.6 4.3 1 86 566 638 566 639 0.85
3 29 4.7e-15 5.8e-12 46.3 4.6 1 87 666 735 666 735 0.83
4 29 1.1e-15 1.4e-12 48.2 0.4 1 87 757 829 757 829 0.85
5 29 1e-15 1.2e-12 48.4 5.7 1 87 926 996 926 996 0.83
6 29 1e-14 1.3e-11 45.1 3.3 1 86 1020 1091 1020 1092 0.82
7 29 7.2e-13 8.9e-10 39.2 1.0 1 87 1127 1195 1127 1195 0.80
8 29 1.4e-10 1.8e-07 31.9 1.2 1 86 1244 1313 1244 1314 0.76
9 29 7.7e-15 9.5e-12 45.6 0.1 1 86 1341 1410 1341 1411 0.81
10 29 9.1e-14 1.1e-10 42.1 0.8 1 86 1432 1501 1432 1502 0.81
11 29 1.2e-14 1.5e-11 44.9 1.8 1 86 1529 1600 1529 1601 0.86
12 29 4.7e-13 5.8e-10 39.8 1.3 1 85 1673 1741 1673 1743 0.82
13 29 1.4e-12 1.8e-09 38.3 0.1 1 86 1766 1834 1766 1835 0.81
14 29 5e-14 6.3e-11 42.9 0.7 1 87 1994 2063 1994 2063 0.80
15 29 1.1e-11 1.4e-08 35.4 0.1 1 62 2118 2177 2118 2194 0.79
16 29 0.003 3.7 8.4 0.1 1 58 2199 2249 2199 2276 0.79
17 29 6.6e-13 8.2e-10 39.4 1.5 1 87 2288 2358 2288 2358 0.86
18 29 4.4e-14 5.5e-11 43.1 1.1 1 86 2417 2486 2417 2487 0.82
19 29 7.8e-13 9.7e-10 39.1 0.5 1 86 2522 2593 2522 2594 0.82
20 29 8.2e-13 1e-09 39.1 2.4 1 87 2604 2675 2604 2675 0.82
21 29 2.1e-12 2.7e-09 37.7 0.0 1 86 2698 2768 2698 2769 0.81
22 29 0.00025 0.31 11.9 0.1 1 58 2800 2850 2800 2872 0.82
23 29 1.1e-14 1.3e-11 45.1 0.3 1 86 2889 2961 2889 2962 0.80
24 29 1.1e-13 1.4e-10 41.8 0.2 1 86 3102 3174 3102 3175 0.84
25 29 4.4e-14 5.5e-11 43.1 1.5 1 86 3240 3310 3240 3311 0.82
26 29 5.7e-14 7.2e-11 42.8 4.5 1 86 3423 3493 3423 3494 0.85
27 29 8.5e-13 1.1e-09 39.0 0.1 1 86 3585 3654 3585 3655 0.84
28 29 8.7e-10 1.1e-06 29.4 2.2 1 58 3672 3720 3672 3736 0.86
29 29 6.6e-07 0.00082 20.1 2.7 19 87 3738 3795 3726 3795 0.75

Sequence Information

Coding Sequence
ATGTCACAACACAACAATCAACCGCATtcgcatcagcatcatcactactatcagcagcagcagcaccacctccagcaacaacaacaacaccaccaccagcagcagcagcagcagcaacagcagcagcagcatttgcagcataaacaaatacaacagcagcacagtTGGTACTCACATGTTGCTTCCTACCCGCCCCACCAACCGCACGCCGCCGCTGCCTATGCGGCGCCCtgcaagaataacaacaacaataacaacaacaatattatgaATGCATACGGCACGGGCGCTGCCAGCGCGCACTATTATGGCGCTGCTCCTACTGCTGGGGCTGGGGTGGGCTATAACCTTGAGGCCAATACTGTGGCCTATGCGCACAACCAGCTGCtgcaataccaacaacaacagcagcagcagcaacagctcagtCAACGCTCGTATATGCCGCACGGTTTAATGCATGGCTCGTATCCTTATATCAAGAGCGAGCCATTGGAACTGCCAGATGATAGACAACGccatcaacaacatcaacatcaacagcagcaacaacaacatttccaGAACCCAATGGCCCCGCCACCAGCGCCCCCCGTCAACCGTCACACGCTCGATGCCAGCGGtgaaatgataataaaatcGGAACCCATTGACGAACATGCGTTCAAGTCCAACTATATCGATGacaatataccctttgccGATTTTAGTAAGTTTTCCGAATTTGGCGACGACATGCTAAGTCCCAAGGTTGAGTTAACGGTTAAGGACGAGGCTTATGGCAACCAAAAGaaccCGCTCAGCTATCCGCGACGCAAGCTGCAAAACGAGCGACCTTCGGAGAATCTGCCCATTTGCCAACGCTGCAAGGAGGTCTTCTTCAAGAAGCAGGTCTATCTACGTCATGTGGCcgagagcagctgcagcatacACGAGtatgaatttaaatgcaacatCTGCCCCATGTCCTTCATGGGCGCTGAGGAGCTGCAGAAGCACAAGCAACTGCATCGCGCGGACAAGTTCTTTTGCCACAAATACTGTGGCAAGCACTTCGACAACATCGCCGAATGCGAGTCGCATGAGTACATGCAGCATGAATACGATAGCTTTGTGTGCAATATGTGCTCTGTAACGTTTTCAACTCGGGAACAGCTTTATGCTCATCTGCCGCAGCACAAGTTTCAGCAGCGTTACGATTGCCCTATTTGCCGCTTGTGGTATCAAACGGCACTAGAGTTGCACGAGCATCGACTGGCGGCGCCCTACTTTTGTGGCAAGTATTATCCAGCagcacatcagcagcagcagcaacaacaacatcaacagcagcaacacccgcagcagcagcaaggcaACTACAAACTGCAGGACTGCCACATGGGCACCATAGAAATGACAGCACCGCACCACAAGACAAATGCTTTGCCTGCAACGGCGGCGCTTAGTTCCttgttgcagcagcggcaagcgAATGCAGATGGTGCCGCGCTGTATGCCTCGACCCTGAAGAGCGAGGCTAATGTCAAGTTGGAGCGCAGCTATAGCAACTCCACTAGCGAGTCTGGCTACAGTCTGCACGAGAGTAGTTATAATAATGCCTACGGCAGCGATAATTCGTTGCATGGTGGCGGCGCAGCAATTGGTGGTCCACAGGCACACTCCTCCACGCTGGACGAATCGGAGGATGCGCTGTGCTGTGTGCCGCTGTGCGGTGTGCGCAAAAGCACCAGTCCCACGCTGCAGTTCTTTACGTTTCCCAAGGATGAAAAGTATCTGCATCAGTGGCTGCACAATCTCAAAATGTTCCATATTCCGGCCTCAAGCTATGCCAGCTTTCGTATCTGCAGTATGCATTTTCCTAAGCGATGCATCAATCGTTACTCGTTGTGTTATTGGGCGGTGCCCACATTCAACCTGGGTCACGATGATGTAGCTAATCTGTATCAGAATCGCGAGCTGACTAACACCTTCACCACCGGCGAGGTGGCGCGCTGCAGCATGCCCAACTGCACTAGCCAACGCGGCGAGAGCAATCTcaagttttataattttcccaAGGACATCAAGAGCCTGATCAAGTGGTGCCAGAATGCACGTTTACCCGTCCAGGCTAAGGAGCCGCGTCATTTTTGCAGTCGCCATTTTGAGGAGCGCTGCATTGGCAAGTTCCGGCTGAAGCCGTGGGCTGTACCTACTCTACATCTGGGCGCCCAGTACGGCAAGATTCATGACAATCCCAAAAACCTGTATGTGGAAGAGAAACGCTGCTGCCTTAACTtttgccgtcgcagtcgctctTCGGACTTTAACATGTCATTGTATCGCTTTCCCAGAGATGAAGTACTGTTGCGACGCTGGTGCTATAATCTGCGCCTCGATCCGGCTGTCTATCGCGGCAAGAACCACAAAATTTGCAGCGCTCACTTCATTAAGGAAGCCCTCGGATTGCGCAAACTGTCACCAGGCGCTGTGCCCACACTGCATCTGGGCCACAATGACACCTTCAACATATACGAGAACGAATTATGGCCACCACCGACGCCCTCTACACCCACCCACAATcatcagcagcaattgcagcagcatcagctgcagcagcatcaacagcaactgcagcaacatgtGCATCATAAATATCAGCGTCATTCGGCGGCATCCACATCATCGTCGGCCAGCTCGGCGTCGCACTATGTGGATCCAGAGCTGAGTGCATCCTACATGGGCTTGAGCGCTTCATCCTCTGGCCTGAATGTCAGCGACAGCATGGACGTGTGCTGTGTGCCCAGCTGCGAGAGCAAACGGCACAACAATGAGAACATCACATTCCATACAATACCCAGGCGGCCAGAGCAGATGCGTAAATGGTGCCACAATCTGAAGATACCCGAGGACAAGATGCACAAGGGCATGCGTATATGCAGTCTTCACTTTGAACCCTATTGCATTGGCGGTTGTATGCGGCCGTTTGCGGTGCCTACACTGCATCTGGGCCACGATGACGAAGACATTCATCGTAATCCGGATGTGATTAAGAAGCTGAACATTCGCGAAACCTGTTGCGTTGCCGTTTGCAAGCGCAATCGAGATCGGGATCATGCCAATCTGCATCGTTTCCCCAGCAATGTCGCCTTGCTGACCAAGTGGTGCGCCAATCTGCAGCGACCCGTGCCGGATGGCACCAAACTTTTCAATGATGCCATCTGCGAGGTGCACTTCGAGGATCGCTGTCTGCGCAACAAGCGGCTGGAGAAGTGGGCAGTGCCCACACTTGTGCTAGGCCACGAGAATATTGCCTACCCACTGCCCACGCCCGAGCAGGTGGCCGAGTCCTATGCGCGTCCCAGTGCGCCAAACAATGGCGAGGAGCAGGGTGAATGCTGCGTGGAGACCTGTAAGCGCAATCCTAGCGTAGATGACATCAAGCTCTATCGTCCGCCCGAAGAATCacaggtgcttgccaaatgGGCGCACAATCTGCAGCTGGACATTGCCCAGCTTCCTAATATGCGAATCTGTAATTTACACTTTGAATCCCACTGCATTGGCAAACGCATGCGACCCTGGGCCATACCCACCCTCAATTTGGCCAGAAACATTGAGAATCTCTTTGAGAATCCCGAACACCATATGCTCTACAAGCGTCGCGCGCATCTCAACGCGGACAGAGCCACCGCTCGCAGCGCTGGCGCTGACGGAGCCACCATGAAGGCCTCTTGGGTGCCACGCTGTTGCCTGCCGCACTGCCGCAAGGTGCGCGCATTGCACAATGTCCAGCTGTATCGCTTTCCGAAGGTCAATCGCACAACGTTGGCTAAATGGGCGCATAATCTACAAGTGCCGCTGGTCGGCAGCGCCCAAAGGCGTTTATGCTCCGCCCACTTTGAGCCTAATGTGCTGAGCAAGAAATGCCCGGTGCCCTTGGCGGTGCCCACGCTGGATCTCAATACGCCACCGGGCTAcaagatttaccaaaacccagCCAAGGTGAGGGCTAACAAGCTGTGTTGGCAGCGCGTCTGCATTGTGGAGAGCTGCCGTCGACAGCGGGCACAGGGCGTACAGCTCTTCCGGCTGCCGCACAGTCGCAGCCAGTTGCGCAAGTGGATGCACAATCTTCGCATGCTGCCGAGAGGCGCCATGCGGCAACAGTATCGCATCTGCTCGCTGCACTTTGAGGCGCACTCGTTTAACGGCAAGCGTCTGAGCACAGGCGCTATTCCAACGCTGGAGCTGGGCCATCAGGATGACGATATTTATCCCAATGAGGCGCAGTCCTTTGTCGAGGAACACTGCGCCGTAGAAGGCTGCGATGCGTCCAAGGAGCAGCCAGATGTGCGTCTCTTCCGCTTTCCCAATGACGACGAGGATCTGCTCTGGAAGTGGTGCAACAATCTCAAAATGAATCCCGTTGACTGCTATGGCATGCGCATCTGCAACAGGCACTTCGAACCGGACTGCATTGGGCCCAAACACCTGTACAAGTGGGCCATACCCACTTTGGTTTTGGGGCACGATGATAGCCAGATCGAGCTGATACCCAATCCCAAGCCGGAGGAACGCTATGCTGATCCTGTGTTCAAGTGCTGTGTGCCCACCTGCGGCAAAACGCGTAAATTTGATGAGGCGCAAATGAATAGCTTTCCCAAGGACCCATCGCTCTTCCAGCGCTGGCGCCACAATCTGCGGTTGGAACATCTCAACTTTAAGGAACGCGAGCGCTACAAAATTTGCAATGCGCACTTTGAGGACATTTGCATTGGCAAAACGCGTCTCAATATTGGCTCCATACCCACGCTAGAGCTGGGCCATGAAGTGACCGAAGATCTGTATCAGGTTAATCCCGAGGAGCTGCAGAGCAACTTGTTTGGACGCCCGCGGCGCGTGCATGAGAATCAGCGTCTGAGCATCAAGCAGGAGCTGGATGAGGACATCAAGCCGGACATAAGTATGTCAGAGGCCACGGACACAATCACAACACAGgTGAAGATCAAGAAGTCTGTGTTAGACTTGAAGTGTTGTGTGGCCAGCTGCGGTCGCAGCCGGCTGGAGCATGGTGCTCGCCTGTTTCCCTTTCCCActggcaagcagcagcagaccaaGTGGCGCCACAATCTCCGCCTTAGCGCCGCCGATGTGGACAGAACAACGCGCGTTTGCAGCGCTCACTTCAATCGACGCTGCATCGATGGCAAACAGCTTCGAGGCTGGGCAATTCCCACACAGCAGTTGGGCCACCAGGAACAAAACATATATGAGAATCCAAAGAATATACCGGGCTTCTTTACGCCCACCTGTGCGTTGGCGCACTGTCGTAAGCGACGAAGCATTGACAATGATTTGCGCACCTACCGCTATCCGCGCAACGAGGAGCTGCTCGAGAAATGGCGCGTGAATCTGCGTCTGGCTCCGGATCAGTGTCGAGGACGCATTTGCGCGGATCACTTTGAGCCGATGGTGCGCggcaagctgaagctgaagacGGGCGCGGTGCCCACGCTGAAGCTAGGCCATGATGAGGGCGTAGTCTTTGACAATGAGGCCATTAAAGTTGGAatgcagcaggaggaggaagaggaggaggaggcgggcAGCTTGGAGTCGGAGGGAAAGATAAAAATTGAGAAGCAGGAGAAGGAAACCCTGGAGCCGGAGTTGGAaaatgatgatgaggatgaagatgccgagcagcagcagaaagtgGAGTTTCCTGATGACGAtatggagcaggagcaggaacagAATGAGGAGGAAGAGGAGCTGCAGGATCATGGCTATTTTGATCCCCTAGAGCTAGTGGAAACCTTTGCCGAACAGCACAGCGATGACAATTCCGCTGACAATTATCATCTTgaagctgatgatgatgatgatgaagaagatATACCTGGCAATGatgatgagctgctgctgccagacaCTGTTCCAATACAGTTGCCGCCACGCCGCGAAAAGGCGGTGAACAATGTGACGCCTATTTGTTGCTTGAAACATTGCCGCAAGGAGCGCACCGCAAGTCATCAGCTAAGTACTTTTGGCTTTCCCaaggatcagcagcagctgcttaaatGGAGTGCCAATCTGCAGCTGGATCTCGTCGATTGTGTGGGACGCGTGTGCATCGAGCATTTCGAGGCGGAGATGCTCGGCACTCGTAAGCTAAAGCAGAATGCGGTGCCCACTTTGAATCTGGGACATGCCACGCCGTTGAGCTATAGCTGCAATGGCCAATCATTGAGCATATACGATGCACAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGCAAAAGGAAGCTGCTGACGATGCCTCCGGATCCGGCGATGACTAAACGACGTTGTTGCCTGCCCAGTTGTGGTAAGGAGCCGGAGCTGCATGGCGTTCAATTGAAGCGACTGCCCAAGGAtcgtctgctgctgcgcaaGTGGCTGCACAATCTGAAGCTGCCGCCGCACATGGACACCAAACACTCGTTTCTTTGCGAGGAGCACTTTGAGCCACATGCGACGCTGCCTACCCTGAAGCTGGGCCACGCGGCTAGCAACATTTATCGCAATGGAAGTTCGGCCTTATCCAGTGGCTGCCTAGTGCCCAGCTGTCCGTGTGCACGGCTCAATCTATATCGCTGCTATGCTCTACCCGAGCATCCGCAGGTGCAGCAGGCCTGGCTGAAGTGGCTGCAACTGCCGCCGCCACAGCTGGCTagccttgcccagctctgCGTCATGCATTATATGCAGCTATTTGAGCAGGTGCCGCTACCCGCGGATCTGCCAGAGTCTGTGCAGCGCCAACTGCAGGAAACCTACGAACAAATATCCAGCTCCAGCATGGCCATGAAACTGCGCTGTGCTGTGCCCGGCTGCTACTCCAAATACACGGACAACGTGCGTCTCACCAAGCTGCCCGTGTGCCCGCAAATCTGCGCCCAGTGGGTGCACAATACCAAAATTAAGTACGATCCGGAGCGCCATTACATGTATCGCATCTGCATGCGCCACTTTGAGCCGCAATGCCTGGGTGCAGTACGTCCTAAGCTGTGGGCGGTGCCTACGCTGCATCTTAACCATAGCGATGCggatatatatcaaaataccATGTTGGATAGCTCTGATGCCATGCCGGTGGCCGAGTCTGTACCGCTGACTTTGCCGCTGCGCATCAAGACAGAGCTGCCATTAACACTATCAGTCAGTCCCAGTGCCAGTCCCAGTCCACGCGGCAAACAGCGCACCTGTTGCATTCCCACTTGCGGCCAGCAGGCCAATGCCCTAACGCGTCTGTTTCGCTTTCCCAGCGCCGAGACGGCCCTGCTTAAATGGCTGGTGAacacacaacagcagccacgccTCGTTGATACGCAGAATCTATTTGTATGCCAGCGTCACTTCGCGGCGGAGGCGATTTGCAAGAAGCAGCTACAAAGTTGGGCAGTGCCCACCCTAAGTCTAGGCCATGAGGGCCACATCATACCGAATGCCAAGCACAATGGCAATATTGCCGACAGCCAGGAGAACAAGCAGGCGCTGCAATACATCTGGGCCAATTACTGCTCGGTGCTCACCTGCTTCCAACAGCGCAGCGAGCAGGTTCGTCTCTACGCCTATCCTACAGATCGGCCCACTATACGAAGGTGGGCGGCAAACTGCAAGCATCGCTCCATGCAGGCCAGCAGCGATGGGTTTCAGGTCTGCCAGTTACATTTTACACCCGACTGCTTTGATCCCGATACTGGGGATCTGAAGGAAGACGCGGTGCCCACATTGGAGCTGAGCCGGCCTGTCCATGAGTTACGCTGTTTGGTAAATGGCTGCGTTAGGGAGAAGGATGCAGCGCGATGTCGGTTTTTCAAAGTGCCCAAGCGTGCCTCACAGTTGGAGGATTGGTGTCACAATCTGCGCATCGATGCTGCGTCAATAAGCGGCCAGGAGGTCCACGTATGTGAGCGTCACTTCGAGGCGCACTGTTTCAGTGCGTACAAGCTGCGTCCGGGTGCACGACCTACTCTTCATTTGGGCCACGACGATGAGCTGGATTTGTTGCCCAATCCGGCAAAGTGGGAGGAGGATGTGAATGTATGCTTTGTGCCTAGCTGTGGACGGTCCAAAGATGTGGATAATGTGGAACTATTCGGGCTGCCCAGGATAAGGGGGGTCTTGGAGAAATGGCTGCATAATTTCCGCCTCGACCCGAGCAGGGAGCAGCTGCAAGGCATGCGGATATGCAGCGCACATTTCGAGGCCAGTTGCATAGAGAACGGCCGTCTACACTTAAGTTCGGTGCCCACGCTGCAGCTGGGCCACGATGAGTTGGACAATATTCATCAAAGCGCGGAACTGCCTTCATCGCAGCTTAAAGGCAAACGATTAGCCATGAACTACGACTGCTGCTATCCACAGTGTATGGAGCTGCAGAAGAGCTATCAAAGAATTGCATATGAGCTACCCCAGCAAGAGGCATTGCGTAACTTGTGGATGTCGTATCTGGGTCTGGAGCAGCAAAATCTGCAACCGCTTAAGCTCTGTCCGCTGCACTTGATAATGCTATATGAACACAGTGTCAACCATTTTCCAGAGCATTCATcggaggagcagctgctggacgCCAATTACGAGGCTGCGCGAAATAGCGTGCGCATACGGATTATCAGCTGTGCGGTGCGTGGCTGCAGGACACTCAAACCACGCGACGACTACCGCCTGCACGCCATGCCTACGCGTCGGGATGTACTCCGGATGTGGCTAGACAACATGCAGCTTGTGTTCTACGAGCAGCAGCGTTATATGTACAAGGTATGCAGCAGACACTTTGAGGCCACCTGCGTAACAGAGACTACTCGCCGTCTAAAACCCTGGAGCATGCCGACGTTGGAGTTGCCGGAACGTGACCCAGACGCTCCGCCGTTGCATCAGAATCCCACGGAGGAGGAGTGGCAGCGCATGAATGAGCAAATAGGCAGCAGCGAGGCATTGGCTTTGTTAGAGCCCGCGTTTAAGCTGGAGCCGGAGCCCATTGTCAAGCAGGAGCTGCACTCTATTGTCAAGCTGGAGCCGAAGCCGCAGCCAGAACAGCATGAGGGGGAGGAGTACGAGGCTAACGATCAACAGCAAGCGCTTGAGGTGCTGCTCGAAGTGGGTCACGTTGAGAAGTGCACCACATACGAGCAAATGGACACAAAGCCAATTATAGGCTATGCTGATACCCTGTCACATAATTCACTAGGCCCAACGACAACAGTTGGCAGCGCCTGTATTGTCAACGGTAACGGACTCACCTACAGTGCGCGCCACTGCAGCGTGCGGGGTTGCGATGTGACCTCTCTGGATGTAAATGACAGTCTCAAGCTACACAAGTTTCCCACATCGCTGGATGCCATGGAAAAATGGATGCACAACACCCAGGTGAATGTGGACGTCAACTTTGCGTGGCGGTTTCGCATCtgcagtttgcattttctacCCGAGTGCTTTAATGGTTCGCGTATCAGACGTGGAGCCATGCCCACGCTGCGTCTGGGATCACGCCGCCTAGGGGATATCTATGACAATGAGTTCAATGTGCAGCCCGAGCAGACGAGTGTGGATCAGCCGGTTGAGGCGTCGGCAGACGCTATGGTGCCCACTGAACCGCACGATGGCGCGACGGAGTTTAATATAAATCTGCATTTGCCCTGCCCCGCACCACCGCGCAAGTCCAGCAAATTCTGTCAGATCGATGGCTGCTCTAATCATTTGACCAGCGAAAATCTTACTCTGCACAAGTTTCCACACTCGGCGGACATGTGCGCCAAGTGGCAGCACAATACACAGGTGCCGTTCGACCCGGAGTACCGCTGGCGTTATCGTATCTGCAGCGCACACTTCGAGCCAATCTGCCTGGGAAACATGCGGCTGATGCATGGCAGCGTGCCTACACTGAAACTGGGCGCCCGGGCGCCCAAGCAGCTCTTCGGCAATGACTTTGCGGCGTTAAGCTTGCGTCTAGATAAGGAAAAGCGCAGTGCCGACCAGAGCTTGCCCGTGAAGCTGGAGCAAGTGGAAGATGATCAAGAGCAGTATGATCAGGAGGATCTTAGCATGCTGGTAccagagctgcagctgcacgaGGGCGACGACGAGCATGAAGACAATCAGTTAAATTACACCAACAGTTGGACAGATTCGCAGCAAcaggtgcagctgcagctacgtCTGCCCAGCATTAAACAGGAGAAGGGCACCATCTATAATCCTGTCAAGTCTGGCTATGACAAGTGCTCGCTAGTGCACTGTCAGCGCCAGCGTTCACAGCATGGCGTCCACATCTATAAATTCCCACGCTCACGGCAGCTACAGCATCGATGGATGCACAATTTACGAATCAGATACGACGAGCGGCGACCTTGGAAGACAATGATATGCAGTGTACACTTCGAGCCGCACTGCATACGCCTGCGTAAGCTGCGTCCTTGGGCGGTACCCACACTAGAACTGGGCGACAATGTTCCGCAGGATCTGTACAGGAACgagcaaagccaacaacagtttgtgcagcagcgcagcagcgacGCGGAAGCGGGCAGTGAGGGCGAGGACTATGATGCGGAGCTAGAGGACACTATACTGGAGGAGTACGACGATGAgtatgatgataatgataatgctgaGCAATATCCGGCTGAGCCACACATCAAGCGGGAGTATCGCTCACGCTGCGATCCACAGCCGCCTGGTCAACTGCCACCATGGAAAATCAAGCAATGCTGTTTGCCCTATTGTCGCAGGCCACGCGGCGATGGCATCAAGCTTTTCCGTCTGCCCAACAATATCAGCGCCATACGCAAATGGGAGCAGGCGACGGGTATGCGCTTCTATGAGTCCCAGCGCAATACAAAACTCATCTGCAGTCGTCACTTTGATCCGCAACTTATTGGTGTGCGTCGTCTTATGTCCAATGCGGTACCTACCCGTAATCTGGGCCCAAACAGCGAGGAATCCGAGCTGCCAGCGACCAGTCCACGCTGCTGCATTAAGGATTGCCAACCAGATGCACATGTCAAGCTGCACAAGTTTCCCAGTGATCCCGAGCTGCTGCACCAGTGGTGTCAGGCGCTTAATTTGCGGGATGAGCAGCGCCACGCCGACAAGTACATTTGTGCCGTGCACCTGCCCACCAAAGCGATGAGCTGTCTCATTTGCGGCGTGGAGGATGTACAGTTACCCATGCAGGACTTTCCCGAGCATCGCAATCAGCGAGTGAAATGGTGCTACAATTTGAAAATCGAACCAATAGCCAAGTGGGACAACTCAAAGCACATTTGCTGCAAGCACTTTGAGAGCTATTGCTTCATTAAGCCGGGTCATCTGTTGCCGGACTCTTTGCCCACGCTGCATTTAAAGCACAACGACAGCAATATATTCCTCAACGAATCTGCCATAGAGAGCAGCAGGCTGCTGCACGTCAAGGATGAGCCTATGGAGTGTGAGGATCTGATGCTGTAA
Protein Sequence
MSQHNNQPHSHQHHHYYQQQQHHLQQQQQHHHQQQQQQQQQQQHLQHKQIQQQHSWYSHVASYPPHQPHAAAAYAAPCKNNNNNNNNNIMNAYGTGAASAHYYGAAPTAGAGVGYNLEANTVAYAHNQLLQYQQQQQQQQQLSQRSYMPHGLMHGSYPYIKSEPLELPDDRQRHQQHQHQQQQQQHFQNPMAPPPAPPVNRHTLDASGEMIIKSEPIDEHAFKSNYIDDNIPFADFSKFSEFGDDMLSPKVELTVKDEAYGNQKNPLSYPRRKLQNERPSENLPICQRCKEVFFKKQVYLRHVAESSCSIHEYEFKCNICPMSFMGAEELQKHKQLHRADKFFCHKYCGKHFDNIAECESHEYMQHEYDSFVCNMCSVTFSTREQLYAHLPQHKFQQRYDCPICRLWYQTALELHEHRLAAPYFCGKYYPAAHQQQQQQQHQQQQHPQQQQGNYKLQDCHMGTIEMTAPHHKTNALPATAALSSLLQQRQANADGAALYASTLKSEANVKLERSYSNSTSESGYSLHESSYNNAYGSDNSLHGGGAAIGGPQAHSSTLDESEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYASFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPSTPTHNHQQQLQQHQLQQHQQQLQQHVHHKYQRHSAASTSSSASSASHYVDPELSASYMGLSASSSGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRPVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLVLGHENIAYPLPTPEQVAESYARPSAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESQVLAKWAHNLQLDIAQLPNMRICNLHFESHCIGKRMRPWAIPTLNLARNIENLFENPEHHMLYKRRAHLNADRATARSAGADGATMKASWVPRCCLPHCRKVRALHNVQLYRFPKVNRTTLAKWAHNLQVPLVGSAQRRLCSAHFEPNVLSKKCPVPLAVPTLDLNTPPGYKIYQNPAKVRANKLCWQRVCIVESCRRQRAQGVQLFRLPHSRSQLRKWMHNLRMLPRGAMRQQYRICSLHFEAHSFNGKRLSTGAIPTLELGHQDDDIYPNEAQSFVEEHCAVEGCDASKEQPDVRLFRFPNDDEDLLWKWCNNLKMNPVDCYGMRICNRHFEPDCIGPKHLYKWAIPTLVLGHDDSQIELIPNPKPEERYADPVFKCCVPTCGKTRKFDEAQMNSFPKDPSLFQRWRHNLRLEHLNFKERERYKICNAHFEDICIGKTRLNIGSIPTLELGHEVTEDLYQVNPEELQSNLFGRPRRVHENQRLSIKQELDEDIKPDISMSEATDTITTQVKIKKSVLDLKCCVASCGRSRLEHGARLFPFPTGKQQQTKWRHNLRLSAADVDRTTRVCSAHFNRRCIDGKQLRGWAIPTQQLGHQEQNIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRNEELLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEGVVFDNEAIKVGMQQEEEEEEEAGSLESEGKIKIEKQEKETLEPELENDDEDEDAEQQQKVEFPDDDMEQEQEQNEEEEELQDHGYFDPLELVETFAEQHSDDNSADNYHLEADDDDDEEDIPGNDDELLLPDTVPIQLPPRREKAVNNVTPICCLKHCRKERTASHQLSTFGFPKDQQQLLKWSANLQLDLVDCVGRVCIEHFEAEMLGTRKLKQNAVPTLNLGHATPLSYSCNGQSLSIYDAQPQHSVFRLWSLKHCRKRKLLTMPPDPAMTKRRCCLPSCGKEPELHGVQLKRLPKDRLLLRKWLHNLKLPPHMDTKHSFLCEEHFEPHATLPTLKLGHAASNIYRNGSSALSSGCLVPSCPCARLNLYRCYALPEHPQVQQAWLKWLQLPPPQLASLAQLCVMHYMQLFEQVPLPADLPESVQRQLQETYEQISSSSMAMKLRCAVPGCYSKYTDNVRLTKLPVCPQICAQWVHNTKIKYDPERHYMYRICMRHFEPQCLGAVRPKLWAVPTLHLNHSDADIYQNTMLDSSDAMPVAESVPLTLPLRIKTELPLTLSVSPSASPSPRGKQRTCCIPTCGQQANALTRLFRFPSAETALLKWLVNTQQQPRLVDTQNLFVCQRHFAAEAICKKQLQSWAVPTLSLGHEGHIIPNAKHNGNIADSQENKQALQYIWANYCSVLTCFQQRSEQVRLYAYPTDRPTIRRWAANCKHRSMQASSDGFQVCQLHFTPDCFDPDTGDLKEDAVPTLELSRPVHELRCLVNGCVREKDAARCRFFKVPKRASQLEDWCHNLRIDAASISGQEVHVCERHFEAHCFSAYKLRPGARPTLHLGHDDELDLLPNPAKWEEDVNVCFVPSCGRSKDVDNVELFGLPRIRGVLEKWLHNFRLDPSREQLQGMRICSAHFEASCIENGRLHLSSVPTLQLGHDELDNIHQSAELPSSQLKGKRLAMNYDCCYPQCMELQKSYQRIAYELPQQEALRNLWMSYLGLEQQNLQPLKLCPLHLIMLYEHSVNHFPEHSSEEQLLDANYEAARNSVRIRIISCAVRGCRTLKPRDDYRLHAMPTRRDVLRMWLDNMQLVFYEQQRYMYKVCSRHFEATCVTETTRRLKPWSMPTLELPERDPDAPPLHQNPTEEEWQRMNEQIGSSEALALLEPAFKLEPEPIVKQELHSIVKLEPKPQPEQHEGEEYEANDQQQALEVLLEVGHVEKCTTYEQMDTKPIIGYADTLSHNSLGPTTTVGSACIVNGNGLTYSARHCSVRGCDVTSLDVNDSLKLHKFPTSLDAMEKWMHNTQVNVDVNFAWRFRICSLHFLPECFNGSRIRRGAMPTLRLGSRRLGDIYDNEFNVQPEQTSVDQPVEASADAMVPTEPHDGATEFNINLHLPCPAPPRKSSKFCQIDGCSNHLTSENLTLHKFPHSADMCAKWQHNTQVPFDPEYRWRYRICSAHFEPICLGNMRLMHGSVPTLKLGARAPKQLFGNDFAALSLRLDKEKRSADQSLPVKLEQVEDDQEQYDQEDLSMLVPELQLHEGDDEHEDNQLNYTNSWTDSQQQVQLQLRLPSIKQEKGTIYNPVKSGYDKCSLVHCQRQRSQHGVHIYKFPRSRQLQHRWMHNLRIRYDERRPWKTMICSVHFEPHCIRLRKLRPWAVPTLELGDNVPQDLYRNEQSQQQFVQQRSSDAEAGSEGEDYDAELEDTILEEYDDEYDDNDNAEQYPAEPHIKREYRSRCDPQPPGQLPPWKIKQCCLPYCRRPRGDGIKLFRLPNNISAIRKWEQATGMRFYESQRNTKLICSRHFDPQLIGVRRLMSNAVPTRNLGPNSEESELPATSPRCCIKDCQPDAHVKLHKFPSDPELLHQWCQALNLRDEQRHADKYICAVHLPTKAMSCLICGVEDVQLPMQDFPEHRNQRVKWCYNLKIEPIAKWDNSKHICCKHFESYCFIKPGHLLPDSLPTLHLKHNDSNIFLNESAIESSRLLHVKDEPMECEDLML

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00534610;
90% Identity
iTF_00490809;
80% Identity
-