Basic Information

Gene Symbol
-
Assembly
GCA_035578135.1
Location
JAQJVK010000026.1:1380359-1408505[-]

Transcription Factor Domain

TF Family
THAP
Domain
THAP domain
PFAM
PF05485
TF Group
Zinc-Coordinating Group
Description
The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 40 4.6e-17 4.2e-14 55.4 0.0 1 87 5 94 5 94 0.91
2 40 8.1e-07 0.00074 22.5 0.1 18 60 391 430 379 442 0.85
3 40 4.4e-07 0.0004 23.4 0.1 18 60 492 531 474 543 0.85
4 40 4.4e-07 0.0004 23.4 0.1 18 59 694 732 680 744 0.85
5 40 4.4e-07 0.0004 23.4 0.1 18 60 795 834 777 846 0.85
6 40 4.5e-07 0.00041 23.4 0.1 18 59 944 982 929 994 0.85
7 40 1.6e-06 0.0015 21.6 0.7 18 60 1045 1084 1031 1100 0.84
8 40 3.7e-07 0.00034 23.6 0.1 18 60 1146 1185 1136 1197 0.83
9 40 7.9e-07 0.00072 22.6 0.6 18 59 1249 1287 1237 1292 0.85
10 40 1.8e-05 0.017 18.2 0.4 23 59 1294 1327 1290 1338 0.86
11 40 0.00079 0.71 13.0 0.8 20 58 1392 1427 1383 1438 0.87
12 40 2.2e-06 0.002 21.1 0.1 18 59 1493 1531 1480 1544 0.84
13 40 6.5e-06 0.0059 19.6 0.0 23 60 1586 1620 1569 1632 0.81
14 40 6.5e-07 0.00059 22.8 0.8 18 60 1684 1723 1670 1739 0.84
15 40 0.00079 0.71 13.0 0.8 20 58 1836 1871 1827 1882 0.87
16 40 9.1e-07 0.00082 22.4 1.0 18 59 1937 1975 1927 1985 0.84
17 40 1.3e-05 0.012 18.7 0.3 23 60 1982 2016 1976 2041 0.85
18 40 2.5e-05 0.023 17.8 3.9 20 60 2088 2125 2077 2141 0.86
19 40 4.7e-07 0.00043 23.3 0.1 18 60 2187 2226 2177 2238 0.83
20 40 0.0016 1.5 11.9 0.0 18 59 2336 2374 2326 2387 0.83
21 40 4.8e-07 0.00043 23.3 0.9 17 60 2437 2478 2425 2494 0.80
22 40 4.2e-07 0.00039 23.4 0.1 18 60 2540 2579 2530 2591 0.84
23 40 1.2e-06 0.0011 22.0 0.0 18 59 2641 2679 2623 2692 0.85
24 40 7.9e-06 0.0071 19.4 0.2 23 59 2734 2767 2718 2777 0.81
25 40 3.7e-06 0.0033 20.4 0.6 21 60 2782 2818 2769 2830 0.85
26 40 0.0016 1.5 11.9 0.0 18 59 2928 2966 2918 2979 0.83
27 40 4.6e-07 0.00042 23.3 0.4 17 59 3029 3069 3017 3075 0.80
28 40 1.9e-05 0.017 18.2 0.4 23 59 3076 3109 3071 3118 0.87
29 40 1.6e-05 0.015 18.3 0.3 23 60 3116 3150 3112 3162 0.86
30 40 1.2e-06 0.0011 22.0 0.0 18 59 3212 3250 3194 3263 0.85
31 40 6.5e-07 0.00059 22.8 0.4 18 59 3509 3547 3492 3557 0.86
32 40 4.1e-06 0.0038 20.3 0.7 21 59 3562 3597 3550 3609 0.85
33 40 5e-07 0.00046 23.2 0.2 18 60 3658 3697 3640 3710 0.85
34 40 4.1e-07 0.00037 23.5 0.1 18 60 3757 3796 3747 3808 0.83
35 40 5e-07 0.00046 23.2 0.2 18 60 3856 3895 3838 3908 0.85
36 40 5.2e-07 0.00047 23.1 0.9 18 61 4004 4044 3986 4050 0.84
37 40 1.2e-05 0.01 18.8 0.4 23 61 4047 4082 4044 4091 0.83
38 40 6e-07 0.00054 23.0 0.5 18 59 4092 4130 4081 4139 0.84
39 40 2.1e-05 0.02 18.0 0.4 23 59 4137 4170 4132 4175 0.87
40 40 5.4e-10 4.9e-07 32.7 0.2 23 87 4177 4232 4172 4232 0.82

Sequence Information

Coding Sequence
ATGCCTTGCTCCTGTAGCGCCATCGGGTGTCGCAGCAACTACGGCGTTTACGAAAACATTTCGGTGTTCAAACTGCCCAAGGACCCAGTGCGGAGGCAGCTATGGTTGGACGCGATTGGGCGACCCGACTTCAGGCCCACCGAATACACGCGGCTGTGCGTGCGTCACTTCGAGCCGCGGTTCGTGCTGACGCACGACTCGGTGGTGCGAGACGACGGGAGCGTGCTGACGGTGAAGAGGGACAGGGTCCGGCTGACCCAGGACGCCGTGCCGACGCTCTTCGGGGACCGCGCCGAGACGCCGCTCAAGAGTAGGCGAACTGAACCTGCTGTGAAATATTCTGAAGGGTCGCCTTACATCATCTGCGCGGTGTCCCGTCCAGGTAGCCTCCGCCTGCCGACAACAATTCGCCAACAGCCGCCCTCCCGCACGTACCACATGAGGCGGGTCATAGCAAAAGTGGTTAGCTGTATTCGCCCTTGTTCGGCGATGATGAACTCCCGCGCGGATGAAAGTTTCTGCCCGAGGACGAGGGACAAAACAGCTGATCGATGGGGGCAGCTAGCTACTAGTGCATATTTGACGTGGTTTCTCCCCTCCCCTATATCGACGTGTGCGGAGTTGGAGGGGAACCGTGTGCGCTGTTGCCAGTTCGACCAGCGCCGTCCGAGAACGCGCCACAGCGCAACCCTGCCGGAGCACTTCCACCCTTCAGCCACTGTCATTGTCACTGTCAATGCCACTGTCATTGTCACTGTCAATGCCACTGTCATTGTCACTGTCATTGTCACTGTCAATGCCACTGTCACTGTCAACGCCACTGTCAacgtcactgtcactgtcaatGCCACTTTCACGTTTACTGTGAAAGTGGCATTGACAGTGGTCACTGCCAATGctactgtcactgtcactgtcaatGCCACTGTCACTGTCAATTCCATTTTCACTGTAAACACTACTGTCACTGTCAATGCCACTGTCACTGTCAATGCCACTGTCAACGCCACTGTCACTGTCAATGCCACTGTCATCGCCACTGTCACTGTCAACGCCACTGTCACTGTCAATGCCATTTTCACTGTAAACGCTACTGTCACTGGCAATGCCACTATCACTGCAAAAGTTACCATCAATGTCAATACCACtatcactgtcactgtcactgtccgCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCCCcagctactgtactgtactgtcacTGTCCTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTAATTCTGCAGCGACCACTTCCACCCCCcagctactgtactgtactgtcacTGTCCTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcactgcccTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCAGCAGCGACCACTTCCACCCCCcagctactgtactgtactgtcacTGTCCTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGTGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGctactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCAACCCTCCAGccactgtactgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtccgCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTGCGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGTAGCGACCACTTCCACCCTCcagctactgtactgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCGTCAAACACACCTACTTCTGCAGCGACCACTTCCACCCCCcagctactgtactgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgcccTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGAAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCCCcagctactgtactgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCCCcagctactgtactGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCCCCAGCTACtgtactgtcactgtcactgtcactgcccTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGTAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcctcTCTACTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTCCTGCAGCGACCACATCCACCCTCCAGCTACTGTACTGttactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactgtactgccACTGTCACTGTCCTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACATCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgcccTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTGCTACTGCAGCGACCACTTCCACCCCCcagctactgtactgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCCCcagctactgtactgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACCTCTGCAGCGACCACTTCCACCCCCcagctactgtactgtactgtcacTGTCCTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGTAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcctcTCTACTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTCCTGCAGCGACCACATCCACCCTCCAGCTACTGTACTGttactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGTAGGCTACGTCGCCAGTGGCTGCTAGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCCCcagctactgtactGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCCCcagctactgtactgtactgtcacTGTCCTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCACAGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCCACtgtactgtcactgtcactgtcctcTCTACTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGccactgtactgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACATCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgcccTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCAACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCCCCAGCTACTGTACTATCACTGTCACTGTCCTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGAGACCATTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCCGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGGACCCCAGCAAACACACCTACTTCTGCAGCGACTACTTCCACCCTCCAGCTACTGTACTGTCACTGTCCTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTACACCCTCCAGCTACAGTACtgtactgtcactgtcactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTAGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACTACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGTCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGTAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactgtactgtcacTGTCCTCTCTACTTGCAGGATCCCCCTGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgcccTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGTCTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcctcTCTACATTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCAACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCCCCAGCTACTGTACTATCACTGTCACTGTCCTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGAGACCATTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCCGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGGACCCCAGCAAACACACCTACTTCTGCAGCGACTACTTCCACCCTCCAGCTACTGTACTGTCACTGTCCTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTACACCCTCCAGCTACAGTACtgtactgtcactgtcactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTAGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGTAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactgtactgtcacTGTCCTCTCTACTTGCAGGATCCCCCTGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgcccTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGTCTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcctcTCTACATTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCAACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcctcTCTATAAAGGATCCCCCAACACGGGAGGCTACGTCGCCAGTGGTTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCATCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactgtactgtcacTGTCCTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactgtactgtcacTGTCCTCTCTACTTGCAGGATCCCCCAGCATGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcctcTCTACATTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCAACCACTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactgtactgtcacTGTCCTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactgtactgtcacTGTCCTCTCTACTTGCAGGATCCCCCAGCATGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCATTTCCACCCTCCAGCTACtgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCCAGCTACTGTACTGTCACCGTCACTGTCCTCTCTACTTGCAGGATCCCCCAGCATGGGAGGCTACGTCGCCAGTGGCTGCTGGTCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactgtactgtcacTGTCCTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactgtactgtcacTGTCCTCTCTACTTGCAGGATCCCCCAGCATGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactgtactgtcacTGTCCTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactgtactgtcacTGTCCTCTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCAGCTACtGATCCCCCAGCACGGGAGGCTGCGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAATCACACCTACTTCTGCAGCGACCACTTCCACCCGTCCAGCTACtgtactgtcactgtcactgtcctcTCTACTTGCAGGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTCcagctactgtactGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGTAGCGACCACTTCCACCCTCCAGCTACtgtactGATCCCCCAGCACGGGAGGCTACGTCGCCAGTGGCTGCTGGCCTGTGGCCGCCTGGACTGGAACCCCAGCAAACACACCTACTTCTGCAGCGACCACTTCCACCCTTCCAGCTACGTTCACCGGCCCACCTCCGCCAAACTGCTGCCCAACGCCGTCCCTTCCATCTTCTACGGTGTGCCGCCCCGGGTGGGTGAATCTATCCTGGGGGTCAACTGTGTTTGA
Protein Sequence
MPCSCSAIGCRSNYGVYENISVFKLPKDPVRRQLWLDAIGRPDFRPTEYTRLCVRHFEPRFVLTHDSVVRDDGSVLTVKRDRVRLTQDAVPTLFGDRAETPLKSRRTEPAVKYSEGSPYIICAVSRPGSLRLPTTIRQQPPSRTYHMRRVIAKVVSCIRPCSAMMNSRADESFCPRTRDKTADRWGQLATSAYLTWFLPSPISTCAELEGNRVRCCQFDQRRPRTRHSATLPEHFHPSATVIVTVNATVIVTVNATVIVTVIVTVNATVTVNATVNVTVTVNATFTFTVKVALTVVTANATVTVTVNATVTVNSIFTVNTTVTVNATVTVNATVNATVTVNATVIATVTVNATVTVNAIFTVNATVTGNATITAKVTINVNTTITVTVTVRSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLSLSLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTPQLLYCTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLSLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTLQLLYCHCHCHCPLYLQDPPAREATSPVAAGLWPPGLEPQQTHLILQRPLPPPSYCTVLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTLQLLYCTVTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLSLSLSLPSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTPQLLYCTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLSLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTLQLLSLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTLQPLYCTVTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLSLSLSALLAGSPSTGGCVASGCWPVAAWTGTPANTPTSVATTSTLQLLYCTVTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPVKHTYFCSDHFHPPATVLYCHCHCPLYLQDPPAREATSPVAAGLWPPGLEPQQTHLLLQRPLPPSSYCTVTVTALSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLSLSLSLSSLLAGSPSTGGYVASGCWPVAAWTETPANTPTSAATTSTPQLLYCTVTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLSLSLSLPSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSVATTSTLQLLYCHCHCPLYCRIPQHGRLRRQWLLACGRLDWNPSKHTYSCSDHIHPPATVLLLSLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTLQLLYCTATVTVLSTCRIPQHGRLHRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLSLSLPSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTLQLLIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLSLSLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPATAATTSTPQLLYCTVTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLYCHCHCPLYLQDPPAREATSPVAAGLWPPGLEPQQTHLPLQRPLPPPSYCTVLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSVATTSTLQLLYCHCHCPLYCRIPQHGRLRRQWLLACGRLDWNPSKHTYSCSDHIHPPATVLLLSLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTLQLLYCTVTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLYCHCPLYLQDPPAREATSHRSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTLQPLYCHCHCPLYCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLYCHCHCPLYLQDPPAREATSPVAAGLWPPGLEPQQTHLLLQRPLPPSSYCTVTVTALSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSNHFHPPATVLSLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTPQLLYYHCHCPLYLQDPPAREATSPVAAGLWPPGLEPQQTHLLLQRPFPPSSYCTVTVTVLSTCRIPQHGRLRRPWLLACGRLDWDPSKHTYFCSDYFHPPATVLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTYTLQLQYCTVTVTVTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLYCHCHCPLYLQDPPAREATSPVAASLWPPGLEPQQTHLLLQRLLPPSSYCTVTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLSLSLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTLQLLYCTVTVLSTCRIPLHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLSLSLPSLLAGSPSTGGYVASGCWPVAAWSGTPANTPTSAATTSTLQLLIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLSLSLSSLHCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSNHFHPPATVLSLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTPQLLYYHCHCPLYLQDPPAREATSPVAAGLWPPGLEPQQTHLLLQRPFPPSSYCTVTVTVLSTCRIPQHGRLRRPWLLACGRLDWDPSKHTYFCSDYFHPPATVLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTYTLQLQYCTVTVTVTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLSLSLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTLQLLYCTVTVLSTCRIPLHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLSLSLPSLLAGSPSTGGYVASGCWPVAAWSGTPANTPTSAATTSTLQLLYCHCHCPLYLQDPPAREATSPVAAGLWPPGLEPQQTHLLLQRPLPPSSYCTVTVTVLSTLQDPPAREATSPVAAGLWPPGLEPQQTHLLLQQPLPPSSYCTVTVTVLSIKDPPTREATSPVVAGLWPPGLEPHQTHLLLQRPLPPSSYCTVLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTLQLLYCTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLSLSLSSLHCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSNHFHPPATVLSLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTLQLLYCTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLYCHCPLYLQDPPAWEATSPVAAGLWPPGLEPQQTHLLLQRPFPPSSYCTVTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLSPSLSSLLAGSPSMGGYVASGCWSVAAWTGTPANTPTSAATTSTLQLLYCTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLYCHCPLYLQDPPAWEATSPVAAGLWPPGLEPQQTHLLLQRPLPPSSYCTVLSLSSLLAGSPSTGGYVASGCWPVAAWTGTPANTPTSAATTSTLQLLYCTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPQLLIPQHGRLRRQWLLACGRLDWNPSNHTYFCSDHFHPSSYCTVTVTVLSTCRIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPPATVLIPQHGRLRRQWLLACGRLDWNPSKHTYFCSDHFHPSSYVHRPTSAKLLPNAVPSIFYGVPPRVGESILGVNCV

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-