Ssub026883.1
Basic Information
- Insect
- Sarcophaga subvicina
- Gene Symbol
- -
- Assembly
- GCA_936440885.2
- Location
- CAKZFJ020000511.1:168146-192508[+]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 34 1.8e-16 3.6e-13 51.0 1.4 1 86 889 961 889 962 0.84 2 34 3.7e-15 7.6e-12 46.7 4.7 1 87 989 1058 989 1058 0.80 3 34 1.7e-15 3.5e-12 47.8 0.2 1 87 1079 1151 1079 1151 0.83 4 34 3e-14 6.2e-11 43.8 3.3 1 86 1234 1302 1234 1303 0.78 5 34 8.6e-16 1.8e-12 48.8 5.6 1 87 1327 1399 1327 1399 0.81 6 34 1.7e-12 3.4e-09 38.3 1.1 1 87 1434 1502 1434 1502 0.81 7 34 9.3e-11 1.9e-07 32.7 3.3 1 84 1543 1610 1543 1614 0.74 8 34 4.1e-15 8.3e-12 46.6 0.4 1 86 1640 1709 1640 1710 0.80 9 34 4.6e-14 9.4e-11 43.2 0.7 1 86 1732 1801 1732 1802 0.80 10 34 5.9e-13 1.2e-09 39.7 3.0 1 87 1830 1902 1830 1902 0.85 11 34 1e-06 0.0021 19.7 0.1 1 59 1975 2027 1975 2046 0.78 12 34 2.8e-11 5.8e-08 34.3 0.3 1 87 2073 2145 2073 2145 0.79 13 34 1.2e-15 2.4e-12 48.3 2.9 1 87 2175 2246 2175 2246 0.81 14 34 1.8e-12 3.6e-09 38.2 9.8 1 87 2291 2364 2291 2364 0.86 15 34 7.2e-14 1.5e-10 42.6 0.8 1 87 2386 2454 2386 2454 0.80 16 34 6.4e-14 1.3e-10 42.8 0.2 1 87 2784 2853 2784 2853 0.81 17 34 2.7e-10 5.5e-07 31.2 4.6 1 86 2911 3005 2911 3006 0.71 18 34 1.2e-09 2.5e-06 29.1 2.1 1 87 3038 3111 3038 3111 0.78 19 34 2.1e-12 4.3e-09 37.9 3.0 1 87 3139 3208 3139 3208 0.83 20 34 1.2e-13 2.4e-10 41.9 2.5 1 87 3228 3298 3228 3298 0.80 21 34 5.8e-14 1.2e-10 42.9 1.7 1 86 3321 3391 3321 3392 0.79 22 34 2.7e-06 0.0056 18.3 0.2 1 59 3408 3455 3408 3480 0.78 23 34 1e-12 2.1e-09 38.9 5.0 1 86 3496 3565 3496 3566 0.81 24 34 1.3e-13 2.7e-10 41.8 5.5 1 86 3591 3661 3591 3662 0.83 25 34 8.4e-14 1.7e-10 42.4 3.1 1 86 3682 3754 3682 3755 0.79 26 34 1e-10 2.1e-07 32.5 5.9 1 86 3775 3844 3775 3845 0.79 27 34 2e-12 4e-09 38.0 2.8 1 86 4136 4206 4136 4207 0.81 28 34 5.6e-06 0.011 17.3 1.4 1 86 4226 4295 4226 4296 0.71 29 34 3.9e-12 8e-09 37.1 4.4 1 86 4319 4392 4319 4393 0.80 30 34 0.059 1.2e+02 4.5 4.4 1 59 4421 4477 4421 4499 0.65 31 34 4.3e-14 8.7e-11 43.4 1.3 1 87 4518 4593 4518 4593 0.86 32 34 2.1e-14 4.3e-11 44.3 1.4 1 87 4622 4696 4622 4696 0.85 33 34 1.2e-13 2.5e-10 41.9 4.1 1 87 4829 4902 4829 4902 0.80 34 34 6.9e-13 1.4e-09 39.5 0.4 1 86 4927 4996 4927 4997 0.82
Sequence Information
- Coding Sequence
- atgtCACAAAATAACCAACGTAAACATTATCACATACATGCTCCCTATCAACATCCCCATCAGCAgctgcagcagcaacaacaatcaaatcatcatcatcatttgacAACatcacagcaacaacaacaacagcaacaacaacatcatcatccaCAATGGTACGCACATGGTTCTATGACACCACAACAGCAGCTAgtacatcagcagcagcagcaacaacaacaacaacaacagcagcagcagcaacagcaacattatcAGCATGGTTTGCATTTAAGAGATTCGCGCCATATACAACATCCACAACATCATCATCCTCATCCCCATGcacatcagcaacaacagcaagcaCATCACAACCACACAATGCCTCATATGTTTACAAGTGGTGGTTATGTTGGTATGTCAGCAGCATCATCAGTAGGCGGAGGAGGAGGCGCAAGCGGCGTTGGTGGTATAAGTTCAGCACATAGTACATCAACAATGGCTACACCACACAATATACCGGCTGCTGCTCCAGCATCTGCACATTATTCCTCGACACTGGCTGGTGTTTCTGGAGTAAATGTAAATGCAATTGCTGGTGTTGCAAGTAATGCcaacagtagtagtagtagtagtgctGCTAGTAGCAGTGCTAGTAGTAGTCGCCATCGTATGTTTGACCTTGAAATgttacatcaacaacaacagcaacatccgCAACATGGTTTAGCGGCAGCCTCCCATTCCCATTCAAATGCCATGCTAGCAAGTGCTGGCAGTGCAAGTGCCAGGGCTAGTTATGATGCCTATTCACACAGCTCCTTGTACGCTTCACAGCATAGTCAACGGCATCATTCTGCTCCTGCGTCACATCATCGCGCCGGTGTTACACATTCAGCAGCTCATGCTTTGCATCATCATCAGTCGCTGCATCCGCAGCAACTTCACCACCAGCCTCAACCGCCATCCCTAACATCTCACCATcttcaacagcagcagcagcaacaacaacaacaattccatCAGCAGcaacattattatcatcatacGGCACCAACTCCTTTACATCGATCACATCCGCATGCATTGCCGCAAATGATGCCACACATTAAATCTGAGCCAGTAGAGCAAATAACAATAACGCCATCGATACAAACCGAAGAAGTCATCATCAAAGcTGAACCCATGGATGATGTCGGTTATCACAAAATTGCGCCACAAATTGGTAACAATTCGTTTCACATGgaagacaaacgtaaacaatatgaacagcaacaacaacaacaacaacagcagcagcaacaattacaacaacgcCGGTCAGAACTGCTATTgcgacaacagcaacagcaacaacaacaagagcaacaatTACGCCAACAAcagctgcagcagcagcaacaacaacaacaacaacggcaggAAGAAAatcgtcattatcatcatcaacaacaacagcaacaacaacaacaacaacaacagcagcagcaacaagaaGAGCAACAactaaaaaggaatgaaaattcGCATCATAATGAAGATGTTGTTGGCCAACAAGCACGGGGCACAAATTCCGAGAATTCTACAACATTACTACCAGAAGAACAAGAAGACATAAAACCAACAGAGCAGCAGCTACAAGAACAtcaacaccaccaccaacagcagcagcaagagCAGCAGCAAATATCCTTGACTAATATAAAAACAGAAGCAaagCCTCTTAACTTTCCTCGTCGTAAATTACAAACAGAACGTTCCTCCACTCTGCCCATATGCCAGCGatgtaaacaagtttttttgaaaCGTCAAAACTATGCACAACATGTTGCTCTATCCAGTTGCAATATTGTCGAATACGACTTTAAGTGCTCCGTATGTCCCATGTCCTTTATGTCGAATGAGGAGCTATTGGCCCATGAACAATTGCATCGTATGAATCGGTACTTCTGTCAAAAGTATTGCGGCAAATTTTATGACACCATAAACGAATGTGAACAACACGAGTATCGCCAGCATgagtatgaaatatttaaatgtaatATTTGTTGTATATCGGTTACTGAACGTGACCAATTGTTTGGCCACTTGAATGAGCACAAATATCAACCGCGCTTTGATTGCTGTATTTGTCGTTTATGTTTTCAAACTTCCTTGGAACTACACGATCATTATATGgcgaatgaaaatttttgtggtaaattttatgataaagaagCCTTTAAAAAACCAATTACCTCGTCTTCAACCTCATCTTATATGGGAAAAATGGAAAGCCCGAAACTGGAAATTGCAAATAGTTTCTCGCTAAAAGATATGCCTCCCGCCAACAGCCATCATTTGGAGCCTCTCTacacaaaatcaaaaacttccaTGAAGCCACCTAATGAGCCTACAACACCGCCTTCATCGTCGTCGTTTAGTAGTGCGGTAAATGACTTTTCTTTAGAACCTCAGGTAGAggtaaaaactgaaattaaagtGGAACCAGATTTTTATCCCCCCCTGGATCATGCTGATTACACACCCTACGACAATGATTATGCCACACCGGACTATTCTACTGGGTCTAATCAAAACATACCATTTTTACAAGACTATCAAGATAATGCTTCCAATTCTACAAATTCATCATATTCCTTCAATAACAACGATGCCGCACCTGATGACGAAGCCATCTGTTGTGTGCCTAAATGTGGTGTAAGCAAATTATCATCTCCTACTTTGCAATTCTTTAGCTTTCCCAGAGACGAAAAGTATTTGTCGCAATGGttgcataatttaaaaatgactTACGATCCAAACACTAACTATTCCATGTACCGTATTTGTAGTTTACACTTTCCCAAACGTTGCATAGCAAAGTATTCCTTAAGTTATTGGGCAGTGCCCACTTTTAATTTAGGTCACGACGATGTAGGAAATCTATATCAGAATAGGGAAAGTTCTGGGGGTTTTCCTGGCGGCGAAATGGCCAAATGTAGTATGCCCGGCTGCCCTTCTCAGCGAGGcgaaacaaatgtaaaatttcACGTATTTCCCCGCGATCTGAAGACACTCATTAAATGGTGTCAAAACTCTCGCCTTCCCGTACACAGTAAGGATAATAGATTCTTCTGTTCCAGACATTTTGAAGAGAAATGTTTTGGCAAATTCCGTTTAAAACCCTGGGCCATACCCACCCTGAATTTAGGCACCGTTTACGGTAAGATTCACGATAATCCGAATATCTACCAGGaggaaaagaaatgttttctacCATATTGCCGTCGCAGTCGTTCATATGATTGCAATTTATCTTTGTATAGATTTCCCCGAGATGAGACTTTGTTGCGCCGTTGGTGTTATAATTTAAGATTAGATCCCAACATGTATCGAGGCAAGAATCACAAAATCTGTTCGTCTCATTTTATTAAAGAAGCCTTAGGATTGCGTAAACTTAATCCCGGAGCAGTGCCCACTTTGAATTTAGGACATAATGatagatttaatatttatgaaaatgaacTTTATACACCACCACCGCCGCCACCACCTCCTCAACCATCAACCTCATCTAAAGCTCATAAATTTGCCGAAATGTTTAAACAAGAAATGGGCAACTCTTCGCGTTCGTCTCGTCTTTACGATGGCGTCTTCATGAGTTCTATGAGCCATAAATTCTCCAATTCAAGTTCCTCGAATAGTTCTAACACTTTGGATTTGGGTGATGTATGCTTAGTGCCCTCTTGCAAGAGAACTCGCCATTCAGATGACATTACACTGCATACCATACCCAAGCGGGAAGAACAATTGAAGAAATGGTGCCATAATTTGAAGATGGATTTGCGTAAAATGCACAAGAGTGTGCGCATTTGTAGTGCACATTTCGAAAAGTACTGTATAGGCGGCTGTATGAGACCATTTGCAGTGCCCACATTAGAGTTGGGCCATGAAGACTTAAATATATACCGTAATCCTGatgttataaagaaattaaatataagaGAGACCTGTTGTGTGGCCTCGTGTAAGAGAAATCGTGATCGTGATCATGCCAACCTACATAGATTCCCCACACATCCGGAGTTGCTTCAAAAATGGTGTGACAATTTGCAAAAACCCGTACCAGATGGCACAAAACTTTTCAATGATGCCGTCTGTGAAGTGCATTTTGAGGAacgttgtttaaaaaacaaacgtttAGAGAAATGGGCCATACCCACATTAAATCTAGGCTGGGATGGTGCCCCTCACCATTTACCCTCTGAGGAAGAGATTAACGAATACTGGGTTAAACCATTTGCCCCCAACAATGGCGAAGAACAGGGCGAGTGTTGTGTGGCTAGCTGTAAACGTAATCCTCAAATTGACGATGTGAAGTTGTACAGACCTCCCGAGGATTCCGAACAATTGGTTAAATGGGCTCATAACTTACAAGTGGATGCGAATGATTTAcccaatttgaaaatttgtaatttacaCTTCGAACAGCATTGCATAGGCAAGCGTTTGTTAAATTGGGCAATGCCTACGCTGAATTTGGGTGGCAAGGTCGAGCATCTTTTTGAAAATCCACCTCCAATGCCGAATATATACAAGAGGAAAGAAAAGCCTGCAAGGATACTATCCAGCCACGAAGGCATCAAATGGTCTCCCAGATGTTGTCTTTCCCACTGTCGCAAAATGCGTTCTGTACATAACGTCCATCTATTCCGTTTTCCATACAGTAATCGCCAGACTTTGGCGAAATGGTGCCACAATTTGCAGCTACCCTTAGTGGGTAGTTCGCATCGCCGTATATGTTCGACTCATTTTGAACCCTCTGTCTTAACTAAACGATGTCCCATGACATTGGCTGTGCCCACATTGGATTTGAATACACCACCCGGCTATAAAATCTACCAAAATGCAGCTCgtctaaaacaaataaaactcgGATCTCAAAGGCAGTGTGTAATAGAGTCTTGTCGCAAGACTAAACTGGATGGTATATCGCTTTTCCGTTTCCCCAATAACCGTTCAATGTTATATAAATGGCGacacaatattaaaaactgGCCCAAGGGAAAGCTAACATCTACGTTCAGAATTTGCACCGAACATTTTGAGCCTCATTCGGTGGGTGAAAGGAGACTGTCGCCTGGTGCTATACCCACTTTAAAGTTAGGCCATGATTCCACCGATTTGTATCCCAATGAGACACGTTCGTTCTTTGATTTGGAAAAATGTGTAGTCAGCGGTTGTGATTCACGCAAGGAAATGGAGGACGTAAGACTGTTTCGTTTTCCACGCGATGATGACGATGTGCTAAGGAAATGGTGCAATAATCTCAAGATGAATGCCAACGATTGTGTGGGtattaaaatatgcaacaaacatTTCGAGGACGAATGTTTAGGTCCACGCCTGCTTTACAAATGGGCCGTACCTACACTTAAACTGGGCCACAAAGAAGATGAATTAGTGGAAATCATACCGAACCCGCCACCTGAACAGCGAACCGgagaatttctttataaatgttGTGTGCCTACGTGCGGCAAGTCACGCAAATACGACGATGCCCAAATGAATAGTTTCCCCAAGCATTTGAAAATGTTCCGCAAATGGAAACACAATCTGAAATTGGATTTTCTGAATTTCAAAGAAAGGGAAAAATACAAAATCTGCAACGATCACTTTGAGCCAGTGTGTGTGGGTAAAACGCGACTTAATTTTGGTGCTTTGCCCACTTTAAATTTGGGTCACGATGATGTAGACGATTTATATCAAATCAATCCGGATAGAATAAGACCCAATCTATTCATAAGACAGAAGGATGTGGAAAGGCTAGAGAGAAAAAGACTACGCGAAACAACCAAAGAGCAATATGAGTGTGACAATCCTgagcatgatgatgatgatgatgatgaaaatgctGTAGATCCTTTGAGTTTGGAACCGTCGGACATTAAATGTTGTGTTATGGATTGCACTGCACCCAAGTCCATAATGAGAGAACCTTATGAGCTGCCCGAGTCATTAGAATTCAGACAACTTTGGTTTAAGGAATACCAAAAAAGCGATGACGAAGAACAACCAATTGAAACGAAAATCTGTGGCCTACATTTCCAAATATACTTCCAACAGCTTAAGACTAAAATGCTTAGTATGTTAGAGGAGGGAAATGAAGTTTTGCAAACTGATTTCAATAACTTGCAATACAATTATCAAAAGTCTACCATATCGCTGGTAGTAAACAGTTACCAGTGTCGTGTGGAGGGCTGTACTTCCAATTTATTAAATCCAAACATAAGACTGTATTTCTTTCCATATGGCAAAAATTTAGTTTCCAAATGGTCTCACAACACCGGCATCATACCCGATGAGCATCGCCGCTATATGAATAAGGTGTGCGCTTTACATTTCGAATCCTATTGTGTAACAGAGAACCAAAGATTACGTTCGTGGGCAATACCAACACTTAATTTGCCTGACAGTGAGAACAAACGAGTCTACAAAAACCCCGATCTAACGAAACTTGATAAACGAATGTTAGGACCACAAATATCAAAATGTGCAGTTGCCAACTGTAATTCCGATAAAACGAATGATGATGAAtccataaaattgtttaacttcCCTACTGATGAAAATCTACTCAAGAAGTGGTGTGACAATTTGAAAATGTCCCATCGTTTGACGCCTTTGCAAAAGATCTGctctttacattttgaaaagtCATGCTTGGGCAGCTGCCGCATACGTTCTTGGGCAATACCCACCTTGAATTTGGGCCACGACGAAGCACCAGAACATCCGAATAAAAAAGTCACAAGCCGAGAGGTTTATGATGCATCAGAAGATACAACCGAAGTACAACTGaaacaagttaaaattaaaCGGCCCCTGGACTCAACCAAATGCTGTATAGCCAGTTGTCGGAAGAGTCGTTTAAAACACGGCGTAAGATTGTATTGTTTGCCTAGCAATTCAAAAATGAAACGTAAATGGAtgcacaatttaaaaataaatcatttaaaaaacaatcccAAATTGCACAGCATCAAGGTTTGTAATCATCATTTTCATAAAAGATGTTGGGatggcaaaaatttaaaacaatgggCTGTGCCCACCTTGCATTTGGGACATTCAGAGGCTATTTTTGATAATCCTCGCCGTATACATGCGGTGCCTATAGTACGCTGCGCCCTCAGTAGTTGTAAGAACCACAAGGCAATCAAGGATGTGCAAGCATTTGTATTTCCCAAATCCCCTGAATTATTGGAGAAATGGTCAAAGAATCTAAAATTAGATTTGGAACAGTGCACGGGTAAAATATGTTACGAACACTTTGAAAAAGAGGTTTTAGccgagaaaaaattaaaggccaATGCAGTACCCACACTAAATTTAGGGCACGAGGatattattttcgataacacGGACTTAATCGATAAATTACAGAGAAAGCAAACAGAACAGTTAAATGCTAAGAAACGACGTTACGAAGACGACGATGATTATATGGATGTTGAGTATGAAGATCGagtggaggaggaggaggaggataaTGACATGTGGGAATATGAGGAGTATAATGAAGATTTGgaagatgaggaggaagatgaagACGACGAATATGAAGAGTGTTATTATGATGACGgcgaagatgatgatgatgatgatgacgtggATGAAGAGGAGGACGACGAAGATGAAGAATTTGAAGAGAACGATGACGAACACAGCATATCAAATTCAATAACTGATTGGAGTGGCATTAAATTCAAAGAACTGCGCGTCTCCCTTACTCCTTTAACGCCCGAAGATCTTATGGATTTATGTTCACGCTCCTCATATGAAAGAGAATTTGGTTCTATAACCTCTGGCAGCAGCTTAAGGGGACGTAGATCGATAACACCAGCGGCAAGTTTAAAAGATTTACGTTCAGAAACTCCAGAACAAAACTCAAGATCCGAAACTCCCAATCAAAAGCAATTCAATTGTTTCAGAGAACCTCTTatcattgctgctgctgctgatcaAATTAAATCGGATAACTTCCGTGAACCTAATGCTCTAACGCCTGAACAAAAACCTGATTCAAGAGATTCCCTAAATGAGTACTCAATAGATGCTATAATACATAGCTCAACTCCAgttggaaaacaaaacaattcaaatgaaatttcaaaagcAATTACAACGCCAGAGAATTTCGAAATACATGAAACGTCGAACAATTCAAAAACACCGGCTTTAGATACGTGCAATGAAAATGCCAAACGGGAACGTTTTGCAGATGAAAACACCACAAGTTCAGACCATATTGATTTGGAACATAATGCCAGCACAAATCTTCGAACCGATAAACGACTCAATCCTATATCtccatgttgttgtttaaaacattGCGGCAAAGAAAAAACACCTGAACAACATTTAACGACATACGGTTTCCCCAAAGACCCACAGCTTTTAAGGAAATGGTGTGACAATCTAGGTTTACAACCAGAAGAATGTATAGGGCGTGTGTGTATAGATCACTTTGAATTACGCGTGGTGGGCGTAAGACGTCTTAAGCTGGGTGCTGTACCAACCTTAAATTTAGGACCAAATTGTATAGCCAAACATACTAACTCAGAGGAAACACCACAAAAGAAAGCCATAATCAAAGAATTTACCGAAACGGGTAATATGCAGGAAACGGACACCAGCTCTAAGCCACTACCACCATATAAGACAACGAAACCTGGTAAGCAATCGGTTTTTCGGCTATGTTGCCTCAAACATTGTCGTCGCAAGAAATTTATAAAGCCGAACAAGAAGACGAAACGACCGGATTTGAATCAGACTTCAATGGAATGTCAGAAGAACGCAGCAGTAGCCCCGAATATATTATTTAAGTTCCCCTCGGatacaaaaattctgaaaaaatggTGCAAAAACTTAAGGTTACCGGAAAAACTTTGTTTACCTTCCGACCTGGAAATATGTGCGAGACATTTTGAAGCTAAAGCCATACAAGATGGTAAATTACATCCCAAAGCCATACCCACATTGGAGTTAAGTTATGCTAACAGGGCgcccatttataaaaataacccCAAAGACTTTGAGCAACTACCGCAAAACGCTAAACAGGAAACTAAAGAGAAATGTTTTCTTGAACACTGTGGCAAAACAGAAAACGACGACAATACATTCCTTATATCGTTTCCTTTAAACGAACCCCGAACATTACGTAGAtggtgtaaaaatttaaaaatcgattGCGATAAAAACAAGCTAAAGACCCTGAAAATATGCAACGAACATTTCGAAACGTATgtgtttttcaaaaagaaacatttaagaGTTGGCGGATTACCCACCCTCAACTTGGGCCATAATGGCGCTATCGTTAGAAATTGCCGTAAATTACGTTTAAAGAAAACGAATGGAGGCGCGGTCAAAGAGAAATGTTGCGTACAACAGTGCCAGGAAACCaatcttaaattgttttcattcccCAGAAGTTCGGATTTACGTAAGATTTGGTGTAACAATCTGCAGCTGGATTTGCGTCAAGTACTGATTAATCATTTGAAAGTATGTGCACGACATTTTAACATTGAATGCTTTACCGTGGGCACCGACCATCTAAAACTGAATGCAGTGCCTATGCTGCACTTGGGTTTACAAAGTGAAACGCACATGGTGTTGGAAATCATGCCAAGCGAAAGGAAGTGTATGGTTGAAAATTGTCAAAAGACACCCAGTGTAGATCgagttaaattgtttaattttccacAAAAGAAGGACATACTCAAGAAGTGGCTTTTCAATCTAAACTTATCACCCGATACATACAATCCGAATGCCTTCATTTGCAGTAGACATTTCGATAAGAGCTGCATAAAGAACGGCATgctacatgaaaattctataCCAACGCACTTTTTACAAATCACACCCAAAGGCTGGTTCTATAAAAACAACGAGGAATTGTATGAAATGCCTAAAAAATGTTGTGTCCTTAACTGTGGTCAAACCTCGGAAGAAGCCAAACATTTGTATAGATTTCCCAAACATAAGGAGGATTTGGAAAAGTGGTTATACAATCTCAAACTGCAGGTGGATGAGAATGATGTCAAGGACTTAAGGGTATGCGACAGACATTTCGAACAGAATTGCAAGATTTCCAATAAGGATTTGATTACACAGTCTTTGCCCACATTAAATTTGGGCCATACAGATACTGATATTTATGGCAATAATTTCATCAAGTGTTGCCTGAACGCATGCAACATAGAGGGCTTCTATTTCCATAAGCTGCCTGAGGATTTGATGCTGCAAAGCTATTGGTTTCAGGAACTCGAAATGGAAAGCACATACAATAGTTCTTTGTATATATGCTCAGTACACTTTGTTGCGTTTTTCGAACGGATATTGGAAAAGTACAGTGCTTTCCTTAAAGAATCCAAAGAGTATGTAAAACTAGCTTTAACCTATAATGAGCTGAAGTCTTTACCGGCCTTGCAATGCTACAGATGTTTCATACCCAAATGCAATTCAGGTTTTAAGCTTATttggaaattgtttaaatttcccAAAGATGAGACTTTGTTTAATAAGTGGCTTCACAACACGGGCTTGCAAATTGAACACAGTCAAAGGCCTTGCTATCGCATATGTGCCCAGCATTTTGAAGAGAGATGTTTAAGTGAAAAGAAATTACATCGTTGGTCTTTGCCTACCTTGAAATTGCCTTTCAATAATAGTTTGTATGTCAATCCACCTGAAGCTTTACCCTCTAATCATGAAAACCTCAAACACTGTTGTGTCTCAAACTGTCTTAACGAGAAAGGGCCATTTTTCAAGTTTCCTATTAAGCAATTGGAGGTTAAAAAATGGATCCACAACTTGGATTTGGGTCACCAACAATGTACACTTAATTTAAGGGTGTGTTATAAACATTtcgaaaattattgtttttcgaaaatcaacaataaaatcCGAAGTCTTAAAAGCTGGTCAGTGCCtactttgaaattgaaaagaaaatccgAACTTTATCTTAATCCCGCGGATAAGATAGCCTTCTATGTGTGCTGCATCAACAGCTGTAGGCAGACTTTAAACAAAACCAAGCAAATCTTTCTGTATAAATTTCCACAAAGCAATACCCTTAGGCAGAAATGGttacacaatttaaatttaacgccACAACAGTACAAGGAAACTATGAGGATATGCGGCGTACACTTTGAAATGGATTGTTTCTATAAAGATTTTAAGCTAATGCGCAAACACTCAGTGCCCACTTTGGCTTTGGCCACCAACGTAAAAGAGTTGTACCGAAATCCAGTTCGAAGGCCATATCTAAAGTGTTGTGTTAAGCTATGCAAGGGTCCTTGGAAGAATCTAATCAACTTTCCTAAACACAAAACATTATTAAGGAAATGGTgccataatttaaatttcaacaaggATATCACTCTGGAGTCCCtcagagaatacaaaatatgtgAAAATCATTTTGAAAAGCAATGCTTCAATAGAAATGGTGTAATAAGGCCTACAGCTATGCCCACAGTGAAACTGGGCCATAATAGGAAATTGTTTCTAAATCCAGATTTCACTCGTATTCCggcaataactacaacaacaactaaaaagttaattaaagaTAACGAAGGAGatttagaaaaatcaaaagaaatgcaAGTGCCggttaaagaagaaaaactgaaaacgaatcttaaaaaagaaaaagaggagaAGAAGGATAAGGAGAAGGGGGAGAAGgagattaaaacaaaattaacaaccCTCAGTCATCGTAAACTAAACTcagtgaagaaaacaattaaaacggAAAGCCTACCACAAAAACTGGGACGACGAAAGAAActtattaaaacgaaaattttaacaaaGCAGAGTAAAATGAAAAAGCCAAGCAAAACCTTAGACTCAATCTGTAATGAAACTAAAAACGATGAATTGAATAGCGATGAACCAATTAAAGAACAGTTAATTGCTTCCACTTTACCGGTAAATATTAAACAAGAAACTAATGATGAGCAGCCTCAGCAAATGGAAAAAGAACATATTGAACCCAAAAATATTAATGATGTTCCACAGCCAGCCAATATTAAACACTTGCCAACAACAGGCAATGAAGATGATTATTTGGAAAGTTTGCTGGAGATTTTAACTGAAACTTCAGAATTAGAAGGAAATCAAAatgctaaaacaacaacaacaaaaccagaACCTTTAGAGGaaacaactttaaatataattcctgcaaataaaatggaaaaagaaaagtcTGCTGTAAACAAGGATAATAAATCACCACCTGCTCATGATCATAGCAAGAGTAATGAGGAGAACAATATGTTGAACATAATTACTGAGGCTCCTATTGTTAAAGCGCCAAAGCCAACACAAAATCTTAACGCCTGTTGTGTGAAAACCTGTCCAAATTACAATAATCCCAATGCCTCAGTTGCATtatttaaaataccaaaaataaatggcctacGCCATCATTGGATCTCTAATTGTCATCTTAAAAGTGGTATTTTAAAGCCTGTTAAAGTCTGTATAGAACACTTTGAATCGCACTGTCTTAAGGATAACAATCGTTTGCTATTTGGTGCAGTACCTACTCTGAAGCTGGGCGTTAAACTAGACTCTAaggagattttaaaaacttttagttATTCAAGGTGTCGGATCGAAAGCTGCCAAAGATCAATTTATTATGATAAAATCAATCGTATACCATTTCCCAAAGGTTTAATGAAAACCAAATGGTGCTgcttattgaatttaaatgaagAGGAAATCTCTAACAAAGACTGGATTTGTCACAGGCATTTTGAGAAGGGGACCTTAATTGATTGTCGTAAATTGAAACCGGGCACACAACCTACTTTACTCTTGGATATCAAGGCTAAGAATGCAAATACAAATGATGATGTCTGTAAcacaacaaaagcaaagaaatgtTGTGTACGATCCTGTAACAGCAGCAGTCTTGAGCATAAACTTTTCCCTTTGCCTATCGCGAATGAGGACATGTGCAACAAATGGTTGCACAACTTGAATTTGTTGGAGAATTGCTCTGCCAGTGAACGTAAACAGTATTATGTGTGCGAGCAACATTTTGAAGGGCACTGTTTCCATAGAATAAGTGGCCGTTTAAAATGGGCTGCTTTGCCTACTTTGAGGTTAGacagaaaaaagaatttatacCTGCTAAGCGATAAGGAACTGCGCATTACACCCACCTTAAAAGATACGAAATTGAAATGCTGCTTTTCAAATTGCCATAAGGACAGTTCATTACAATCATATGACTGGCCTTATAAGGACATTTGCAAGGATATACTTCTacagcaacagaaacaaaatgaTGATGGTAGCAGTGTTAATATAAGAAAGGAAACTGACTGCGTACGTTTATGTGATGAACActtttataaattgtataaacCTAACCAACAGGCCATAGCTGATTGCCATTTGGATGCCAAAGTAAAGAATACGCTCAATTTGATATATGAAGACTTaagcaaacaaatgaaattctATACACGTAAGTGTATAGTGCCTGAATGCACGACCGATTATAGCTTAAAGGAAGACTATAAAACcttaaaactttttagttttcctAAAACCGATTTGGCTAAGAAATGGTGCCATAATATAGGCCTGGACTTTAACTCTTTGAAATCTAAGCCCAACCAAAAAGTCTGTGAATTACACTTTGAGCCATATTGTTTAGCCCGAAGAATGTTGTTTAATTGGTCAATACCCACGTTAAATTTACCTGCCAAACATACCGAAACCGAATCAAatcataaaattatacaaaatgatGCCGAGGACATATTCGCTTATTCGGGACAATGTTGCATAAAAGGTTGTATCAATGAAATGGGTTTggattgtaaaacaaaaacccGCTTTTATAGATTTCCTACACAAACAGACATTTTAGAGCAGTGGTTACAGCTAAGCAAATGtaaagatttcaatttaaatgtcaCACGCATTTGTGGTTTACACTTCCATGCAACAGATTTCCTAAATAAGAAAACTTCTCTTAAAGAAGATGCTGTCCCCAGCATAAATTTATCAACAGAATTAAACAATTCACCTGCCAATTCAATCATAGATGAAAATATACAAGTTAAACAAGAACTAGACAATGCAGAGGAATGGTGCGAACAAGGCAACGAAGAGGACATTAACTTTAGTTATGGAAACATGGCCTTGAAATCGTTTGCTTTGGATAGCAGttttgaaactaaatttaaagACGATCTTAAAACCCAAGATTATCATGCTTTATTAGACATAAAAGAAGAGATCATTGAAATTGAAGAAGACAACAATATTATGTacgaaatcaataaatttgaggAGGCCAATGATTTTGAatacgaaacaaatttcaataatatggTGTATCCCAATGAGGAGCCTGCTGGCTTTGTGATAAGCGATGTCAagtctcaaatttatttatgctgCGTACAAAAATGCTCTAACAGCTCGGAAACAGCAAATGTTAAAATTTACACTGAATTTCCTACGGACTCGGAGATTTTCATTAAATGgtgtttcaatttgaaaattgaTCCACGCAATTACAAGGAAAATCAATATGCCATTTGCCAAAAGCACTTTGAAAGCATATGTTTTACCGATACCCAAACGCTTTACCCGTGGGCAGTGCCTAcgctaaatttgaatttaaatgaaaactctTTTATACACAAAAACGATGTACCCGACTATCTAAAGCCTTGCAATGAACAATGCATTGTTTACGGTTGCATAAACCCCTTAAAGCCGCTGTATAAATTTCCTCAAGAGGTCGAGGTAACACAGAAATGGTTTACAAACCTGAAACTGGATTATACGGACTTTAGAGCACAAAATTATCGCATATGCAGAAGGCATTTCTCGCAACAATGCTTTGTGGAGGCTACTACGGATAAACTTAAAACTGAAGCTTTACCCACTTTGTATTTGGGTCACACGGATAAAATCgtgtatttaaataatagaGAAGAACAGCAGCAATTAGATCATGATGATATAGGGCAAGCAGGTTTGGCTGCTGGTGGTGTAAAtcatggtggtggtggtttagTTGTTGCTAATAATCAGGACAATAGTCGCGGTAGTAGTCAAGGCTCTTTGGCAAGAATAATATCACCGCATGATCTAGAGGATCATGACAGCAGTTATTATGAAGATTTTGAAGAGTATTATGGTCAGGatgattaa
- Protein Sequence
- MSQNNQRKHYHIHAPYQHPHQQLQQQQQSNHHHHLTTSQQQQQQQQQHHHPQWYAHGSMTPQQQLVHQQQQQQQQQQQQQQQQQHYQHGLHLRDSRHIQHPQHHHPHPHAHQQQQQAHHNHTMPHMFTSGGYVGMSAASSVGGGGGASGVGGISSAHSTSTMATPHNIPAAAPASAHYSSTLAGVSGVNVNAIAGVASNANSSSSSSAASSSASSSRHRMFDLEMLHQQQQQHPQHGLAAASHSHSNAMLASAGSASARASYDAYSHSSLYASQHSQRHHSAPASHHRAGVTHSAAHALHHHQSLHPQQLHHQPQPPSLTSHHLQQQQQQQQQQFHQQQHYYHHTAPTPLHRSHPHALPQMMPHIKSEPVEQITITPSIQTEEVIIKAEPMDDVGYHKIAPQIGNNSFHMEDKRKQYEQQQQQQQQQQQQLQQRRSELLLRQQQQQQQQEQQLRQQQLQQQQQQQQQRQEENRHYHHQQQQQQQQQQQQQQQQEEQQLKRNENSHHNEDVVGQQARGTNSENSTTLLPEEQEDIKPTEQQLQEHQHHHQQQQQEQQQISLTNIKTEAKPLNFPRRKLQTERSSTLPICQRCKQVFLKRQNYAQHVALSSCNIVEYDFKCSVCPMSFMSNEELLAHEQLHRMNRYFCQKYCGKFYDTINECEQHEYRQHEYEIFKCNICCISVTERDQLFGHLNEHKYQPRFDCCICRLCFQTSLELHDHYMANENFCGKFYDKEAFKKPITSSSTSSYMGKMESPKLEIANSFSLKDMPPANSHHLEPLYTKSKTSMKPPNEPTTPPSSSSFSSAVNDFSLEPQVEVKTEIKVEPDFYPPLDHADYTPYDNDYATPDYSTGSNQNIPFLQDYQDNASNSTNSSYSFNNNDAAPDDEAICCVPKCGVSKLSSPTLQFFSFPRDEKYLSQWLHNLKMTYDPNTNYSMYRICSLHFPKRCIAKYSLSYWAVPTFNLGHDDVGNLYQNRESSGGFPGGEMAKCSMPGCPSQRGETNVKFHVFPRDLKTLIKWCQNSRLPVHSKDNRFFCSRHFEEKCFGKFRLKPWAIPTLNLGTVYGKIHDNPNIYQEEKKCFLPYCRRSRSYDCNLSLYRFPRDETLLRRWCYNLRLDPNMYRGKNHKICSSHFIKEALGLRKLNPGAVPTLNLGHNDRFNIYENELYTPPPPPPPPQPSTSSKAHKFAEMFKQEMGNSSRSSRLYDGVFMSSMSHKFSNSSSSNSSNTLDLGDVCLVPSCKRTRHSDDITLHTIPKREEQLKKWCHNLKMDLRKMHKSVRICSAHFEKYCIGGCMRPFAVPTLELGHEDLNIYRNPDVIKKLNIRETCCVASCKRNRDRDHANLHRFPTHPELLQKWCDNLQKPVPDGTKLFNDAVCEVHFEERCLKNKRLEKWAIPTLNLGWDGAPHHLPSEEEINEYWVKPFAPNNGEEQGECCVASCKRNPQIDDVKLYRPPEDSEQLVKWAHNLQVDANDLPNLKICNLHFEQHCIGKRLLNWAMPTLNLGGKVEHLFENPPPMPNIYKRKEKPARILSSHEGIKWSPRCCLSHCRKMRSVHNVHLFRFPYSNRQTLAKWCHNLQLPLVGSSHRRICSTHFEPSVLTKRCPMTLAVPTLDLNTPPGYKIYQNAARLKQIKLGSQRQCVIESCRKTKLDGISLFRFPNNRSMLYKWRHNIKNWPKGKLTSTFRICTEHFEPHSVGERRLSPGAIPTLKLGHDSTDLYPNETRSFFDLEKCVVSGCDSRKEMEDVRLFRFPRDDDDVLRKWCNNLKMNANDCVGIKICNKHFEDECLGPRLLYKWAVPTLKLGHKEDELVEIIPNPPPEQRTGEFLYKCCVPTCGKSRKYDDAQMNSFPKHLKMFRKWKHNLKLDFLNFKEREKYKICNDHFEPVCVGKTRLNFGALPTLNLGHDDVDDLYQINPDRIRPNLFIRQKDVERLERKRLRETTKEQYECDNPEHDDDDDDENAVDPLSLEPSDIKCCVMDCTAPKSIMREPYELPESLEFRQLWFKEYQKSDDEEQPIETKICGLHFQIYFQQLKTKMLSMLEEGNEVLQTDFNNLQYNYQKSTISLVVNSYQCRVEGCTSNLLNPNIRLYFFPYGKNLVSKWSHNTGIIPDEHRRYMNKVCALHFESYCVTENQRLRSWAIPTLNLPDSENKRVYKNPDLTKLDKRMLGPQISKCAVANCNSDKTNDDESIKLFNFPTDENLLKKWCDNLKMSHRLTPLQKICSLHFEKSCLGSCRIRSWAIPTLNLGHDEAPEHPNKKVTSREVYDASEDTTEVQLKQVKIKRPLDSTKCCIASCRKSRLKHGVRLYCLPSNSKMKRKWMHNLKINHLKNNPKLHSIKVCNHHFHKRCWDGKNLKQWAVPTLHLGHSEAIFDNPRRIHAVPIVRCALSSCKNHKAIKDVQAFVFPKSPELLEKWSKNLKLDLEQCTGKICYEHFEKEVLAEKKLKANAVPTLNLGHEDIIFDNTDLIDKLQRKQTEQLNAKKRRYEDDDDYMDVEYEDRVEEEEEDNDMWEYEEYNEDLEDEEEDEDDEYEECYYDDGEDDDDDDDVDEEEDDEDEEFEENDDEHSISNSITDWSGIKFKELRVSLTPLTPEDLMDLCSRSSYEREFGSITSGSSLRGRRSITPAASLKDLRSETPEQNSRSETPNQKQFNCFREPLIIAAAADQIKSDNFREPNALTPEQKPDSRDSLNEYSIDAIIHSSTPVGKQNNSNEISKAITTPENFEIHETSNNSKTPALDTCNENAKRERFADENTTSSDHIDLEHNASTNLRTDKRLNPISPCCCLKHCGKEKTPEQHLTTYGFPKDPQLLRKWCDNLGLQPEECIGRVCIDHFELRVVGVRRLKLGAVPTLNLGPNCIAKHTNSEETPQKKAIIKEFTETGNMQETDTSSKPLPPYKTTKPGKQSVFRLCCLKHCRRKKFIKPNKKTKRPDLNQTSMECQKNAAVAPNILFKFPSDTKILKKWCKNLRLPEKLCLPSDLEICARHFEAKAIQDGKLHPKAIPTLELSYANRAPIYKNNPKDFEQLPQNAKQETKEKCFLEHCGKTENDDNTFLISFPLNEPRTLRRWCKNLKIDCDKNKLKTLKICNEHFETYVFFKKKHLRVGGLPTLNLGHNGAIVRNCRKLRLKKTNGGAVKEKCCVQQCQETNLKLFSFPRSSDLRKIWCNNLQLDLRQVLINHLKVCARHFNIECFTVGTDHLKLNAVPMLHLGLQSETHMVLEIMPSERKCMVENCQKTPSVDRVKLFNFPQKKDILKKWLFNLNLSPDTYNPNAFICSRHFDKSCIKNGMLHENSIPTHFLQITPKGWFYKNNEELYEMPKKCCVLNCGQTSEEAKHLYRFPKHKEDLEKWLYNLKLQVDENDVKDLRVCDRHFEQNCKISNKDLITQSLPTLNLGHTDTDIYGNNFIKCCLNACNIEGFYFHKLPEDLMLQSYWFQELEMESTYNSSLYICSVHFVAFFERILEKYSAFLKESKEYVKLALTYNELKSLPALQCYRCFIPKCNSGFKLIWKLFKFPKDETLFNKWLHNTGLQIEHSQRPCYRICAQHFEERCLSEKKLHRWSLPTLKLPFNNSLYVNPPEALPSNHENLKHCCVSNCLNEKGPFFKFPIKQLEVKKWIHNLDLGHQQCTLNLRVCYKHFENYCFSKINNKIRSLKSWSVPTLKLKRKSELYLNPADKIAFYVCCINSCRQTLNKTKQIFLYKFPQSNTLRQKWLHNLNLTPQQYKETMRICGVHFEMDCFYKDFKLMRKHSVPTLALATNVKELYRNPVRRPYLKCCVKLCKGPWKNLINFPKHKTLLRKWCHNLNFNKDITLESLREYKICENHFEKQCFNRNGVIRPTAMPTVKLGHNRKLFLNPDFTRIPAITTTTTKKLIKDNEGDLEKSKEMQVPVKEEKLKTNLKKEKEEKKDKEKGEKEIKTKLTTLSHRKLNSVKKTIKTESLPQKLGRRKKLIKTKILTKQSKMKKPSKTLDSICNETKNDELNSDEPIKEQLIASTLPVNIKQETNDEQPQQMEKEHIEPKNINDVPQPANIKHLPTTGNEDDYLESLLEILTETSELEGNQNAKTTTTKPEPLEETTLNIIPANKMEKEKSAVNKDNKSPPAHDHSKSNEENNMLNIITEAPIVKAPKPTQNLNACCVKTCPNYNNPNASVALFKIPKINGLRHHWISNCHLKSGILKPVKVCIEHFESHCLKDNNRLLFGAVPTLKLGVKLDSKEILKTFSYSRCRIESCQRSIYYDKINRIPFPKGLMKTKWCCLLNLNEEEISNKDWICHRHFEKGTLIDCRKLKPGTQPTLLLDIKAKNANTNDDVCNTTKAKKCCVRSCNSSSLEHKLFPLPIANEDMCNKWLHNLNLLENCSASERKQYYVCEQHFEGHCFHRISGRLKWAALPTLRLDRKKNLYLLSDKELRITPTLKDTKLKCCFSNCHKDSSLQSYDWPYKDICKDILLQQQKQNDDGSSVNIRKETDCVRLCDEHFYKLYKPNQQAIADCHLDAKVKNTLNLIYEDLSKQMKFYTRKCIVPECTTDYSLKEDYKTLKLFSFPKTDLAKKWCHNIGLDFNSLKSKPNQKVCELHFEPYCLARRMLFNWSIPTLNLPAKHTETESNHKIIQNDAEDIFAYSGQCCIKGCINEMGLDCKTKTRFYRFPTQTDILEQWLQLSKCKDFNLNVTRICGLHFHATDFLNKKTSLKEDAVPSINLSTELNNSPANSIIDENIQVKQELDNAEEWCEQGNEEDINFSYGNMALKSFALDSSFETKFKDDLKTQDYHALLDIKEEIIEIEEDNNIMYEINKFEEANDFEYETNFNNMVYPNEEPAGFVISDVKSQIYLCCVQKCSNSSETANVKIYTEFPTDSEIFIKWCFNLKIDPRNYKENQYAICQKHFESICFTDTQTLYPWAVPTLNLNLNENSFIHKNDVPDYLKPCNEQCIVYGCINPLKPLYKFPQEVEVTQKWFTNLKLDYTDFRAQNYRICRRHFSQQCFVEATTDKLKTEALPTLYLGHTDKIVYLNNREEQQQLDHDDIGQAGLAAGGVNHGGGGLVVANNQDNSRGSSQGSLARIISPHDLEDHDSSYYEDFEEYYGQDD
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01313848;
- 90% Identity
- iTF_01313848; iTF_01313044;
- 80% Identity
- -