Gvir032013.1
Basic Information
- Insect
- Gymnocheta viridis
- Gene Symbol
- -
- Assembly
- GCA_956483585.1
- Location
- OY101444.1:20753987-20780592[+]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 33 9.4e-15 1.1e-11 46.5 1.6 1 86 806 878 806 879 0.84 2 33 8.2e-15 9.8e-12 46.7 4.9 1 87 906 975 906 975 0.80 3 33 4.3e-15 5.1e-12 47.6 0.3 1 87 996 1068 996 1068 0.83 4 33 4.6e-14 5.5e-11 44.3 3.2 1 86 1150 1218 1150 1219 0.79 5 33 2.5e-15 3e-12 48.3 6.1 1 87 1243 1315 1243 1315 0.81 6 33 9.2e-12 1.1e-08 36.9 0.9 1 87 1350 1418 1350 1418 0.80 7 33 2.6e-11 3e-08 35.5 2.3 1 85 1459 1527 1459 1534 0.72 8 33 8.1e-15 9.6e-12 46.7 0.5 1 86 1556 1625 1556 1626 0.81 9 33 7.1e-14 8.5e-11 43.6 1.0 1 86 1648 1717 1648 1718 0.79 10 33 3.6e-13 4.3e-10 41.4 2.6 1 86 1746 1817 1746 1818 0.86 11 33 7.3e-08 8.7e-05 24.4 0.3 1 62 1885 1942 1885 1966 0.77 12 33 1.9e-10 2.2e-07 32.7 0.5 1 87 1982 2054 1982 2054 0.80 13 33 1.3e-12 1.6e-09 39.6 2.5 1 87 2086 2157 2086 2157 0.80 14 33 2.2e-13 2.7e-10 42.0 4.6 1 86 2202 2274 2202 2275 0.84 15 33 2.3e-13 2.7e-10 42.0 0.6 1 86 2297 2364 2297 2365 0.80 16 33 8.7e-14 1e-10 43.4 0.2 1 87 2685 2754 2685 2754 0.80 17 33 2.5e-11 3e-08 35.5 1.2 1 86 2812 2901 2812 2902 0.70 18 33 5.5e-13 6.6e-10 40.8 0.8 1 86 2932 3002 2932 3003 0.81 19 33 1.5e-12 1.8e-09 39.4 1.0 1 87 3031 3100 3031 3100 0.82 20 33 9.1e-13 1.1e-09 40.1 2.8 1 87 3120 3190 3120 3190 0.81 21 33 3.1e-13 3.7e-10 41.6 1.5 1 87 3213 3284 3213 3284 0.83 22 33 1.3e-05 0.016 17.1 0.3 1 60 3300 3348 3300 3373 0.76 23 33 1e-10 1.2e-07 33.5 7.0 1 86 3388 3457 3388 3458 0.81 24 33 4.4e-11 5.2e-08 34.7 3.6 1 86 3483 3553 3483 3554 0.81 25 33 1.6e-12 1.9e-09 39.3 1.4 1 86 3574 3646 3574 3647 0.77 26 33 1e-11 1.2e-08 36.8 2.2 1 86 3667 3736 3667 3737 0.80 27 33 1.2e-13 1.4e-10 42.9 0.9 1 87 4041 4118 4041 4118 0.82 28 33 8.4e-07 0.001 21.0 0.9 1 86 4137 4206 4137 4207 0.71 29 33 0.59 7e+02 2.2 0.1 24 58 4285 4335 4259 4361 0.55 30 33 1.1e-12 1.3e-09 39.8 0.4 1 87 4380 4456 4380 4456 0.84 31 33 1.1e-13 1.4e-10 43.0 1.7 1 85 4483 4555 4483 4557 0.81 32 33 3.8e-12 4.6e-09 38.1 3.2 1 87 4689 4762 4689 4762 0.79 33 33 9.2e-12 1.1e-08 36.9 0.7 1 87 4787 4857 4787 4857 0.79
Sequence Information
- Coding Sequence
- atgtCACAAAATAATCAACGAAAACATTATCACATTCATGCTCCCTATCAACACCCCCAACAGCAGCAGCCACagcaacaacaaccacaacaaatTCAACAACATCATCATAGCCACCATTTAACGCCGtcgcagcagcagcaacatcaaCAATGGTATTCGCAGCAACATTATCAACATGGTCTTCATATGAGAGATTCGCGCCATATTCAACATCCCCAACATCATCCCCATCATCATATTGCTCAACAGCAGCAACCACATCACAATCATACAATGTCACCACACATGTTTACAAGTGGTTATGTCGGTATCACAGCGAGTGGGGGTGTTGCAAGTGGTGGTGCCGGTGTTAGCAGTGCCGGAAGTGGGGTTGGTGTAACTAGTTCAGCACATAATTCGGCAGCAATGGCTGCTACATCGCACAACATGCCGGCTTCTTCGTCCTCTGCACATCATTATTCTTCCGCTATGCCGGCTTCTGGTGCGAGTGTTGGTGGTAATGCGGCTAATGCTAATAGTGGTGGTGCTTATGCTGGCCGTAATAGAATCTTTGACCTTGAAATGTTAACAACACAACCACAACAACATTCACAGCATAGTACAGCATCAGCTACACCTTCACATTCTATGTTATCGAGTGGCAGTTCGAGTGGAAGAGCAGGCTTTGATGCATATTCACATAGCTCCTTATATGCACAACAAAATCAACGGCATCTTTTAGCTCCTGCTTCCTCCCACCATCATCTGGCTACTACACATCATTCTACAGCTAACGCCTTGCATCCACATCATCATACCCAGCAACTACATCATCCACACCAGCAGCCACAGTCTGCTCTGCATCATCATCAGCTTCAACATCACCAACAGACACAACATTATTATCACCATACTCAACAAACTTCTCTGCAACGGTCACATACACAAGTCATGCCACCAATGCTGCAGCATGTTAAATCTGAGCCAGTAGAACAAATGCCCACAACACCGTCAATACAAACCGAGGAAGTCATCATTAAATCTGAGCCAGTCGATGAAATTAGTTATCATCACAAAAGTGCGCCACAATTTGAAAACAAACCTTTTCACATCGAAGAAAAACGTAAACAACATGaacagcatcaacaacaacagcagcagcagcaacaacaggaGCAGCATCAGCGTGACCAACATCAACGTgcacaacaccaacaacaattaCATGAGCAACAATTTCATCAACAtcaactacaacaaataaaacaagaacaTTATTATCATCCTCAGAATGAAAATACTAATAATGAAGATGTCTCCCAACAAACGCAACAACGTACAAATTCCGAGAATTCTTCTATAATGCaaccagcagcagcagcagtgaCGGCAGCAGAtgaaaagcaacaacaacaagaacaacaacagcaaccgCAAATATCCttgacaaatataaaaacagaagcaaagcctCTTAACTTTCCTCGTCGCAAATTACAAACAGAACGTTCCTCAACTCTGCCTATATGCCAACGatgtaaacaagtttttttaaaacgtCAAAACTATTCACAACATGTTGCTCTATCCACTTGCAATATTGTGGAGTACGACTTTAAGTGCTCCGTATGTCCCATGTCCTTTATGTCTAATGAGGAACTGCAGATACACGAACAACTCCATCGTTCAAATAggtatttttgccaaaaatattgTGGCAAATTCTACGAAACAATTGATGAATGCGAACAACATGAATATGGCCAGCATGAATatgaaatgtttaaatgcaatATTTGTTGTATAAGTGTTACGCAACGTGATCAATTATTGGAACATCTTAATGACCATAAATATCAGCCACGTTTCGATTGCTGCATCTGCCGTTTATGTTTTCAAACTTCAAGCGAACTACATGATCATTATATGGCTAATGAAGATTTCTGTggtaaattttatgacaaagaagCTTTTAAAAAACCTAATACTTCGTCGTCTGCTGCTTATTTGGGAAAGCCGGAAAGTTCCAATCTGGAAATAGCCAACTCATTTTCGTTAAAAGatataccTCCTGGCAATAGTCATCAGTTAGAAGGTTTGTACCCCAAGCCCTCCAGCTCAAAAACTTCTATGGAACCACCTAGCACACCAACTACAGCGTCGTCTTCATCATTTAGCACGGTAAATGAGTTTGCCGTCTTAGAGCCCCAAATAGAGGTAAAAACGGAAATTAAAGTAGAGCCTGATTTTTATCCCCCGATGGATCAATCTGATTTTGCTAGCTTTGACAGTGATTACGGCACACCCGACTATACATCTAGTTCTAATCAGAGTTTTTCATTTCTACATGATTATCAAGATAATGCTTCCAGTTCTACCAATTCATCGTTTTCCTTGAACAATAACGATGCCATACAAGATGATGCTGCCATTTGTTGTGTTCCTAAATGTGGTGTACGCAAATTTTCATCGCCATCCTTACAATTCTTTGGCTTTCCCAAAGAAGAAAAGTATTTATCGCAATGGCTGCACAATTTGAAAATGGTATACAATCCTAATGTAAATTATTCTATGTATCGTATTTGTAGTTTACACTTTCCTAAACGTTGTATAGCCAAATATTCATTAAGTTACTGGGCTGTACCCACCTTTAATTTAGGTCACGATGATGTTGGGAACTTATATCAAAATAGGGAAAGTTCTGGGGGGTTTCCAGGTGGTGAAATGGCTAAATGCAGTATGCCTGGTTGCCCTTCACAGCGTGGAGAAACCAATGTAAAATTTCATGTATTTCCACGGGACTTAAAGACCTTAATTAAATGGTGTCAGAACTCACGACTACCGGTACATAGTAAAGATAATCGCTTTTTCTGCTCCAAACACTTTGAAGAGAAATGCTTTGGCAAGTTTCGCTTGAAACCTTGGGCCATACCCACACTTAATTTAGGCACGGTCTATGGCAAGATACATGATAATCCCAATATCTATCAAGAggagaaaaaatgttttttgcccTTCTGCCGACGTAGTAGATCGTACGATTGCAATTTGTCTTTATATAGATTTCCAAGAGACGAAACTTTGTTGCGCCGTTGGTGTTATAACTTAAGATTAGATCCCAATATGTACAGAggcaaaaatcataaaatttgctCTTCTCATTTTATTAAAGAGGCATTAGGTTTAAGAAAGCTCAACCCAGGAGCAGTGCCCactttgaatttgggtcataatgATAGAtttaatatatatgaaaatgaaCTATATACACCACCGCCACCACCTCCACCACCACCACAACCTTCTACGTCGTCAAAGGCCCAAAAATATGCCCAACTATTTAAACAAGAAAGGGACAGTTCTTCCGGTTCACGTATCTATGATGGTGTATTCATAAACTCCATGGTGAAAAAATTCTCTTCCGGTTCTTCGAACAGCTCTAATAATCAGGATTTGGGAGATGTCTGCCTTGTGCCATCTTGCAAAAGAACTCGTAATTCCGATGACATTACTCTGCACACCGTGCCCAAACGCCCGGAACAGCTTAAGAAATGGTGtcacaatttaaaaatgaatttagttAAAATGCACAAAAGTGCCAGAGTTTGTAGTGCTCATTTCGAAAAGTACTGCATAGGCGGCTGTATGAGACCTTTTGCCGTACCCACCTTAGAGTTGGGGCATGATGATCCAAATATATTCCGCAATCCCGATGTCATAAAGAAATTGAATATTAGAGAAACTTGTTGCGTACAATCCTGTAAAAGAAACCGTGATCGCGATCATGCCAATTTGCATAGATTTCCCACTCATCCGGAATTGATGCAGAAGTGGTGTGAGAATTTACAAAAACGCATACCGGATGGCACTAAACTTTTCAATGACGCAGTTTGTGAGGTACATTTCGAAGATAGATGTTTGCGCAATAAGCGTTTGGAAAAATGGGCCATACCCACGTTGAATTTGGGCTGGAATGGGGCTCCCCACAGTTTGCCCTCAGAAGAAGAGATCAACGAGAACTGGGTAAAACCTTTTGCACCTAACAATGGAGATGAACAGGGCGAATGCTGTGTGGTCAGCTGCAAACGCAATCCTCAAATCGATGATGTCAAATTATACAGGCCTCCGGAAGATGCAGAACAGTTAGTTAAATGGGCGCATAACTTGCAAGTAGATGTTACGGACTTGcccaatttgaaaatttgtaatttacacTTTGAACAACATTGCATAGGCAAGCGTTTGCTGAATTGGGCTATGCCTACTTTAAATTTGGGCGGAAAAGTAGAGCATCTATTTGAAAATCCTCCACCAATGCCCGCCATTTACAAGAAGAAAATCAAACCTGATAGAATTTTAAACAGTCAAGATGGCATCAAGTGGTCACCAAGGTGCTGTCTGCCGCATTGTCGTAAAATGCGTTCCGCAGACAAGATTCATCTTTTCCGTTTTCCCTACAACAACCGCCAGACTTTGGCGAAATGGTGTCACAATTTGCAATTACCTTTGGTGGGCAGCTCACATCGTCGTATCTGTTCCAGTCATTTCGAGGCATCTGTTTTAACTAAACGTTGTCCCATGACATTGGCAGTGCCCACACTAGACCTGAATGCTCCTCCCGGCTATAAAATCTATCAAAACCCAGCAAggctaaaacaaataaaaataggcGCTCAAAGACACTGTGTAATCGAGTCTTGTCGCAAAACTAAATTAGATGAAGTTATACTATTTCGTTTCCCCAACAATAGATCCATTTTGTATAAATGGCgtcataatattaaaaattggcCCAAGGGCAAATTAAGTTCTCAGATGAGAATTTGTTCCGAACATTTTGAGCCTCATTCAGTTGGCGTAAAGAAATTATCACCCGGCGCCATACCCACTTTGAAGTTGGGACACGAAGCCAAGGATTTGTATCCCAATGAAATAAGATCCTTCTTTGATTTGGAAAAATGTGTAGTTAGTGGCTGTGACTCCCGCAAGGAAATGGAGGACATTAAACTTTTCCGTTTTCCGCGAGATGATGATGAATTGCTTAAGAAATGGTGCAACAATCTCAAAATGAATGCCAATGACTGTGTGGGCATTAAAATAtgcagcaagcattttgaattAGAATGTATAGGTCCCAGGCAGCTATACAAATGGTCGATACCCACTTTAAAATTGGGTCACAAAGAAGACGATATGGTGGAAATAATAGCCAACCCTCCGCCCGAGCAAAGAACCGGAGAATTTCTTTTCAAGTGTTGTGTACCTTCATGTGGCAAAACACGCAAATACGATGATGCACAAATGAACAGTTTTCCCAAACACTTGAAATTGTTCCGCAAATGGACACATAATCTAAAGTTAGATTTTCTTAACTTCaaagaaagagaaaaatataaaatttgcaacGATCATTTCGAGCCAGTTTGTGTGGGAAAGACCCGACTCAATTTCGGTGCTCTGCCCACTTTGAAGTTGGGGCATGACGAGCTAGATGATTTATATCAAATTAATCCGGATAGAATAAGACCAAATTTGTTTATCAAACAAAAAGACGTGGAAAGATTAGAAAGGAGAAGGATATTGAGAGAAGAAAATGCCGAACAATATACTGGCGAAGAGCAGGACGATGATGTGGGTGATCCCTTGGGATTAGAGCCAAGTGACATAAAATGCTGCGTTACAGAATGCACTGCCCCTAAATCAATAATGAGGGAGCCCTATGATTTACCAGAAACTAAACAAATCCGACAGCTGTGGTTAAAAGAATTCGAAAAAACTGATGAAGAAGATTTGCCAACAGAATCTAAAATTTGTGGCTTACACTTccaatcaatatttaaaaaattaaaacaccaaATGCTAGAGATAATAGACGAAAATGATGACTTAAAATCAGATTTCAATAAACTACAATACAACCTTCAAAAGTCCAACATATCTCTGGTTATAAGTAGTTATCAGTGCAGGGTTGAAGATTGCCCTACCAACCTACTTAATTCTTCTATAAgactatatttttttccatatggCAAACAACTGGTAAGCAAATGGTCTCACAACACAGGCATAATACCCGATGAACATCGCAGATACATGAACAAGGTATGTGCCTTGCATTTCGAGTCGTTTTGTATAACAGAAAATCAAAGACTGCGATCATGGGCCATACCTACACTCAACTTACCAGCTGGCGAGGAGAAAGAAAAGCATTTATATAAGAATCCTGATCTAACTAAAATTGACCGAAGAATATTGGGACCTCAAATTTTGAAATGTGCCGTCAACAATTGCAGCTATCCCAAACTGGTAGATGACGAATCCCTCAAACTATTTAACTTTCCCACGGATGACAAGTTGTTGAGAAAATGGTGTGATAACTTGAAAATGTCTCACCATTTTACACCCTTGCTTAAAATCTGTTCTttgcattttgaaaaaatatgctttGGCAGCTGTCGCATACGTTCTTGGGCCATACCCACCTTAAATTTGGGTCATAGCGATGCTCCCGAACATCTAAATAAAACAACTATAAGACAAGAGGTTTATGATGTACCCGAAGATGTGTCTGAAATACAATTGAAACAAGTTAAAATCAAAAAGTCACTAGATAGTACGAAATGTTATATACCCAGCTGTCGCAAAAGCCGATTAAAACATGGAGTGCGTTTTTACAATCTGCCCTCAAATTTAAAGATGAAACGCAAATGGCTGCACAATTTGCAAATCAGGCATTTGAAGTCCAATCAAAAAAtgcataatattaaaatttgcaacttGCACTTTCACAAAAGATGTTTGGAGGGTAAACTTTTGAAACCTTGGGCAGTGCCCACAAGGCATTTGGGTCACAGCGAATCCGTTTATGATAATCCCCGAAAAGTAAGGGCATTGCCGCCATTACGCTGTGCTCTCTCACACTGTAAAAATCATGCAGGATCGAGGGCAGTACGTACTTTTGTATTTCCCAAATCGCCAGAGTTCTTAGAGAAATGGGCGAAAAACTTGAAATTGGAATTGGAAAagtgcaaaggaaaaatatgtCATGAACACTTCGATAAGGAAATTGTGGGTATGAAAAAGTTGCAAAGTGGTGCGGTACCTACTCTCGATTTAGGCCATAGCGATAAGGTTATGTATGATAATACAGAATTAATGgagaaacttaaattaaaacaaattgaaaaagagTTAAACAGAGATTCGTGCAAAATGAATATAATAGAACAAGACGATTTGGATGAGGAATATGAGCCGCACTCAGAGGAGGAAGAGGAGGAGATATGGGAGTATGAAGAATGCGAGGATGAGGAAGACGAGGAGGAGGAAGATGATGAACAAATATGTTATGATGATGAAGATGAAGAAGAGGAGGAGGAGGAGAAAAGGCATGATGAAGATAAGGAGGAGACTCCACAAGATGACGATGAAATCAGCATAACGAATTCAACATCCGACTGGAGTTCTGTTAAGTTTAAGGAACTTAGAGTCTCCATAACTCCCTTGACACCGGAAGATTTAATGGATTTATGTTCACGTTCTTCCTATGAAAGAGAATTTGGGGCTTTAACACCGGCCAACAATTTAAGGGGCCGCAGATCTGTTACACCAGCTTCAAGCTGGAAAGATTCTCGCTCAGAAACTTCTGATCAAAAGTCTAACAGTTTCAACTTTAACTCTAACAGATCAGAAACACCCGATAAAAAAGCATCTAATTATTTTAGAGAACCTCGCTCCGTCTCACCTGaacaaaaaccaaatattaGAACTGCTGATGAAAAATGTAACAGTCCGAAAGATCCGCTTGGTGAAAACCTGGAGGATTTTTGTACCAAAACCCCAAACCAGATAGAAGCACTTGTTTTCAAAGAGGAAACAACGTCTGAATGTGATTTAACTGTTAACAAATTGAAAAGGAGAACTTCTCAAATACCCAACGAAAGTTTCAAAAGGGAATGTTTGGAATTCTCGGAATATGAAATTGTTAATACCATGTTGCCAAATGAAATAGAGCTTACTGGCACTACCAACCTAAGAACAGATAAAGCTCTCAATGCGGTGGCACCCATTTGCTGTTTGAAACACTGTGGCAAGGAAAAGACGCCGGAACAGCATCTAACCACTTACGGGTTTCCCAAAGATCCTCAACTTTTACAAAAATGGTGCGATAACTTGGGCTTACAACCCGAAGAGTGTATTGGACGTGTCTGCATAGACCATTTTGAACTAAGAGTTATCGGCACGCGACGACTCAGATTAGGAGCTGTGCCAACTCTGAACTTAGCTCCAAATCAAGTTGCCAAGCACACTAACATGGAGGATACTCCACAAAAGAAAAGTGTAACCAAGGAGTTCTCCGAAACAGCGAATATGCAAGAGGCAGACTCAAGCTTAGAGCCACCGCCACCTTATAAAACACCCAAACCCAGTAAGCAATCGGTTTTTCGGCTATGTTGCCTCAAACATTGTCGACGCAAGAAACTCTTGAACCTGGACAAGGTAGACAACCAACCGCTGATGGAAAGAATGGTTTGCCAGGAAGAACCCCAGGAAATCTTGTTTAAATTTCCCACTGagcaaaatatgttaaataaatggTATAAAAACTTAAGATTGCCGGAAAATCTAACCGTAACACAGGACTTGCAAATATGCTCCCAACACTTTCAATCGAATGTTATTGAAAATGGCAAATTGCATCCCGAAGCCGTACCCACTTTACAACTAAGTTATGCTAATCTGCCACCTATTTATACAAACTATCAACTTCTAGGCTACAAATCGGAGATGAAGGAAAAGCCCATCCAAAAGTGTTGCCTTCCTCATTGCGGCAATAAAATGTCGGAACATATACACCTGTTCGCGTTCCCTGAAAATCAACCCTGGCTTCTGAGGAAATGGtgtcaaaacttaaaactaaatctCTTACCGGGTCAATATAAAAGTTTGTATATATGCAATGTGCACTTTGAGCCGTATGTGTTCTTTAGAAAAAGATTACGTTCGGGTGCTTTGCCAACACTTGATTTGGGACATACGGATGCAATTATTCGAAATTGTCGCAAATTGCGTTTGCAAACTGAAAATATTAGTACCATTAAGGAGAAATGTTGTATAGCCGATTGCGAGACAACTAAccttaaactttattcatttccCCGTAGCTCCGAGTTAAGGAAAATTTGGTGCAACAATTTGCAAATTGAACCACGCCAGGCTCTCAACAATCATAGTAAACTATGTGCGCACCATTTTACGGTAGATAGTTTCATAGTGGGCACCGACAATCTCAAACTAAATGCTGTACCTGTATTAAACTTGGGAATAAAAAATGAAAGCCATTTATTGATGACAACAAATCCAGCTGAAAGCAAATGTATAGTGGAGAACTGTCAAAAAACACCCAGTGTCGATAAAGTGAAGCTGTTCAATTTTCCCCAAAAGCAGGAGATACTTAAGAAGTGGCTTTTTAACTTGAATTTAGCAGCCGATAACCTTCGAAAGGATGATGTGGTCTGCAGTAAACATTTCGATAAATGTTGCATTAAGAATGgtattttacatgaaaaagccATACCCACCCAGTTTCTAGAATTTTCGCCGAAAGGATGGTTTTACAAAAACAACGAGGATTTATatgaaataccaaaaaaatgctGTGCCCTCAGTTGTCAACAAACTTCGGAAGATGCCAAACATCTGTATAGATTTCCTAAGCACAAAGAGGATTTGGACAAATGGgtgtacaatttaaaattacaagtgGACGAGTCAGATGTTAAGGATTTAAGGGTATGTGATAGACATTTCGAGCCGAGTTGTAAAATTTCCAACAAGGACTTGCTAACCCAGGCCTTGCCCACCCTTAATCTGGGTCATGACGATGCCGACATCTATggcaataactttattaaatgcTGTTTAGATAACTGTTCCATAGAGGGCTTTTACTATCATAAATTGCCCGAGGATTTAATGCTGCagagtttttggtttcaggaACTGGAAATGGAGACAACCTACAACAATTCTTTGTATATATGTTCCGTTCATTTTGTAACATTCTTCGAAAGAACATTGGAAAAGTACAGTGCTTTTCTGAAAGAGTCCAAGGAGTATGTAAAACTATCTGTAACTTATAATGAGATTAAAGCTCTACCTGCCTTGCAATCTTACAAATGTCATATAAGCAAATGTACTTCTGGTTTTAAACTGAtctggaaattatttaaatttccaaaagaTGTTAAATTGTTCAATAAGTGGATGCATAATACGAGTTTACAATTTGAATATGAGCAACGCCATTGTTATCGCATTTGCTCGCAACATTTTGAGGAAAGAtgtttaagtgaaaaaaaattacaccgCTGGTCTCTGCCCACTCTCAAGTTGCCTTTCAACAACAGTTTATATGTCAATCCCCCCGAAGCTTTGCCCTCCAATCACGAAAACCTGAGGCACTGTTGTGTCTCTAATTGCACTACCCTAAAAGGACCATTTTACAAGTTCCCCGTCAGGCAGGTGGAGGTAAAGAAATGGATACATAATTTAGATTTGGGCAACCAACAATGTACGCTTAACTTGCGCGTGTGCTATAAGCATTTCGAGAACTATTGCTTTTCCAAGGCTGTTAACAAAGTTAAACCGTTGATATCGTGGTCAGTGCCAACTCTTAGATTGAAACGAAAAGTTGCTCTTTTCCTCAATCCAGCAGACAAGATTGCCTTCCATGTTTGCTGCATCGAAAGCTGTAGAAAAATTCTCAATAAATCCAAAGGGATCTATCTGTTTAAATTTCCCTTCAGTAACACCTTCAAACAAAGATGGCTGCACAATTTAAACATTGGCCAACAGGATTATAAGGAAACAATGAGAGTTTGTTCGGCTCACTTTGAAATGGAGTGCTTTTACAAGGGCTTTAAATTAATGCGCAAAGATTCGGTACCCACCTTGGCACTATTCAAACCGCCTCCTGATCTCTATACAAATCCTGTGCGTAGGGCTTATTTTAAATGTTGTGTTAAATTGTGTAAAGCACCCTGGGAACAACTTTTAAGTTTCCCTAAGGATAAGATACTTTTGAGAAAGTGGTCTCATAATTTACAGTTggacaaagaaataaaattagaaactcTGAGGGATTGGAAAATATGTAGCCGGCATTTTGAACAACAATGCATAAATTCAAATGGGACAATAAGAAGTGTGGCGGTACCTACTCTTAAACTGGGACACcgcaagaaattgtttctaaatcCTGATTTCGCTTTGAAATCGAAccttaaaagtaaacaaaaaaagttacatGATGAGTCCAGTGCAAAGATTGACGAACATGAAGTAACAAATACTTTGGAAACTAATATAGAACCAGAGATACTGGATGATATTTCGCTAGAAGAtcagaatataaatataaagacGCAAACGGAAATTAAAACCAAATTATCTTCTAAAGTTAAAAGCTTAAAGCCTCGGAAACGTTTAAGGAAACGTAAATGTCTGTTTGGAAAAAAACGGAAGGCTAAAATTGTGGCTAAGAAACTTCTTAACGAAAACGAACAAAACGTTATTAAAGAGAAAGAAAACTCTGCAACGTCACAGCAATTTACAATAGAAGAGAAAAAAGAAAGAGCAAACGAACAAAAATCTCTAACAGAGTTGCACACACATGAGAACTTGGACGAAACCACAAACTCCTTATTTACTGAAGTTGTTAATCCAAACATTTCGGACACTGTAGTGTACGAACAAAAGGTCGAGGAAACTATTAACCTGCAAGAAGATGCTTATCTGGAGAACTTGTTGGAAATCTTAACAGAGAGTTTGCCGGAAAATGACGAACTAAAAGAAACTCCACAACTGTTCAAACAAGAACCCACTGATTCCGATACACTTTTGCAGGTTGCTGGAGAACCGCAGCATGTAGACTTGAAGTATTTTCCAAATGATAGTGACGAAATTGCCACATTTCAAATTACCGAAATAAAACAGGAAGTAGACCAGATGCCAATCGAAGAAGAGTTCCTAGAAGAACGAGCCGACTATGAACCCAGCCAACATAGTGAAGCCGAAGTATCAAAGGAGGAAAAAGCTTTAAGTTTTAAAAGCAAACATCTCAAAAATCTTATCTCTTGCTGTATAAAAACCTGTCGCAATTATCTCAATTACAAACCGGACCTACTCCTTTTCAAGCTACCCGTTGTACGCAAACTGCGTACTCATTGGCTAGAAAATTGTAAACTCAATCAGCGCCAATATTCGGCAAATGGCGTGTTGAAAAAACTTAGAATTTGTGCTGAACACTTTGACAAAAACTGCATTAAAGATGACAGCCGTCTACTGCTGGGCGCAGTGCCGACATTACACCTCGGAAGCAATCTAGACTATAAAGAAAGTTTAATCAAATTTACCTATTTGAGATGCAAGATACAAAGTTGCCAGCGATCTGTCCAACATGATAAGATCCATCGTATACCATTTCCCGAAGGAGAAGAGAAAAGAAACTGGTGTTTGAAAATGAACATTAAGGAAGGAACTGTTACTCCAGATGATTGGATATGTCATAGACATTTCGAAAGAAAATCTATAATAGATGGCCGAAAACCCAAGCCGGGTATGTTGCCCACTTTACTATTGAATAGTCTGgatgaaaaatctttaattcGAAAATCCCAGCCGCATACGCTGGCTACTTTAGTACGCGATAGTGTGACTGTGGCTGAGAATATTAACGATTTAGCTGTGCCAGATTGTAGACCGAAAAATCACAAAGTaatcaaaactaaatgtttATTTCCCTTTTGTAAGGACAACAAGGGACAAGTTTTATACGATTGGcctgataaatttattttcggcAAAATATGGCTGATGGCAAACAAGTTAGGACGACATGCCGAAGATGCTAGTTCCTGGAAGAGAATGTTTGAACAAACTTTAATAAATGAGCAGCCGCCAATTGAAAGTTCGGCGGGCGGAGATTCggaaaaacacattaaattgtGTGATGaacacttttattatttatacaaaacaaacaatGAAGCCATAAACGGCTACGAAGCCTTCGAAGAATACCAGGACTTAAAACACAATGTTCAAGTTACCTTTGACTTCTTaaattctttagaaaaaatctatacaaaaaaatgtgctGTGCCACAATGTAAAACAGATCAAAATATCAAAGACGCCGCagtaaaatcaataaaactttttgacttCCCGAGAAAAGAAATAGCTAAGAAATGGTGCGATAATATTGGCCTAGAATACAGCATCCTCGAAAGAAAGCCATTTATCAAGGTTTGTGAGATTCATTTTGAGGATTATTGTTTACTAAGAAGAAACCTTCTCGATTGGGCTTTACCAACTTTACATTTACCGCTTATGAAAGATGCTCAGGATATTAAGCAAAATGATGCTGTCCAGGTAATTGCCATAAGGGACAAGAGCAAGTGTTGCATTGAAACATGTCCTTCTGTTCGGGACATGGACTCGAATAGCAATTTAAGCTTATACAAATTTCCCAAAGACCCTGTACTGCTTCAGAAATGGCTACAGAATACCAACTGTGAAAAGACATTTGATGCAAATTTAACACGTATATGTGCGTTACATTTTCATGCATCCGATATACTCGATGAGAGCAAATTACACGAACAAGCTATTCCAAAATGTTATTTAGATTTAAGCAATTCAAACTTTTCATCATACCCATCCTGTCTAAATAGTTCGTTTATAGATGAACATATACAAGTTAAACAGGAATTGGATAACAGTGAAGAATGGTGTGTAACCTCCCAGCAGGAAAGTGTTGCACGTACAACTCCAACAAACGAAATGGTATTTGAActtaaattgaaagaaaacaaTAGAGACGCTGATTACAATCAATTGacagaaataaaacaagaaatcaTAGAAGTTCAAGAAGAAACCTCCTTATTTACCATACACAAATTTGAAACCTCCAGTCCACCCGAATTTAAATACCCCTACACCAATTcgaataatagtaataataatcaGCCAGCAGCTTTTGTTATAAGCgatgtcaaatcgcagctctaTTTTTGTTGTGTGCAAAAATGTACCAACAATTCGGCAACACCCGGCATACGTATATTCAATAAGTTTCCCCACGATTcggaaattttcattaaatggtgttttaatttaaaaattgacccTCGCAACTATAAGGAAAACCAATATGCCATTTGTGAACAGCATTTTGAACCTATTTGCTTTACGGGAAATGGCCTACTGCAAAACTGGTCGGTACCAACATTGAATcttaatttaaatgaacaatCTTTTATACATCAAAACGATATACCTGAACATTTGAAACCCTCCAGCGAACAGTGTATTGTATATGGTTGTATAAATCCGTTGAAAccactttttaaatttccccATAATCCTGATATTTCACTCAAATGGTTTTCAAATCTAAAACTAGACTTTACTGACTTTCGAGCCCAGAATTATCGCATTTGTAGGCGACATTTTTCCCCCATATGCTTCGGAATAAACGATTCTAATAAATTGACTAGCGAAGCTGTGCCGACGCAATTTCTTGGTCACACCGATAAAATATGCCATTTTAATAGTGTCGAAGAGCAGCAACTGCAAGCGGATGGTGGGGTTAATAATCAGGATAATAGTCGGGGCAGCAGTCAGGGATCCTTAGTAAGAATAATATCTCCACATAATATAGAAGATCATGATAGtagttattttgaagattttgaagAATATTACGGACAAGATGAATAA
- Protein Sequence
- MSQNNQRKHYHIHAPYQHPQQQQPQQQQPQQIQQHHHSHHLTPSQQQQHQQWYSQQHYQHGLHMRDSRHIQHPQHHPHHHIAQQQQPHHNHTMSPHMFTSGYVGITASGGVASGGAGVSSAGSGVGVTSSAHNSAAMAATSHNMPASSSSAHHYSSAMPASGASVGGNAANANSGGAYAGRNRIFDLEMLTTQPQQHSQHSTASATPSHSMLSSGSSSGRAGFDAYSHSSLYAQQNQRHLLAPASSHHHLATTHHSTANALHPHHHTQQLHHPHQQPQSALHHHQLQHHQQTQHYYHHTQQTSLQRSHTQVMPPMLQHVKSEPVEQMPTTPSIQTEEVIIKSEPVDEISYHHKSAPQFENKPFHIEEKRKQHEQHQQQQQQQQQQEQHQRDQHQRAQHQQQLHEQQFHQHQLQQIKQEHYYHPQNENTNNEDVSQQTQQRTNSENSSIMQPAAAAVTAADEKQQQQEQQQQPQISLTNIKTEAKPLNFPRRKLQTERSSTLPICQRCKQVFLKRQNYSQHVALSTCNIVEYDFKCSVCPMSFMSNEELQIHEQLHRSNRYFCQKYCGKFYETIDECEQHEYGQHEYEMFKCNICCISVTQRDQLLEHLNDHKYQPRFDCCICRLCFQTSSELHDHYMANEDFCGKFYDKEAFKKPNTSSSAAYLGKPESSNLEIANSFSLKDIPPGNSHQLEGLYPKPSSSKTSMEPPSTPTTASSSSFSTVNEFAVLEPQIEVKTEIKVEPDFYPPMDQSDFASFDSDYGTPDYTSSSNQSFSFLHDYQDNASSSTNSSFSLNNNDAIQDDAAICCVPKCGVRKFSSPSLQFFGFPKEEKYLSQWLHNLKMVYNPNVNYSMYRICSLHFPKRCIAKYSLSYWAVPTFNLGHDDVGNLYQNRESSGGFPGGEMAKCSMPGCPSQRGETNVKFHVFPRDLKTLIKWCQNSRLPVHSKDNRFFCSKHFEEKCFGKFRLKPWAIPTLNLGTVYGKIHDNPNIYQEEKKCFLPFCRRSRSYDCNLSLYRFPRDETLLRRWCYNLRLDPNMYRGKNHKICSSHFIKEALGLRKLNPGAVPTLNLGHNDRFNIYENELYTPPPPPPPPPQPSTSSKAQKYAQLFKQERDSSSGSRIYDGVFINSMVKKFSSGSSNSSNNQDLGDVCLVPSCKRTRNSDDITLHTVPKRPEQLKKWCHNLKMNLVKMHKSARVCSAHFEKYCIGGCMRPFAVPTLELGHDDPNIFRNPDVIKKLNIRETCCVQSCKRNRDRDHANLHRFPTHPELMQKWCENLQKRIPDGTKLFNDAVCEVHFEDRCLRNKRLEKWAIPTLNLGWNGAPHSLPSEEEINENWVKPFAPNNGDEQGECCVVSCKRNPQIDDVKLYRPPEDAEQLVKWAHNLQVDVTDLPNLKICNLHFEQHCIGKRLLNWAMPTLNLGGKVEHLFENPPPMPAIYKKKIKPDRILNSQDGIKWSPRCCLPHCRKMRSADKIHLFRFPYNNRQTLAKWCHNLQLPLVGSSHRRICSSHFEASVLTKRCPMTLAVPTLDLNAPPGYKIYQNPARLKQIKIGAQRHCVIESCRKTKLDEVILFRFPNNRSILYKWRHNIKNWPKGKLSSQMRICSEHFEPHSVGVKKLSPGAIPTLKLGHEAKDLYPNEIRSFFDLEKCVVSGCDSRKEMEDIKLFRFPRDDDELLKKWCNNLKMNANDCVGIKICSKHFELECIGPRQLYKWSIPTLKLGHKEDDMVEIIANPPPEQRTGEFLFKCCVPSCGKTRKYDDAQMNSFPKHLKLFRKWTHNLKLDFLNFKEREKYKICNDHFEPVCVGKTRLNFGALPTLKLGHDELDDLYQINPDRIRPNLFIKQKDVERLERRRILREENAEQYTGEEQDDDVGDPLGLEPSDIKCCVTECTAPKSIMREPYDLPETKQIRQLWLKEFEKTDEEDLPTESKICGLHFQSIFKKLKHQMLEIIDENDDLKSDFNKLQYNLQKSNISLVISSYQCRVEDCPTNLLNSSIRLYFFPYGKQLVSKWSHNTGIIPDEHRRYMNKVCALHFESFCITENQRLRSWAIPTLNLPAGEEKEKHLYKNPDLTKIDRRILGPQILKCAVNNCSYPKLVDDESLKLFNFPTDDKLLRKWCDNLKMSHHFTPLLKICSLHFEKICFGSCRIRSWAIPTLNLGHSDAPEHLNKTTIRQEVYDVPEDVSEIQLKQVKIKKSLDSTKCYIPSCRKSRLKHGVRFYNLPSNLKMKRKWLHNLQIRHLKSNQKMHNIKICNLHFHKRCLEGKLLKPWAVPTRHLGHSESVYDNPRKVRALPPLRCALSHCKNHAGSRAVRTFVFPKSPEFLEKWAKNLKLELEKCKGKICHEHFDKEIVGMKKLQSGAVPTLDLGHSDKVMYDNTELMEKLKLKQIEKELNRDSCKMNIIEQDDLDEEYEPHSEEEEEEIWEYEECEDEEDEEEEDDEQICYDDEDEEEEEEEKRHDEDKEETPQDDDEISITNSTSDWSSVKFKELRVSITPLTPEDLMDLCSRSSYEREFGALTPANNLRGRRSVTPASSWKDSRSETSDQKSNSFNFNSNRSETPDKKASNYFREPRSVSPEQKPNIRTADEKCNSPKDPLGENLEDFCTKTPNQIEALVFKEETTSECDLTVNKLKRRTSQIPNESFKRECLEFSEYEIVNTMLPNEIELTGTTNLRTDKALNAVAPICCLKHCGKEKTPEQHLTTYGFPKDPQLLQKWCDNLGLQPEECIGRVCIDHFELRVIGTRRLRLGAVPTLNLAPNQVAKHTNMEDTPQKKSVTKEFSETANMQEADSSLEPPPPYKTPKPSKQSVFRLCCLKHCRRKKLLNLDKVDNQPLMERMVCQEEPQEILFKFPTEQNMLNKWYKNLRLPENLTVTQDLQICSQHFQSNVIENGKLHPEAVPTLQLSYANLPPIYTNYQLLGYKSEMKEKPIQKCCLPHCGNKMSEHIHLFAFPENQPWLLRKWCQNLKLNLLPGQYKSLYICNVHFEPYVFFRKRLRSGALPTLDLGHTDAIIRNCRKLRLQTENISTIKEKCCIADCETTNLKLYSFPRSSELRKIWCNNLQIEPRQALNNHSKLCAHHFTVDSFIVGTDNLKLNAVPVLNLGIKNESHLLMTTNPAESKCIVENCQKTPSVDKVKLFNFPQKQEILKKWLFNLNLAADNLRKDDVVCSKHFDKCCIKNGILHEKAIPTQFLEFSPKGWFYKNNEDLYEIPKKCCALSCQQTSEDAKHLYRFPKHKEDLDKWVYNLKLQVDESDVKDLRVCDRHFEPSCKISNKDLLTQALPTLNLGHDDADIYGNNFIKCCLDNCSIEGFYYHKLPEDLMLQSFWFQELEMETTYNNSLYICSVHFVTFFERTLEKYSAFLKESKEYVKLSVTYNEIKALPALQSYKCHISKCTSGFKLIWKLFKFPKDVKLFNKWMHNTSLQFEYEQRHCYRICSQHFEERCLSEKKLHRWSLPTLKLPFNNSLYVNPPEALPSNHENLRHCCVSNCTTLKGPFYKFPVRQVEVKKWIHNLDLGNQQCTLNLRVCYKHFENYCFSKAVNKVKPLISWSVPTLRLKRKVALFLNPADKIAFHVCCIESCRKILNKSKGIYLFKFPFSNTFKQRWLHNLNIGQQDYKETMRVCSAHFEMECFYKGFKLMRKDSVPTLALFKPPPDLYTNPVRRAYFKCCVKLCKAPWEQLLSFPKDKILLRKWSHNLQLDKEIKLETLRDWKICSRHFEQQCINSNGTIRSVAVPTLKLGHRKKLFLNPDFALKSNLKSKQKKLHDESSAKIDEHEVTNTLETNIEPEILDDISLEDQNINIKTQTEIKTKLSSKVKSLKPRKRLRKRKCLFGKKRKAKIVAKKLLNENEQNVIKEKENSATSQQFTIEEKKERANEQKSLTELHTHENLDETTNSLFTEVVNPNISDTVVYEQKVEETINLQEDAYLENLLEILTESLPENDELKETPQLFKQEPTDSDTLLQVAGEPQHVDLKYFPNDSDEIATFQITEIKQEVDQMPIEEEFLEERADYEPSQHSEAEVSKEEKALSFKSKHLKNLISCCIKTCRNYLNYKPDLLLFKLPVVRKLRTHWLENCKLNQRQYSANGVLKKLRICAEHFDKNCIKDDSRLLLGAVPTLHLGSNLDYKESLIKFTYLRCKIQSCQRSVQHDKIHRIPFPEGEEKRNWCLKMNIKEGTVTPDDWICHRHFERKSIIDGRKPKPGMLPTLLLNSLDEKSLIRKSQPHTLATLVRDSVTVAENINDLAVPDCRPKNHKVIKTKCLFPFCKDNKGQVLYDWPDKFIFGKIWLMANKLGRHAEDASSWKRMFEQTLINEQPPIESSAGGDSEKHIKLCDEHFYYLYKTNNEAINGYEAFEEYQDLKHNVQVTFDFLNSLEKIYTKKCAVPQCKTDQNIKDAAVKSIKLFDFPRKEIAKKWCDNIGLEYSILERKPFIKVCEIHFEDYCLLRRNLLDWALPTLHLPLMKDAQDIKQNDAVQVIAIRDKSKCCIETCPSVRDMDSNSNLSLYKFPKDPVLLQKWLQNTNCEKTFDANLTRICALHFHASDILDESKLHEQAIPKCYLDLSNSNFSSYPSCLNSSFIDEHIQVKQELDNSEEWCVTSQQESVARTTPTNEMVFELKLKENNRDADYNQLTEIKQEIIEVQEETSLFTIHKFETSSPPEFKYPYTNSNNSNNNQPAAFVISDVKSQLYFCCVQKCTNNSATPGIRIFNKFPHDSEIFIKWCFNLKIDPRNYKENQYAICEQHFEPICFTGNGLLQNWSVPTLNLNLNEQSFIHQNDIPEHLKPSSEQCIVYGCINPLKPLFKFPHNPDISLKWFSNLKLDFTDFRAQNYRICRRHFSPICFGINDSNKLTSEAVPTQFLGHTDKICHFNSVEEQQLQADGGVNNQDNSRGSSQGSLVRIISPHNIEDHDSSYFEDFEEYYGQDE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00741912;
- 90% Identity
- -
- 80% Identity
- -