Ofri000867.1
Basic Information
- Insect
- Ornithomya fringillina
- Gene Symbol
- -
- Assembly
- GCA_963978525.1
- Location
- OZ021690.1:8399461-8417989[-]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 32 8.3e-16 1.1e-12 48.6 3.2 1 86 692 764 692 765 0.85 2 32 1.4e-15 1.9e-12 47.8 6.2 1 87 792 861 792 861 0.82 3 32 3.2e-16 4.4e-13 49.9 0.7 1 87 883 955 883 955 0.85 4 32 6e-14 8.2e-11 42.6 2.8 1 86 1045 1113 1045 1114 0.78 5 32 2e-16 2.7e-13 50.5 5.2 1 86 1138 1209 1138 1210 0.80 6 32 4.9e-12 6.7e-09 36.5 1.1 1 87 1245 1313 1245 1313 0.81 7 32 1.3e-11 1.8e-08 35.1 3.7 1 87 1353 1422 1353 1422 0.77 8 32 4.4e-15 6e-12 46.2 0.2 1 87 1449 1519 1449 1519 0.80 9 32 5.1e-14 7e-11 42.8 0.3 1 86 1541 1610 1541 1611 0.81 10 32 4.1e-13 5.5e-10 39.9 1.6 1 87 1640 1712 1640 1712 0.85 11 32 0.00018 0.25 12.2 0.1 1 64 1782 1838 1782 1858 0.70 12 32 3.1e-11 4.3e-08 33.9 2.1 1 87 1878 1950 1878 1950 0.79 13 32 3.6e-15 4.9e-12 46.5 2.0 1 85 1980 2049 1980 2051 0.78 14 32 1.2e-14 1.7e-11 44.8 4.0 1 87 2097 2168 2097 2168 0.80 15 32 8.9e-14 1.2e-10 42.1 4.1 1 87 2190 2260 2190 2260 0.80 16 32 4.6e-15 6.3e-12 46.2 0.4 1 87 2388 2457 2388 2457 0.80 17 32 7.9e-13 1.1e-09 39.0 4.6 1 87 2514 2593 2514 2593 0.80 18 32 3.3e-10 4.6e-07 30.6 4.6 1 87 2605 2679 2605 2679 0.79 19 32 2.2e-11 3e-08 34.4 0.0 1 86 2705 2773 2705 2774 0.80 20 32 2.1e-13 2.9e-10 40.9 0.4 1 87 2797 2867 2797 2867 0.80 21 32 1.6e-17 2.1e-14 54.1 0.4 1 87 2886 2966 2886 2966 0.81 22 32 2.8e-09 3.9e-06 27.6 0.8 1 61 2983 3036 2983 3057 0.76 23 32 1.1e-11 1.4e-08 35.4 2.3 1 86 3075 3144 3075 3145 0.83 24 32 3.1e-13 4.3e-10 40.3 3.2 1 86 3170 3240 3170 3241 0.84 25 32 1.2e-14 1.7e-11 44.8 0.7 1 86 3261 3333 3261 3334 0.81 26 32 0.00032 0.44 11.4 0.0 1 69 3360 3421 3360 3444 0.81 27 32 1.9e-14 2.5e-11 44.2 3.9 1 86 3590 3660 3590 3661 0.83 28 32 1.6e-07 0.00021 22.0 1.2 1 58 3711 3758 3711 3778 0.81 29 32 6e-14 8.2e-11 42.6 0.7 1 86 3801 3879 3801 3880 0.81 30 32 8.9e-07 0.0012 19.6 0.1 1 86 3904 3968 3904 3969 0.70 31 32 1.2e-13 1.7e-10 41.6 2.7 1 87 4279 4353 4279 4353 0.80 32 32 2.4e-12 3.2e-09 37.5 0.4 1 86 4379 4446 4379 4447 0.82
Sequence Information
- Coding Sequence
- atgtcacatcaacatcatcatcaccaccatcaccaccaacaacaccaacaacaatatcaacgcaaacaacagcagcaacaacaacagcaacatccttttcatatatatgctCAACCTCAGacacaacatcaacaacagccACCGCATCAACAATGGTATGCACATATAAATGATatacaaccacaacaacagcagcaacagcagcaacaacatcaacaatatatTGCTGGTCATGTAAGGGATTCCCGCCATTTACATGCGCCACATATATACGGTAGTATGTTGGGGCATAGTAGTGGAGCTTATATGGCTGGTGCTAGTATGCAGAGTATGGGACGGCATGCATATAATATGCCGGGTACTACTGCTACATCTGTGCCATACACATTTCCACACAATCCAATTAGGGGTGACAATAACATTGTGCATACAAGGAACTATGACCTTGAAATGGTAAACAGAGCACACAGCATACCATCAACTACTCATACAATGATTGGCGGTGATAATCATAGAAGCTATGATGCCTATTCGCATAATGCTGCTATATTTacgcaacaacagcaacagcaacaacaacaccaacagcagcaacagcaacagcaagtACGTTTACACCATCATCATGCCGTCCATCCTATACATCATCAATCGCATCAACAGCAACATTCTATACCTGGACATCAGGCaacacatcatcatcatcatcatcagcagcagcagcagcagcagcagcagcaacgtCACCAACAACAtattcaacaacaacttgTACCCAATCCAATGCATAATATGAAAACGAAACCAGTGGAGGAGATTACCATAACACCAACCATACAAATGgatgaaataattatcaaagCTGAACCTCCTGACGAATATATTTACCATAGAAATTTGCagcagcaacatcaacaacaacagcagcagccaTCCTATAGTGTGcttaaatgtataaaacaGGAACCACAGGCACAACCACaactattacaacaacaacaacagcaacaacaacaccatcatcagcagcagcaacagcgaCAGCAGCAACATCAGTTAGCGCAACAACGACAGTCTAAACAATTACAATCTCCACCGCCTCCGCAGCCGCCACCTCCTCCACCACCACAAGCACCACCAGCAGCAACAGTAACATCAATAGCAACTACACCTCCTCCTGAAGCCTCATCACCAATTGACGAGAAAGATATAAAACCTATGAACTTTCCACGGCGTAAAGTGCAAACGGAACGTTCCTCAACGTTGCCGATTTGTCACCGGTGTAAACAAGTATTTCTGAAGCGTCAAAATTACACACAACATGTGGCCCTGTCCACGTGCGATATGGTTGAATATGACTTCAGATGCTCGATCTGTCCAATGTCTTATATGTCCAATGAAGAATTGCGTGCCCATGAACGTTTGCATCGCTTGTATCGGTACTTTTGCATGCTGAATTGTGGTAAACATTTCGAGACAATACAAGAATGTGAACACCACGAATACATGGAACATGAccaatatatgtacaagtgCGGTATGTGTGGTTTGGATTATCCAACACGTGAAGAACTTTTGATTCATCGAAAATGTCATAAATATGCAACACGTTTTGTTTGTACGATATGTCGAGGCTGGTTTAGAAACATGTCAAGATTACACAACCACTATTTGAACAACCCATATAGATGTGGGAAATTTTACAACAAGGACGATTTCAATGCTTGTGCCACTGAAACCAGCATCAAGTTCAGAAAAGTTTCAACGGATACCAATGGCGAACATGATTCAAAACAGAGTACAgatgacCTAAATGCTTCAGATGATATTGTAGAGGAAATGGACACTGCTCCGCCAGATGAGTCATCTTCCGACGAAGAGGTTGAAACTGAAATAAAAGTTGAACCTGATTTTTATCCACCGATGGATCAAACCGATTATCAATCTGAAGAGTTTTCAAccaatcaaaatcttaattttttacatgactTCCAGGAAAATGCTTCAAATAGCACAAATTCGTCGTATACCATGGGTGGTAGTGAGGCGATTGCGGGAGATCAAGATGTAGTttgttgtgtaaaattttgtggCGTTACAAAGCATGCCAGTCCTTCtttgcaatttttcaatttccctCGTGAAGACAAATACCTGCAACAGTGGTTGCATAACTTAAAAATGCCTTACGACCCGCAGGTTAAGTACACGCAATATCGCATTTGTAGTTTACACTTTCCCAAACGTTGCATAAATCGTTACTCGTTAAGTTATTGGGCAGTGCCCACCTTTAATTTAGGCCACGATGAAGTAGCCAATTTGTATCAAAATCGTGAAATTAGTAACAGTTTATTGAGCAGTGATGCTGCCCGGTGCTGTATGCCGGGCTGTCGTGCTCAACGTGGTCAAACGAATGttaagttttataattttcctaAAGATCTAAAGACTTTGATTAAGTGGTGTCAAAACGCTCGTCTGCCCGTGCACACGAAAGAGTCGAGACATTTCTGTTCGCGTCACTTTGAGGAGAAATGTTTCGGGAAATTCCGTTTGAAGCCTTGGGCTATACCCACGCTAAATTTGGGCACGGTGTATGGTAAGATTCATGACAATCCCAACGTTTCGTATTTGGAAGAGAAGAAATGTTGTTTGACGTTCTGTCGTAAAAGTCGTTCAGACGATTTCAGTTTATCTTTATATCGGTTTCCGCGTGACGAGTCCATGCTGCGTAAATGGTGCTACAATTTACGCTTGCATCCGGACGTTTACAGGGGTAAGAATCACAAAATCTGTTCACATCATTTCATTAAAGAAGCGTTAGGGTTGCGTAAACTGTCGCCGGGTGCGGTGCCCACTTTAAATTTGGGCCATAATGATCGCATAAATATCTACGACAATGAATTGCATCCACCATCCAATGTGGCGTCAACCTCATATAGCGCCAAAGCGGCAGCCGCCTCGGCTCTTACCGCATCTATACGTAAAGCTCAGTTATTAAAATACCGTAATGCTTCGCAATCAGCCAACGCATCAGCATCTTCCATATATGATGAGGTTCTCAGTAACAGTCAAAAGttctcctcctcctcctcttcGGTGGCATCCAATGCCATAGAATTGGGTGACGTTTGTTTAGTGCCCTCCTGCAAGCGTTCCCGGCACACGGAAAATGTCACGCTACACACTGTGCCGCGAAGACCCGAACAATTGGAGAAATGGTGTCACAATctgaaaatgaatttgaatgGTTTGCATAAGAATGCTCGCATTTGTAGTGCACACTTTGAGAATTACTGCATTGGCGGGTGCATGCGGCCATTTGCTGTGCCCACGCTGGAGTTGGGTCACGAGGACTCCGACATTTATCGCAATCCGGatgttattaaaaagttaaacatTCGCGAGACATGTTGCGTACCAAGTTGTAAGCGAAATCGGGACCGGGATCACGCCAATTTGCACCGTTTCCCCACCAATCCAGATTTGTTGCAGAAGTGGTGTGCAAATCTACAAAAGAGCATCCCAGATGGCACCAAGCTGTTCAATGATGCGGTGTGTGAGGTACACTTTGAAGATAAGTGTTTGCGCAACAAACGATTAGAGAAGTGGGCGGTGCCCACTTTGAAATTAGGCTACGAACCAATACCGCATTATTTGCCATCGCAAGAAGAAATTGATGAATATTGGTCCAAACCAATGGCGCCAAATAACGGTGATGAAACGGGTGAATGTTGTGTGGTCACCTGTAAAAGAAATCCACAAATGGATGATGTGAAATTGTATCGGCCACCCGAAGACGCCGAACAGTTGGTCAAGTGGTCGCACAATCTGCAAATTGATGTTACAAAATTGTCCACAATGAAGATATGCAATTTACATTTTGAAACGCATTGTATTGGCAAGCGGTTGTTGAACTGGGCGATGCCCACTCTCAATTTAGCCGCCAAGGTTgaacatttatttgaaaatccTCCACCCACGATTTCGCATTATCGTCGCCGAGTGAAATTGGGTCTCAAAGCGGAGCATGACTTAATTAAGTGGTCGCCGCGCTGTTGTTTAGCCCATTGTCGCAAAATGCGTAGTCGCGATAATGTACAATTGTATCGGTTCCCTGTTAATTTGAATACCATGAGTAAATGGTGTCACAATGTACAATTGCCGGTGGTTGGCAGTTCACATCGACGCATCTGTTCGGCACACTTTGAATCGAGCGTACTTACCAAACGTTGTCCCATAGCTACAGCTGTACCCACAGTAAATCTGAATACACCGCCTGGCtataaaatttaccaaaactCACCGAAACTGAAACAGCACAAAATATTCAGCCAACGTTGGTGTGTGGTGAGTACTTGTCGCAAGACGCGCAGTGAGGGTGTACAATTGTTTCGTTTCCCTCATAATCgtataattttaacaaaatggcGTTATAATCTGAAGAATCTACCGAAAGGGAAGCTTAGCTCCCAATTTCGCATTTGTTCGTtgcattttgaaaatcattcgATTGGTATGAAGCGACTGTCACCGGGTGCCATACCCACATTAAATCTGGGACATGAGGCAGAGGATATATATCCTAATGAGACGCGTTCCTTTTTCGATCTAGATAAATGTGTGGTGAATGGTTGCTCGTCGAGTAAGGAGACAGAAAATATACGGCTTTTTAAATTCCCCGGAGATGATGAAGAGCTATTGGGCAAGTGGTGTCATAATCTGAAAATGAATCCCAACGATTGCATTGGCATCAAAATATGCAGTTTACACTTCGAAACAGATTGCATGGGTCCCCGACTATTGTACAAATGGTCTATACCTACTCTACAGCTGGGTTATGAAAATGCTGAGGATGCCCCAGAAATTATACCGAATCCACCAATTGAGAAACGCTGCGGTGAGGTGTTATTCAAATGTTGTTTGCCCAGCTGCGGCAAAACGCGCAAATACGATGAGGCTCAAATGAATAGTTTTcctaaaaatgttaaaatgtttCGTCGCTGGCAACATAATCTCAAGTTGGATTTTTTAGATTTCAAAGATcgagaaaaatataaaatctgCAATGATCACTTTGAGGCGATTTGTTTAGGCAAGACGCGACTTAATTTTGGAGCAATACCCACAATAAATTTGGGTCACAATCTCACCAACGACCTGTACAAGGTTAATCCACAAAAAATCTGGCCCAATTTGTTTACTAAACAATCGGAATATGAAAAAGAATTGTACGAGGGAGAAAATAAGCGAACAGATGGGGAATATTCACGCCTAGAAGGCGAAGTaggtgaagaagaagaagtcgTTGTGGAAACCTCTAATATACCAATGAATTTTACATGTTTATTCAACGAGTGCAATGGTCCAAAATCGTTGATGCGAGAACCCTACGATATACCACAAACAGAACAATTAAAAACCCTATGGTGTACCCTGATGAAAGTGGAGCCTAACAGTATTTCAGGGGATAACAAACTGTGTGGATTGCATTTCCAGCAGTTATTCAATGAAACCAGAGAACAAATGTTAGCTTTAAGTACCGAGGACACTGACATGAAACGCGATTTTGAAAAGTTGGATTATGCTTATAAAAAATCAGAAATATCTTTAGTTATCAAAGGATCAAAATGTAGCATACAGGATTGCTACAAGACGTTAGTGGAACCACATGTAAAACTCTATCAATTTCCCTACGGAAAAGAATTGATAGAGAAATGGTCCCATAACACGAACATCAAGCCCGATGAACATCGTcgttatttaacaaaaatatgctGCCTTCATTTCGAAGCGTATTGTTTTACACCCAATCAACGATTACGAACATGGGCTATACCCACACTGAACTTAACCAATGCTCCAACGAAAACTATACATAAAAATCCCGATTTAACCACATTGGATCGTCGCCTTGTAGGACCGCCTATTTTAAAATGCTGTGTTCCAAATTGCACTCAAGAGAATAATGAGACGATTGAGGGTAATAAACTTTTTAGCTTTCCCATGGATGATAACATACTACAGAAATGgtgtgaaaatttaaaattgtcgcGTGAGCAAACGccaatttttaagatttgtgCTCAACACTTTGAGAAACAGTGCTTTGGACTTAGCCGTTTACGATCAGGTGCTATACCAACTGCAAAATTGGGCCACAATGAGGAACCAATCCATCATAATCAATCAAGCCTAAAGGAGGAGATGTACGAACCAAAAGTTGATCAAAATGTTGGATTGGGATTAAAGCAGGCAAAAATAAAGAAGTCTTTAGATAGCATGAAATGCTTTATACCCTCTTGCCGTCGCAGTCGCCTACAACATGGTGTACGTTTCTTTACCTTTCCCTCCAACCCCGTACTGAGGCATAAATGGTGTCACAACTTGCAAATGCCAGCAAATTTTGGCAAACTGCTAAGTATACGCATTTGTAGCATacattttcacaaaaagtgCTTGGagggtaaaaatttaaaagactGGGCGGTACCCACATTGCATTTGGGACATAATGAATCCATTTATGACAATCCACGGGCAATGCGACGTCACAACATACCAAAATGTATTCTACCCCACTGTGGCGAACAGAGATCTCATGGTAAAGAACTGCGTTTCTTTACATTTCCCAAGGATAatcaaatattgaaaaaatggtGTAAAAACCTGAAGCTCTCGGGGGAACAATGCAAGGGACGACATTTATGCGAGAAACATTTTGAAGCTAAAGTTTTGAGTTATAAAAGATTGAAAACGAGTTGCGTACCAACTTTAAATCTAGGCCATTCTGAGCCTTTGGTATATAACAATGTAGCTTTGTTGGAAGATAGACAGCATTCACTTACCGATGCTAATGCTAATGAAGCTGAAGATATAGATTCCTTGGAGCTAGATTTGGAAGAAGCCGAAGGTAATGACTTTGAAACAGAATCTTTAAGGACTCCAATAAGTTGGAGTAACTTGGAGTCTAGAGAGTTGCGAGTAAAAATCACACCTTTGAAACACGAAGACCTAACAGACATAGCTTCCATATGTTCTTCCATGAGtaaggaaaaagaagaaaatgattCCATTTATAGTGGTTGCGAGTCTCGAGAAGATACAGCCACCTTGTCTGCAAATCGCAAATCGAAGACCGTGAACAGCTTCAATGCCATCTGCTGTTTGAAACATTGTCGTAAAGAAAAAACACCAGAACAACATTTGACCACTTATGGCTTTCCCAAAGATCTGGAGCTTTTACGAAAATGGTGTGACAATTTGGGTTTGGAACTAAACCAGTGCATTGGGCGCGTGTGTGTGGATCATTTTGAGTTGAGAGTAATGGGGAGACGACGTTTAAAACCTGGAGCAGTTCCCACTTTAAATTTAGGTCATGACCGACCCTTAAAACATACCAACGatgcaattaaattaaaaatgaacgaGAAAAGTAAGACTTCCGAACAAACCGAAGATAAATTTTCTAGCCCAGAACCGAAACTGACTCCACCACCCTATAAAACTAGACCCACCAAACAATCGGTTTTTCGGCTATGTTGCCTCAAGCATTGTCGCCGCAAGAAAGACCTTGACACTAGCAATAAAGAAGTGCCCATAGTATTTAAATTCCCTCAAGAACccaaattgttaaaaaagtgGTCTGAGGCTTTACACATCCCTCTACAGCAGTGCACGCGCCCCAATTTGGGTTTATGTGCTGATCACTTTGAAACACattgttttgaaaatgaagcaaaatatcaattaaaagCGAATAGTGTACCAACCATAAATCTGAAGGAAGATTTTcagaaaaaggaagaaataTGTTGCTTGAAACATTGCGCCAGTTCAACAAAATGCCATGATAACGTTTTTTTGCTGTCGTTCCCCTTAAAACCGCAAGTTTTAATACGTAAATGGTGCTACAATACCAGAATCTCTCATAAAGTTAAGGGACTGAAATCCCTAAAGATCTGTAGCCTGCATTTCGAAAAGCAGGTCTTCTTTAGAGGTTGTTTACTGCGGATTAATGCTGTACCCACAATTAATCTGGGACATACGGGAAAGATTTATAAGAATCCCAAATCGTATCGtataataaatgtacaaaaaccGTTAGAAAAGTGTTGTATAGTTAGTTGCCAACAGGAGAGTGACAAGTTGTATAGTTTCCCGAAAAATTCCGAATTGCGTCGTATTTGGTCGAATAATTCGGGTATTGAGACTCGCTTAGCATTAAAGCAACAATTGAAATTATGCAAGCGACACTTCACCGCGGACAGTTTCATAAGTGGTGGTGATTCTTTGAAATTGGAAGCAGTACCCTTGTTGTATTTGGACGTGGACAAAAGTCAGCATTTGGTATTGGACATGTCTACGATGATACAAAACAATCCTCATTGTTTGATACATAATTGCGGCTGCATACCCAGCGTTGATAAGGTCAAATTGTATCCATTTCCCCAGGAGAAAGAGGTCTTGGAAAAATGGCTTTTCAATCTGCAACTGCCAGAGAACTACGCCCCCGAAAATGCTTACATATGTAGCCGCCACTTTGACAAGGCATGCATACAGCGCGGattattacataaaagtgCTGTACCAACAATATTTTTGGGACATTCTGGCGGATTTTATAGAAACGGTGATGATATATTTAACACACCCTGTGCTGTGCCTCATTGTAAGTATGATCTGAACGAAGAAGACGATGGTGCTCATGATGTGCGTTTAATGTATAAATTTCCGAAAGATTCGCAACGTTTGAAAAAgtggttggataacatacGTATTACGGATGATGTTTATCAAAAGCAAAAGAATCGCCGCATATGTTCAGAACATTTTGAAGAGATCTGTAAGGTGGCCGGTAAGGATACTTTGTTACCGCATTCAGTGCCCACTCTAAATTTAGGCTACAATCCAGCTGAAGTGCCGCATAgaaatcatcatcaaaattGTTGCTTTGATTCATGTAAACTTAAGGACAAATATTCCTCGATGACAATGCATAAATTGCCACAAAATGAGAAGATGCGTGCGTTGTGGTTGGAGGAACTTGATTCGAAAGATGATGCTATATCCAAGCAATTCTTATGTGCGGCACACTTTCTGGCTATTTATGAGAGAGTTAAAGAGAAACACAAAGTTTTTGTTAAGCAATTGAACGAGTACGAAGCTTTATCTAATGTGTACAGGGACCTTAAACAAAGCGATTTACTGCAAAGTTTCAAGTGTTCTATACCGCAATGCTCCACAGGCTTTAAACAGACCATTAAGCTATTCAAATTTCCCATGGATGGTAACCTTTTCAATAAATGGCAACATAATACCGGCTTACAATTTGACATGAGCCAACGCAGCTGCCATTTAATGTGCGCTCTACATTTCGAGCCGCGCTGCCTATGCGAAGTGCAATTGCATCGCTGGGCTGTGCCCACCTTAGGATTACCCACATCTAACAGCTTGTACGTAAATCCACCCGAAGCCTTACCATCGGATCATGAAAATTTGCAACATTGTTGTGTGTCATCGTGCAGCTCAACACGGGGACCCTTCTTTCAATTCCccacaaaacaaattaatctAAAGAGATGGATACATAATCTGGGCTTGGGTACACAACAATGTACCACAAACCTACGTGTCTGTTATAagcattttgaaaaatactgCTTCATGAAACGTGAAGATCAAGCACTGACGTTAAAAATATGGTCCATTCCAACACTAAAACTACCCGCCAACCAGGACTTGTATAAAAATCCTATAGATAAAGTTTGCTATTTTTCGTGCAGCGTACCCGGCTGCAAACAGATACGCAATAGTACCGAAggtatttacttttattgtttccctaaaaataaaactttagaaAGGAACTGGCTTTTAAATACGGGCATAAAGCCGGGGAATTTTCGCGAAGATATGCGCATTTGTAGCTTGCACTTTGAACCAGAGTGCTTCCTAAAAGATTCTATGCAGCTAAGAAAGCATACAGTGCCCACTTTGAAACTACGTACAGCCaacaatttattacataaaaatccCATACGCAAAAAACTTCTTATCAAGAATACAACCGAAAAGTGTTTGGTAAAGTCTTGTACATATGCGGGGGACACATTATATGATCTGCCAAAAAATATCAGTGAATTGAAAAGCTTAGGCCGTAATTTAGAATTGGAAGACGTGCCATTGGATCAGCTGGAGAGGAAtttgaaaatatgtaaaaaacattacaacgAAGAATTGTTAAAAAGGTCAGACGAAATGAAAGCCAAGCCTTTAACTGCAGCAACAATGGACCCGGAAAGTTTTGCAACTGAACACAAAGAGGAAGAGGAGGAAAACACAAACAGAGAAGCTAATATGGAAAACTTTACAATTGAACAAGTGGATTTGACAAGCTATGAAGAACAAAACTTAGGTAAAGCTGCGAAAACTATAAGAAACTTGGAGGATGGCGTCACTGTAACTACCTTTATTGATCCTTTGAGTGTAAAAGCTGAAAAGTTAGCGGAAAGAAAAGTCATACAAACAAAGAATTTAACTCACAAATACAAGCAATTTAATAACGATGGGGCTGCCACCACATCTCCCATCAAAACGGAAttgataaaagaaaataacaccCTTTGTAGGGTGGTATTGAAAAGGAAACTCTCCCTCTCCACGGAAACATCATTAAAAGTGTGCAAAATCGAAACCAGAATGAACCCCTTCAAAGAGTCCCCAAATGCACTAACAAATGATTCACCCTCCAACCTAACGCAAATATGTTGTGTGAAAAATTGTGGCAGCAAACAGAAAGACTCACCGGTTCAATTTACCGAATTTCCCAAAACAATtgccatttataaaaaatggttGCAAAACTTGCGCATACCGCACTCGCCCACTGTGCGACATTATTATCGCGTCTGTTGGCAACACTTTGAAACCGTTTGTTGTGGAAAGAATGGTTTAAAAATTGACTCAGTGCCAACATTAAAATTAGGACATAACAACACAGACATACATCCCTGTTTAGATGAAAATCTACCAAACACATCTAACCTACCACATCAGGGAGCTTCAATGCTAATGCAAACCCCAGCACTGGCACTGGCACAACCAAAAccgaaatataatataaagaagtGTTCATATCCAGAGTGCaaaagtaataaaacaaaattgtacgATTTGCCGCCATTTCAGAAATTGTGCGAAAAATGGTGGCAGTCCATGCAGCTGTATTCCGAGCATGACAAAAGCCAAGCAAAAGTTTGCGACATACACTTTTACATGTTATACCATCAACATGAAGATATTGTGTATGCCATAAAGCGAGAACAGCCGGGCAAGTATGGGGAATTGAAAGAATTGTACGCAAGTATAGCGGCCAGGGGGAAAGTGATACGTCATAGATGCATTGTACCGAACTGTACAACCGATTATCTAATCAACAGTAGTCTGGATATTAAACTGTACAACTTCCCCTCCGAATACCAGCTGGCGCAGAAATGGTGCAGCAATTGCCAAATCGATTATGAGACCTGCATCAATGGGGATCGTGATCATAACTACAAGGTGTGTGCTTTACACTTTGAAACCTATTGTGTGGGTAATGACTTGAAATTATACAACTGGGCAGTACCGACCCTACGGTTGTTACATGTTGATTATAACCAACTATGTACCAACAATGCGGATGATGTATTTTCCACATCAGGCCGTTGTTGCATCAGGGACTGCATTAATGAAAATGGCTTAAAAACCAACACAAGATTCTATCGACTGCTTGACAAGTGGATACATGGCATTGATGCGAAACTGCTTAATTTGAATGAGTTGCGTATATGTGGTGTACACTTCTCGAAGCGATTCTTTAGAAAAGATAAAAGTTTAACAGCCCAGGCCACACCGTATCTACAAATAAATCCAAACAATGCCAATTTGCATCATCATACGCAAGACACAAGAACAACAGCTGCAAAGGAGGAAGCTGTTGAGCAAGTGGACGATTCTACAAATGCTGTTGTCACTGTTAAACAAGAAATTGAATACTGGGATGATTGGTACAACAGTCAACCAATAGACAGAGACGAGAGCATTGTCACACCTCAATacgaaattattaccataaagGAGGAAATCATTGATGATACGTATACCAATGACTTTCAAATGAATACGTCGTATGTAAATAccaatgatgataataataataaggatAATAATGATAAGGAGACTAGGCCACAAATTGTCGCCTGTTACTCACAACATCTACCCATCGATGAGACATTTATTAAGCAAGAAACTGATATAGATGAGAGAGATTACAAGATGGAGGAATCTATAATACCTGAACCAATCACTTTTCCATATCATAATAATGATGAGTCCGAAGCGATGCAAGAAAACGATCATCACACAAATAGGAAGAGTATTGTAATGCCTTTAGAAATAATACCAACCATTAGCACAAATGAACTGAATGAGAATAATgcgaatgtaaaaaaaagtaaaacgaTAAATGAGACAAATGCGCAAAGCACAGAATTATGTGAAATCAATTCAAATCAACAATTAGCAGACACAACAATAGCTACAGTAAGCAGCCCTTTAATGTCGATCAACAGCACACTTAACCAGGATGTTAACttcaataaaacaacaacaacaacaacatcatatgAAAATCGTTTCCAtcatattattacaacaatatcattaaatacaaaaaatgttgcTGATGAAAATCATAAAACATCTACAGCCACCTCTTCAATCCGTGTAGTGGCACCGGCCGCCTCCAACCAATCAATGATTAATCCCGTTTTTAAATTACACATTCTAACATGTTGTGTGGCCAGTTGTTTAAATTCCACCCAAACACCGCTAATTAAACTTTACACGGAATTCCCATCCGACTCGGatctatttataaaatggtgtttcaatttgaaaattgATCCGCGTCACTATAGAGAGCACTTGTATGCGGTGTGTAGTGCTCATTTTGATAGTGTTTGCTTTAAAGAGAGCAATCGCTCCTTACAGCCCTGGGCTGTACCAACATTAAATTTAGGTTTGCCCCACAATTCCTTCATACACCAATACGATATGCCGCATAGTTTGAAAGCAACAAATGAACAACAATGCATTGTATGGGGTTGTCATCAATCGCAAACACCTTTCTATCCATTTCCGGCTGATCCCCAGCAGTCCCGTAAATGGTTTACCAATTTACAGTTAGAATATACCGAATTTCGTGCACAAACGTATCGTGTTTGTCGCAAGCATTTTAACAATTCATTAATCGATGAGCATGGCCAGTTAGATAATGAGGCTTTGCCCACCTTAGATTTGAACCATAATAATAGTGATAATAATAGTGTCGGGTGTGCTCAGTCGCATAATGTTGATAGTGAAAATTTTAGGGCAGTACGTTTGGCAGCCGCTTTGGCACCACAAGATTTAGAAGATCACGACAGTAGTTATTATGAGGATTTTGAAGAGTGCCTGCAACACAATGAAcaggaaaattga
- Protein Sequence
- MSHQHHHHHHHHQQHQQQYQRKQQQQQQQQHPFHIYAQPQTQHQQQPPHQQWYAHINDIQPQQQQQQQQQHQQYIAGHVRDSRHLHAPHIYGSMLGHSSGAYMAGASMQSMGRHAYNMPGTTATSVPYTFPHNPIRGDNNIVHTRNYDLEMVNRAHSIPSTTHTMIGGDNHRSYDAYSHNAAIFTQQQQQQQQHQQQQQQQQVRLHHHHAVHPIHHQSHQQQHSIPGHQATHHHHHHQQQQQQQQQQRHQQHIQQQLVPNPMHNMKTKPVEEITITPTIQMDEIIIKAEPPDEYIYHRNLQQQHQQQQQQPSYSVLKCIKQEPQAQPQLLQQQQQQQQHHHQQQQQRQQQHQLAQQRQSKQLQSPPPPQPPPPPPPQAPPAATVTSIATTPPPEASSPIDEKDIKPMNFPRRKVQTERSSTLPICHRCKQVFLKRQNYTQHVALSTCDMVEYDFRCSICPMSYMSNEELRAHERLHRLYRYFCMLNCGKHFETIQECEHHEYMEHDQYMYKCGMCGLDYPTREELLIHRKCHKYATRFVCTICRGWFRNMSRLHNHYLNNPYRCGKFYNKDDFNACATETSIKFRKVSTDTNGEHDSKQSTDDLNASDDIVEEMDTAPPDESSSDEEVETEIKVEPDFYPPMDQTDYQSEEFSTNQNLNFLHDFQENASNSTNSSYTMGGSEAIAGDQDVVCCVKFCGVTKHASPSLQFFNFPREDKYLQQWLHNLKMPYDPQVKYTQYRICSLHFPKRCINRYSLSYWAVPTFNLGHDEVANLYQNREISNSLLSSDAARCCMPGCRAQRGQTNVKFYNFPKDLKTLIKWCQNARLPVHTKESRHFCSRHFEEKCFGKFRLKPWAIPTLNLGTVYGKIHDNPNVSYLEEKKCCLTFCRKSRSDDFSLSLYRFPRDESMLRKWCYNLRLHPDVYRGKNHKICSHHFIKEALGLRKLSPGAVPTLNLGHNDRINIYDNELHPPSNVASTSYSAKAAAASALTASIRKAQLLKYRNASQSANASASSIYDEVLSNSQKFSSSSSSVASNAIELGDVCLVPSCKRSRHTENVTLHTVPRRPEQLEKWCHNLKMNLNGLHKNARICSAHFENYCIGGCMRPFAVPTLELGHEDSDIYRNPDVIKKLNIRETCCVPSCKRNRDRDHANLHRFPTNPDLLQKWCANLQKSIPDGTKLFNDAVCEVHFEDKCLRNKRLEKWAVPTLKLGYEPIPHYLPSQEEIDEYWSKPMAPNNGDETGECCVVTCKRNPQMDDVKLYRPPEDAEQLVKWSHNLQIDVTKLSTMKICNLHFETHCIGKRLLNWAMPTLNLAAKVEHLFENPPPTISHYRRRVKLGLKAEHDLIKWSPRCCLAHCRKMRSRDNVQLYRFPVNLNTMSKWCHNVQLPVVGSSHRRICSAHFESSVLTKRCPIATAVPTVNLNTPPGYKIYQNSPKLKQHKIFSQRWCVVSTCRKTRSEGVQLFRFPHNRIILTKWRYNLKNLPKGKLSSQFRICSLHFENHSIGMKRLSPGAIPTLNLGHEAEDIYPNETRSFFDLDKCVVNGCSSSKETENIRLFKFPGDDEELLGKWCHNLKMNPNDCIGIKICSLHFETDCMGPRLLYKWSIPTLQLGYENAEDAPEIIPNPPIEKRCGEVLFKCCLPSCGKTRKYDEAQMNSFPKNVKMFRRWQHNLKLDFLDFKDREKYKICNDHFEAICLGKTRLNFGAIPTINLGHNLTNDLYKVNPQKIWPNLFTKQSEYEKELYEGENKRTDGEYSRLEGEVGEEEEVVVETSNIPMNFTCLFNECNGPKSLMREPYDIPQTEQLKTLWCTLMKVEPNSISGDNKLCGLHFQQLFNETREQMLALSTEDTDMKRDFEKLDYAYKKSEISLVIKGSKCSIQDCYKTLVEPHVKLYQFPYGKELIEKWSHNTNIKPDEHRRYLTKICCLHFEAYCFTPNQRLRTWAIPTLNLTNAPTKTIHKNPDLTTLDRRLVGPPILKCCVPNCTQENNETIEGNKLFSFPMDDNILQKWCENLKLSREQTPIFKICAQHFEKQCFGLSRLRSGAIPTAKLGHNEEPIHHNQSSLKEEMYEPKVDQNVGLGLKQAKIKKSLDSMKCFIPSCRRSRLQHGVRFFTFPSNPVLRHKWCHNLQMPANFGKLLSIRICSIHFHKKCLEGKNLKDWAVPTLHLGHNESIYDNPRAMRRHNIPKCILPHCGEQRSHGKELRFFTFPKDNQILKKWCKNLKLSGEQCKGRHLCEKHFEAKVLSYKRLKTSCVPTLNLGHSEPLVYNNVALLEDRQHSLTDANANEAEDIDSLELDLEEAEGNDFETESLRTPISWSNLESRELRVKITPLKHEDLTDIASICSSMSKEKEENDSIYSGCESREDTATLSANRKSKTVNSFNAICCLKHCRKEKTPEQHLTTYGFPKDLELLRKWCDNLGLELNQCIGRVCVDHFELRVMGRRRLKPGAVPTLNLGHDRPLKHTNDAIKLKMNEKSKTSEQTEDKFSSPEPKLTPPPYKTRPTKQSVFRLCCLKHCRRKKDLDTSNKEVPIVFKFPQEPKLLKKWSEALHIPLQQCTRPNLGLCADHFETHCFENEAKYQLKANSVPTINLKEDFQKKEEICCLKHCASSTKCHDNVFLLSFPLKPQVLIRKWCYNTRISHKVKGLKSLKICSLHFEKQVFFRGCLLRINAVPTINLGHTGKIYKNPKSYRIINVQKPLEKCCIVSCQQESDKLYSFPKNSELRRIWSNNSGIETRLALKQQLKLCKRHFTADSFISGGDSLKLEAVPLLYLDVDKSQHLVLDMSTMIQNNPHCLIHNCGCIPSVDKVKLYPFPQEKEVLEKWLFNLQLPENYAPENAYICSRHFDKACIQRGLLHKSAVPTIFLGHSGGFYRNGDDIFNTPCAVPHCKYDLNEEDDGAHDVRLMYKFPKDSQRLKKWLDNIRITDDVYQKQKNRRICSEHFEEICKVAGKDTLLPHSVPTLNLGYNPAEVPHRNHHQNCCFDSCKLKDKYSSMTMHKLPQNEKMRALWLEELDSKDDAISKQFLCAAHFLAIYERVKEKHKVFVKQLNEYEALSNVYRDLKQSDLLQSFKCSIPQCSTGFKQTIKLFKFPMDGNLFNKWQHNTGLQFDMSQRSCHLMCALHFEPRCLCEVQLHRWAVPTLGLPTSNSLYVNPPEALPSDHENLQHCCVSSCSSTRGPFFQFPTKQINLKRWIHNLGLGTQQCTTNLRVCYKHFEKYCFMKREDQALTLKIWSIPTLKLPANQDLYKNPIDKVCYFSCSVPGCKQIRNSTEGIYFYCFPKNKTLERNWLLNTGIKPGNFREDMRICSLHFEPECFLKDSMQLRKHTVPTLKLRTANNLLHKNPIRKKLLIKNTTEKCLVKSCTYAGDTLYDLPKNISELKSLGRNLELEDVPLDQLERNLKICKKHYNEELLKRSDEMKAKPLTAATMDPESFATEHKEEEEENTNREANMENFTIEQVDLTSYEEQNLGKAAKTIRNLEDGVTVTTFIDPLSVKAEKLAERKVIQTKNLTHKYKQFNNDGAATTSPIKTELIKENNTLCRVVLKRKLSLSTETSLKVCKIETRMNPFKESPNALTNDSPSNLTQICCVKNCGSKQKDSPVQFTEFPKTIAIYKKWLQNLRIPHSPTVRHYYRVCWQHFETVCCGKNGLKIDSVPTLKLGHNNTDIHPCLDENLPNTSNLPHQGASMLMQTPALALAQPKPKYNIKKCSYPECKSNKTKLYDLPPFQKLCEKWWQSMQLYSEHDKSQAKVCDIHFYMLYHQHEDIVYAIKREQPGKYGELKELYASIAARGKVIRHRCIVPNCTTDYLINSSLDIKLYNFPSEYQLAQKWCSNCQIDYETCINGDRDHNYKVCALHFETYCVGNDLKLYNWAVPTLRLLHVDYNQLCTNNADDVFSTSGRCCIRDCINENGLKTNTRFYRLLDKWIHGIDAKLLNLNELRICGVHFSKRFFRKDKSLTAQATPYLQINPNNANLHHHTQDTRTTAAKEEAVEQVDDSTNAVVTVKQEIEYWDDWYNSQPIDRDESIVTPQYEIITIKEEIIDDTYTNDFQMNTSYVNTNDDNNNKDNNDKETRPQIVACYSQHLPIDETFIKQETDIDERDYKMEESIIPEPITFPYHNNDESEAMQENDHHTNRKSIVMPLEIIPTISTNELNENNANVKKSKTINETNAQSTELCEINSNQQLADTTIATVSSPLMSINSTLNQDVNFNKTTTTTTSYENRFHHIITTISLNTKNVADENHKTSTATSSIRVVAPAASNQSMINPVFKLHILTCCVASCLNSTQTPLIKLYTEFPSDSDLFIKWCFNLKIDPRHYREHLYAVCSAHFDSVCFKESNRSLQPWAVPTLNLGLPHNSFIHQYDMPHSLKATNEQQCIVWGCHQSQTPFYPFPADPQQSRKWFTNLQLEYTEFRAQTYRVCRKHFNNSLIDEHGQLDNEALPTLDLNHNNSDNNSVGCAQSHNVDSENFRAVRLAAALAPQDLEDHDSSYYEDFEECLQHNEQEN
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00384614;
- 90% Identity
- iTF_00384614;
- 80% Identity
- -