Dmay011130.1
Basic Information
- Insect
- Drosophila mayri
- Gene Symbol
- -
- Assembly
- GCA_008042485.1
- Location
- VNJN01001702.1:55255-68540[+]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 29 4.2 9.7e+03 -2.7 1.9 49 60 197 213 181 231 0.56 2 29 2.3e-15 5.2e-12 46.2 4.0 1 86 426 498 426 499 0.85 3 29 8.6e-15 2e-11 44.3 5.0 1 87 526 595 526 595 0.83 4 29 7.9e-16 1.8e-12 47.7 0.2 1 87 617 689 617 689 0.85 5 29 5.4e-16 1.2e-12 48.2 5.3 1 87 787 857 787 857 0.82 6 29 1.9e-15 4.3e-12 46.5 3.6 1 86 881 952 881 953 0.82 7 29 9.9e-13 2.3e-09 37.7 1.5 1 87 988 1056 988 1056 0.81 8 29 7e-11 1.6e-07 31.8 1.4 1 86 1098 1167 1098 1168 0.76 9 29 4.3e-17 9.8e-14 51.7 0.4 1 86 1195 1264 1195 1265 0.82 10 29 1.2e-12 2.6e-09 37.5 1.6 1 85 1286 1354 1286 1356 0.79 11 29 1.2e-14 2.7e-11 43.9 0.6 1 86 1383 1454 1383 1455 0.85 12 29 1.5e-12 3.5e-09 37.1 3.7 1 85 1529 1597 1529 1599 0.82 13 29 1.5e-12 3.4e-09 37.2 0.1 1 86 1622 1690 1622 1691 0.83 14 29 1.8e-13 4.1e-10 40.1 2.2 1 87 1838 1907 1838 1907 0.80 15 29 1.7e-10 3.8e-07 30.6 0.2 1 87 2002 2076 2002 2076 0.79 16 29 0.014 32 5.2 1.3 1 61 2091 2143 2091 2162 0.69 17 29 1.1e-13 2.5e-10 40.8 0.1 1 87 2170 2241 2170 2241 0.79 18 29 7.2e-13 1.6e-09 38.2 0.2 1 87 2293 2363 2293 2363 0.81 19 29 3.5e-12 8e-09 36.0 0.1 1 86 2398 2472 2398 2473 0.80 20 29 9.2e-13 2.1e-09 37.8 0.0 1 86 2483 2556 2483 2557 0.80 21 29 5.4e-11 1.2e-07 32.2 0.0 1 61 2582 2637 2582 2654 0.76 22 29 1e-05 0.024 15.2 0.6 1 58 2683 2733 2683 2753 0.84 23 29 3.5e-11 7.9e-08 32.8 0.8 1 87 2773 2845 2773 2845 0.81 24 29 3.5e-16 7.9e-13 48.8 0.4 1 86 2957 3029 2957 3030 0.81 25 29 2.8e-12 6.3e-09 36.3 3.5 1 86 3090 3160 3090 3161 0.80 26 29 7.9e-14 1.8e-10 41.3 4.6 1 86 3253 3323 3253 3324 0.84 27 29 3.6e-12 8.3e-09 35.9 0.2 1 86 3405 3474 3405 3475 0.85 28 29 3.1e-10 7.1e-07 29.7 0.7 1 58 3501 3549 3501 3560 0.82 29 29 3.4e-10 7.8e-07 29.6 1.1 18 87 3567 3625 3556 3625 0.77
Sequence Information
- Coding Sequence
- ATGTCACAACACAACCACAATCACGCCCACCCACACTACCAATACCCCataagcaacaacaacagtatGGGCGCCTATGGGGGAGGAGTGGGAGGGGGTGGAGGCTCGCATGGATATTTCGGCGCCGCTGGCGGTGGCCTCAATAGCGAACCCTTGGAGGGGTTCCAGCAGCCGCCCAACCCAATGGCCCCACCCCCGGCCCCagaaatgataataaaatCGGAACCCATTGACGACCTGGCCTACAAGTCAAACTACATAGACGACAATACGCCATTTGCGGACTTCAGCAAGTTTAGCGATTTCAGCGAGGACATGCTGAGTCCCAAAGTGGAGCTGACAGTCAAGGATGAGTCCTTCGTTAGGAACCCCAATAGCTTTTTACGCCGTAAACAACAATCGGATCTGACGACAGCAGAGAGCCTGCCCGTCTGCCAACGATGCAAAGAGGTGTTCTTCAAGAAGCAGACATACCTGCGTCACGTTGCCGAGAGTAGCTGCGGCATCCAGGAGTATGACTTTAAGTGCACCATATGCCCCATGTCCTTCATGAACGCCGAGGAACTACACGAGCATAAGCAACAGCATCGAGCGGACAGATTCTTCTGCCACAAGTACTGCGGAAAACACTTTGGCACGATCACAGAGTGCGAGGCGCATGAGTACATGCAACATGAATACGAAAACATTGTGTGCAACATGTGCTCGGGATCCTTTGCCACGCGGGAACAACTGTATGCTCATTTGCCGCAGCACAAGTTCCAGCAGCGCTTTGACTGCCCCGTATGCCGCCTATGGTACCAAACGGCTCTGGAGCTGCACGAGCATCGGCTGGCTGCACCCTACTTCTGCGGTAAATACTACACGGGCGGACAGTCCCCGTCCCCGTCCTCCcaacagcatcagcaccaGAGTCAGACGAACTACAAGCTGCAGGACTGTCATATGGCCACCATGGAAATGCCAAACGCACCGCTCCTTAAGGCAAACTCGTCCAACTCGCCTGCCttgccagcaacagcagcgctTAACTCACTGTTGCAACAGCGCCAGGCCAATGCCGATGGAGCGGCTATTTTTGCCGCATCTTCGCTGAAGAACGAGGTCGCTGTGAAACTGGAGCGCAGCTACAGTAACTCGACCAACGAATCGTCTTATAGCGTCCAGGAGAGCGGCTACAATAATGTGTATGGCAGCAGTGACAGCTCAGGCCACGGTGCCATCGCCGGACCACAGGCACACTCTTCGACGCTTGACGATTCCGAGGATGCGCTGTGCTGTGTGCCGCTGTGCGGCGTGCGGAAGAGTACGAGTCCCACCTTGCAGTTCTTCACGTTCCCAAAGGATGAAAAATATCTCAACCAGTGGCTGCATAATCTCAAGATGTTCCACATACCTGCTTCCAGTTACGTAAGCTTCCGGATCTGCAGTATGCACTTCCCCAAGCGATGCATCAACCGCTATTCGCTGTGCTACTGGGCGGTGCCGACGTTTAACCTCGGCCACGATGACGTAGCCAATCTCTACCAGAACCGGGAGTTGACCAACACGTTTACCACTGGCGAAGTGGCgcgctgcagcatgcctcactGTACCAGTCAGCGGGGTGAGAGCAACCTCAAGTTTTACAACTTCCCAAAAGATATTAAAAGCCTGATTAAGTGGTGCCAAAACGCCCGTCTTCCGGTGCAAGCAAAGGAGCCGAGACATTTCTGCAGCCGCCACTTTGAGGAGCGGTGCATTGGCAAATTCCGACTGAAACCTTGGGCAGTGCCCACCTTGCACCTGGGCGCCCAGTATGGGAAGATCCACGACAATCCAAAGAATCTATACGTGGAAGAGAAACGCTGTTGCCTCAACTTTTGCCGCCGGAGCCGCTCTTCCGATTTCAATATGTCGCTATATCGATTTCCTAGAGACGAAGTCCTGCTACGTCGCTGGTGCTATAATCTTCGCCTCGATCCGGGAGTGTACCGCGGCAAGAATCACAAAATATGCAGCGCTCACTTTATAAAAGAGGCGTTGGGTCTTCGGAAACTATCTCCTGGTGCCGTGCCCACACTTCATCTGGGCCACAATGATACCTTCAATATCTACGAGAACGAACTGTGGCCGCCGCCAGCACCGACACCCTCCTCTTGTCATctccaacagcaacagcagtcaTCCCTTCATTCGCTTCAACAGCAAATGCACAGCAAGTCCTACCAGCGCCGTTCAGCAGCATCCACATCGTCATCGGCAAGCTCGGCAGCTTCGCATTACGTGGACCCAGAGATGAGTGCCTCTTAccatctagccatgtccgccTCCGCCGGCGGCTCTGCAATGATAAACGCCAGCGACAGCATGGATGTATGTTGCGTGCCCAGTTGCGAGAGCAAGCGACACAATAGCGAGAACATTACATTCCACACGATTCCGCGACGGCCCGAGCAGATGCGCAAATGGTGTCACAATCTTAAGATTGCCGAGGAcaagatgcacaagggcatgCGAATCTGTAGCCTTCACTTCGAGCCCTACTGCATCGGCGGCTGTATGCGACCGTTTGCTGTGCCCACTCTTCACTTGGGCCACGACGATGAGGATATCCACCGCAATCCGGACGTGATCAAGAAGCTGAACATCCGGGAGACATGCTGTGTGGCTGTGTGCAAGCGGAATAGGGACAGGGATCATGCGAACCTGCATCGTTTCCCTAGCAACGTGGCGTTACTGAAGAAATGGTGCGCCAATTTGCAGCGCAGCGTGCCCGATGGCAGTAAACTTTTCAATGATGCCATCTGTGAGGTGCACTTTGAGGATCGTTGTTTGCGCAACAAAAGGCTCGAGAAGTGGGCAGTGCCTACTCTGATCCTGGGACACGATGACATTGCCTATCCGCTGCCTACGCCAGAGCAAGTAACCGAGTTCTATGCCCGGCCGACGGCTCCCAACAATGGTGAGGAACAGGGCGAGTGCTGTGTGGAGACCTGCAAGAGGAATCCGAGCGTGGACGATATAAAGCTATACCGCCCACCGGAGGAAGCCGCCGTGCTGGCCAAGTGGGCTCACAACCTGCAAACGGAGGCCAACCAACTGACAAGCATGAGGATCTGCAATCTACACTTTGAGGCACATTGCATCGGCAAGAGGATGCGCCATTGGGCCATACCGACTTTAAATCTAGCCGGCAACATTGAGAATCTTTATGAGAATCCAGAGCAATCGCTGCTGTACAGGCGTCGCACTACTCACTTGAAGGCGAAGCTGCCGCAATCCTCCGTGAAACCCACCTGGGTGCCCAGGTGCTGTCTTCCGCACTGTCGCAAGGTCAGAGCTCTGCACAATGTCCAGCTTTATCGGTTCCCCAAACTCAATCGCTCCACATTGGCCAAGTGGGCGCATAATCTCCAGGTTCCAATGGTGGGCAGTGCCCAGCGCAGGCTATGCTCGGCCCATTTCGAGCCACATGTGCTGAGTAAAAAGTGTCCGGTGCCGCTGGCGGTGCCTACGCTTGACCTAAATTCACCACCCGGCTTGAAAATCTACCAGAATCCGGCCAAGCTAAAGGCTAACAAACTGTGCCTGCAGCGGGTTTGCATCGTCGAAAGCTGCCGCAAGACGCGGGCGCAGGGCGTTCAGCTTTTCCGGCTGCCGCACAGCCCCACACAGCTGCGAAAATGGATGCACAACATTAGGACGCGGCCACGAGCAGCTATGCGGGCTCAGTACCGGGTGTGTTCCCGTCACTTTGAGACGCACTCCTTCAATGGCCGAAGACTGAGTGCAGGTGCAATTCCGACTTTAGAACTGGGCCACGATGACGACGATATCTATCCCAATGAAGCGCAGGCATTTGTGGACGAGCATTGTGCTGTCGAAGGCTGCGAGGCATCCAAGGAGCAGCCGGAGGTGCGACTGTTCCGCTTCCCCACCGACGACGACGATATGTTGTGGAAGTGGTGCAACAACCTGAAAATGAATCCTGTGGACTGCATTGGGGTACGCATCTGCAACAAGCACTTCGAGACCGATTGCATCGGTCCCAAGCATCTGTATAAGTGGGCTATTCCCACGCAGGAGCTGGGCCACGACGACGCCCAGATCGAGCTAATTCCGAATCCCAAGCCAGAGGATAGGTATGTGGATCCAGTCTTCAAGTGCATCGTTCCCACCTGCGGCAAGACACGACGGTTTGACGAGGTGCAAATGAACAGCTTCCCCAAGGACCCGGATCTATTCCAGCGTTGGCGGCACAACCTCCGCCTCGATCATCTCAGTTTCCAGGAGCGTGAGCGCTACAAGATCTGCAACTCACACTTTGAGGAGATTTGCATTGGAAAGACACGGCTAAACATTGGATCCGTTCCAACCTTGGAACTTGGTCATGACGATGAGGAGGATATTTTCCAAGTAAATCCAGCGGAGCTGCAGAGCAATTTATTCGGACGGCAGCGTCGACTGCTGCTCGAGGGATCCGGCGAACAGAGTATCAAGCAAGAGCTGTCCGAGACGGAGGACAGCAACAAAGCGGATGTGACGGCCACAGGCTCTATTTCCAAACAGaTCAAGATCAAGAGATCTTCTTCGGATCTAAAGTGTTGCGTGCACAGTTGTGGAAGAAGTCGCTTGGAACACGGGGCACGCCTGTTTCCCTTTCCTACAGGCAAGCAGCAACACCTAAAGTGGCGTCACAATCTGCACCTGGAACCGGAGGAGGTGGACCGTTCGACGCGCGTTTGCAGTGCTCACTTTAATCGACGTTGCATCGAGGGCAAACAACTGAGGAGCTGGGCGATGCCCACCCAACAGTTGGGACACAACGACCAGCCAATCTACGAGAACCCAAAGAACATACCTGGATTCTTCACACCTACCTGTGCCCTGGGACACTGTCGGAAACGGAGGAGTATTGACAACGATCTGCGTACCTATCGATATCCCAGGAGCGAAGATCTTCTAGAAAAATGGCGAGCTAATCTGCGGCTGGCTCCCGATCAGTGTCGTGGTCGGATCTGTGCCAATCACTTTGAGCCACAGGTGCGCGGCAAGCTAAAGTTGAAGACGGGAGCCGTTCCCACACTACAACTGGGACACGATGAGGATTTAATCTATGACAATGAAGCTATAAAGGCGGGCATGACCGAAGAAGAGGAGGCCATTTCCACAGACTTCCCGcgattaaaaccaaaaaaagagttgttcgaagaggaggaggaggagtgcgAAGGGAATGATGGCGAGCAGCAGCATACAGATGACCTGGATGATAATGCAGATGAAGAAGACAAAGATGATCAGTACTTTGACCCACTTGAGCTAGTTGAGACTTTTGCTGAACATCGTAGTGATGACGAAGCCCAAGACTATGAGGATGAAGAAGACGAAGAACGAGTTGAAGACTCCCCCTCCGGTTATGATGTCAAGGAGGAGATTGAACCGCCTCCAAGCTCTTCACCTTCTCCGCTTCGCCGACGGCATCATGTTCCGCGTCGAGACAAGCCGGCCAACAATGTGACTCCCATTTGCTGCCTGAAGCACTGCAGGAAGGAACGCACTGCCTTCCACCTGCTGAGCACTTTCGGCTTCCCAAAGGATCGCCAGTTGCTGCTGAAATGGTGTGCCAATCTGCATTTAAACCCGGACGACTGCATCGGTAGGGTTTGCATTGAGCACTTTCAGCCGGAGGTACTCGGCACCCGCAAGCTCAAGCAGAATGCAGTGCCCACTCTTAATGTGGGACATGATGAACCGCTCAGGTACTCGTGCCATGGAGTGGACCAGAATCTCGAGGAGCGGGAGCCCCAGCCACAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGAAAAAGGAAGCTAACGGAGCCGCCGGATATTCCCCTAGCCAAGAGGAGAGTGCTGGAGATGCCGATGATGAAGCGGGAGTGGGAGATGGAGATGccgatgcagatgcagatggagCAGAAGAAGGAGGCAAAAAAGATGACTCAAACTGAAAGTAAATCACTTATATGCTGTATTAGCAGTTGCGGAAGGCAGGAACTTAACCAATTGGTGGCATTTCCTAAAGAGAAGTCCTTGTTAAGAAAGTGGATGCATAATTTAAGGCTGCCCACTGAGATTGAGTCCACTTGCCTAAGCCTGAAAAGAGCTTGTTTGGCGCATTTCGAAACGCAGCTCTTGGAGAATGGAAAGCTCATAAAGGAAGCAGAGGCAGTGGCTGTGCCGACTTTAAACCTGGGCCATAGCAGCTGGAACCTATACAGGAGCAATGGGATCTGCCTAGTGCCAAACTGCGCCTTTAATAACTTCAGAAACATTAGCTTTATTGACCTGCCGGATAACAGTATTATTAGGGACGCTTGCTTCTCCTGCCTGAACCTACCTGAATCCTGTGAGGAGCTGGCAAAGCTATGTTGTATCCACTTTATGGAGGCTTACAAAAAGTTTGATCTTCCTAATGTTCTGCACCCTGAAGTCATGACGATGCTACAAAGTGTTGTGGCCGAGCTGCAATGCGCGGTGCCAGGCTGCAATTTCGAAGATGCTGATCCGGACTTTCAACTAATACAGTTTCCCGATAACAAGGAGGCGCTGTCACAGTGGCTGCACAACACCAAGGTCCCGTATGATCCTTCTAGCCACCACAGTTATCGCATCTGCACGCGTCACTTTAAATCAGAGTATTTAGAGACGAATGGCCCGCTAGAAGGGGCTATACCGACGCTCCATCTAAACCATGAAGATGAGATTCACTTGAATACTAGCTCTTTGCCAGGGGATTCGAATTCTATATTAACTCCACTGCGTATAAAGACGGATCCGGCCTTTTTGGGCAGTCCCTGTGCAAGTGCAAGCCCCAGTCCCCGGGGCAAGATCCGGATTTGCTGTATTCCCTCATGCGGCCAGTTTGGCAGCAGTCAAGTGAGACTGTTTCGTTTTCCCACCGAGGAGCAGGCGTTGCTCCGGTGGCTGGTGAACACACAGCAGCAACCGCGGCTGGTTGATCCCATGGACTTGTATGTGTGCCAGTCGCATTTTGAGCCCGAGGCCATTTATATGAAGCATCTTCGAAACTGGGCTGAGCCCACCTTAAACTTGGGCCACGACGGCCATATAATCCCGAATGCCAAACACAATGGAAACATTTCCGACAGCCAAGATACTGAGCAAGCCATGAGGTTTATTCGCGAGCGATTCTGCTCTGTCCTTTCTTGCTTTCAGGCAGGCGGTCAGGAAGAGGAGGGAGTGAGGCTATTTGATTATCCCGAGGATATGGCGACTACTCGAAAGTGGGCAGCCGCATGCAGACATCGCTCCATGCAGGCCAGGAGCCATGGGTTCAAGGTGTGCCAGTTGCATTTCGCCAAGGAGTGCTTTGATCCAAATTCTGGAGCATTGATTGAGGGCGCTGTGCCCACTTTGGAGTTGAGCAGAGATGAAATGGAGAGGCAATGTCTGGTGGCTGGATGTGTAAAAAATGATGCCACTGGAACCCGCCTTCGCTACTTTAAGATACCAAAAGTGGCTGCTCAATTGGAAGCGTGGAGCAACAACCTTAAAGTCCATCCAACGGATCTCATGCAAGGAGAGCAGCAATACATCTGCGAGAAACATTTTGAGTCGTTCTGCTTTGGAGCCAACAAGGGACTGCGTTCTGGTGCTCTTCCAACCCTCTTGCTAGGCCATGATGAGGAGGTGGATATGCTTCCAAATCCGGAAAGCTTTATCTGCCAGAATAAGGCCGATAAATGCTGCGTACCTGGTTGCGGGCGTGTCTGGCAGGCTGGCGATCGTAAATTTCGTGGATTTCCCAAATTGCTGGCCATGGCCAATAAATGGAGGCATAATCTTCGCTTGGACGAGCCTGTGGAGCAACTCGGCAAGCTGAAGGTCTGCAGTGCACACTTTGAGGCCACCTCACCCAACCTGGGTACAAATGGTCTAAGTGTCTCGATACCAACTTTGGAATTGGGCCACTCTTCTCCGGATATTTTCCCAGCGGAAATTAGCTTAAAGTTCCAAAAGCGCTCTGGAATGCCGGCGAAAATTTATTGCTGTTATCCCAAATGCGAGGAAGTCTGTTTATCCAAGAATTTTTCTTACAGCCTTCCCCAGGAGGAGCATCTGAGAAATGCCTGGCTAAGCCATATGGACATAGAAGATCCAAAAGATGAAGAAATCGCACGGGTTTGCCCGCTGCACTATGTCATTCTCTACCAGCACAGTGCCGCACTCTATCCGGAGCTTCATGCTTCAAGACGACAGCTTCTTGACTTCAACTACAAGGAGGCGTGGAACAACAGGCGCGTAAAGATTGTGAGTTGCACGATTAAGGGCTGCGACATGGTTAGGCCTCGAGATGGCGTACCACTGCACGGGATGCCGCAAAGCGATGAAATCCTGCAGATGTGGATAGACAATGGCCAGTTTGAGTTTTTGGAGCAACAGCGGTATATGTTCAAGGTGTGTCACAATCATTTTGAGCCATGCTGTTTCTTCGACGACAGACGTTTGCATTCATGGAGCGTGCCCACTTTGCATCTACCTGGAGATGTAATTCACCAAAATCCCACTCCCGAGCAGTGGCAGAACATGATCAATaagcaagcagcagcaaaaacacaCGCTGAAGAGAACGAGGAGCCAGATCCATATGAGGATGTGGTTAAAACCGAACCCGTTGTAAAGATGGAGCATATCGAATCGGAATATGAAGATGAAAACCCTGAGATGCAGGCCCTAGAGGTCCTCCTAGAAGTTGGTCATGTCGAGCGAATGGAGAGCTATGAGGAAATGGATAAATCACCAGCGATATACGCCGATAGTGCGCCCTTTCGATCCTCACCCATACGCTGCCAATATAATGCTAATCATTGTGCCGTTGAAGGATGCCAAGTGACTGTCGAAGATGTGGACGGCACGATTAAGCTGCATAAATTCCCGGCGTCGCAGGAAGCCGCACAGAAGTGGATGCACAACACCCAAGTTGATATGGATGAAAAGTTCTGGTGGCGCTATCGCATATGCAGTTACCACTTCGATCAAGAATGCTTTCAGAGTGCTAGAATTCGTAAAGGCGCGATGCCCACGCTTTTGCTAGGGCCACGGCGACCGGACGAGGTGTACGATAATGAGTTTTCACTGCCAGAGGCGGAGGAACCTTTTCCAGAGACACCGGAAGAGGAAAGTTCGACTGTTGCGTCCAAAGTTCAAAAGGAGGTAACCAATTTATGCCTGCCGCCACGGGCGCCGCCTCGAAAGTCAAGCAAGTTTTGCCAAATTGATTCCTGCACAAACCACCTGACCACTGAGAACATGACACTTCACAAGTTTCCACACTCGGAGGACATGTGCCTCAAGTGGCAGCACAACACGCAAGTGCCATTTGATCCCTACTACCGCTGGCGTTACCGCATTTGCAGTGCACATTTTCATCCGGTGTGTTTGGTCAACATGCGTCTAGTCCATGGAAGCGTTCCCACTTTAAAGCTGGGTCCCAAGGCACCTTCCGAGCTGTTTGACAACGATTTCGAAGCCATCAACCTAAGATTGGATAAAAGGTTGACAGAGTCCAATGCTAACGTGTATATCAAGCATGAAAAAagggaggaggatgaggattCGATGATGTTCCTTGAGCCCGAGCTTCAGTTACATGAGGACCAAGATGATAAGGTATCAAGCTGGAACAGCAAACTGCAGTTGCCACCTGTGAAGCAAGAGAAAATGATATACAGCCAAATCAAGTCTGGCTACGACAAGTGTTCGCTGGCTCACTGCCAGCGCCAAAGGTCCCAGCATGGCGTCCACATTTATAAGTTTCCCAGATCGAGGCGTCAGCAGGAGCGGTGGATGCACAACCTACACATCCGCTATGATGATCGGACACCGTGGAAATTCATGATTTGCAGCGTTCACTTCGAGCCGCACTGCGTCAGCCTAAGGAAGCTGCGACCTTGGGCGGTGCCCACACTGGAACTGGGTGACAATGTACCAGAGACAATCTTTACGAACGAACAGTGCGAGAAGGAGCTGGTGATCGAGCGCAGTGATCCGGATAGCGACGCGGAAGAAGAAGACGGCTTGCAGgaggacgacgaggatgatgacgacgaAGACGATGTAAGGCCCGATGTTATTGGCATAAAAAGGAGGAAACGTTCCAAAATAGATTCCACCGGCCCTCCTAGCCAGATTCCACCCTGGAAAGTCAAGCAGTGCTGCTTACCCTATTGCCGGGCCTTTCGAGGCGATGGTATCAAGCTGTTTCGGCTTCCGAACAACCGAAACTCCATTAGCAACTGGGAACGAGCCACCGGAATGGTATTCAAGGAGTCGCAACGGAACACTCGCCTGATCTGCAGCCGTCACTTTGAGCCAGAGCTGATTGGAGTCAGGCGTCTAATGCGTAACGCCATTCCCACGAAACACTTGAGCCCTCAATCTGTGGACCAGATCCGTACTAAAAAGGAGAAGAATCCTCCTCCGGCCACTATTATACCCATCTGCTGCATGGCCGATTGCCACTACAACGGAAATGCGAAGCTGCACAAGTTTCCAAGTGATCCCACTCTTCTCAAACAGTGGTGCCAGGCTCTCCGGCTCACGGATACGCAGCGGTATTTGGGCAAGCACATTTGCTCCATGCACCTGCCAATGAACAAGACGCTGAGCTGTGTCATCTGCGGTGGAGACGACGTAGAGCTGCCGATGCTTGGGTTTCCGGAAAACCGCAATCAGCGCGCCAAATGGTGTTACAATCTCAAAATTGAGGCAATACCAAAGTGGGACCACTCAAAGCATATTTGCTGCCGGCACTTTGAGTCCCATTGCTTCGACAAGCCGGGTGAGCTACGTCCAGGAGCGGCTCCCACGCTCCATCTCAATCACGACGACACAAACATATTCTTCAGCGACTATGCCACTGGTCTTCCGTCCTCGCCACTAGGCAATCGAATTAAAGACGAGCCCCTGGAATCGGAGTCCGACGAGACACTGCTGGTGTAG
- Protein Sequence
- MSQHNHNHAHPHYQYPISNNNSMGAYGGGVGGGGGSHGYFGAAGGGLNSEPLEGFQQPPNPMAPPPAPEMIIKSEPIDDLAYKSNYIDDNTPFADFSKFSDFSEDMLSPKVELTVKDESFVRNPNSFLRRKQQSDLTTAESLPVCQRCKEVFFKKQTYLRHVAESSCGIQEYDFKCTICPMSFMNAEELHEHKQQHRADRFFCHKYCGKHFGTITECEAHEYMQHEYENIVCNMCSGSFATREQLYAHLPQHKFQQRFDCPVCRLWYQTALELHEHRLAAPYFCGKYYTGGQSPSPSSQQHQHQSQTNYKLQDCHMATMEMPNAPLLKANSSNSPALPATAALNSLLQQRQANADGAAIFAASSLKNEVAVKLERSYSNSTNESSYSVQESGYNNVYGSSDSSGHGAIAGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLNQWLHNLKMFHIPASSYVSFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPAPTPSSCHLQQQQQSSLHSLQQQMHSKSYQRRSAASTSSSASSAASHYVDPEMSASYHLAMSASAGGSAMINASDSMDVCCVPSCESKRHNSENITFHTIPRRPEQMRKWCHNLKIAEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLKKWCANLQRSVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHDDIAYPLPTPEQVTEFYARPTAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEEAAVLAKWAHNLQTEANQLTSMRICNLHFEAHCIGKRMRHWAIPTLNLAGNIENLYENPEQSLLYRRRTTHLKAKLPQSSVKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGLKIYQNPAKLKANKLCLQRVCIVESCRKTRAQGVQLFRLPHSPTQLRKWMHNIRTRPRAAMRAQYRVCSRHFETHSFNGRRLSAGAIPTLELGHDDDDIYPNEAQAFVDEHCAVEGCEASKEQPEVRLFRFPTDDDDMLWKWCNNLKMNPVDCIGVRICNKHFETDCIGPKHLYKWAIPTQELGHDDAQIELIPNPKPEDRYVDPVFKCIVPTCGKTRRFDEVQMNSFPKDPDLFQRWRHNLRLDHLSFQERERYKICNSHFEEICIGKTRLNIGSVPTLELGHDDEEDIFQVNPAELQSNLFGRQRRLLLEGSGEQSIKQELSETEDSNKADVTATGSISKQIKIKRSSSDLKCCVHSCGRSRLEHGARLFPFPTGKQQHLKWRHNLHLEPEEVDRSTRVCSAHFNRRCIEGKQLRSWAMPTQQLGHNDQPIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRYPRSEDLLEKWRANLRLAPDQCRGRICANHFEPQVRGKLKLKTGAVPTLQLGHDEDLIYDNEAIKAGMTEEEEAISTDFPRLKPKKELFEEEEEECEGNDGEQQHTDDLDDNADEEDKDDQYFDPLELVETFAEHRSDDEAQDYEDEEDEERVEDSPSGYDVKEEIEPPPSSSPSPLRRRHHVPRRDKPANNVTPICCLKHCRKERTAFHLLSTFGFPKDRQLLLKWCANLHLNPDDCIGRVCIEHFQPEVLGTRKLKQNAVPTLNVGHDEPLRYSCHGVDQNLEEREPQPQHSVFRLWSLKHCRKRKLTEPPDIPLAKRRVLEMPMMKREWEMEMPMQMQMEQKKEAKKMTQTESKSLICCISSCGRQELNQLVAFPKEKSLLRKWMHNLRLPTEIESTCLSLKRACLAHFETQLLENGKLIKEAEAVAVPTLNLGHSSWNLYRSNGICLVPNCAFNNFRNISFIDLPDNSIIRDACFSCLNLPESCEELAKLCCIHFMEAYKKFDLPNVLHPEVMTMLQSVVAELQCAVPGCNFEDADPDFQLIQFPDNKEALSQWLHNTKVPYDPSSHHSYRICTRHFKSEYLETNGPLEGAIPTLHLNHEDEIHLNTSSLPGDSNSILTPLRIKTDPAFLGSPCASASPSPRGKIRICCIPSCGQFGSSQVRLFRFPTEEQALLRWLVNTQQQPRLVDPMDLYVCQSHFEPEAIYMKHLRNWAEPTLNLGHDGHIIPNAKHNGNISDSQDTEQAMRFIRERFCSVLSCFQAGGQEEEGVRLFDYPEDMATTRKWAAACRHRSMQARSHGFKVCQLHFAKECFDPNSGALIEGAVPTLELSRDEMERQCLVAGCVKNDATGTRLRYFKIPKVAAQLEAWSNNLKVHPTDLMQGEQQYICEKHFESFCFGANKGLRSGALPTLLLGHDEEVDMLPNPESFICQNKADKCCVPGCGRVWQAGDRKFRGFPKLLAMANKWRHNLRLDEPVEQLGKLKVCSAHFEATSPNLGTNGLSVSIPTLELGHSSPDIFPAEISLKFQKRSGMPAKIYCCYPKCEEVCLSKNFSYSLPQEEHLRNAWLSHMDIEDPKDEEIARVCPLHYVILYQHSAALYPELHASRRQLLDFNYKEAWNNRRVKIVSCTIKGCDMVRPRDGVPLHGMPQSDEILQMWIDNGQFEFLEQQRYMFKVCHNHFEPCCFFDDRRLHSWSVPTLHLPGDVIHQNPTPEQWQNMINKQAAAKTHAEENEEPDPYEDVVKTEPVVKMEHIESEYEDENPEMQALEVLLEVGHVERMESYEEMDKSPAIYADSAPFRSSPIRCQYNANHCAVEGCQVTVEDVDGTIKLHKFPASQEAAQKWMHNTQVDMDEKFWWRYRICSYHFDQECFQSARIRKGAMPTLLLGPRRPDEVYDNEFSLPEAEEPFPETPEEESSTVASKVQKEVTNLCLPPRAPPRKSSKFCQIDSCTNHLTTENMTLHKFPHSEDMCLKWQHNTQVPFDPYYRWRYRICSAHFHPVCLVNMRLVHGSVPTLKLGPKAPSELFDNDFEAINLRLDKRLTESNANVYIKHEKREEDEDSMMFLEPELQLHEDQDDKVSSWNSKLQLPPVKQEKMIYSQIKSGYDKCSLAHCQRQRSQHGVHIYKFPRSRRQQERWMHNLHIRYDDRTPWKFMICSVHFEPHCVSLRKLRPWAVPTLELGDNVPETIFTNEQCEKELVIERSDPDSDAEEEDGLQEDDEDDDDEDDVRPDVIGIKRRKRSKIDSTGPPSQIPPWKVKQCCLPYCRAFRGDGIKLFRLPNNRNSISNWERATGMVFKESQRNTRLICSRHFEPELIGVRRLMRNAIPTKHLSPQSVDQIRTKKEKNPPPATIIPICCMADCHYNGNAKLHKFPSDPTLLKQWCQALRLTDTQRYLGKHICSMHLPMNKTLSCVICGGDDVELPMLGFPENRNQRAKWCYNLKIEAIPKWDHSKHICCRHFESHCFDKPGELRPGAAPTLHLNHDDTNIFFSDYATGLPSSPLGNRIKDEPLESESDETLLV
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00525910;
- 90% Identity
- iTF_00594581;
- 80% Identity
- -