Daus014104.1
Basic Information
- Insect
- Drosophila austrosaltans
- Gene Symbol
- -
- Assembly
- GCA_035045865.1
- Location
- JAWNOQ010000083.1:4015181-4029649[-]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 32 10 3e+04 -7.5 5.7 21 67 393 444 374 459 0.54 2 32 5.1e-15 1.5e-11 45.6 4.4 1 86 594 666 594 667 0.85 3 32 1.3e-14 3.8e-11 44.3 5.2 1 87 694 763 694 763 0.83 4 32 3.8e-15 1.1e-11 46.0 0.3 1 87 786 858 786 858 0.84 5 32 7.7e-16 2.3e-12 48.2 5.5 1 87 967 1037 967 1037 0.82 6 32 6.1e-15 1.8e-11 45.3 3.2 1 86 1061 1132 1061 1133 0.81 7 32 7e-13 2.1e-09 38.7 0.6 1 87 1168 1236 1168 1236 0.81 8 32 1e-10 3e-07 31.8 1.6 1 86 1279 1348 1279 1349 0.77 9 32 2.5e-16 7.5e-13 49.7 0.4 1 86 1376 1445 1376 1446 0.83 10 32 1.8e-12 5.4e-09 37.4 2.3 1 86 1467 1536 1467 1537 0.81 11 32 9e-15 2.7e-11 44.8 1.6 1 86 1564 1635 1564 1636 0.85 12 32 6.6e-13 2e-09 38.8 2.0 1 85 1716 1784 1716 1786 0.82 13 32 3.7e-12 1.1e-08 36.4 0.1 1 86 1810 1878 1810 1879 0.82 14 32 8.9e-14 2.6e-10 41.6 2.8 1 87 2005 2074 2005 2074 0.80 15 32 4.8e-11 1.4e-07 32.8 0.3 1 86 2157 2223 2157 2224 0.82 16 32 0.035 1e+02 4.4 0.0 1 58 2243 2290 2243 2313 0.73 17 32 1.4e-12 4.1e-09 37.8 0.2 1 86 2320 2389 2320 2390 0.83 18 32 9.4e-14 2.8e-10 41.5 1.2 1 87 2456 2526 2456 2526 0.82 19 32 2.6e-12 7.8e-09 36.9 1.0 1 86 2561 2632 2561 2633 0.80 20 32 3.3e-11 9.9e-08 33.3 0.4 1 87 2645 2718 2645 2718 0.78 21 32 5.8e-13 1.7e-09 39.0 0.2 1 86 2744 2816 2744 2817 0.81 22 32 3.2e-07 0.00094 20.6 0.5 1 58 2853 2904 2853 2921 0.86 23 32 8.2e-13 2.4e-09 38.5 0.1 1 87 2942 3014 2942 3014 0.81 24 32 1.1e-16 3.2e-13 50.9 2.8 1 86 3066 3137 3066 3138 0.83 25 32 5.2e-05 0.15 13.5 0.2 1 58 3169 3218 3169 3237 0.79 26 32 4.2e-13 1.3e-09 39.4 0.3 1 87 3256 3328 3256 3328 0.82 27 32 1.1e-14 3.2e-11 44.5 0.4 1 87 3471 3544 3471 3544 0.83 28 32 2.2e-12 6.4e-09 37.2 2.4 1 86 3609 3679 3609 3680 0.81 29 32 1e-14 3.1e-11 44.6 4.5 1 86 3783 3853 3783 3854 0.85 30 32 7.4e-13 2.2e-09 38.7 0.1 1 86 3934 4003 3934 4004 0.85 31 32 1.4e-11 4.1e-08 34.6 0.5 1 58 4030 4079 4030 4095 0.86 32 32 1.2e-10 3.4e-07 31.6 1.1 18 87 4096 4155 4085 4155 0.77
Sequence Information
- Coding Sequence
- ATGTCACAACATAATCCACATTATCATCCCCACCCCCATCCCCTACActatcagcaacaacagcagcagcagcagctgcatcaCCACCATACCTCtcttcaacagcaacaacataaacaaatacaacacAGCAATTGGTACTCACATGTTGCTTCCACCTCTTCCGCTCCCTACCCTCATCACCCCTCCTCGACCACCTCATCGGTGGCGGCGTCAACTTCAGGCGCTAACAACAATCACATAATGAATGCCTATGGAACACATGGATATTATGGTGCCGCTGGCGGTGGCCTCAATGTCAATGCTGTGGGTGTAGGTGTtgggggtggtggtggtggtgggggaAGTTCAAACAGTTATAACCTTGAGGCGGCCAATACAGTGGCCTATGCCCACAACCAGCTGCTGCagtatcaacaacaacaacagcatcaacaacaacatcagcaacagcatcaacaacaacaacagcagcagcaacagcaacaacaccaacatcaTCTCAATGCAAGATCTTATATGGGAGGTCATCATCATGGTATATATCCCTATATTAAAAGTGAACCCATGGAATATACCCATAACACAATGGCTCCACCTCCAGCACCTACTACAGCAACCACAGAAATGAGAATTAAATCGGAACCCATTGACGAACTGGCCTACAAATCGTCCAATTATATTGATGATAATACTCCATTTGCTGACTTTTCGAAATATAATGAATTTAGTGAGAATATGTTGAGTCCCAAAGTGGAATTAACTGTGAAAAATGAATCACCCTACGGCAAgcATCCTAATAATTATCCACGGCGTAAATTACAAACGGAACGCTCATCGGAAAATTTACCCATATGTCAACGTTGCAAAGAAGTCTTCTTCAAGAAGCAATCGTATCTACGTCATGTGGCCGAAAGTAGTTGTAGCATTCAGGAATATGAATTCAAATGCAACATTTGTCCCATGTCCTTTATGAGTGGCGAAGAATTGCAAAGGCATAAACATCTCCATCGGGCTGATAAATTCTTTTGTCATAAATATTgtggaaaatattttgatacaATTGCCGAATGTGAATCCCATGAATATATGCAACATGAATATGATAGTTTTGTTTGTAATATGTGTTCGTTGACATTTGCCACCAGGGAGCAGCTTTATACCCATTTACCACAACATAAGTTCCAGCAGCGTTACGATTGTCCCATTTGTCGTTTATGGTATCAGACGGCTGTCGAACTCCATGAGCATCGTCTGGCGGCACCTTACTTCTGTGGCAAATATTATaatcagcagcatcatcatcagtcacaacagcagcagcaacatcaccatcagcagcaacagaatcaCCAACAACAGACGCatcaaacaaattataaattgcAGGATTGTCATATGGCTACCATGGAAATGCCCACAGCACCACCGCCATCATCAGCGGTAACACATCACAAGTCTAATGCATCCGGAACATCTTCTACATTACCAGCAACGGCAGCTTTGAGTTCTCTGCTCCAACAACGTCAGGCCAATGCAGATGGTGCGGCcatgtttgctgctgctgcctcctCAACATCCCTCAAAGGGGAAGTCAACGTGAAGTTGGAACGAAGTTATAGCAACTCCACAAGTGACTCTTCTTTTGGTGGAATGCATGAATCCaactataataataataataatgcctATGGCAGTGATAATTCCATTCATGGATCTGGTGCCGTTGGTGGGCCACAAGCTCATTCCTCAACGCTGGATGACTCTGAGGATGCTCTATGCTGTGTGCCCATGTGCGGTGTAAGCAAAAGCACTAGTCCCACACTCCAGTTTTTCACATTCCCCAAAGATGACAAATATCTCCATCAATGGCTACACAATTTAAAGATGTTCCACATACCCGCCTCAAGCTATTCGACATTTCGTATCTGTAGCATGCATTTCCCAAAACGTTGCATCAATCGGTATTCGTTATGCTATTGGGCAGTGCCTACCTTCAATTTGGGACACGATGATGTCGCCAATCTCTATCAGAATCGCGAGCTAACAAATACCTTTACCACCGGCGAGGTCGCACGCTGCAGCATGCCGCACTGTAATAGCCAGCGGGGTGAGAGTAATCTCAAGTTCTATAACTTTCCCAAGgatattaaaagtttaatcAAATGGTGTCAGAATGCTCGGCTGCCTGTTCAGGCCAAGGAGCCCCGACACTTTTGTAGCCGTCACTTTGAGGAGCGTTGCATTGGCAAATTTCGTTTAAAACCCTGGGCAGTGCCCACACTACATCTGGGTGGTGCCCAATATGGGAAAATCCATGATAATCccaaaaatttgtatgtagAGGAGAAGCGCTGTTGTCTTAACTTTTGTCGTCGCAGCCGTTCAACGGATTTCAATATGTCGCTTTATCGTTTCCCAAGGAATGAGGTATTATTACGACGCTGGTGCTATAATCTGAGACTCGATCCGGGTGTATATCGGGGCAAGAATCATAAAATATGCAGTGCACACTTTATTAAAGAGGCATTGGGTTTAAGAAAACTGTCGCCGGGTGCTGTTCCTACACTTCATTTGGGTCACAATGATACCTTTAATATCTATGAAAATGAATTATGGCCACCGCCGACGCCAAGTTCCTCAACGCCACATCATcagcaccatcatcatcagcagcagcagcagcaacatggcCATGGGCATGGTCATgcacagcaacatcatcatcatcacaacAAAGCAGCGTATCATCGTCAAACGGCAGCTTCGACTTCATCATCGGCTAGCTCAACTTCGCACTACGTGGATCCGGATAATATGGGCAGCGGAGCATATCTTGGCATGGGTGGTGCTAACTCCCTTTCTGGTGGAATGAATGTCAGCGATAGCATGGACATTTGCTGTGTACCAAGTTGTGAGAGTAAGCGACATAATAGCGAGAACATCACATTCCATACGATACCCAGAAGGCCCGAGCAGATGAGGAAATGGTgtcacaatttaaaaatacccGAGGATAAAATGCACAAGGGCATGCGGATATGTAGTCTACATTTCGAGCCGTATTGCATTGGCGGCTGCATGCGTCCATTTGCAGTGCCAACTCTTCATCTGGGACATGACGATAAGGATATTCATCGTAATCCGGATGTGATTAAGAAACTTAATATAAGGGAAACTTGTTGTGTGGCAGTCTGTAAAAGGAATCGTGATCGTGATCATGCCAATCTCCATCGGTTCCCTAGCAATGTGGCCCTATTAACGAAATGGTGTGCCAATCTGCAAAGGCCTGTCCCAGATGGCAGTAAACTCTTTAACGATGCCATATGCGAAGTGCATTTCGAAGATCGTTGTTTGCGCAACAAGAGATTGGAGAAATGGGCAGTGCCGACGTTAATGTTGGGTCATGAGGATATTGCGTATCAGTTGCCCACATCCGAGCAAGTGGCAGAGTTCTATGCACGTCCAAATGCACCGAATAATGGCGAGGAGCAGGGAGAATGTTGTGTGGAAAGCTGTAAGCGTAATCCCAGTGTGGATGACATAAAACTATATCGTCCACCCGAAGAGTCAGATATACTGGCCAAATGGGCGCATAATCTTGAACTGGATGTGGCCGAGTTGCCAAATATGAGGATATGCAATCTACATTTCGAATCCCATTGCATTGGTAAACGGATGAGACCGTGGGCCATACCAACATTAAACCTATCTTCTAATATTGAGAATCTGTACGAGAATCCAGAGCACTCAATGTTGTACAAGAGGAGAACGAAGCGAGATCCAAATCGAGACGTATCCCTAGCGGCAACGAAACCAACTTGGGTTCCTAGATGCTGTTTGCCGCATTGTCGCAAGGTCCGAGCTCTGCATAATGTTCAACTCTATCGATTCCCCAAACTGAATCGTTCCACATTGGCCAAATGGGCACACAATCTACAAGTGCCAATGGTGGGCAGTGCCCAACGGAGACTCTGTTCGGCACATTTCGAACCTCATGTATTAAGTAAAAAGTGCCCTGTACCATTGGCTGTACCCACGATCGATTTAAATGCCCCGCCAGGTTACAAAATCTATCAAAATCCAGCCAAACTTAAAGCCAGCAAATTGTGCCTGCAAAGAGTTTGCATTGTGGAGAGTTGCCGTCGCACCAGGGCTCAAGGAGTCCAGCTCTTCCGTTTGCCTCACAGTCCGACGCAGTTAAGGAAATGGATGCACAACATCAAGACACGTCCACGGGCAGCTACAAGATCGCAGTATCGCATCTGTTCGATACACTTTGAATCGCATTCGTTTAATGGCAAAAGATTAAGTGCTGGAGCCATTCCCACCTTGGAATTGGGTCATGACGATGACGACATCTATCCGAATGAGGCACAAGCATTTGTGGATGAGCATTGTGTGGTCGAGAGTTGTGAATCGTCAAAGGATCAACCCGAAGTGCGTTTATTCCGTTTCCCCACCGAAGATGATGATCTTCTGTGGAAATGGTGCAACAATCTCAAAATGAATCCAGTTGATTGTGTAGGAGTGCGTATTTgtaataaacattttgaagCTGATTGCATTGGTCCCAAACACCTATTCAAATGGGCCATACCCACTATGGAGCTGGGACACGATGACAGTGAAATCGAACTGATACCAAATCCCAAGCCTGAAGAGCGATATGTTGATCCAGTTTTTAAGTGTTGTGTACCAACTTGTGGCAAGACCAGGAAATTTGATGAGGTGCAAATGAATAGTTTTCCGAAAGATCCTTTGCTCTTCCAGCGCTGGCGTCACAATCTGCGTTTGGATCACCTGAATTTTAAGGAGCGGGAACGCTACAAGATTTGCAATGATCACTTTGAGGATGTTTGCATTGGCAAAACTCGACTTAATATAGGCTCCATACCCACCCTTCAGTTGGGTCACAATGAGACGGAGGATCTGTATCAAGTCAATCCTGCGGAATTGCAAAGTAATCTCTTTGGCAGACCACGTAGATTACATGGTGGGGTTGACATTAAGCTAGAATATGCGGAGGATTCCGAGGCAGAATCAGGACTGCAGGATGTTAAACCAAATATCTATGAGATGGCCGAAGCCACCGATATAAATATCAGGCAGGTGAAGATTAAGAAATCTCTCGCTGATCTAAAGTGTTGTGTACGCAGCTGTGGTCGTAGTCGCCTGGAGCATGGTGCTCGCCTCTTCCCCTTCCCCAATGGCAAGCAACAGAATCTGAAATGGCGTCACAATCTCCAACTTGAACCGGAAGAAGTGGACAAAATGACACGCGTCTGCAGTGCGCATTTCAATCGGCGTTGCATAGATGGCAAACATCTGCGGGGATGGGCCATACCCACACAACAATTGGGACACCATCATGAACAGCCAATTTATGAAAATCCCAAAAATATTCCAGGCTTCTTTACCCCAACATGTGCCCTAAGCCACTGTAGACAGAGGCGAAGCATTGATAATGATTTGCGCACCTATCGCTATCCGAGAAGTGAGGATCTATTAGAGAAATGGCGTGCCAATTTACGTTTGGCGCCAGATCAATGCCGTGGACGGATTTGTGCTGATCACTTTGAGCCGTTGGTTAGGGGCAAACTGAAATTGAAGACTGGAGCAGTGCCCACTCTGAAATTAGGACATGATGAGGAATTAGTTTACGATAATGAAGCTATCAAAGCTAATCTAGTGGATGAAGAGGATGTCAGTTTGGAATCACCACCGCAAGTAATAACTAAAAAGGAGATTTTGGAAGAGgaagatgatgaagaagaTCTGCAAGAgcatgaggatgatgatgaggaggaggaggaggaagaaaACGATCCACCAGAAGAGGATTCACATTCCGATTATTTCGATCCCCTAGAATTGGTAGAGACATATGCCGATGATCAAGTACCAGAAGATGAATATAGTGCACCCGCTCATCAACTCCCGGCACCACCATCAGTAGCTGCTCCACCTTTTGGCAGGCGTGAAAAGGTGGCGAATAATGTAACACCCATTTGTTGTTTGAAGCATTGTCGAAAGGAACGCACTCCCACCCATCACTTGAGTACTTTTGGCTTTCCCAAAGATCATCAGCTTTTGCTGAAATGGTGTGCCAATCTTCACCTGGAACCCATGGATTGTGTGGGACGTGTTTGCATTGAGCATTTTGAAGCGGAAATGTTAGGAACACGCAAGCTAAAGCAAAATGCTGTTCCCACCATTAATGTGGGACATCAGATGCCTTTACCGTATACCTGCAACGGCCAGGAGCGTAGCGATGAGAAGGAGGATAATTCGGTTTTTCGGCTTTGGAGCCTGAAACATTGTCGCAAGAGGAAACTAATGGAACCACCAGATATTCGCCTAAAAGTGGAGAAGATGGATCCGATGGGTCTAGTGAAAGTGAAGAAggagaaaatggaaatggaggaGGAGAAAgagacaatgatgatgatgactaaACCTAAGAGATGTTGCCTTAACCAATGTGAGCAAACTGCagaattgcagaaatttccaaGAGATTTCAATTTGCTAAGAAAATGGTTGCACAACCTCAAGTTGACCCTTAACGAGGATTTGGATCCCTCACAGCTGCGTTTGTGTCTAAGGCACTTTGAAGGTCATTTGGTACGAAATGGACATCTTTCAAAAGAGGCATTACCCACTCTGGAACTGGGTCATCAGGATAAGAATATTTATAGAACAACTGTAGCAACTTCTGGTGGTTGCTTGGTGGCCAGTTGTCCATGTGCTCGTCTCAATCTCTATCGAAGTTATGCTCTACCCAAGGAGCCCTATATTAAAGAGGCGTGGCTAAACTATCTAAAGCTGCCGGCAATCACCCATGGACAACTCTGTGTAATGCACTATATGCAACTGTACGAGGAGATGCCGTTCAAGGAATTGCGTCATATCTATGAATCCATTGCCAATTCCACACAGGCTCTGAAATTGCGCTGTGCCGTACCCGGCTGTCGATCAAAGTACACGGATAATATACACTTGACCAAGTTGCCGCAAAATCAAAGCTTACTTACCAAATGGTTGCATAACACCATGTTGACCTATGATCCCAGCAAACATTCAATTTATCGCATTTGTTTGCTGCACTTTGAGCCATTCGCATTGGGTCCAGCATGTCCCAAGCCATGGGCAGTACCCACCTTGGAATTAAATTATCAGAATGACATTTATTTGAATCCTTCGAAAGAGGAATTGGCTAACATAACAGACTATCCCCGAATTAGTACTCCGCTGCAAATTAAAACAGAATTTACTTTACCATTGAGAATAAAAACGGAATTAGCCGCCTTAAGCAGTCCCAGTGTTGGTTCCACACCTAGTCCACGGGGCAAGGTCAGAATTTGTTGCATACAATCATGTCTGCAGCAAGCCAATTCCCAGTTACGTCTCTATCGTTTTCCCAATACAGAATCCGCTCTACTCAAGTGGCTGGTCAATACGCAGCAGCAACCACGTCTTGTGGATCCCACACAGTTGTATGTGTGTCAATCCCACTTCGAACTTGAAGCTATCTGTAAGAAACAATTGAGAAGTTGGGCTGTGCCCACATTAAATTTAGGACATGATGGTCATGTCATACCCAATGCCAGGCATAATGGAAATATTGCCGATAGCCAGGAAACGGAACAGGCAATGGAATTTATTAGGGAAAACTATTGTTCCGTGCTAAGTTGCTTTCAGCCAAAGAGTGAGGCTCTGCGTTTGCATCCCTATCCCAAGGATATGCCTACCATACGGAAATGGGCTGCCAATTGTAAGCATCGTTCCATGCAGGCCAGCAGTCATGGATTCCAGGTCTGTCAATTGCATTTTGAAGCAGATTGCTTTCATCCGGATACTGGTGACTTACGTGAGGGATCTGTACCCACTCTGGATCTAACAGTGACTCGGCTAAACAGCGAGTTGCGTTGCCTGGTCACTGGCTGTGTCAAAGATGAAACTCAGCCGCGACGTCGTTACTACAAACTACCTAAGCGACCTGCTTTGCTCAGTGAATGGTGCAGAAATCTCGGTTTAGTTCCTTCTGGACTCCTACATGGTGCTGATCATCACGTTTGCGAACGTCACTTTGAATCTCGTTGCTTCAACATCCACAAACAGTTGCGTTCAGGATCACGTCCGACCCTGAATTTGGGTCACAATGAAAATATTACGTTGCTGCCAAATCCAGAGATATTCTGTGATGAGATTGACGACGTCAGTACTTGCTCTGTGCCAAATTGTGGTCAATCCAAGCTAACGGATGAAACACTTCAACTAAATAGTTTGCCCAGAATGCGTAAGTTGGCGGAGAAATGGTTGCATAATCTGCATCTACCATACACTGGAAAGGAGCAACTGGCCAAGTTTCGTGTCTGCCAGAAACACTTTGATCCATCTTGCTTTGAAAACGGGTTTTTGCGTCAGGGAGCCCTGCCCACCTTGGAGTTGGGTCATGAGTCTGTGGACATTTATCAAACAGATGACCAGAGTGTGGGCAAATACAGAAAGCACCAAAAAGTATTGTCTGGCGTACGTGTATCGGGGCACGACTGTTGTTATCCCCAATGTGTGCAACAGCAAAAGAATTACCAACGAATGGTGTACGACTTGCCCAAAGAGGAGAAGCTGCGTCAGAGATGGCTACAGCATTTGGAAATTGATGaaagagaaagggaaagaCCTTTGATATTATGTCCACTCCATTATATATTCCTATACGATTATAGTGTGAAAAACTTTGAAGAACATGTTCCAAATGATCTGCTGGAAAGCAACTATGAAGATGCAAGAAATGGCTCTAGAATCCGGCTTATCAGTTGTGCTGTGCGAGGATGTGGAACACTTCAGCCACGTGATGGTGGCAGATTGCATGGTCTGCCCACGAATCCAGAGATCTTCCAGATGTGGTTGGATAACACTGAATTGGTTGTATATGAGCCACAGCGTTACATGATCAAAGTCTGTAGCAAACACTTTGAGTCTATATGTTTTACGGATATTCGCAAATTGAAATGCTGGAGTGTGCCCACTCTTCATCTACCCGGTGAGGCAGTGCATCAAAATCCAACCGAAGAGGAATGGTTAAAGATAAACGAAAGAATAGCTGTATCAGCCGCTCAGCCAGGGGAACCCTGTGAGGACAATTCAATGCTGGAACCAGTTGTTATAATGGAAGAAGAGGACTGTGTCTGTTGTGTACCCAATTGTGGACGGTCCAAGCAAATGGATAATTCCATTCAGTTTACAAGCTTCCCCAAGAACAACATGCTGGCCGAGAAATGGATTCTTAATTTTCATCTGAAAGTGACCAAAGATCAGTGGTCCAATCTTCGTGTATGCAATCGGCATTTTGAGACAACTTGTTGGGAAAACGGTCGATTGCGAAGGGGAGCCATGCCGACCCTAGAATTGGGTCATGAGAGCAGTGATATTTATCAAACCGACGAGCTAGATCTCTTCAAGAGTCGCAAGCAAACCAAGAGGACATATGGCCAGGGATGTTGTTTTCCTCAGTGCGTGGaacttttaaagaatttcCAACGTATGGTCTATGATTTGCCAAGAGAAGCTCAACTGCGACAACGCTGGTTACAATATATGGAATTGACGGAATCAGAGCAGCCATTAAAAATGTGCCCACTCCATTATATTATTCTATATGATCACAGTGTAAAAAACTTTGAGGAACATGCTCCGGAAAAGCTGcttgattttaattatgaaaatgcTAGAAATTGTGTGAGAATTCGGATTATTAGCTGTGCGGTGGAAGGATGTAATACACTGCAGCCACGAGACGGAGGTCGCATGCATGGTCTGCCACCAAGATCAGATATACTCCAGATGTGGCTGGACAACACAAGATTAGTCTTCCATGAGCATCAACGTTACATGCTAAAAGTGTGCAGTAAGCATTTTGAGCCAAAATGTTTTACGGATATTCGTAAATTGAAGAGCTGGAGTATTCCGACGCTTCATCTGCCCGATGAGGTTGTGCATCAAAATCTCACCGAAAGAGAATGGCAGCAAATGAATGAGAGACTTGCCGTGCAAAACAATCGGGAAGAGGAAAgttttgatgaaaactcaaTGCTAGAACCGATTGTTATGATGGAGCACGCCGAATCCGAAGCGGAAATGGAGGAGCAGGTCGAAACCATGCCTCAGCAAAAACTAGTGACCCATGATAAATTAAAGCACGAGTCCCAAGATGATAATGgcaataatgatgatgaaatgCAAGCATTGGAAGTACTCCTCGAAGTGGGTCATGTTGAAAAATGTTCCAGTTATGAGAAAATGGACAATAAATCACATTTACCATACTCCGAGACGAGTCCATTGAGTCCTTCGATGGGATCTATGCCACCGGGTCAACGCGGTGGTCATTATAATGCTCGTCACTGCAGTGTCCAGGGCTGTCAGATAACTGCCAATGATGTAGACGGTAATATCAAGCTGCACAAGTTCCCCACCTCCGTGGAGGCCACTGAAAAGTGGATGCATAACACCCAGGTAGATGTGGATGAGAACTATTCCTGGCGGTATCGCATTTGCAGTTACCATTTCGAACAGGAATGCTTCAATGGGGCCCGTATACGGCGGGGATCTATGCCCACATTGCATTTGGGTCCACTTCGACCCAAGGATATCTTTAGGAATGAGTTCCCGcaattggaaatggatgaAACTATGGAAGAATCAATTCCTAAAGTTACTCCCACTGTTGAACAGGAACCTGGGGCTCAGCCTATAAAGAGTAAGGTGACACAACTATGCCTGCCACGTCCTGCTCCGCCTCGAAAATCGAGCAAATTCTGTCAGATTGAAGGCTGTTCGAATCATTTGACCAGCGAGAATATGACTTTGCACAAGTTTCCCCACTCCCTGGATATGTGTGCCCGCTGGCAGCACAATACTCAGGTGCCATTTGATCCAGAGTATCGTTGGCGCTACCGCATCTGTAGTATCCATTTTCATCCAGTCTGTTTGGTCAATATGAGATTATTGCATGGCAGTGTGCCTACTTTAAAACTGGGCCCTAGAGCTCCCGCTCAACTGTTTGACAATGATTTCGATGCCATTAATATGAGATTGGATAAGAGATCACATTTGGAGCAGGGAGGTAGCAAGGTCAAGCAAGAGAGACCCCACCATCAACAGCAATCCGATGAATTCTATTTAGAGccagaaatggaaatggaagtAGATGATGAGGAGCAAGACCCAGATCAATCCCAATCCATGACATCATTTGAAAGCTGGAGACATCAACTTCGCCTACCAACTGTTAAGCAAGACAAGGTCGCCTACAATCCCATCAAATCTGGCTACGATAAATGCTCCCTAACACACTGTCAGCGTCAGAGATCCCTGCATGGCGTCCACATATACAAATTCCCACGATCGAAACGCCATCAGCAGCGATGGATGCACAATTTGCGCATACGTTATGATGAGAAGAAACCATGGAAATACATGATCTGCAGTGTTCACTTTGAACCAAATTGTATACGCCTGAGAAAACTTCGTCCATGGGCTGTGCCCACTTTGGAATTGGGTTCGAATGTGGCAGATCAGATTTACACCAATGAACAGTGCCAGGAAATGGCTTCAGATGTAAGTGAAGAAGAGGAAACCGGACCAGAAGAAAGTGGAcaagaagaagatgatgacgatgaagtAGATGACGATGGAGATACTGGTGCAGAGGCCCACATAAAGCGTGAAAGACGCCCTTGGGGAACGTCCGGAGCCGCCGGTGGTCAAATGGCTCCTTGGAAAGTAAAACAATGTTGTCTGCCCTATTGTCGTCGACCACGAGGGGATGGTATCAAACTATTCCGACTGCCCGGCAATCCTACTTCCATACGTAATTGGGAAAAGGCCACGGGGATGACATTTAAAGCATCGCAACGGAACACACGACTCATTTGTAGTCGTCACTTTGAGCCGGAATTGATGGGGGTACGCCGTTTGATGCGGAATGCCATACCCACCAGACATCTATATCACCAAAGGGACAGCTATAGCCCAGAATTGGTGATACCCACAAACACTCCAACTCCTATTGGTCCCCGTTGCTGCATTCCTGATTGCCCCCCACACGATGGGTCGTCTCAACTTCATCGATTTCCCAGTGatcCACAATTGTTGAAGCAATGGTGTGAATCTCTTAAACTTACGGATTTCCAACGCTATAGTGGACAATACGTTTGCTCTAATCATCTTCCCGCCCAGGATTTAGCATGCATTATCTGTGGCGTGGATGATATACAATTGCCGCTTCTTGATTTTCCCGAGAATCGCAATTATCGGGCTAAATGGTGTTATAATCtcaaaattgaaacaataCCCAAATGGGACAACTCCAAGCATATTTGCTCGAAACACTTTGAATCCTATTGCTTCAGTCAGCAAACCGGTGAACTGCATCCAGAGGCAGCACCTACATTGCATTTAAATCACAATGATACGAATATATTCCTCAACGAGTATGCCATAGAACAGCATTCTTTGATGAGGATTAAAGACGAGCCCTTGGACAACGATGAGATGTTGTTGGCTTAA
- Protein Sequence
- MSQHNPHYHPHPHPLHYQQQQQQQQLHHHHTSLQQQQHKQIQHSNWYSHVASTSSAPYPHHPSSTTSSVAASTSGANNNHIMNAYGTHGYYGAAGGGLNVNAVGVGVGGGGGGGGSSNSYNLEAANTVAYAHNQLLQYQQQQQHQQQHQQQHQQQQQQQQQQQHQHHLNARSYMGGHHHGIYPYIKSEPMEYTHNTMAPPPAPTTATTEMRIKSEPIDELAYKSSNYIDDNTPFADFSKYNEFSENMLSPKVELTVKNESPYGKHPNNYPRRKLQTERSSENLPICQRCKEVFFKKQSYLRHVAESSCSIQEYEFKCNICPMSFMSGEELQRHKHLHRADKFFCHKYCGKYFDTIAECESHEYMQHEYDSFVCNMCSLTFATREQLYTHLPQHKFQQRYDCPICRLWYQTAVELHEHRLAAPYFCGKYYNQQHHHQSQQQQQHHHQQQQNHQQQTHQTNYKLQDCHMATMEMPTAPPPSSAVTHHKSNASGTSSTLPATAALSSLLQQRQANADGAAMFAAAASSTSLKGEVNVKLERSYSNSTSDSSFGGMHESNYNNNNNAYGSDNSIHGSGAVGGPQAHSSTLDDSEDALCCVPMCGVSKSTSPTLQFFTFPKDDKYLHQWLHNLKMFHIPASSYSTFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCNSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSTDFNMSLYRFPRNEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPSSSTPHHQHHHHQQQQQQHGHGHGHAQQHHHHHNKAAYHRQTAASTSSSASSTSHYVDPDNMGSGAYLGMGGANSLSGGMNVSDSMDICCVPSCESKRHNSENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDKDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRPVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLMLGHEDIAYQLPTSEQVAEFYARPNAPNNGEEQGECCVESCKRNPSVDDIKLYRPPEESDILAKWAHNLELDVAELPNMRICNLHFESHCIGKRMRPWAIPTLNLSSNIENLYENPEHSMLYKRRTKRDPNRDVSLAATKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTIDLNAPPGYKIYQNPAKLKASKLCLQRVCIVESCRRTRAQGVQLFRLPHSPTQLRKWMHNIKTRPRAATRSQYRICSIHFESHSFNGKRLSAGAIPTLELGHDDDDIYPNEAQAFVDEHCVVESCESSKDQPEVRLFRFPTEDDDLLWKWCNNLKMNPVDCVGVRICNKHFEADCIGPKHLFKWAIPTMELGHDDSEIELIPNPKPEERYVDPVFKCCVPTCGKTRKFDEVQMNSFPKDPLLFQRWRHNLRLDHLNFKERERYKICNDHFEDVCIGKTRLNIGSIPTLQLGHNETEDLYQVNPAELQSNLFGRPRRLHGGVDIKLEYAEDSEAESGLQDVKPNIYEMAEATDINIRQVKIKKSLADLKCCVRSCGRSRLEHGARLFPFPNGKQQNLKWRHNLQLEPEEVDKMTRVCSAHFNRRCIDGKHLRGWAIPTQQLGHHHEQPIYENPKNIPGFFTPTCALSHCRQRRSIDNDLRTYRYPRSEDLLEKWRANLRLAPDQCRGRICADHFEPLVRGKLKLKTGAVPTLKLGHDEELVYDNEAIKANLVDEEDVSLESPPQVITKKEILEEEDDEEDLQEHEDDDEEEEEEENDPPEEDSHSDYFDPLELVETYADDQVPEDEYSAPAHQLPAPPSVAAPPFGRREKVANNVTPICCLKHCRKERTPTHHLSTFGFPKDHQLLLKWCANLHLEPMDCVGRVCIEHFEAEMLGTRKLKQNAVPTINVGHQMPLPYTCNGQERSDEKEDNSVFRLWSLKHCRKRKLMEPPDIRLKVEKMDPMGLVKVKKEKMEMEEEKETMMMMTKPKRCCLNQCEQTAELQKFPRDFNLLRKWLHNLKLTLNEDLDPSQLRLCLRHFEGHLVRNGHLSKEALPTLELGHQDKNIYRTTVATSGGCLVASCPCARLNLYRSYALPKEPYIKEAWLNYLKLPAITHGQLCVMHYMQLYEEMPFKELRHIYESIANSTQALKLRCAVPGCRSKYTDNIHLTKLPQNQSLLTKWLHNTMLTYDPSKHSIYRICLLHFEPFALGPACPKPWAVPTLELNYQNDIYLNPSKEELANITDYPRISTPLQIKTEFTLPLRIKTELAALSSPSVGSTPSPRGKVRICCIQSCLQQANSQLRLYRFPNTESALLKWLVNTQQQPRLVDPTQLYVCQSHFELEAICKKQLRSWAVPTLNLGHDGHVIPNARHNGNIADSQETEQAMEFIRENYCSVLSCFQPKSEALRLHPYPKDMPTIRKWAANCKHRSMQASSHGFQVCQLHFEADCFHPDTGDLREGSVPTLDLTVTRLNSELRCLVTGCVKDETQPRRRYYKLPKRPALLSEWCRNLGLVPSGLLHGADHHVCERHFESRCFNIHKQLRSGSRPTLNLGHNENITLLPNPEIFCDEIDDVSTCSVPNCGQSKLTDETLQLNSLPRMRKLAEKWLHNLHLPYTGKEQLAKFRVCQKHFDPSCFENGFLRQGALPTLELGHESVDIYQTDDQSVGKYRKHQKVLSGVRVSGHDCCYPQCVQQQKNYQRMVYDLPKEEKLRQRWLQHLEIDERERERPLILCPLHYIFLYDYSVKNFEEHVPNDLLESNYEDARNGSRIRLISCAVRGCGTLQPRDGGRLHGLPTNPEIFQMWLDNTELVVYEPQRYMIKVCSKHFESICFTDIRKLKCWSVPTLHLPGEAVHQNPTEEEWLKINERIAVSAAQPGEPCEDNSMLEPVVIMEEEDCVCCVPNCGRSKQMDNSIQFTSFPKNNMLAEKWILNFHLKVTKDQWSNLRVCNRHFETTCWENGRLRRGAMPTLELGHESSDIYQTDELDLFKSRKQTKRTYGQGCCFPQCVELLKNFQRMVYDLPREAQLRQRWLQYMELTESEQPLKMCPLHYIILYDHSVKNFEEHAPEKLLDFNYENARNCVRIRIISCAVEGCNTLQPRDGGRMHGLPPRSDILQMWLDNTRLVFHEHQRYMLKVCSKHFEPKCFTDIRKLKSWSIPTLHLPDEVVHQNLTEREWQQMNERLAVQNNREEESFDENSMLEPIVMMEHAESEAEMEEQVETMPQQKLVTHDKLKHESQDDNGNNDDEMQALEVLLEVGHVEKCSSYEKMDNKSHLPYSETSPLSPSMGSMPPGQRGGHYNARHCSVQGCQITANDVDGNIKLHKFPTSVEATEKWMHNTQVDVDENYSWRYRICSYHFEQECFNGARIRRGSMPTLHLGPLRPKDIFRNEFPQLEMDETMEESIPKVTPTVEQEPGAQPIKSKVTQLCLPRPAPPRKSSKFCQIEGCSNHLTSENMTLHKFPHSLDMCARWQHNTQVPFDPEYRWRYRICSIHFHPVCLVNMRLLHGSVPTLKLGPRAPAQLFDNDFDAINMRLDKRSHLEQGGSKVKQERPHHQQQSDEFYLEPEMEMEVDDEEQDPDQSQSMTSFESWRHQLRLPTVKQDKVAYNPIKSGYDKCSLTHCQRQRSLHGVHIYKFPRSKRHQQRWMHNLRIRYDEKKPWKYMICSVHFEPNCIRLRKLRPWAVPTLELGSNVADQIYTNEQCQEMASDVSEEEETGPEESGQEEDDDDEVDDDGDTGAEAHIKRERRPWGTSGAAGGQMAPWKVKQCCLPYCRRPRGDGIKLFRLPGNPTSIRNWEKATGMTFKASQRNTRLICSRHFEPELMGVRRLMRNAIPTRHLYHQRDSYSPELVIPTNTPTPIGPRCCIPDCPPHDGSSQLHRFPSDPQLLKQWCESLKLTDFQRYSGQYVCSNHLPAQDLACIICGVDDIQLPLLDFPENRNYRAKWCYNLKIETIPKWDNSKHICSKHFESYCFSQQTGELHPEAAPTLHLNHNDTNIFLNEYAIEQHSLMRIKDEPLDNDEMLLA
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00604131;
- 90% Identity
- iTF_00577817;
- 80% Identity
- -