Dkan013365.1
Basic Information
- Insect
- Drosophila kanekoi
- Gene Symbol
- -
- Assembly
- GCA_037075305.1
- Location
- JBAMCE010000577.1:7846087-7860149[+]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 29 4.1 5.1e+03 -1.6 2.4 49 64 341 360 320 375 0.57 2 29 7.5e-15 9.4e-12 45.6 4.3 1 86 566 638 566 639 0.85 3 29 4.7e-15 5.8e-12 46.3 4.6 1 87 666 735 666 735 0.83 4 29 1.1e-15 1.4e-12 48.2 0.4 1 87 757 829 757 829 0.85 5 29 1e-15 1.2e-12 48.4 5.7 1 87 926 996 926 996 0.83 6 29 1e-14 1.3e-11 45.1 3.3 1 86 1020 1091 1020 1092 0.82 7 29 7.2e-13 8.9e-10 39.2 1.0 1 87 1127 1195 1127 1195 0.80 8 29 1.4e-10 1.8e-07 31.9 1.2 1 86 1244 1313 1244 1314 0.76 9 29 7.7e-15 9.5e-12 45.6 0.1 1 86 1341 1410 1341 1411 0.81 10 29 9.1e-14 1.1e-10 42.1 0.8 1 86 1432 1501 1432 1502 0.81 11 29 1.2e-14 1.5e-11 44.9 1.8 1 86 1529 1600 1529 1601 0.86 12 29 4.7e-13 5.8e-10 39.8 1.3 1 85 1673 1741 1673 1743 0.82 13 29 1.4e-12 1.8e-09 38.3 0.1 1 86 1766 1834 1766 1835 0.81 14 29 5e-14 6.3e-11 42.9 0.7 1 87 1994 2063 1994 2063 0.80 15 29 1.1e-11 1.4e-08 35.4 0.1 1 62 2118 2177 2118 2194 0.79 16 29 0.003 3.7 8.4 0.1 1 58 2199 2249 2199 2276 0.79 17 29 6.6e-13 8.2e-10 39.4 1.5 1 87 2288 2358 2288 2358 0.86 18 29 4.4e-14 5.5e-11 43.1 1.1 1 86 2417 2486 2417 2487 0.82 19 29 7.8e-13 9.7e-10 39.1 0.5 1 86 2522 2593 2522 2594 0.82 20 29 8.2e-13 1e-09 39.1 2.4 1 87 2604 2675 2604 2675 0.82 21 29 2.1e-12 2.7e-09 37.7 0.0 1 86 2698 2768 2698 2769 0.81 22 29 0.00025 0.31 11.9 0.1 1 58 2800 2850 2800 2872 0.82 23 29 1.1e-14 1.3e-11 45.1 0.3 1 86 2889 2961 2889 2962 0.80 24 29 1.1e-13 1.4e-10 41.8 0.2 1 86 3102 3174 3102 3175 0.84 25 29 4.4e-14 5.5e-11 43.1 1.5 1 86 3240 3310 3240 3311 0.82 26 29 5.7e-14 7.2e-11 42.8 4.5 1 86 3423 3493 3423 3494 0.85 27 29 8.5e-13 1.1e-09 39.0 0.1 1 86 3585 3654 3585 3655 0.84 28 29 8.7e-10 1.1e-06 29.4 2.2 1 58 3672 3720 3672 3736 0.86 29 29 6.6e-07 0.00082 20.1 2.7 19 87 3738 3795 3726 3795 0.75
Sequence Information
- Coding Sequence
- ATGTCACAACACAACAATCAACCGCATtcgcatcagcatcatcactactatcagcagcagcagcaccacctccagcaacaacaacaacaccaccaccagcagcagcagcagcagcaacagcagcagcagcatttgcagcataaacaaatacaacagcagcacagtTGGTACTCACATGTTGCTTCCTACCCGCCCCACCAACCGCACGCCGCCGCTGCCTATGCGGCGCCCtgcaagaataacaacaacaataacaacaacaatattatgaATGCATACGGCACGGGCGCTGCCAGCGCGCACTATTATGGCGCTGCTCCTACTGCTGGGGCTGGGGTGGGCTATAACCTTGAGGCCAATACTGTGGCCTATGCGCACAACCAGCTGCtgcaataccaacaacaacagcagcagcagcaacagctcagtCAACGCTCGTATATGCCGCACGGTTTAATGCATGGCTCGTATCCTTATATCAAGAGCGAGCCATTGGAACTGCCAGATGATAGACAACGccatcaacaacatcaacatcaacagcagcaacaacaacatttccaGAACCCAATGGCCCCGCCACCAGCGCCCCCCGTCAACCGTCACACGCTCGATGCCAGCGGtgaaatgataataaaatcGGAACCCATTGACGAACATGCGTTCAAGTCCAACTATATCGATGacaatataccctttgccGATTTTAGTAAGTTTTCCGAATTTGGCGACGACATGCTAAGTCCCAAGGTTGAGTTAACGGTTAAGGACGAGGCTTATGGCAACCAAAAGaaccCGCTCAGCTATCCGCGACGCAAGCTGCAAAACGAGCGACCTTCGGAGAATCTGCCCATTTGCCAACGCTGCAAGGAGGTCTTCTTCAAGAAGCAGGTCTATCTACGTCATGTGGCcgagagcagctgcagcatacACGAGtatgaatttaaatgcaacatCTGCCCCATGTCCTTCATGGGCGCTGAGGAGCTGCAGAAGCACAAGCAACTGCATCGCGCGGACAAGTTCTTTTGCCACAAATACTGTGGCAAGCACTTCGACAACATCGCCGAATGCGAGTCGCATGAGTACATGCAGCATGAATACGATAGCTTTGTGTGCAATATGTGCTCTGTAACGTTTTCAACTCGGGAACAGCTTTATGCTCATCTGCCGCAGCACAAGTTTCAGCAGCGTTACGATTGCCCTATTTGCCGCTTGTGGTATCAAACGGCACTAGAGTTGCACGAGCATCGACTGGCGGCGCCCTACTTTTGTGGCAAGTATTATCCAGCagcacatcagcagcagcagcaacaacaacatcaacagcagcaacacccgcagcagcagcaaggcaACTACAAACTGCAGGACTGCCACATGGGCACCATAGAAATGACAGCACCGCACCACAAGACAAATGCTTTGCCTGCAACGGCGGCGCTTAGTTCCttgttgcagcagcggcaagcgAATGCAGATGGTGCCGCGCTGTATGCCTCGACCCTGAAGAGCGAGGCTAATGTCAAGTTGGAGCGCAGCTATAGCAACTCCACTAGCGAGTCTGGCTACAGTCTGCACGAGAGTAGTTATAATAATGCCTACGGCAGCGATAATTCGTTGCATGGTGGCGGCGCAGCAATTGGTGGTCCACAGGCACACTCCTCCACGCTGGACGAATCGGAGGATGCGCTGTGCTGTGTGCCGCTGTGCGGTGTGCGCAAAAGCACCAGTCCCACGCTGCAGTTCTTTACGTTTCCCAAGGATGAAAAGTATCTGCATCAGTGGCTGCACAATCTCAAAATGTTCCATATTCCGGCCTCAAGCTATGCCAGCTTTCGTATCTGCAGTATGCATTTTCCTAAGCGATGCATCAATCGTTACTCGTTGTGTTATTGGGCGGTGCCCACATTCAACCTGGGTCACGATGATGTAGCTAATCTGTATCAGAATCGCGAGCTGACTAACACCTTCACCACCGGCGAGGTGGCGCGCTGCAGCATGCCCAACTGCACTAGCCAACGCGGCGAGAGCAATCTcaagttttataattttcccaAGGACATCAAGAGCCTGATCAAGTGGTGCCAGAATGCACGTTTACCCGTCCAGGCTAAGGAGCCGCGTCATTTTTGCAGTCGCCATTTTGAGGAGCGCTGCATTGGCAAGTTCCGGCTGAAGCCGTGGGCTGTACCTACTCTACATCTGGGCGCCCAGTACGGCAAGATTCATGACAATCCCAAAAACCTGTATGTGGAAGAGAAACGCTGCTGCCTTAACTtttgccgtcgcagtcgctctTCGGACTTTAACATGTCATTGTATCGCTTTCCCAGAGATGAAGTACTGTTGCGACGCTGGTGCTATAATCTGCGCCTCGATCCGGCTGTCTATCGCGGCAAGAACCACAAAATTTGCAGCGCTCACTTCATTAAGGAAGCCCTCGGATTGCGCAAACTGTCACCAGGCGCTGTGCCCACACTGCATCTGGGCCACAATGACACCTTCAACATATACGAGAACGAATTATGGCCACCACCGACGCCCTCTACACCCACCCACAATcatcagcagcaattgcagcagcatcagctgcagcagcatcaacagcaactgcagcaacatgtGCATCATAAATATCAGCGTCATTCGGCGGCATCCACATCATCGTCGGCCAGCTCGGCGTCGCACTATGTGGATCCAGAGCTGAGTGCATCCTACATGGGCTTGAGCGCTTCATCCTCTGGCCTGAATGTCAGCGACAGCATGGACGTGTGCTGTGTGCCCAGCTGCGAGAGCAAACGGCACAACAATGAGAACATCACATTCCATACAATACCCAGGCGGCCAGAGCAGATGCGTAAATGGTGCCACAATCTGAAGATACCCGAGGACAAGATGCACAAGGGCATGCGTATATGCAGTCTTCACTTTGAACCCTATTGCATTGGCGGTTGTATGCGGCCGTTTGCGGTGCCTACACTGCATCTGGGCCACGATGACGAAGACATTCATCGTAATCCGGATGTGATTAAGAAGCTGAACATTCGCGAAACCTGTTGCGTTGCCGTTTGCAAGCGCAATCGAGATCGGGATCATGCCAATCTGCATCGTTTCCCCAGCAATGTCGCCTTGCTGACCAAGTGGTGCGCCAATCTGCAGCGACCCGTGCCGGATGGCACCAAACTTTTCAATGATGCCATCTGCGAGGTGCACTTCGAGGATCGCTGTCTGCGCAACAAGCGGCTGGAGAAGTGGGCAGTGCCCACACTTGTGCTAGGCCACGAGAATATTGCCTACCCACTGCCCACGCCCGAGCAGGTGGCCGAGTCCTATGCGCGTCCCAGTGCGCCAAACAATGGCGAGGAGCAGGGTGAATGCTGCGTGGAGACCTGTAAGCGCAATCCTAGCGTAGATGACATCAAGCTCTATCGTCCGCCCGAAGAATCacaggtgcttgccaaatgGGCGCACAATCTGCAGCTGGACATTGCCCAGCTTCCTAATATGCGAATCTGTAATTTACACTTTGAATCCCACTGCATTGGCAAACGCATGCGACCCTGGGCCATACCCACCCTCAATTTGGCCAGAAACATTGAGAATCTCTTTGAGAATCCCGAACACCATATGCTCTACAAGCGTCGCGCGCATCTCAACGCGGACAGAGCCACCGCTCGCAGCGCTGGCGCTGACGGAGCCACCATGAAGGCCTCTTGGGTGCCACGCTGTTGCCTGCCGCACTGCCGCAAGGTGCGCGCATTGCACAATGTCCAGCTGTATCGCTTTCCGAAGGTCAATCGCACAACGTTGGCTAAATGGGCGCATAATCTACAAGTGCCGCTGGTCGGCAGCGCCCAAAGGCGTTTATGCTCCGCCCACTTTGAGCCTAATGTGCTGAGCAAGAAATGCCCGGTGCCCTTGGCGGTGCCCACGCTGGATCTCAATACGCCACCGGGCTAcaagatttaccaaaacccagCCAAGGTGAGGGCTAACAAGCTGTGTTGGCAGCGCGTCTGCATTGTGGAGAGCTGCCGTCGACAGCGGGCACAGGGCGTACAGCTCTTCCGGCTGCCGCACAGTCGCAGCCAGTTGCGCAAGTGGATGCACAATCTTCGCATGCTGCCGAGAGGCGCCATGCGGCAACAGTATCGCATCTGCTCGCTGCACTTTGAGGCGCACTCGTTTAACGGCAAGCGTCTGAGCACAGGCGCTATTCCAACGCTGGAGCTGGGCCATCAGGATGACGATATTTATCCCAATGAGGCGCAGTCCTTTGTCGAGGAACACTGCGCCGTAGAAGGCTGCGATGCGTCCAAGGAGCAGCCAGATGTGCGTCTCTTCCGCTTTCCCAATGACGACGAGGATCTGCTCTGGAAGTGGTGCAACAATCTCAAAATGAATCCCGTTGACTGCTATGGCATGCGCATCTGCAACAGGCACTTCGAACCGGACTGCATTGGGCCCAAACACCTGTACAAGTGGGCCATACCCACTTTGGTTTTGGGGCACGATGATAGCCAGATCGAGCTGATACCCAATCCCAAGCCGGAGGAACGCTATGCTGATCCTGTGTTCAAGTGCTGTGTGCCCACCTGCGGCAAAACGCGTAAATTTGATGAGGCGCAAATGAATAGCTTTCCCAAGGACCCATCGCTCTTCCAGCGCTGGCGCCACAATCTGCGGTTGGAACATCTCAACTTTAAGGAACGCGAGCGCTACAAAATTTGCAATGCGCACTTTGAGGACATTTGCATTGGCAAAACGCGTCTCAATATTGGCTCCATACCCACGCTAGAGCTGGGCCATGAAGTGACCGAAGATCTGTATCAGGTTAATCCCGAGGAGCTGCAGAGCAACTTGTTTGGACGCCCGCGGCGCGTGCATGAGAATCAGCGTCTGAGCATCAAGCAGGAGCTGGATGAGGACATCAAGCCGGACATAAGTATGTCAGAGGCCACGGACACAATCACAACACAGgTGAAGATCAAGAAGTCTGTGTTAGACTTGAAGTGTTGTGTGGCCAGCTGCGGTCGCAGCCGGCTGGAGCATGGTGCTCGCCTGTTTCCCTTTCCCActggcaagcagcagcagaccaaGTGGCGCCACAATCTCCGCCTTAGCGCCGCCGATGTGGACAGAACAACGCGCGTTTGCAGCGCTCACTTCAATCGACGCTGCATCGATGGCAAACAGCTTCGAGGCTGGGCAATTCCCACACAGCAGTTGGGCCACCAGGAACAAAACATATATGAGAATCCAAAGAATATACCGGGCTTCTTTACGCCCACCTGTGCGTTGGCGCACTGTCGTAAGCGACGAAGCATTGACAATGATTTGCGCACCTACCGCTATCCGCGCAACGAGGAGCTGCTCGAGAAATGGCGCGTGAATCTGCGTCTGGCTCCGGATCAGTGTCGAGGACGCATTTGCGCGGATCACTTTGAGCCGATGGTGCGCggcaagctgaagctgaagacGGGCGCGGTGCCCACGCTGAAGCTAGGCCATGATGAGGGCGTAGTCTTTGACAATGAGGCCATTAAAGTTGGAatgcagcaggaggaggaagaggaggaggaggcgggcAGCTTGGAGTCGGAGGGAAAGATAAAAATTGAGAAGCAGGAGAAGGAAACCCTGGAGCCGGAGTTGGAaaatgatgatgaggatgaagatgccgagcagcagcagaaagtgGAGTTTCCTGATGACGAtatggagcaggagcaggaacagAATGAGGAGGAAGAGGAGCTGCAGGATCATGGCTATTTTGATCCCCTAGAGCTAGTGGAAACCTTTGCCGAACAGCACAGCGATGACAATTCCGCTGACAATTATCATCTTgaagctgatgatgatgatgatgaagaagatATACCTGGCAATGatgatgagctgctgctgccagacaCTGTTCCAATACAGTTGCCGCCACGCCGCGAAAAGGCGGTGAACAATGTGACGCCTATTTGTTGCTTGAAACATTGCCGCAAGGAGCGCACCGCAAGTCATCAGCTAAGTACTTTTGGCTTTCCCaaggatcagcagcagctgcttaaatGGAGTGCCAATCTGCAGCTGGATCTCGTCGATTGTGTGGGACGCGTGTGCATCGAGCATTTCGAGGCGGAGATGCTCGGCACTCGTAAGCTAAAGCAGAATGCGGTGCCCACTTTGAATCTGGGACATGCCACGCCGTTGAGCTATAGCTGCAATGGCCAATCATTGAGCATATACGATGCACAGCCGCAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGCCGCAAAAGGAAGCTGCTGACGATGCCTCCGGATCCGGCGATGACTAAACGACGTTGTTGCCTGCCCAGTTGTGGTAAGGAGCCGGAGCTGCATGGCGTTCAATTGAAGCGACTGCCCAAGGAtcgtctgctgctgcgcaaGTGGCTGCACAATCTGAAGCTGCCGCCGCACATGGACACCAAACACTCGTTTCTTTGCGAGGAGCACTTTGAGCCACATGCGACGCTGCCTACCCTGAAGCTGGGCCACGCGGCTAGCAACATTTATCGCAATGGAAGTTCGGCCTTATCCAGTGGCTGCCTAGTGCCCAGCTGTCCGTGTGCACGGCTCAATCTATATCGCTGCTATGCTCTACCCGAGCATCCGCAGGTGCAGCAGGCCTGGCTGAAGTGGCTGCAACTGCCGCCGCCACAGCTGGCTagccttgcccagctctgCGTCATGCATTATATGCAGCTATTTGAGCAGGTGCCGCTACCCGCGGATCTGCCAGAGTCTGTGCAGCGCCAACTGCAGGAAACCTACGAACAAATATCCAGCTCCAGCATGGCCATGAAACTGCGCTGTGCTGTGCCCGGCTGCTACTCCAAATACACGGACAACGTGCGTCTCACCAAGCTGCCCGTGTGCCCGCAAATCTGCGCCCAGTGGGTGCACAATACCAAAATTAAGTACGATCCGGAGCGCCATTACATGTATCGCATCTGCATGCGCCACTTTGAGCCGCAATGCCTGGGTGCAGTACGTCCTAAGCTGTGGGCGGTGCCTACGCTGCATCTTAACCATAGCGATGCggatatatatcaaaataccATGTTGGATAGCTCTGATGCCATGCCGGTGGCCGAGTCTGTACCGCTGACTTTGCCGCTGCGCATCAAGACAGAGCTGCCATTAACACTATCAGTCAGTCCCAGTGCCAGTCCCAGTCCACGCGGCAAACAGCGCACCTGTTGCATTCCCACTTGCGGCCAGCAGGCCAATGCCCTAACGCGTCTGTTTCGCTTTCCCAGCGCCGAGACGGCCCTGCTTAAATGGCTGGTGAacacacaacagcagccacgccTCGTTGATACGCAGAATCTATTTGTATGCCAGCGTCACTTCGCGGCGGAGGCGATTTGCAAGAAGCAGCTACAAAGTTGGGCAGTGCCCACCCTAAGTCTAGGCCATGAGGGCCACATCATACCGAATGCCAAGCACAATGGCAATATTGCCGACAGCCAGGAGAACAAGCAGGCGCTGCAATACATCTGGGCCAATTACTGCTCGGTGCTCACCTGCTTCCAACAGCGCAGCGAGCAGGTTCGTCTCTACGCCTATCCTACAGATCGGCCCACTATACGAAGGTGGGCGGCAAACTGCAAGCATCGCTCCATGCAGGCCAGCAGCGATGGGTTTCAGGTCTGCCAGTTACATTTTACACCCGACTGCTTTGATCCCGATACTGGGGATCTGAAGGAAGACGCGGTGCCCACATTGGAGCTGAGCCGGCCTGTCCATGAGTTACGCTGTTTGGTAAATGGCTGCGTTAGGGAGAAGGATGCAGCGCGATGTCGGTTTTTCAAAGTGCCCAAGCGTGCCTCACAGTTGGAGGATTGGTGTCACAATCTGCGCATCGATGCTGCGTCAATAAGCGGCCAGGAGGTCCACGTATGTGAGCGTCACTTCGAGGCGCACTGTTTCAGTGCGTACAAGCTGCGTCCGGGTGCACGACCTACTCTTCATTTGGGCCACGACGATGAGCTGGATTTGTTGCCCAATCCGGCAAAGTGGGAGGAGGATGTGAATGTATGCTTTGTGCCTAGCTGTGGACGGTCCAAAGATGTGGATAATGTGGAACTATTCGGGCTGCCCAGGATAAGGGGGGTCTTGGAGAAATGGCTGCATAATTTCCGCCTCGACCCGAGCAGGGAGCAGCTGCAAGGCATGCGGATATGCAGCGCACATTTCGAGGCCAGTTGCATAGAGAACGGCCGTCTACACTTAAGTTCGGTGCCCACGCTGCAGCTGGGCCACGATGAGTTGGACAATATTCATCAAAGCGCGGAACTGCCTTCATCGCAGCTTAAAGGCAAACGATTAGCCATGAACTACGACTGCTGCTATCCACAGTGTATGGAGCTGCAGAAGAGCTATCAAAGAATTGCATATGAGCTACCCCAGCAAGAGGCATTGCGTAACTTGTGGATGTCGTATCTGGGTCTGGAGCAGCAAAATCTGCAACCGCTTAAGCTCTGTCCGCTGCACTTGATAATGCTATATGAACACAGTGTCAACCATTTTCCAGAGCATTCATcggaggagcagctgctggacgCCAATTACGAGGCTGCGCGAAATAGCGTGCGCATACGGATTATCAGCTGTGCGGTGCGTGGCTGCAGGACACTCAAACCACGCGACGACTACCGCCTGCACGCCATGCCTACGCGTCGGGATGTACTCCGGATGTGGCTAGACAACATGCAGCTTGTGTTCTACGAGCAGCAGCGTTATATGTACAAGGTATGCAGCAGACACTTTGAGGCCACCTGCGTAACAGAGACTACTCGCCGTCTAAAACCCTGGAGCATGCCGACGTTGGAGTTGCCGGAACGTGACCCAGACGCTCCGCCGTTGCATCAGAATCCCACGGAGGAGGAGTGGCAGCGCATGAATGAGCAAATAGGCAGCAGCGAGGCATTGGCTTTGTTAGAGCCCGCGTTTAAGCTGGAGCCGGAGCCCATTGTCAAGCAGGAGCTGCACTCTATTGTCAAGCTGGAGCCGAAGCCGCAGCCAGAACAGCATGAGGGGGAGGAGTACGAGGCTAACGATCAACAGCAAGCGCTTGAGGTGCTGCTCGAAGTGGGTCACGTTGAGAAGTGCACCACATACGAGCAAATGGACACAAAGCCAATTATAGGCTATGCTGATACCCTGTCACATAATTCACTAGGCCCAACGACAACAGTTGGCAGCGCCTGTATTGTCAACGGTAACGGACTCACCTACAGTGCGCGCCACTGCAGCGTGCGGGGTTGCGATGTGACCTCTCTGGATGTAAATGACAGTCTCAAGCTACACAAGTTTCCCACATCGCTGGATGCCATGGAAAAATGGATGCACAACACCCAGGTGAATGTGGACGTCAACTTTGCGTGGCGGTTTCGCATCtgcagtttgcattttctacCCGAGTGCTTTAATGGTTCGCGTATCAGACGTGGAGCCATGCCCACGCTGCGTCTGGGATCACGCCGCCTAGGGGATATCTATGACAATGAGTTCAATGTGCAGCCCGAGCAGACGAGTGTGGATCAGCCGGTTGAGGCGTCGGCAGACGCTATGGTGCCCACTGAACCGCACGATGGCGCGACGGAGTTTAATATAAATCTGCATTTGCCCTGCCCCGCACCACCGCGCAAGTCCAGCAAATTCTGTCAGATCGATGGCTGCTCTAATCATTTGACCAGCGAAAATCTTACTCTGCACAAGTTTCCACACTCGGCGGACATGTGCGCCAAGTGGCAGCACAATACACAGGTGCCGTTCGACCCGGAGTACCGCTGGCGTTATCGTATCTGCAGCGCACACTTCGAGCCAATCTGCCTGGGAAACATGCGGCTGATGCATGGCAGCGTGCCTACACTGAAACTGGGCGCCCGGGCGCCCAAGCAGCTCTTCGGCAATGACTTTGCGGCGTTAAGCTTGCGTCTAGATAAGGAAAAGCGCAGTGCCGACCAGAGCTTGCCCGTGAAGCTGGAGCAAGTGGAAGATGATCAAGAGCAGTATGATCAGGAGGATCTTAGCATGCTGGTAccagagctgcagctgcacgaGGGCGACGACGAGCATGAAGACAATCAGTTAAATTACACCAACAGTTGGACAGATTCGCAGCAAcaggtgcagctgcagctacgtCTGCCCAGCATTAAACAGGAGAAGGGCACCATCTATAATCCTGTCAAGTCTGGCTATGACAAGTGCTCGCTAGTGCACTGTCAGCGCCAGCGTTCACAGCATGGCGTCCACATCTATAAATTCCCACGCTCACGGCAGCTACAGCATCGATGGATGCACAATTTACGAATCAGATACGACGAGCGGCGACCTTGGAAGACAATGATATGCAGTGTACACTTCGAGCCGCACTGCATACGCCTGCGTAAGCTGCGTCCTTGGGCGGTACCCACACTAGAACTGGGCGACAATGTTCCGCAGGATCTGTACAGGAACgagcaaagccaacaacagtttgtgcagcagcgcagcagcgacGCGGAAGCGGGCAGTGAGGGCGAGGACTATGATGCGGAGCTAGAGGACACTATACTGGAGGAGTACGACGATGAgtatgatgataatgataatgctgaGCAATATCCGGCTGAGCCACACATCAAGCGGGAGTATCGCTCACGCTGCGATCCACAGCCGCCTGGTCAACTGCCACCATGGAAAATCAAGCAATGCTGTTTGCCCTATTGTCGCAGGCCACGCGGCGATGGCATCAAGCTTTTCCGTCTGCCCAACAATATCAGCGCCATACGCAAATGGGAGCAGGCGACGGGTATGCGCTTCTATGAGTCCCAGCGCAATACAAAACTCATCTGCAGTCGTCACTTTGATCCGCAACTTATTGGTGTGCGTCGTCTTATGTCCAATGCGGTACCTACCCGTAATCTGGGCCCAAACAGCGAGGAATCCGAGCTGCCAGCGACCAGTCCACGCTGCTGCATTAAGGATTGCCAACCAGATGCACATGTCAAGCTGCACAAGTTTCCCAGTGATCCCGAGCTGCTGCACCAGTGGTGTCAGGCGCTTAATTTGCGGGATGAGCAGCGCCACGCCGACAAGTACATTTGTGCCGTGCACCTGCCCACCAAAGCGATGAGCTGTCTCATTTGCGGCGTGGAGGATGTACAGTTACCCATGCAGGACTTTCCCGAGCATCGCAATCAGCGAGTGAAATGGTGCTACAATTTGAAAATCGAACCAATAGCCAAGTGGGACAACTCAAAGCACATTTGCTGCAAGCACTTTGAGAGCTATTGCTTCATTAAGCCGGGTCATCTGTTGCCGGACTCTTTGCCCACGCTGCATTTAAAGCACAACGACAGCAATATATTCCTCAACGAATCTGCCATAGAGAGCAGCAGGCTGCTGCACGTCAAGGATGAGCCTATGGAGTGTGAGGATCTGATGCTGTAA
- Protein Sequence
- MSQHNNQPHSHQHHHYYQQQQHHLQQQQQHHHQQQQQQQQQQQHLQHKQIQQQHSWYSHVASYPPHQPHAAAAYAAPCKNNNNNNNNNIMNAYGTGAASAHYYGAAPTAGAGVGYNLEANTVAYAHNQLLQYQQQQQQQQQLSQRSYMPHGLMHGSYPYIKSEPLELPDDRQRHQQHQHQQQQQQHFQNPMAPPPAPPVNRHTLDASGEMIIKSEPIDEHAFKSNYIDDNIPFADFSKFSEFGDDMLSPKVELTVKDEAYGNQKNPLSYPRRKLQNERPSENLPICQRCKEVFFKKQVYLRHVAESSCSIHEYEFKCNICPMSFMGAEELQKHKQLHRADKFFCHKYCGKHFDNIAECESHEYMQHEYDSFVCNMCSVTFSTREQLYAHLPQHKFQQRYDCPICRLWYQTALELHEHRLAAPYFCGKYYPAAHQQQQQQQHQQQQHPQQQQGNYKLQDCHMGTIEMTAPHHKTNALPATAALSSLLQQRQANADGAALYASTLKSEANVKLERSYSNSTSESGYSLHESSYNNAYGSDNSLHGGGAAIGGPQAHSSTLDESEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLHQWLHNLKMFHIPASSYASFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPNCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPAVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPSTPTHNHQQQLQQHQLQQHQQQLQQHVHHKYQRHSAASTSSSASSASHYVDPELSASYMGLSASSSGLNVSDSMDVCCVPSCESKRHNNENITFHTIPRRPEQMRKWCHNLKIPEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLHLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLTKWCANLQRPVPDGTKLFNDAICEVHFEDRCLRNKRLEKWAVPTLVLGHENIAYPLPTPEQVAESYARPSAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEESQVLAKWAHNLQLDIAQLPNMRICNLHFESHCIGKRMRPWAIPTLNLARNIENLFENPEHHMLYKRRAHLNADRATARSAGADGATMKASWVPRCCLPHCRKVRALHNVQLYRFPKVNRTTLAKWAHNLQVPLVGSAQRRLCSAHFEPNVLSKKCPVPLAVPTLDLNTPPGYKIYQNPAKVRANKLCWQRVCIVESCRRQRAQGVQLFRLPHSRSQLRKWMHNLRMLPRGAMRQQYRICSLHFEAHSFNGKRLSTGAIPTLELGHQDDDIYPNEAQSFVEEHCAVEGCDASKEQPDVRLFRFPNDDEDLLWKWCNNLKMNPVDCYGMRICNRHFEPDCIGPKHLYKWAIPTLVLGHDDSQIELIPNPKPEERYADPVFKCCVPTCGKTRKFDEAQMNSFPKDPSLFQRWRHNLRLEHLNFKERERYKICNAHFEDICIGKTRLNIGSIPTLELGHEVTEDLYQVNPEELQSNLFGRPRRVHENQRLSIKQELDEDIKPDISMSEATDTITTQVKIKKSVLDLKCCVASCGRSRLEHGARLFPFPTGKQQQTKWRHNLRLSAADVDRTTRVCSAHFNRRCIDGKQLRGWAIPTQQLGHQEQNIYENPKNIPGFFTPTCALAHCRKRRSIDNDLRTYRYPRNEELLEKWRVNLRLAPDQCRGRICADHFEPMVRGKLKLKTGAVPTLKLGHDEGVVFDNEAIKVGMQQEEEEEEEAGSLESEGKIKIEKQEKETLEPELENDDEDEDAEQQQKVEFPDDDMEQEQEQNEEEEELQDHGYFDPLELVETFAEQHSDDNSADNYHLEADDDDDEEDIPGNDDELLLPDTVPIQLPPRREKAVNNVTPICCLKHCRKERTASHQLSTFGFPKDQQQLLKWSANLQLDLVDCVGRVCIEHFEAEMLGTRKLKQNAVPTLNLGHATPLSYSCNGQSLSIYDAQPQHSVFRLWSLKHCRKRKLLTMPPDPAMTKRRCCLPSCGKEPELHGVQLKRLPKDRLLLRKWLHNLKLPPHMDTKHSFLCEEHFEPHATLPTLKLGHAASNIYRNGSSALSSGCLVPSCPCARLNLYRCYALPEHPQVQQAWLKWLQLPPPQLASLAQLCVMHYMQLFEQVPLPADLPESVQRQLQETYEQISSSSMAMKLRCAVPGCYSKYTDNVRLTKLPVCPQICAQWVHNTKIKYDPERHYMYRICMRHFEPQCLGAVRPKLWAVPTLHLNHSDADIYQNTMLDSSDAMPVAESVPLTLPLRIKTELPLTLSVSPSASPSPRGKQRTCCIPTCGQQANALTRLFRFPSAETALLKWLVNTQQQPRLVDTQNLFVCQRHFAAEAICKKQLQSWAVPTLSLGHEGHIIPNAKHNGNIADSQENKQALQYIWANYCSVLTCFQQRSEQVRLYAYPTDRPTIRRWAANCKHRSMQASSDGFQVCQLHFTPDCFDPDTGDLKEDAVPTLELSRPVHELRCLVNGCVREKDAARCRFFKVPKRASQLEDWCHNLRIDAASISGQEVHVCERHFEAHCFSAYKLRPGARPTLHLGHDDELDLLPNPAKWEEDVNVCFVPSCGRSKDVDNVELFGLPRIRGVLEKWLHNFRLDPSREQLQGMRICSAHFEASCIENGRLHLSSVPTLQLGHDELDNIHQSAELPSSQLKGKRLAMNYDCCYPQCMELQKSYQRIAYELPQQEALRNLWMSYLGLEQQNLQPLKLCPLHLIMLYEHSVNHFPEHSSEEQLLDANYEAARNSVRIRIISCAVRGCRTLKPRDDYRLHAMPTRRDVLRMWLDNMQLVFYEQQRYMYKVCSRHFEATCVTETTRRLKPWSMPTLELPERDPDAPPLHQNPTEEEWQRMNEQIGSSEALALLEPAFKLEPEPIVKQELHSIVKLEPKPQPEQHEGEEYEANDQQQALEVLLEVGHVEKCTTYEQMDTKPIIGYADTLSHNSLGPTTTVGSACIVNGNGLTYSARHCSVRGCDVTSLDVNDSLKLHKFPTSLDAMEKWMHNTQVNVDVNFAWRFRICSLHFLPECFNGSRIRRGAMPTLRLGSRRLGDIYDNEFNVQPEQTSVDQPVEASADAMVPTEPHDGATEFNINLHLPCPAPPRKSSKFCQIDGCSNHLTSENLTLHKFPHSADMCAKWQHNTQVPFDPEYRWRYRICSAHFEPICLGNMRLMHGSVPTLKLGARAPKQLFGNDFAALSLRLDKEKRSADQSLPVKLEQVEDDQEQYDQEDLSMLVPELQLHEGDDEHEDNQLNYTNSWTDSQQQVQLQLRLPSIKQEKGTIYNPVKSGYDKCSLVHCQRQRSQHGVHIYKFPRSRQLQHRWMHNLRIRYDERRPWKTMICSVHFEPHCIRLRKLRPWAVPTLELGDNVPQDLYRNEQSQQQFVQQRSSDAEAGSEGEDYDAELEDTILEEYDDEYDDNDNAEQYPAEPHIKREYRSRCDPQPPGQLPPWKIKQCCLPYCRRPRGDGIKLFRLPNNISAIRKWEQATGMRFYESQRNTKLICSRHFDPQLIGVRRLMSNAVPTRNLGPNSEESELPATSPRCCIKDCQPDAHVKLHKFPSDPELLHQWCQALNLRDEQRHADKYICAVHLPTKAMSCLICGVEDVQLPMQDFPEHRNQRVKWCYNLKIEPIAKWDNSKHICCKHFESYCFIKPGHLLPDSLPTLHLKHNDSNIFLNESAIESSRLLHVKDEPMECEDLML
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00534610;
- 90% Identity
- iTF_00490809;
- 80% Identity
- -