Dbir005075.1
Basic Information
- Insect
- Drosophila birchii
- Gene Symbol
- -
- Assembly
- GCA_008042755.1
- Location
- VNKA01002195.1:113121-126616[-]
Transcription Factor Domain
- TF Family
- THAP
- Domain
- THAP domain
- PFAM
- PF05485
- TF Group
- Zinc-Coordinating Group
- Description
- The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 29 4.9 1.3e+04 -3.0 2.0 49 60 168 181 150 199 0.56 2 29 2.3e-15 6.1e-12 46.2 4.0 1 86 396 468 396 469 0.85 3 29 8.6e-15 2.3e-11 44.4 5.0 1 87 496 565 496 565 0.83 4 29 7.8e-16 2.1e-12 47.7 0.2 1 87 587 659 587 659 0.85 5 29 5.5e-16 1.5e-12 48.2 4.8 1 86 757 826 757 827 0.82 6 29 2.3e-15 6.1e-12 46.2 3.8 1 86 851 922 851 923 0.82 7 29 9.8e-13 2.6e-09 37.8 1.5 1 87 958 1026 958 1026 0.81 8 29 6.9e-11 1.9e-07 31.8 1.4 1 86 1068 1137 1068 1138 0.76 9 29 4.2e-17 1.1e-13 51.7 0.4 1 86 1165 1234 1165 1235 0.82 10 29 1.1e-12 3e-09 37.6 1.4 1 85 1256 1324 1256 1326 0.79 11 29 1.1e-14 3e-11 44.0 0.5 1 86 1353 1424 1353 1425 0.85 12 29 1.5e-12 4.1e-09 37.2 3.7 1 85 1500 1568 1500 1570 0.82 13 29 1.5e-12 4e-09 37.2 0.1 1 86 1593 1661 1593 1662 0.83 14 29 3.8e-13 1e-09 39.1 2.2 1 87 1809 1878 1809 1878 0.80 15 29 2.7e-13 7.4e-10 39.5 0.1 1 87 1973 2047 1973 2047 0.82 16 29 0.00014 0.38 11.6 1.1 1 61 2062 2114 2062 2129 0.73 17 29 2.8e-14 7.5e-11 42.7 0.0 1 86 2141 2211 2141 2212 0.76 18 29 2.6e-13 7e-10 39.6 0.1 1 87 2264 2334 2264 2334 0.81 19 29 1.7e-12 4.7e-09 37.0 0.1 1 86 2369 2443 2369 2444 0.80 20 29 5.3e-13 1.4e-09 38.6 0.0 1 86 2454 2527 2454 2528 0.80 21 29 2.3e-10 6.2e-07 30.2 0.0 1 61 2553 2608 2553 2625 0.77 22 29 2.8e-05 0.075 13.9 0.1 1 58 2650 2700 2650 2717 0.84 23 29 2.1e-11 5.6e-08 33.5 0.8 1 87 2740 2812 2740 2812 0.82 24 29 2.7e-16 7.3e-13 49.2 0.2 1 86 2923 2995 2923 2996 0.81 25 29 2.7e-12 7.4e-09 36.3 3.5 1 86 3059 3129 3059 3130 0.80 26 29 3.2e-14 8.7e-11 42.5 4.2 1 86 3222 3292 3222 3293 0.84 27 29 3.6e-12 9.8e-09 35.9 0.2 1 86 3374 3443 3374 3444 0.85 28 29 4e-10 1.1e-06 29.4 0.4 1 58 3470 3518 3470 3532 0.81 29 29 4e-10 1.1e-06 29.4 1.4 18 87 3536 3594 3524 3594 0.77
Sequence Information
- Coding Sequence
- ATGTCACAACACAACCCCAATCACGCCCACCACCCACACTACCACTACCCTGAACCCTTGGAAGGGTTCCAGCAGCCGCCAAATCCAATGGCCCCACCCCCGGCCCCagaaatgataataaaatCGGAACCCATTGACGACCTGGCCTACAAGTCAAACTACATAGACGACAATACGCCATTTGCGGACTTTAGCAAGTTTAGCGAATTCAGCGAGGACATGCTGAGTCCCAAAGTCGAGCTGACAGTCAAGGATGAGTCCTTCGTTAGGAACCCCAATAGCTTTTTACGCCGTAAGCAACAATCGGATCTGACGACAGCAGAGAGCCTGCCCGTCTGCCAGCGATGCAAAGAGGTGTTCTTCAAGAAGCAGACTTACCTGCGGCACGTCGCCGAGAGCAACTGCGGCATCCAGGAGTACGACTTCAAGTGCACCATATGCCCCATGTCCTTTATGACCGCCGAGGAGCTACACCAGCATAAGCAACAGCATCGAGCGGACAGATTCTTCTGCCACAAGTACTGCGGAAAGCACTTTGGCACGATCACAGAGTGCGAGGCGCATGAGTACATGCAACATGAATACGAAAACATTGTGTGCAACATGTGCTCGGGATCTTTCGCCACGCGGGAACAACTTTATGCTCATTTGCCGCAGCACAAGTTCCAGCAGCGCTTTGACTGCCCCGTATGCCGCCTATGGTACCAAACCGCTGTGGAGCTGCATGAGCACCGCCTGGCTGCACCCTACTTCTGCGGTAAATACTACACGGGCGGACAGTCCCCGTCCCCGTCCTCGTCCtcccaacagcaacagcaccagAGTCAGACGAACTACAAGCTGCAGGACTGTCATATGGCCACCATGGAAATGCCAAACGCACCGCTCCTTAAGGCAAACTCATCCAACTCGCCGGCCTTGCCAGCGACAGCAGCGCTTAACTCCCTGTTGCAACAGCGCCAGGCCAATGCCGATGGAGCAGCTATTTTTGCCGCATCTTCGCTGAAGAACGAGGTCGCTGTGAAGCTGGAGCGCAGCTACAGTAACTCGACCAACGAATCGTCTTATAGCGTCCAGGAGAGCGGCTACAATAATGTGTATGGCAGCAGTGACAGCTCAGTTCACGGTGCCATTGCCGGGCCACAGGCACACTCTTCGACGCTGGACGATTCCGAGGATGCGCTTTGCTGTGTGCCGCTGTGCGGTGTGCGGAAGAGTACGAGTCCCACCTTGCAGTTTTTCACGTTCCCAAAGGACGAAAAATATCTCAACCAGTGGCTGCATAACCTCAAGATGTTCCACATACCCGCTTCCAGCTACGTTAGCTTCCGGATCTGCAGTATGCACTTCCCCAAGCGATGCATCAACCGCTATTCGCTGTGCTACTGGGCGGTGCCGACATTTAACCTCGGCCACGATGACGTAGCCAATCTCTACCAGAATCGGGAGCTGACCAACACGTTTACCACTGGCGAAGTGGCGCGCTGCAGCATGCCACATTGTACCAGCCAGCGGGGTGAGAGCAACCTCAAGTTTTACAATTTCCCAAAGGATATCAAAAGCCTGATTAAGTGGTGCCAAAACGCCCGACTTCCGGTGCAGGCAAAGGAGCCGCGACATTTCTGTAGCCGCCACTTTGAGGAGCGGTGCATTGGCAAGTTTCGACTGAAACCTTGGGCAGTGCCCACCTTACACCTGGGCGCCCAGTACGGCAAGATCCACGACAATCCAAAGAATCTATATGTGGAAGAGAAACGCTGTTGCCTCAACTTTTGTCGCCGGAGCCGCTCTTCCGACTTCAATATGTCGCTATATCGATTTCCTAGAGACGAAGTCCTGCTACGGCGCTGGTGCTACAATCTTCGCCTCGATCCGGGAGTGTATCGCGGCAAGAATCACAAAATATGCAGCGCTCACTTTATAAAAGAGGCGTTGGGTCTTCGGAAACTGTCGCCTGGTGCCGTGCCCACACTTCATCTGGGCCACAATGATACCTTCAACATCTACGAGAACGAACTGTGGCCACCGCCAACTCCGACACCCTCCTCTTGTCATctccaacagcaacagcagtcaTCCCTTCATTCGCTTCAACAGCAGATGCACAGCAAATCCTACCAGCGCCGCTCAGCGGCATCTACATCGTCATCGGCAAGCTCGGCAGCCTCGCATTATGTGGATCCTGAGATGAGCGCCTCTTAccatctagccatgtccgccTCCGCCGGTGGCTCTGCGACGATAAACGCCAGCGACAGCATGGATGTCTGTTGCGTGCCCAGTTGCGAGAGCAAGCGACACAATAGCGAGAACATTACATTCCACACGATTCCGCGACGGCCCGAGCAAATGCGTAAATGGTGTCACAATCTTAAGATTGCCGAGGACAAGATGCACAAGGGCATGCGAATCTGCAGCCTTCACTTCGAGCCCTACTGCATCGGCGGCTGTATGCGTCCGTTTGCTGTGCCCACTCTTCAGTTGGGCCACGACGATGAGGATATCCACCGCAATCCGGACGTGATCAAGAAGCTGAACATCCGGGAGACATGCTGTGTGGCTGTGTGCAAGCGGAATAGGGACAGGGATCATGCGAATCTGCATCGTTTCCCCAGCAATGTGGCTTTGCTGAAAAAGTGGTGCACCAATTTGCAGCGCAGCGTTCCCGATGGCAGTAAACTCTTCAATGATGCCATCTGTGAGGTGCACTTTGAGGATCGTTGCCTGCGCAACAAGAGGCTCGAGAAGTGGGCAGTGCCTACTCTGATCCTGGGACACGATGACATTGCCTATCCGCTGCCCACGCCAGAGCAAGTAACCGAGTTCTATGCCCGGCCCACGGCTCCCAACAATGGTGAGGAACAGGGCGAGTGCTGTGTGGAGACGTGCAAGAGGAATCCGAGCGTGGACGATATAAAGCTATACCGGCCACCGGAGGAGGCCGCCGTGCTGGCCAAGTGGGCGCACAACCTGCAAACGGAGGCCAACCAACTGACAAGCATGAGGATCTGCAATCTACACTTTGAGGCGCATTGCATCGGCAAGAGGATGCGACATTGGGCCATACCGACTTTGAATCTAGCCGGCAACATTGAGAATCTTTATGAGAATCCAGAGCAATCGCTGCTGTACAGGCGTCGCACTACTCACATGAAGGCGAAGCTGACGCAAGCCTCCGTCAAACCCACCTGGGTGCCCAGGTGCTGTCTTCCACACTGTCGCAAAGTCAGAGCCCTGCACAATGTCCAGCTGTATCGCTTCCCCAAGCTCAATCGCTCCACATTGGCCAAGTGGGCGCATAATCTCCAGGTTCCAATGGTGGGCAGTGCCCAGCGCAGGCTATGCTCGGCCCACTTCGAGCCACATGTGCTCAGCAAAAAGTGCCCGGTGCCGCTGGCGGTGCCTACGCTCGACCTAAATTCACCACCCGGCTTGAAAATCTACCAGAATCCGGCCAAGCTAAAGGCCAGCAAACTGTGCCTGCAGCGGGTGTGCATTGTCGAAAGCTGCCGCAAGACGCGGGCGCAGGGCGTTCAGCTTTTCCGGCTGCCGCACAGCCCCACACAGCTGCGAAAGTGGATGCACAACATCAGGACGCGGCCACGAGCAGCTATGCGGGCTCAGTACCGGGTGTGTTCCCGTCACTTCGAGACGCACTCCTTCAATGGCCGAAGACTGAGCGCAGGTGCCATTCCGACTCTAGAGCTGGGCCACGATGGCGACGATATCTATCCGAATGAAGCGCAGGCATTTGTGGACGAGCATTGTGCTGTCGAAGGCTGTGAGGCATCCAAGGAGCAGCCGGAGGTGCGATTGTTCCGCTTCCccaccgacgacgacgataTGTTGTGGAAGTGGTGCAACAACCTCAAAATGAATCCTGTGGACTGCATTGGGGTACGCATCTGCAACAAGCACTTCGAGGCCGATTGCATCGGTCCCAAGCATCTGTACAAGTGGGCTATTCCCACACAGGAGCTGGGCCACGACGATGCGCAGATCGAGCTGATACCGAATCCCAAGCCAGAGGATAGGTATGTGGATCCCGTCTTCAAGTGCATCGTTCCCACCTGCGGCAAGACACGACGGTTTGACGAAGTGCAAATGAACAGCTTCCCCAAGGATCCGGATCTATTCCAGCGATGGCGGCACAATCTGCGCATTGATCATCTCAGTTTCCAGGAGCGTGAGCGCTACAAGATCTGCAACGCACACTTCGAGGAGATTTGTATTGGAAAGACACGGCTAAACATTGGATCCGTTCCAACCTTGGAGCTTGGTCATGACGATGAGGAGGATATTTTCAAAGTTAATCCAGCGGAGCTGCAGAGCAATTTATTCGGGCGGCAGCGTCGACTGCTGCTCGAGGGATCCGGCGAACAGAGTGTCGTCAAGCAAGAGCTATCCGAGACGGAGGACAACAACAAGGCGGATGTGACGGCCACTGGCTCCAATTCCAAGCAGATCAAGATCAAGAGATCTTCTTTGGATCTTAAGTGTTGTGTGCACAGTTGTGGAAGAAGTCGCTTGGAACACGGAGCCCGGCTGTTTCCCTTTCCCACGggcaagcagcagcacctaAAGTGGCGTCACAATCTGCACCTGGAACCGGAGGAGGTGGACCGTTCGACGCGCGTTTGCAGCGCTCACTTTAATCGACGCTGCATCGAGGGCAAACAACTGAGGAGCTGGGCCATGCCCACGCAACAGTTGGGACACAACGACCAGCCGATCTACGAGAACCCAAAGAACATACCGGGATTCTTCACACCTACCTGTGCCCTGGGACACTGTCGCAAGCGGAGGAGTATTGACAACGATCTGCGCACCTATCGGTATCCCAGGAGCGAAGATCTTCTAGAGAAATGGCGAGCTAATCTGCGACTGGCTCCAGATCAGTGTCGTGGTCGAATCTGTGCAAATCATTTCGAACCGCAGGTTCGGGGCAAGCTAAAGCTGAAGACGGGAGCCGTTCCTACACTACAACTGGGACACGATGAGGAATTAATCTATGACAATGAAGCTATTAAGGCAGGCATGACCGAAGAAGAGGAGGCCATAACCACAGACTTTCCGCGattgaaaccaaaaaaagagttgttcgaagaggaggaggaggagtgcgAAGGGAACGATGGCGAGCAGCAGCACGCAGATGACCTGGACGAGAATGCAGATGAAGAAGACAAAGATGATCAGTACTTTGATCCTCTTGAGCTGGTTGAGACTTTTGCCGAACATCGCAGTGATGACGAAGCCCAGGACTATGAGGATGAAGAAGACGAGGGTCGAGTTGAGGACTCCCCCTCCGGTTATGATGTCAAGGAGGAGATAGAACCGCCGCCAAGCTCACCACCCTCTCCGCTTCGACGACGGCACCATGTTCCGCGCCGAGACAAGCCTGCTAACAATGTGACGCCCATTTGCTGCCTGAAGCACTGCAGAAAGGAACGCACTGCCTTCCATCTGTTGAGCACTTTCGGCTTCCCAAAGGAtcgccagctgctgctgaaaTGGTGTGTCAATCTGCATTTAAACCCGGACGACTGCATCGGTAGGGTTTGCATCGAGCACTTTCAGCCGGAGGTACTCGGCACCCGCAAGCTCAAGCAGAACGCAGTGCCCACTCTTAATGTGGGACATGATGAACCGCTTAGGTATTCGTGCCATGGCGTGGACCAGAATCTCGAGGAGCGGGAGCCCCAGCCACAGCATTCGGTTTTTCGGCTTTGGAGCCTGAAACACTGTCGCAAAAGGAAGCCAACGGAGCCGCCGGATATTCCCCCAACCAAGAGGAGAGTGCTGGAGATGCCAATGATGAAGCGGGAGTGGGAGATGGAGATGCcaatgcagatgcagatggagCAGAAGAAGGAGGCAAAGAAGATGACTCAAAATGAAAGTAATTCACTCACATGCTGTATTAGCAGTTGCGGAAACCAGGAAGTTAGCCAATTGCTGGCATTTCCTGAAGAGACATCCTTGTTGAGAAAGTGGATCCATAATTTAAGGCTTTCCAATGAGATTGAGCCCTCTTCTCTAAGCCTGAAAAGAGTTTGCTTGTCGCACTTCGAATCGCAGCTCTTGGAGAATGGAAAGCTCACAAAGgaagcagaggcagaggctgTGCCTACCTTAAACCTGGGCCATAGCAGCTGGAATCTATACAGAAGCAATGGGATTTGCCTAGTGCCTGACTGCACCCATAATACCTTCGGACGCATTAGCTTTATCGACCTGCCGGATAACAGTATTATTAGGAAGGCTTGTTTCTCCTGCTTAAACCTACCTGAATCTTCCGAGGAGCAGGCGAGACTATGTTGTGTCCACTTCATGCAGGCTTACAAAAAGTTTGATCTGCCTAATGTTCTGCACCCTAAAGTCATGATGGCGCTACAAAGTGTTGTGGCCGAGCTGAAATGCGCGGTGCCTGACTGTAATTCCGAAGAAGCTGGTTCTGACTTTCAACTTATCCAGTTTCCCGATGACAAGGAGATGCTGTCACAGTGGCTGCACAACACCAAGGTCCCTTATGATCCTTCTAATCACCAAAGTTATCGCATCTGCACACGTCACTTTGAATCAGAGTATTTGGAGTTGAATGGCCCGCTAAAAGGAGCTCTACCAACGCTACATCTAAACCATGAAGATGAGATTCACTTGAATACCAGCCCTTTGCCAGAGGATCAGAACTCTATATTGACACCACTGCGTATAAAGACGGATCCGGCCTTCTTGGGCAGTCCCTGTGCAAGTGCAAGCCCCAGTCCCCGGGGCAAGATCCGTATGTGCTGCATTCCCACATGTGGACAGTATGGCAGCAGTCAAGTGCGGCTGTTTCGTTTTCCCACCGAGGAGCAGGCGTTGCTTCGGTGGCTGGTGAACACCCAACAGCAGCCGCGACTGGTTGATCCCATGGACTTGTATGTGTGCCAGTCGCATTTTGAGCCCGAGGCCATTTATATGAAGCAACTACGAAACTGGGCTGAGCCCACCTTAAACTTGGGACACGACGGCCATATAATACCGAATGCCAAACataatggaaatatttccGACAGCCAAGATACCGAGCAAGCCATGAGGTTTATTCGCGAACGATTCTGCTCGGTCATTTCTTGCTTTCAGGCTGGCggacaggaggaggagggagtgAGGCTATTTGATTATCCCGAGGATATGGCGACTACTCGAAAATGGGCCGCCGCATGCCGACATCGCTCCATGCAGGCCAGAAGCCATGGGTTCAAGGTGTGCCAATTGCATTTCGCCAAGGAATGCTTTGACCCAAATACCGGGGAATTAATTGAGGGCGCTGTGCCTACACTGGAGTTGAGCAGAGATGAAATGGAGAGGCAATGTCTGGTGGCTGGATGTGTGAAAAATGATGCCAATGGAACCCGCCTTCGATACTTTAAGATACCAAAAGTTGCTGCCCAATTAGAAGCGTGGAGCAACAACCTTAAAGTCCATCCAACGGATCTCATGCAAGGCGAGCAGCAGTACATCTGCGAGAAACACTTTGAGTCGTTCTGTTTCGCGGCCAACAAGGGACTGCGTTCTGGTGCTCTGCCAACCCTCCTCCTAGGCCATGATGAAGAGGTGGATATGCTTCCAAATCCGGAAAGCTTCAACTGCCAGAATAAGGCGGATAAATGCTGCGTACCCGGCTGCGGGCGTGTCTGGCAGGCTGGTGATCGTAAATTTCGTGGATTTCCCAAATTGCTGGCCATGGCCAAGAAATGGAGACATAATCTTCGTTTGGAGGAGCCCGTGGAGCAACTCGGCAAGCTGAAGGTCTGCGGTGCTCACTTTGAGGCCACCTCACCCAGCCTGGGTACAAATGGACTAAGTGTCTCGATACCAACCCTGGAATTGGGCCACTCTTCTCGGGATATTTTCCCAGCGGAGATAAGCTTAAAGTTCCAAAAGCCAGCGAAAACGATTTGCTGTTATCCCAAATGCGAAGAAGCCTGTTTATCCAAGAACTTTTCTTACGGTCTTCCCCAGGAGGAGCATCTGAGAAATGCCTGGCTAAGCTATATGGACATCGAAGACCCGAAAGATGAAGAAATCGCACAGGTGTGCCCGCTGCACTATGTCATCCTCTACCAGCACAGTGCCGCACTCTATCCGGAGCTTCATGCTTCAAGCCGTCGGCTTCTTGACTACAATTACAAGGAGGCGTGGAACAACAGGCGCGTTAAGATTGTGAGTTGCACGATCAAGGGCTGCGACATGATTAAGCCACGAGATGGGATACCACTGCACGGGATGCCGCAAAGCAAGGACATCCTGCAGATGTGGATAGAAAATGGTCAGTTTGAGTTCTTAGAGCAGCAGCGGTATATGTTCAAGGTGTGTCACAATCATTTTGAGCCATGCTGCTACTTCGACGACAGACGTTTGCACTCATGGAGCGTGCCCACTTTGCATCTACCTGGAGATGTAATTCACCAAAATCCCACCGCCGAGCAGTGGCAGAACATGATCAACAAGCAAGCAGCAGCGAAAACTGACCGAGAAGAGAGCGAGGAGCCAGATCCATATGAGGATGTGGTTAAAACCGAACCCATTGTAAAGATGGAGCATATCGAATCGGAATATGAAGATGAAAACAGTGAGATGCAGGCCCTCGAGGTCCTCCTAGAAGTTGGCCATGTCGAGCGAATGGAGAGCTATGAGAAAATGGATAAATCACCAGCGACATACACCGATACACCGTTTCGATCTTCACCCATACGTTACCCATACAATGCTAATCATTGTGCCGTAGAAGGATGCCAGGTGACTGTCGAGGATGTGGACGGCACAATTAAGCTGCATAAATTTCCCGCCTCGCAGGAAGCAGCACAGAAGTGGATGCACAACACCCAAGTTGACATGGACGAAAAGTATTGGTGGCGTTATCGCATTTGCAGCTATCACTTCGAACAAGAATGCTTTCAGGGTGCTAGAATTCGTAAGGGCGCGATGCCCACGCTTTTGCTAGGACCGCGGCGACCGGACGAGGTATACGATAATGAGTTTTCACTACCAGAGGCGGAGGAGCCCTTTCCAGAGCCACCCGAGACTCAACTTGAGGAAAGAACGTCCTTGGCGTCCAGAGTTCAAAAGGAGGTAACCAATTTATGCCTGCCGCCACGGGCGCCGCCTCGAAAGTCCAGCAAGTTTTGCCAAATTGATTCCTGCACAAATCATTTGACCACTGAGAACATGACACTTCACAAGTTTCCACACTCGGAGGACATGTGCCTCAAGTGGCAGCACAACACACAAGTGCCATTTGATCCCTACTACCGCTGGCGCTATCGCATTTGCAGTGCCCATTTTCATCCGGTGTGTTTGGTCAACATGCGTCTAGTCCATGGAAGCGTTCCCACTTTAAAGCTGGGCCCTAAGGCTCCTTCCGAGCTGTTTGACAACGATTTCGAAGCCATTAACCTAAGATTGGACAAAAGGTTGACAGAGTCCAATGCTAATGTGTATATCAAGCATGAAAAAagggaggaggatgaggattCGATGATGTTCCTGGAGCCCGAACTTCAGTTACACGAGGACCAAGACGATAAGGTATCAAGCTGGAGCAGCAAAATGCCATTACCACCTGTGAAGCAAGAGAAGATTATATACAGCCAGATCAAGTCTGGCTACGATAAGTGTTCGCTGGCGCACTGCCAGCGCCAAAGGTCCCAGCATGGCGTCCACATTTATAAGTTTCCCAGATCGAGGCGTCAGCAGGAGCGTTGGATGCACAACCTACGCATCCGCTATGATGATCGGACGCCGTGGAAATTCATGATCTGCAGCGTTCATTTCGAGCCGCATTGCGTCAGCCTAAGGAAGCTGCGACCATGGGCGGTGCCCACACTGGAACTGGGTGACAATGTACCAGAGACAATCTTCACGAACGAACAGTGCGAGAAGGAGCTGGTGACCGATCGCAGTGATCCGGATAGCGACGCCGAGGAAGAAGACGGCTTGcaggaggacgacgacgatgatgaagaCGAAGACGATGTGAAGCCCGATGTTATTGGCATAAAAAGGAGGAAACGTTCCAAAATAGATGCCAACTGCCCTCCCAGCCAGATTCCACCCTGGAAAGTCAAGCAATGCTGCCTCCCCTATTGTCGTGCCTTTCGAGGCGATGGCATCAAGCTGTTTCGGCTTCCGAATAACCGAAACTCCATTAGCAACTGGGAACGGGCCACCGGAATGGTATTCAAGGAGTCGCAACGGAACACTCGCCTGATCTGCAGTCGTCACTTTGAGCCAGAGCTGATTGGCGTCAGGCGTCTAATGCGTAACGCCATTCCCACGAAACACTTGAGCCCCCAAGCTGTGGACCAGACGCGTactaaaaaggaaaagaatcCTCCTCCGGCCACTATTGTACCCATCTGCTGTATGGCGGATTGTCATTACAACGGAAATGTGAAGCTGTACAAGTTTCCAAGTGATCCCACTCTTCTCAAACAGTGGTGCCAGGCTCTCCGTCTCACGGATACGCAGCGGTATTTGGGCAAGCACATTTGCTCCATGCACATGCCAATGAACAAGACGCAGAGCTGTGTCATCTGCGGTGGAGATGACGTAGAGCTGCCAATGCTTGGGTTTCCGGAAAACCGCAATCAGCGCGCCAAATGGTGTTACAATCTTAAAATTGAGGCAATACCAAAGTGGGACCACTCAAAGCATATTTGCTGCCGGCACTTTGAGTCCCATTGCTTTGACAAGCCGGGTGAGCTACGTCCAGGAGCGGCTCCCACGCTCCATCTCAATCACGATGACACAAACATATTCTTCAGCGACTATGCCACTGGTCTTCCGTCCTCGCCACTAGGCAATCGAATTAAAGACGAGCCCCTGGAATCGGAATCCGACGAGACACTGCTGGTGTAG
- Protein Sequence
- MSQHNPNHAHHPHYHYPEPLEGFQQPPNPMAPPPAPEMIIKSEPIDDLAYKSNYIDDNTPFADFSKFSEFSEDMLSPKVELTVKDESFVRNPNSFLRRKQQSDLTTAESLPVCQRCKEVFFKKQTYLRHVAESNCGIQEYDFKCTICPMSFMTAEELHQHKQQHRADRFFCHKYCGKHFGTITECEAHEYMQHEYENIVCNMCSGSFATREQLYAHLPQHKFQQRFDCPVCRLWYQTAVELHEHRLAAPYFCGKYYTGGQSPSPSSSSQQQQHQSQTNYKLQDCHMATMEMPNAPLLKANSSNSPALPATAALNSLLQQRQANADGAAIFAASSLKNEVAVKLERSYSNSTNESSYSVQESGYNNVYGSSDSSVHGAIAGPQAHSSTLDDSEDALCCVPLCGVRKSTSPTLQFFTFPKDEKYLNQWLHNLKMFHIPASSYVSFRICSMHFPKRCINRYSLCYWAVPTFNLGHDDVANLYQNRELTNTFTTGEVARCSMPHCTSQRGESNLKFYNFPKDIKSLIKWCQNARLPVQAKEPRHFCSRHFEERCIGKFRLKPWAVPTLHLGAQYGKIHDNPKNLYVEEKRCCLNFCRRSRSSDFNMSLYRFPRDEVLLRRWCYNLRLDPGVYRGKNHKICSAHFIKEALGLRKLSPGAVPTLHLGHNDTFNIYENELWPPPTPTPSSCHLQQQQQSSLHSLQQQMHSKSYQRRSAASTSSSASSAASHYVDPEMSASYHLAMSASAGGSATINASDSMDVCCVPSCESKRHNSENITFHTIPRRPEQMRKWCHNLKIAEDKMHKGMRICSLHFEPYCIGGCMRPFAVPTLQLGHDDEDIHRNPDVIKKLNIRETCCVAVCKRNRDRDHANLHRFPSNVALLKKWCTNLQRSVPDGSKLFNDAICEVHFEDRCLRNKRLEKWAVPTLILGHDDIAYPLPTPEQVTEFYARPTAPNNGEEQGECCVETCKRNPSVDDIKLYRPPEEAAVLAKWAHNLQTEANQLTSMRICNLHFEAHCIGKRMRHWAIPTLNLAGNIENLYENPEQSLLYRRRTTHMKAKLTQASVKPTWVPRCCLPHCRKVRALHNVQLYRFPKLNRSTLAKWAHNLQVPMVGSAQRRLCSAHFEPHVLSKKCPVPLAVPTLDLNSPPGLKIYQNPAKLKASKLCLQRVCIVESCRKTRAQGVQLFRLPHSPTQLRKWMHNIRTRPRAAMRAQYRVCSRHFETHSFNGRRLSAGAIPTLELGHDGDDIYPNEAQAFVDEHCAVEGCEASKEQPEVRLFRFPTDDDDMLWKWCNNLKMNPVDCIGVRICNKHFEADCIGPKHLYKWAIPTQELGHDDAQIELIPNPKPEDRYVDPVFKCIVPTCGKTRRFDEVQMNSFPKDPDLFQRWRHNLRIDHLSFQERERYKICNAHFEEICIGKTRLNIGSVPTLELGHDDEEDIFKVNPAELQSNLFGRQRRLLLEGSGEQSVVKQELSETEDNNKADVTATGSNSKQIKIKRSSLDLKCCVHSCGRSRLEHGARLFPFPTGKQQHLKWRHNLHLEPEEVDRSTRVCSAHFNRRCIEGKQLRSWAMPTQQLGHNDQPIYENPKNIPGFFTPTCALGHCRKRRSIDNDLRTYRYPRSEDLLEKWRANLRLAPDQCRGRICANHFEPQVRGKLKLKTGAVPTLQLGHDEELIYDNEAIKAGMTEEEEAITTDFPRLKPKKELFEEEEEECEGNDGEQQHADDLDENADEEDKDDQYFDPLELVETFAEHRSDDEAQDYEDEEDEGRVEDSPSGYDVKEEIEPPPSSPPSPLRRRHHVPRRDKPANNVTPICCLKHCRKERTAFHLLSTFGFPKDRQLLLKWCVNLHLNPDDCIGRVCIEHFQPEVLGTRKLKQNAVPTLNVGHDEPLRYSCHGVDQNLEEREPQPQHSVFRLWSLKHCRKRKPTEPPDIPPTKRRVLEMPMMKREWEMEMPMQMQMEQKKEAKKMTQNESNSLTCCISSCGNQEVSQLLAFPEETSLLRKWIHNLRLSNEIEPSSLSLKRVCLSHFESQLLENGKLTKEAEAEAVPTLNLGHSSWNLYRSNGICLVPDCTHNTFGRISFIDLPDNSIIRKACFSCLNLPESSEEQARLCCVHFMQAYKKFDLPNVLHPKVMMALQSVVAELKCAVPDCNSEEAGSDFQLIQFPDDKEMLSQWLHNTKVPYDPSNHQSYRICTRHFESEYLELNGPLKGALPTLHLNHEDEIHLNTSPLPEDQNSILTPLRIKTDPAFLGSPCASASPSPRGKIRMCCIPTCGQYGSSQVRLFRFPTEEQALLRWLVNTQQQPRLVDPMDLYVCQSHFEPEAIYMKQLRNWAEPTLNLGHDGHIIPNAKHNGNISDSQDTEQAMRFIRERFCSVISCFQAGGQEEEGVRLFDYPEDMATTRKWAAACRHRSMQARSHGFKVCQLHFAKECFDPNTGELIEGAVPTLELSRDEMERQCLVAGCVKNDANGTRLRYFKIPKVAAQLEAWSNNLKVHPTDLMQGEQQYICEKHFESFCFAANKGLRSGALPTLLLGHDEEVDMLPNPESFNCQNKADKCCVPGCGRVWQAGDRKFRGFPKLLAMAKKWRHNLRLEEPVEQLGKLKVCGAHFEATSPSLGTNGLSVSIPTLELGHSSRDIFPAEISLKFQKPAKTICCYPKCEEACLSKNFSYGLPQEEHLRNAWLSYMDIEDPKDEEIAQVCPLHYVILYQHSAALYPELHASSRRLLDYNYKEAWNNRRVKIVSCTIKGCDMIKPRDGIPLHGMPQSKDILQMWIENGQFEFLEQQRYMFKVCHNHFEPCCYFDDRRLHSWSVPTLHLPGDVIHQNPTAEQWQNMINKQAAAKTDREESEEPDPYEDVVKTEPIVKMEHIESEYEDENSEMQALEVLLEVGHVERMESYEKMDKSPATYTDTPFRSSPIRYPYNANHCAVEGCQVTVEDVDGTIKLHKFPASQEAAQKWMHNTQVDMDEKYWWRYRICSYHFEQECFQGARIRKGAMPTLLLGPRRPDEVYDNEFSLPEAEEPFPEPPETQLEERTSLASRVQKEVTNLCLPPRAPPRKSSKFCQIDSCTNHLTTENMTLHKFPHSEDMCLKWQHNTQVPFDPYYRWRYRICSAHFHPVCLVNMRLVHGSVPTLKLGPKAPSELFDNDFEAINLRLDKRLTESNANVYIKHEKREEDEDSMMFLEPELQLHEDQDDKVSSWSSKMPLPPVKQEKIIYSQIKSGYDKCSLAHCQRQRSQHGVHIYKFPRSRRQQERWMHNLRIRYDDRTPWKFMICSVHFEPHCVSLRKLRPWAVPTLELGDNVPETIFTNEQCEKELVTDRSDPDSDAEEEDGLQEDDDDDEDEDDVKPDVIGIKRRKRSKIDANCPPSQIPPWKVKQCCLPYCRAFRGDGIKLFRLPNNRNSISNWERATGMVFKESQRNTRLICSRHFEPELIGVRRLMRNAIPTKHLSPQAVDQTRTKKEKNPPPATIVPICCMADCHYNGNVKLYKFPSDPTLLKQWCQALRLTDTQRYLGKHICSMHMPMNKTQSCVICGGDDVELPMLGFPENRNQRAKWCYNLKIEAIPKWDHSKHICCRHFESHCFDKPGELRPGAAPTLHLNHDDTNIFFSDYATGLPSSPLGNRIKDEPLESESDETLLV
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00525910;
- 90% Identity
- iTF_00594581;
- 80% Identity
- -