Btha016917.1
Basic Information
- Insect
- Bhutanitis thaidina
- Gene Symbol
- -
- Assembly
- GCA_029286595.1
- Location
- JAGSMS010000038.1:1896457-1900695[+]
Transcription Factor Domain
- TF Family
- TEA
- Domain
- TEA domain
- PFAM
- PF01285
- TF Group
- Helix-turn-helix
- Description
- The TEA domain is a DNA-binding region of about 66 to 68 amino acids that has been named after the two proteins that originally defined the domain: TEF-1 and AbaA. The TEA domain is located toward the N terminus of eukaryotic transcription factors of the TEA/ATTS family. It shows a three-helix bundle with a homeodomain fold [3, 1]. Two α-helices are nearly anti-parallel and pack on either side of the third one, which is the DNA-recognition helix of the TEA domain. Phosphorylation of one or both of the two conserved serines found on the DNA-binding surface could interfere with DNA-binding activity, by introducing electrostatic repulsion and/or steric hindrance, and help regulate the transcription factor activity of the proteins [2, 1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 19 0.071 7.2e+02 0.8 0.0 37 54 33 50 2 54 0.67 2 19 0.03 3e+02 2.0 0.0 28 54 48 74 24 78 0.64 3 19 0.044 4.5e+02 1.4 0.0 30 54 74 98 59 102 0.72 4 19 0.044 4.5e+02 1.4 0.0 30 54 98 122 83 126 0.72 5 19 0.049 5e+02 1.3 0.0 30 55 122 147 109 151 0.75 6 19 0.012 1.2e+02 3.2 0.5 54 54 194 194 133 246 0.65 7 19 0.034 3.5e+02 1.8 0.0 29 54 241 266 226 270 0.60 8 19 0.05 5.1e+02 1.2 0.0 30 54 266 290 253 294 0.74 9 19 0.047 4.7e+02 1.3 0.0 30 55 290 315 276 319 0.74 10 19 0.048 4.9e+02 1.3 0.0 30 54 314 338 300 342 0.74 11 19 0.043 4.4e+02 1.5 0.0 30 54 338 362 323 367 0.73 12 19 0.044 4.5e+02 1.4 0.0 30 54 362 386 347 390 0.72 13 19 0.044 4.5e+02 1.4 0.0 30 54 386 410 371 414 0.72 14 19 0.044 4.5e+02 1.4 0.0 30 54 410 434 395 438 0.72 15 19 0.021 2.2e+02 2.4 0.1 39 55 443 459 421 486 0.57 16 19 0.02 2e+02 2.5 0.1 38 55 490 507 467 534 0.58 17 19 0.05 5.1e+02 1.2 0.0 30 54 530 554 517 558 0.74 18 19 0.05 5.1e+02 1.2 0.0 30 54 554 578 541 582 0.74 19 19 1.5 1.6e+04 -3.5 0.0 30 41 616 627 606 628 0.69
Sequence Information
- Coding Sequence
- ATGCGAGCGCTGATCGAAGCCCTGCAAGCGCCGAAACCTATCTACCGTGACAATCTACCTACATTAGACCCTGATAAGCAAGATACGGatctgATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGACAATCAGCTGGTTCGAGATTATATCGACCTAGAAAAATTGTACCTCCGCACAAACGTAAAAGTACTGATAGGGTTCCAATGCAGTACCAGGCGTGTGGATAATGCAACTAGTAGTGTCAAATTTACACAAGTTCATAACAACTCATCCAGTGTTGTCCGACAGCATGAAACTTCTGTCACCCGACGAAACCAGCTTCTGCAGTAA
- Protein Sequence
- MRALIEALQAPKPIYRDNLPTLDPDKQDTDLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIDNQLVRDYIDLEKLYLRTNVKVLIGFQCSTRRVDNATSSVKFTQVHNNSSSVVRQHETSVTRRNQLLQ
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -