Basic Information

Gene Symbol
-
Assembly
GCA_018290095.1
Location
CM031382.1:7120817-7123839[-]

Transcription Factor Domain

TF Family
TSC22
Domain
TSC22 domain
PFAM
PF01166
TF Group
Basic Domians group
Description
These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 8 0.33 2.9e+03 -1.5 0.2 35 55 218 239 206 242 0.65
2 8 0.94 8.3e+03 -3.0 0.8 22 38 293 309 290 320 0.58
3 8 4.9e-09 4.3e-05 23.6 7.0 11 51 335 375 333 381 0.91
4 8 0.00053 4.7 7.4 3.8 17 42 362 387 362 389 0.95
5 8 0.0015 13 6.0 7.3 18 43 384 409 380 417 0.57
6 8 0.054 4.8e+02 1.0 2.4 22 41 422 441 414 456 0.60
7 8 0.19 1.7e+03 -0.7 2.3 15 44 436 465 431 483 0.73
8 8 4e-05 0.35 11.0 3.1 15 47 513 544 511 551 0.87

Sequence Information

Coding Sequence
ATGGAAATAAActCATCTCGGTTACTTCATCAATTATTCCTGATATTCTTATCAGTCATAGCAACATTTTCTCAATCACAACAACAGTCATTgacattaaaatgtaaatatgaAATCTCAAACTAcaatggaagaaatttttatacttgCTATGCAATTGGACTTGAAAATCatcatgaaaatattgaaattgacaCAATTGAAGGAAgccatttgaaaaattatgatgaaagcAAAGTTGAagctttaataattaaaaatcaaaatgttaGTTTTCTTCCAAATGGatttggtgaaatttttaagaacttAATTCGACTTGAAGTGACAGAATCGCAattaaaggaaataaaaaaatcaagtttcaATGGAATGAATTTAGAAACCTTGAAGATaaggaaaaatttgataaaaaatattgaaaatggaGCATTTGAtagtttggaaaatttaaaagaattgaatttggataaaaatgaaattgaaaattttagcacTGAAGTCTTTACAAAAGTATTGAATTTGGAAAGTTTTTCGattgaaaacaacaaaattaaagaactgaatgaaaatttctgcAAGAATTTGCCTAATTTGCAAGATTTTTATGTTAGAcacaacaaaattgaaaaaatttcatcaaaggtctttgaaaattgtgaaaatttgaaaacactTGACTTTAGTAACAACAAACTTTcccaaatttcatcaaaaattattgaaaatttaaatttttttgatttctcaaATAATTCTTGCATTATTAATGGATTTTACGGCACAAATGCTGAtggaaattttgatgaaaattcaacatttattgaaaaacttgaagaaTTATGTGGACAAGGTCAAAgtagaatatttaaaatcgaaattcaaaaactgggagaaaataatgaaattttagaaaataaaaattctgaagaaaaagaaccaaaaatttacagtaatgaagaaaaattagaaaataatccAACAGAAAACCAAATCAGTTCAGCAACTTCAGATGatgtggaaaattttaaagaagaaattcaagacttgaattcaaaattatcaaatttagaaactgaaaatgaaaatttaaagtcaaatattgaagagaaaaatttaaaaattgaaaatttggagatggaaaatgaaaatttaagaactgAAAAGGAAAACTTAAAGGAacaaatttcaagtttaaaggaagaaaatgataaacttcaaaatgaaattgaagaaactaataaaaattctgatattggaacacaaattgaaaaattggaatttgaaAACTCAAAATTGGAAgcagaaattgaagaatttaaggcacaaaatcaaaaacttacttcagaaattgaaaagaaagaagaagaactaaaaaattcaaaatctttaCTTCAAAGTTGTGAAATCAGTAAAGAAAGCTTAAGATATcagaatgaagaaattaaccaaaatttaagaaatctcacaaatgaaattttaattacaagctcaaattttgacaaatgcaaagaagaaaatacCAAACACCAGTCtgaaactgaaaatttaactagaaaaattccaaatttagAAAGACAAAACGAAGAAATGCAaactgaaattgaaagtttaagaaaaatttccgaTGAAAAAAccagagaaaattttgatatttcaagaaattttagcAATTTAGAAGCTGAATATCTTGGactaagaagagaaaaagaagatttggAATATGATTTGCAAATTTATAGAGAAAAAGTTGTttactttgaaaaaaattatcgtgaaaaacttgaagaaatttcaaaatgtccTGAAGcCCCACAATCACAAGTTTGCTGCAATGaacttcaaatttataaatcagtTCCATTAGAATGCGATTTCGATCACATTGAACTTGAATACAGTTGCAAATCGACATCACTTGTAATCATTCATCGTCATATGTCAATCAGTGAAACTCGAGGCAACCATATCACAAGATTTATCGATAATTCAAATGTCAAAAGACTTCATGTAAttgatgcaatttttaaatttttcacaaatgatatttttaatcattttgagTCAATTCAATCATtggaaattataaattctcaacttttagatttaaatgatgacaaaattaaaaatgaaaatttaaaattcttgagaATTGAAGGAAATTTAGTTGACAAActtgaagatgaaatttttagtggaattaaaaatgttgaaattttaattttatcaagcAACAAAATCAACAATGTTGAACCATTGGCATTCACTGGactttttaatcttaaaaaacttgatctaaaaaataacgaaattgCATCATTGcatcaaaatgtttttgctGATCTTAAAAGTTTGACACAcatttctttatcaaataatCAACTTCAATATTTGCAAGGAAATTTATTGGcaaatcaaaagaatttgttgatggtaaaatttaatgaaaatccaATTATAAGCATTggaaatgaattattaaaatatgctTATAATCTTGAAAATGTTGACTTTGCTAGAACATGTGTTGGTCATTCTACTATTAAAGGAATTGAAGAGACTAAATTGAGAATTGAagttaattgtaaattttga
Protein Sequence
MEINSSRLLHQLFLIFLSVIATFSQSQQQSLTLKCKYEISNYNGRNFYTCYAIGLENHHENIEIDTIEGSHLKNYDESKVEALIIKNQNVSFLPNGFGEIFKNLIRLEVTESQLKEIKKSSFNGMNLETLKIRKNLIKNIENGAFDSLENLKELNLDKNEIENFSTEVFTKVLNLESFSIENNKIKELNENFCKNLPNLQDFYVRHNKIEKISSKVFENCENLKTLDFSNNKLSQISSKIIENLNFFDFSNNSCIINGFYGTNADGNFDENSTFIEKLEELCGQGQSRIFKIEIQKLGENNEILENKNSEEKEPKIYSNEEKLENNPTENQISSATSDDVENFKEEIQDLNSKLSNLETENENLKSNIEEKNLKIENLEMENENLRTEKENLKEQISSLKEENDKLQNEIEETNKNSDIGTQIEKLEFENSKLEAEIEEFKAQNQKLTSEIEKKEEELKNSKSLLQSCEISKESLRYQNEEINQNLRNLTNEILITSSNFDKCKEENTKHQSETENLTRKIPNLERQNEEMQTEIESLRKISDEKTRENFDISRNFSNLEAEYLGLRREKEDLEYDLQIYREKVVYFEKNYREKLEEISKCPEAPQSQVCCNELQIYKSVPLECDFDHIELEYSCKSTSLVIIHRHMSISETRGNHITRFIDNSNVKRLHVIDAIFKFFTNDIFNHFESIQSLEIINSQLLDLNDDKIKNENLKFLRIEGNLVDKLEDEIFSGIKNVEILILSSNKINNVEPLAFTGLFNLKKLDLKNNEIASLHQNVFADLKSLTHISLSNNQLQYLQGNLLANQKNLLMVKFNENPIISIGNELLKYAYNLENVDFARTCVGHSTIKGIEETKLRIEVNCKF

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-