Basic Information

Gene Symbol
-
Assembly
GCA_954870645.1
Location
OX940924.1:27868473-27903820[-]

Transcription Factor Domain

TF Family
TSC22
Domain
TSC22 domain
PFAM
PF01166
TF Group
Basic Domians group
Description
These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 4 0.00013 0.72 11.5 0.3 13 44 18 49 15 69 0.75
2 4 6.4 3.6e+04 -3.5 0.0 34 41 107 114 97 119 0.75
3 4 2.1e-05 0.12 14.0 0.2 14 44 305 335 304 337 0.94
4 4 0.018 1e+02 4.6 0.3 17 34 633 650 628 656 0.87

Sequence Information

Coding Sequence
ATGGCAGCGGTGGCAGACAGCCTCAGCCGGCCTCCTCCCGCATCGACAACATCGGACGAAATGTCCcaaatgcgggaggaggtcgccaagctcacggcggcgatggatGCTCTCCGTAAGGAGAACATCCAGCTGAAAGAGGAGCTGGCCGGACTGCGGAAGGGGGGCCGCCGAGAGGAGGTCCCGCAGGCGCGAACGCAGACGCAGGCGCAGACGCGGCAgaaggagccgccggcgaagcccgcaccaaAAGACGACAGCAACGCTGTCTtaaatctggtgcggcaggagctggcggccttcaaccagcgctttacagcgctggagaaccgcgtcttgcgccccccgctcgcgTCATCCTATgcggccgtagcggcttccGCGCCGCCCAGGCCAGCCCCGGAAAAGCGGGCCGCGGCGAAACCGCCGGCGCCTCCCAAACAGGCCACAGGGAGAGCTgccccggcggcggcggcaccggctggggcgaaggcggccccgcaGAGCCCGGCTgcgaagaaaagaaagaaaggtgttccggcggcggcggcggtgagcgtatgCTCTGGAAGCACGACAAGGAGGGACGAGGGGAGCGCCGCCAGCAGGAGGAAGGAATTCCTCAAGCGCTCCCGGGACGACGTCTCTTCATCCGAGGAGGATGCTGCCTCGAGCCGGAAGATCCCCACcagtgtgcggggtaaaggccgaggcgggacgtgcggTAGAGGGCGGGAAGCCGAGCCCTCCCTAGAAAAATTGGTGGAGGACGCCATCCGCACTATTAAGGCCAAGCCTGGCAGGCCCGGTAAAAAGGCCACAAAGGACTTGGCCATAAGTAAGGCCACGGCCACTATAATGGCGGCGGTGTCGGGCGGCATCGGACGGCCTCCACCCGCATCGAGCCCATCGGACGAAGTGTCCCAGTtgcgggaggaggtcgccaAATTGACGGCAGCGATGGATGTTCTCCGAAAGGAGAACATCCATCTTAAAACGGAGCTGGCCGGGCTAAAAAAGGGGACCCGCGGGGAGGAGATCCCGCAGACGCGGAGgaaggagccgccggcgaagcccgcaccaaCAGATGACAGCAATGTCTTAAGTCTGGTGCGACAGGAGCTGGCGGCTTTCAACCAGCGGTTCGCGGCTCTTGAAAGCCGCATCCTGCGTCCCCCGCTCGCGGCATCATACgcggccgtagcggcttccGCGCCGCCCAGGTCAGCTCCGGAAAAGCGGGCCGCGGCGAAACCGCCGGCGCCTCCCAAACAGGCCACAGGGAGAGctgctccggcggcggcggcaccggctggggcgaaggcggccccgcaGAGCTCGGCTGCCAAAAAGAAGAAAgCCACCAGACCTGTAcgaccctccctttacaaggGTATGTCTGTGTGCGCTCAAATGGTCACCCCCGACCCACATGACGCCCCGTACGAGCATCGGTCGACGGGGGAGTCCATGGGGGAAAAGCTGCTTTCCCCGAAGGAACGCGACGAGGTGAAGCGAAGAATAACCTTCGAGGACGCCACCCCTCGTGCGTATATCGAGGTCTTCACCACGCCAAAGGGGCGCCCGCCTCCCGTGGCCGCTACCGGATTGCTCGGTGGGGCTGTGGCCGAGCGTCGCAGTCGTCTTGAGGAGGCGAGAGATTGCCTACAGAAGGCGAAGCTCAACCTGGGCAACTCTCGCAACCTCAAGACAAACATCAAGAACGCGGTGCTGGAGGCCCTGGAACGGCTCTTCGAGCTGGTCGAGGACGCGGAACTCGAGCGGGCCCCTGATGCGCCAGTGACCGTGCCAGAGCCGAGGACCACCGCATCCACCGCTGTCCATAGTTCCGCACCCGTCTCCGTCGAGATTCTTGCTGGCCAGACTGAGCTCCTCAACACCCTCAAGGAGCATATCTCGGAGTTGAAAGACCACTCCGAGAAGATGGATAGACTCCGAGACAGTCTGGCCAGCGGGGAAAGTGTGGGGGGGGAAACATACGCGAGTGTTGCGTCAACGACTAGTCGCCATACCGGGACCCAGGCCCTGGAGCGAAAGACGCTGCACTCCGTGGTTGTCTCCTCCACTGAGGTGATGGATACGGGTGACGAGATCCTGGAAAGGGTCAGGAGAGCGGTGGATGCGAAGGACGGCTGGGTTCGGGTGGAACGTGTGAGAAAGGCCAAGGACCGGAAGATCATTATGGGGTGTGCGACAAAGGAGGAGAGAAAAAAGATCAGGGAGCGACTGCAGGCGGCCGGAGGAaacctcgtcgtcgaggacgTGAAGAATAGGGACCCGCTTCTGATTCTGAGGGATGTACTTTCAGTACACAGCGACGAGGAAGTGCTCGCAGCCCTCAGGAGTCAGAATCGGGAGATCTTCTGCGGCCTCGACGACGAGGAAGGGAGAATGTCTGTCAAATACAGGAAAAAGGTACGAAACCCGCATGTAAACTGCATTGTGCTGACTGTATCCCCCATTATCTGGAGGAGGGCACTCGAAGGAGGCAAGCTCCGAATTGACCTCCAGAGAGTGCGCGTCGAAGATCAGAcgcctctggtgcagtgcaccCGCTGCCTAGCATTCGGCCACGGCAAGCGATTCTGCACGGAGCCAGCCGATCTCTGCAGCCACTGCGGTGGTCCACACCTTAGGTTGGACTGCCCAGTCAAGCAGAGCGGCGGGGCAGCGGTATGCGTGAACTGCACCAGAGCAAAATTGGGGGAGCATCAGCACAGTGCTTTTAGTGACGTTTGCCCGATCCGTAGGAAGTGGGACGCATTCGCGAGGGAAGCAATAGCCTATTGCTAA
Protein Sequence
MAAVADSLSRPPPASTTSDEMSQMREEVAKLTAAMDALRKENIQLKEELAGLRKGGRREEVPQARTQTQAQTRQKEPPAKPAPKDDSNAVLNLVRQELAAFNQRFTALENRVLRPPLASSYAAVAASAPPRPAPEKRAAAKPPAPPKQATGRAAPAAAAPAGAKAAPQSPAAKKRKKGVPAAAAVSVCSGSTTRRDEGSAASRRKEFLKRSRDDVSSSEEDAASSRKIPTSVRGKGRGGTCGRGREAEPSLEKLVEDAIRTIKAKPGRPGKKATKDLAISKATATIMAAVSGGIGRPPPASSPSDEVSQLREEVAKLTAAMDVLRKENIHLKTELAGLKKGTRGEEIPQTRRKEPPAKPAPTDDSNVLSLVRQELAAFNQRFAALESRILRPPLAASYAAVAASAPPRSAPEKRAAAKPPAPPKQATGRAAPAAAAPAGAKAAPQSSAAKKKKATRPVRPSLYKGMSVCAQMVTPDPHDAPYEHRSTGESMGEKLLSPKERDEVKRRITFEDATPRAYIEVFTTPKGRPPPVAATGLLGGAVAERRSRLEEARDCLQKAKLNLGNSRNLKTNIKNAVLEALERLFELVEDAELERAPDAPVTVPEPRTTASTAVHSSAPVSVEILAGQTELLNTLKEHISELKDHSEKMDRLRDSLASGESVGGETYASVASTTSRHTGTQALERKTLHSVVVSSTEVMDTGDEILERVRRAVDAKDGWVRVERVRKAKDRKIIMGCATKEERKKIRERLQAAGGNLVVEDVKNRDPLLILRDVLSVHSDEEVLAALRSQNREIFCGLDDEEGRMSVKYRKKVRNPHVNCIVLTVSPIIWRRALEGGKLRIDLQRVRVEDQTPLVQCTRCLAFGHGKRFCTEPADLCSHCGGPHLRLDCPVKQSGGAAVCVNCTRAKLGEHQHSAFSDVCPIRRKWDAFAREAIAYC

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-