Basic Information

Gene Symbol
-
Assembly
GCA_000349025.1
Location
KB663610.1:29654006-29658698[+]

Transcription Factor Domain

TF Family
TSC22
Domain
TSC22 domain
PFAM
PF01166
TF Group
Basic Domians group
Description
These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 13 5.8e-06 0.037 13.7 0.6 15 53 28 64 25 68 0.84
2 13 0.0067 43 3.9 1.6 21 47 72 98 71 107 0.86
3 13 0.011 69 3.2 0.5 19 44 108 133 104 139 0.85
4 13 0.015 96 2.8 2.6 18 49 163 198 157 211 0.69
5 13 0.00018 1.1 9.0 0.1 21 45 217 241 213 245 0.90
6 13 0.0029 19 5.1 0.4 21 44 257 280 252 291 0.87
7 13 0.0013 8.3 6.2 0.3 21 45 296 320 292 330 0.87
8 13 0.0012 7.5 6.3 0.6 22 51 335 362 334 367 0.83
9 13 0.0002 1.3 8.8 2.0 21 46 372 397 371 408 0.91
10 13 0.0028 18 5.1 0.5 21 46 410 435 407 439 0.89
11 13 6.7e-05 0.43 10.3 0.7 21 46 448 473 445 480 0.93
12 13 0.04 2.6e+02 1.4 3.1 14 37 490 513 487 532 0.83
13 13 0.27 1.8e+03 -1.3 0.3 20 20 542 542 515 578 0.59

Sequence Information

Coding Sequence
ATGTCGGACCTGGACGATATGAGCAACTTCGATGCGTGGTTGGAGAAGGACTTTCTTAGCAAAATTAACCGGCTCGAGACGGAGGTGGAAGATCTGAAGGATAAGAAAAGCAATCTCGACAAGGacctgaagaagaaaacgatcgaGTGTGATATACTGCGCGCGAAGGTCGGCAAACTCGAACAGGTCCCGTCGAACAATGCGGCCTCCAACGAAAAGATCGAAAACCTGGAAAAGGAGCTCAAGCGTAAGACGATGGAGCTGGACATACTGCGGTCGAAGCTGTCGCAGGCCGAAACGAACCCGCAAGCCTCGCAGGACCTGACCGAGCGTGTGAAGCAGCTCGAGAAGGAGCTGAAAAAGCGCATCATTGAGAACGGCATACTGCGCGGTAAGCTGACGGAGGTCGACAGTGAGCCACAGACACCGGAAGCGCAGGAGAAGATCGATGAGTTGGTGCGGGGCCTACAGCGTACCGAGAGTGTGGTGCTGCGGGAGAAGGTTAAGGAGCTGGAAGGTGACTCGGAAGCGTCCGAACGGAACGGTGAACTGGAGCGCGAgctcaagaagaaaaacatcgagAGTGATATACTGCGCTCGAAGTTGAAGCAGTTCGAAAACACGCCCTCGAACGCAcaggaagcgaacgaacggaTGCGTGATCTGGAGAAGCAGGTCCGGAAGTACCAGATCGAGGCGGACATTCTGCGCGCGAAGGTGACCCAGCTGGAGAGTGATGCGGCGGGGCGCGATGACGAAGAATCGACTGAGCGCATCGAGGAGCTGGAGAAGGGTCTGAAGAAAAGCACCACCGAGGGTAACATGTTGCGGGCGAAGatgcaaatgtttgagaaGGAATCGACCGCAACACAGGACTCGAACAGTGAGCAGATCGAGGAGCTGCGCAAGGGGCTGAAGAAAAGCACCACCGAGAGCCAAATACTGCGTTCGAAGCTGACCGAGCTGGAGAGCAAACAGGTGCACAGTGGCGCTAGCAGCTACAAGATCGAAACGCTCGAGCAGGagctgaagaagaagaacatcgAGAACGATATCCTGCGATCGAAGGTGGAACAGCTGGAGCGCGTCCCGTTGGCCGACAGTCCGGCGAACGACAAGATCGAGGAGCTGCAGAAGGAGATCCGCAATCTCAAGATCCAGAACGACATACTCAAATCGAAGCTAAACCAGGCCGAGGACGAACCGACGACCACGCAGGAATCGACCGATCGCATCAGCGAGCTGGAGAACGAGCTGAAGAAGCGCACGATCGAGAGCGATATACTGCGCGCGAAGGTGACCCAGCTGGAGTGTGAGCTGTCCCCGACCAAAAACTCGAACGAGCGGATAGAGGATCTGGAAAGCgagctgaagaagaaaacgctCGAGAACGACATCCTGAAGTCGAAGGTGAGCCAGCTGGAGGGTGAGGTGATCGCGCATTCGGGCGCCAGCAAGGCGGACCCGAACGAGGTGAAGAAGCTGAAGGAAAAGCACGAGGAGGTGATGAATAAGGCGAAGGAGCTGCTGTTCGAGCGGACGAAGACGGTCAAGACGCAGGACATGCAGATCAAGGCGCTGCAGAACCAGATCGAAAACATCAAGGAGGTGGTGGCGGTAACGAAGGACATGCTGAACATCCGGAACATGGAGCACGAGCAGTTGCAGACGCGGTTCGAAAACATCGACTGCAAGATGAAGGCGGAACGGGAACGGCAGACGCTGCTCGAGAAGAAGCTGACCGTCTCGCAGAAGATGTACAACGATCTGCGGGACGAGTACACGACACAGCTCGAGCTCTTCAAGGATTCTTCTTCGTCAAATTGCAGGAAATCCAGAAAACCTACGCCGAAAAGATCGAACTCATCAAGGAGGAAATCGAATCCGTCCGGAATGGTGCCAAAAATTGAGTCGTCCTACCGGCAGGACCAGCAACTGCCAACGGTCACGGTGGAAGATCATCTTCGGAGTGAGCTGGGTGGTCGATCCTGCTGTTGTACGGTGCAATAA
Protein Sequence
MSDLDDMSNFDAWLEKDFLSKINRLETEVEDLKDKKSNLDKDLKKKTIECDILRAKVGKLEQVPSNNAASNEKIENLEKELKRKTMELDILRSKLSQAETNPQASQDLTERVKQLEKELKKRIIENGILRGKLTEVDSEPQTPEAQEKIDELVRGLQRTESVVLREKVKELEGDSEASERNGELERELKKKNIESDILRSKLKQFENTPSNAQEANERMRDLEKQVRKYQIEADILRAKVTQLESDAAGRDDEESTERIEELEKGLKKSTTEGNMLRAKMQMFEKESTATQDSNSEQIEELRKGLKKSTTESQILRSKLTELESKQVHSGASSYKIETLEQELKKKNIENDILRSKVEQLERVPLADSPANDKIEELQKEIRNLKIQNDILKSKLNQAEDEPTTTQESTDRISELENELKKRTIESDILRAKVTQLECELSPTKNSNERIEDLESELKKKTLENDILKSKVSQLEGEVIAHSGASKADPNEVKKLKEKHEEVMNKAKELLFERTKTVKTQDMQIKALQNQIENIKEVVAVTKDMLNIRNMEHEQLQTRFENIDCKMKAERERQTLLEKKLTVSQKMYNDLRDEYTTQLELFKDSSSSNCRKSRKPTPKRSNSSRRKSNPSGMVPKIESSYRQDQQLPTVTVEDHLRSELGGRSCCCTVQ

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00094915;
90% Identity
iTF_00094915;
80% Identity
-