Basic Information

Gene Symbol
-
Assembly
GCA_029963845.1
Location
JANEYF010004506.1:49313-58866[+]

Transcription Factor Domain

TF Family
TSC22
Domain
TSC22 domain
PFAM
PF01166
TF Group
Basic Domians group
Description
These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 21 0.021 5.5e+02 2.9 1.2 19 43 27 51 21 57 0.70
2 21 0.0004 10 8.4 0.1 16 44 66 94 64 102 0.88
3 21 0.11 2.9e+03 0.6 0.3 32 48 110 125 106 138 0.57
4 21 0.00031 8.1 8.7 0.2 20 46 154 180 151 189 0.87
5 21 0.23 6e+03 -0.5 0.2 19 37 190 208 188 210 0.86
6 21 0.00042 11 8.3 0.8 26 46 220 240 218 253 0.78
7 21 0.17 4.5e+03 -0.1 0.1 25 44 261 280 250 291 0.77
8 21 0.0072 1.9e+02 4.4 0.3 24 44 302 322 295 341 0.73
9 21 0.0047 1.2e+02 5.0 0.3 26 42 357 373 351 377 0.83
10 21 7.3e-05 1.9 10.8 0.6 18 42 391 415 388 419 0.92
11 21 0.00091 24 7.2 0.6 18 48 440 470 433 480 0.84
12 21 0.0016 41 6.5 0.5 20 43 477 500 473 505 0.78
13 21 0.0005 13 8.1 0.1 16 44 515 543 513 550 0.91
14 21 0.036 9.5e+02 2.1 0.3 18 42 562 579 557 586 0.58
15 21 0.0029 76 5.6 0.1 26 44 605 623 595 632 0.82
16 21 0.0025 65 5.8 0.4 13 35 641 663 638 671 0.72
17 21 0.022 5.6e+02 2.8 0.1 24 44 687 707 678 713 0.80
18 21 1.7e-05 0.45 12.8 1.6 17 43 739 765 737 775 0.91
19 21 0.1 2.7e+03 0.7 0.1 32 45 782 795 779 797 0.78
20 21 0.011 2.9e+02 3.8 4.7 15 48 786 822 783 833 0.70
21 21 0.42 1.1e+04 -1.3 1.7 16 36 938 956 904 960 0.73

Sequence Information

Coding Sequence
atgaaattaacaaaTCGAAAAAAAGACATGGAAAATGCTTCAAATGACTCgcaaaaatattcagaaaaagtCATGCAACTGGAAAAAGAAAATCTCAAATTCCAACAAGATATAAGTAATCTAGAAGCAGAAAATGATAggttaaaaaaagaattggaTGCAGTTCTTAACGACGCTAAAAAGGGAACTTCTGGAGTTTCAGAGATGCAGGACGAATTAGGAGAATTAAAGCAAAGAGTATCATTGTTAGACAGTGAAAATTCTTCGCTGAAGAAAGACCTAGAAGCAGCTTTAATGCACGCTAAAAGTGGCTGTAAAGGAGTAATCCAACTTAAGGACGAAAATGCGaaactaaaacaaattaatgatgACTTGGAGAATCAGCTTGCAAATTTAAGAAAAGAGTTAGAGGTAGCTTTGGTGGAAGCAAAAACGGCCTCTTCAGGTTCAGCgcaatttcaaaatgaaatgtctGATTTAAGAGATAGACTGGCAAAGTTGGAATCAGAGAATGCCAAGTTGAAGGGAGATTTAAATTCTGCATTGGAAGAAGCTAAAAGAcggctcttgaaaatgaaaattcaaaacttaaaaaaagatttagaaaaagCCGTGGAAGAAGCTAAAAGCGCAGGTGCGGCTCAGGACGACAGTAAGCTGAGAGAGAGAATAGAAAAACTTGAGGCAGAGAATGCCCAACTGAAGAAAGATTTAAATTCTGCACTAGACGAACTTAAAAATGCTAGCAGTACAATATCACAATTACAAGGAGATAgtgataaattaaaacaaagattAGCTTTAATGGAAACAGAAAGCTCTAAATTGAAAAAGGATTTAGATGACGCTTTGCAAGATGCAAGAAAATCAGCGTCTGCACTTTCTCAAATGCGTGATGATGAATCTAAACTGAAACAGCGAATTGCCCAGCTTGAAACCGAGAGTTCTAAGCTAAAACAAGATTTGGATAATTCTTTAGCTGGAGCAAAGAAAAATGAAAACCAAAAGCTTAGAGGTGATTTGGATAATTATATCAAAGATACCAAGTCATCAGATGAAGAGAATAGACTTAAGCAGAGAGTTGCTCAACTTGAAGCAGAAAACAGTAAGTTGAAGAAAGATTTGGAAGGTGCAAGAGATGAACATAAGAATTGTGGATTAAATTTATCACAGCTACAAGACGAAATTGGAAAATTGAAACAGAGGCATGCAGAACTTGAAAaggaaaattcaaatttgaggAAAGATTACGAAGGGGCAATGGGTGATGCTAAAAAGAGTGCTGCAGAATTAGCGATGCTTCGAGACAATGAAGCAAAATTGAAACAACAAATAACACAACTTCAAGGTGATAATAACAAACTGAGTAAAGACATTACAGCTTTAAATGCGGAAATAAATAGCAATTCAAGTGGCCTGGcacaaataaaagataatgaagcgaaattaaaacaaagaataGCTCAACTCGAAGACGAAaatgctaaattaaaaaaagcctTAGATTCTGCAATAGACGAATCGAAAAAATCAGGTGCTGACGTATCAAAATTGAAAGATGATGATGCTAAAATGAAACAAAGATTAGCCCAACTGGAAGCAGAAAATGCCAAGTTAAAAAGTAACTTAGACGCAGCTTTAGCTGAAGCTAAAAAGGGGGCAGCTGGAGCAGACGAGGAAGCTAAATTACGACAAAAACTAGCCCAGTTAGAAGcggacaataataaattaaaaactgatttgAACAGAGCTCTAGATGATGCTAAAAAGAGTGTAGCAGATATGTCCAAATTACGAGATGATGATGGaatattaaaacagaaaatggCACAATTGGAAAATGAAAATGCCAAGCTGAAGAAAGATTTAGAGAAAGCTTTGGCTGACGCTCAAAGTGGAAGTAGTGGAGCATCGCAGCTTcgggatgaaaataacaaattaaaacaaagaattGGACAACTCGAAAGTGAAATTTCAAAGCTAAAGATGGACCTAAATTCTGCTCTAGAGGATGCTAAAAAAGGGGCTTCTGGTGTGTCCCAAATGCAAGATGACGatgcaaaattaaaacaacgactcagccagttagaaaaagataATGCGAAACTAAAAAGTGATCTCGAGTCAGCACTGAAAGAAGCTAAGGGAGGATCTGACACTGCGAGCGATTTAGATTCGTCGTTAAGTGAATCTAAAAGTGGATCTGCTGGATTAGATAAACTAAAAGATGAGAACGCAAATCTAAAACAAAAACTCTCTCAATTGGAAAAGGAAAATAGTAAACTCAAAAAAGATCTCGAAGCAGCACTGAATGATCTAAAGAGTGGATCTGCGGGATCTGACCGTCTCAAAGATGAAAactcaaatttaaaacaaagaatagCTCAGCTTGAAAatcTTGAGGACAAAATTAAACAGCTAGAAGCAGAACTGGCTAAACAAAAGAAGGAATGTGACAATAACATGGCTTCCTTAAAAGAACAATTTGACAGAGATATGAGAAATTTGCTTAAGAAAAACGAAGAGAGCATAGCTAAACTGCAAAAAGCCCATGAGGAACTTTTAAAggaattaaatgaaaaacatgATAATGAGCTAAAAGGCATACAGGACGCATTAAGGAAATCAAAAGAAGAATTGGACAGTACCATAGCGGAAcgaaacaaaatgaaaaatttgttaGACGACCAACGAAAAAAGAGcatcgaaataaaaaataaggttGAGCAAGTGGAATCCGTTATATCAAAAGAGCGAAGTAAAAGTAGAGAACTAAGGAGAAGCAGTTTACTATTGCAAAAACAACTTGAGACCGAAGTGGTTAAAAGCAAGATCTTAGAAGAAGAAATAAAGGAATTGCAAAAAGAAGAGGATAAAGAAGACAAACTTACAATGACAACATTCGAGAATGCAACAGTTGCCTCGAAGTTCCCAGATGTTATCGCAATTACAGCCCCACTGTCTGAAAAGGAGTTAAAGTTCTTTGGAACTCAAAGTTGTCCATGTGATACGAGTCTCGAAACAGTTCTAGATAAACTAATTAACAATGGAATTGAGtctCTTAGCATTGAAGAATTACAAATGCTTCATAAAAAGAATAGTGATGCTACAGCCAATGTCTTAGAGAAACTTGGACAGAGCACTGGTGTTCTAAAACCTAACGAGACCAATAAATTATCGTTAATGAAAAGGATAGCAGCATTAGAGGGCGATTTACTGAAGAAACAAAAACATGCTCAACAAAAAGAGTTGTAA
Protein Sequence
MKLTNRKKDMENASNDSQKYSEKVMQLEKENLKFQQDISNLEAENDRLKKELDAVLNDAKKGTSGVSEMQDELGELKQRVSLLDSENSSLKKDLEAALMHAKSGCKGVIQLKDENAKLKQINDDLENQLANLRKELEVALVEAKTASSGSAQFQNEMSDLRDRLAKLESENAKLKGDLNSALEEAKRRLLKMKIQNLKKDLEKAVEEAKSAGAAQDDSKLRERIEKLEAENAQLKKDLNSALDELKNASSTISQLQGDSDKLKQRLALMETESSKLKKDLDDALQDARKSASALSQMRDDESKLKQRIAQLETESSKLKQDLDNSLAGAKKNENQKLRGDLDNYIKDTKSSDEENRLKQRVAQLEAENSKLKKDLEGARDEHKNCGLNLSQLQDEIGKLKQRHAELEKENSNLRKDYEGAMGDAKKSAAELAMLRDNEAKLKQQITQLQGDNNKLSKDITALNAEINSNSSGLAQIKDNEAKLKQRIAQLEDENAKLKKALDSAIDESKKSGADVSKLKDDDAKMKQRLAQLEAENAKLKSNLDAALAEAKKGAAGADEEAKLRQKLAQLEADNNKLKTDLNRALDDAKKSVADMSKLRDDDGILKQKMAQLENENAKLKKDLEKALADAQSGSSGASQLRDENNKLKQRIGQLESEISKLKMDLNSALEDAKKGASGVSQMQDDDAKLKQRLSQLEKDNAKLKSDLESALKEAKGGSDTASDLDSSLSESKSGSAGLDKLKDENANLKQKLSQLEKENSKLKKDLEAALNDLKSGSAGSDRLKDENSNLKQRIAQLENLEDKIKQLEAELAKQKKECDNNMASLKEQFDRDMRNLLKKNEESIAKLQKAHEELLKELNEKHDNELKGIQDALRKSKEELDSTIAERNKMKNLLDDQRKKSIEIKNKVEQVESVISKERSKSRELRRSSLLLQKQLETEVVKSKILEEEIKELQKEEDKEDKLTMTTFENATVASKFPDVIAITAPLSEKELKFFGTQSCPCDTSLETVLDKLINNGIESLSIEELQMLHKKNSDATANVLEKLGQSTGVLKPNETNKLSLMKRIAALEGDLLKKQKHAQQKEL

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-