Basic Information

Gene Symbol
-
Assembly
GCA_905404295.1
Location
FR990141.1:1-46547[-]

Transcription Factor Domain

TF Family
TSC22
Domain
TSC22 domain
PFAM
PF01166
TF Group
Basic Domians group
Description
These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 6 0.013 1.1e+02 3.6 0.0 15 37 140 162 137 164 0.90
2 6 0.0014 12 6.7 0.0 14 37 438 461 435 463 0.90
3 6 0.021 1.9e+02 2.9 0.0 14 32 734 752 731 758 0.80
4 6 0.01 88 3.9 0.0 14 37 940 963 937 965 0.89
5 6 0.78 6.8e+03 -2.1 0.1 19 36 1333 1350 1320 1358 0.54
6 6 0.01 88 3.9 0.0 14 37 1671 1694 1668 1696 0.89

Sequence Information

Coding Sequence
ATGTCCCTGTGGAAGGGACAAGGGGTGTGCAACCCATTAGGGTTGTTTAGGGTGGGGGCGACGGGCTGTCGGCAACCCGCCGGCCTTCGTCAACACGACCAACGCACACCCATACCGGATACCGATGTCCTGCGAACGGAGGAGGAGACACATGGAGCCGCGGAGGAGTCTGACTCGTCCGTGGCCACCATGGTGTCTGTGTCGTCTGCGGACGATAGCACGGGGGGATTTTGGCGCTCTGCGCGAAAACGCGGGCCCCCAGCACCCCAAAAGGGTGACGAGGAGGAGGCGGAGTCGGCCAAGAGAGACTTGAAAGAGGCCTCCGCCAACCTGGCGGCGATCGTGGCGGAGCTGGCGGAGCGCTCCGCTACCGAGGAGACAGCTCGGCTCCAAAGGGCCAATGCCCGCCTACAGGGCGAGGTCGGCGAGCTTCGTAAGGAGGTCGCAGACCTGCGGGCGAGCATGGCAAGCCTGCCCAAGGAGTCGAGCGGAGCCGTCGGAGATGACGTAGCCTCGCTGACGCGCTCGATAATGAAGCAAGTGGGCGACATGGTCGGCGCTAGGCTCGAGGCGCTCGAGAGGCGGCTCCCTCCCCCGCCGATCGTCCGGCCTTCTTTGGCTGCCGACAGGGCGAGGGATGCGGCTGCGCCCCCCGCCGCCGCTCCTGCGGCCGGTGAGGCCGAGCGAGGCGGCCGGAAGAAAGAAAAGGCGCCTCCCAAGGCTTACGCTGAGTTGGAGGAGACCCCCGAGACGGCTGGCGTGTTTGGTGTTCCGGCACGCCAGGTGAAAGGGCCGGCCCGCGCGGAAAGACAGTTGCGGGCTGGCGAGGTGGTCGCTCCCAGCAGGCTTGGCGCGACAAGCCTGCTTGGGATGCGCAAGGTGGGGACCGCAGAGGAAGGTTTTCCTTCCTCTGCGGCCGAGGCGGAGCCGCTCCGATTCGCGGCCGGAGCGGCATTGGTGGGGCCGCAGGGGAAGAGGAATGCCGGTATTGCGGTGCGCCGCTTTCCCTGTCCTCTGCGGCCGAGGTGTGGTTGGCGGTGGAGCCACAGATCTGATCCACCGCCGGGTGCTGGAGGTCGGCAACCCGCCGGCCTTCGTCAACACGACCAACGCACACCCATACCGGACACCGATAGCATGCGATTCGAGGAGGAGACACATGGAGCCGCGGAGGAGTCTGACTCGTCCGTGGCCACCATGGTGTCTGTGTCGTCTGCGGACGACGCCACGGGGGGATTTTGGCGCTCTGCGCGAAAACGCGGGCCCCCAGCACCCCAAAAGGGTGACGAGGAGGAGGCGGAGTCGGCCAAGGGTGAGGTAGCTGAGCTCCGCAAGGAGATCGCGGACCTGCGGGCGAGCATGGCAAGCCTGCCCAAGGAGTCGAGCGGAGCCGTCGGAGACGACGTAGCCTCGCTGACGCGCTCGATCATGAAGCAAGTGGGCGACATGGTCGGCGCTAGGCTCGAGGCGCTCGAGAGGCGGCTCCCTCCCCAGCCGATCCTCCGGCCGCTTTTGGCCGCGGACAGGGCGAGGGATGCGGCTGCGCCCCCCGCCGGCGCTACTGCGGCCGGTGAGGCCGGGCCAGGAAGCCGGAAAAGGAAGGAGAAGGCGCCTCCTAAGGCTTACGCCGAGGTTCCCCCCCGGGACATGATCGGCGCTAGGCTCGAGGCACTCGAGAGGCGGCTCCCTCCCCAGCCGATCCTCCGGCCGCCTTTGGCCGCCGACACGGCGAGGGATGCGGCTGCGCCCCCCGCCGCTCCTGCGGCCGGTGAGGCCGGGCCAGGCAAAAAGAAGAAGGAAAAGGCGCCTCCTAAGGCTTACGCCGAGGTGGCAGCGCCGAAAGCCGGCGCCCCGAAGCCCCCGACGGCCCCCTCCGAGGAGTGGAGCACGGTCAAGCGGGGCAAGCGGGGAAAGCCGGACAAGAAGAAGGGCAGCGCACCGCCGGACAGGCCGGCAGCGCCCAAGAAGAGGGACCCGGCGGCGAAGCTTCGCCCCCCGCGCTCGGCTGCGGTTACTATAACCATCCAGCCGGGCGCCACCGAGACACATGGAGCCGCGGAGGAGTCTGACTCGTCCGTGGCCACCATGGTGTCTGTTTCGTCTGCGGACGACGCCACGGGGGGATTTTGGCGCTCTGCGCGAAAACGCGGGCCCCCAGCACCCCAAAAGGGTGACGAGGAGGAGGCGGAGTCGGCCAAGGGTGAGGTCGCTGAGCTCCGCAAGGAGATCGCGGACCTGCGGGCGAGCCTTGTGGGCTTGCCCAAAGAGCCGAGTGGAGCCGTCGGAGCCGTCAACGATGTCGCCTCGCTGACGCGCTCGATCATGAAGGAAGTGGGGAACATGATCGGCGCTAGGCTCGAGGCACTCGAGAGGCGGCTCCCTCCCCAGCCGATCGTCCGGCCGCCTTTGGCTGCCGACAGGGCGAGGGATGCGGCTGCGCCCCCCGCCGCCGCTCCTGCGGCCGGTGAGGCCGGGCCAGGCAGCCGAAAGAAGAAGGAGAAGGCGCCTCCTAAGGCTTACGCCGAGGTGGCAGCGCCGAAAGCCGGCGCCCCGAAGCCCCCGACGGCCCCCTCCGAGGAGTGGAGCACGGTGCTCAAGCGGGGCAAGCGGGACAAAGTAAAGAAGGGCAGCGCACCGCCGGACAGGCCGGCAAAGGAGGAGACACATGGAGCCGCGGAGGAGTCTGACTCGTCCGTGGCCACCATGGTGTCTGTGTCGTCTGCGGACGACGCCACGGGGGGATTTTGGCGCTCTGCGCGAAAACGCGGGCCCCCAGCACCCCAAAAGGGTGACGAGGAGGAGGCGGAGTCGGCCAAGGGCGAGGTCGGCGAGCTTCGTAAGGAGGTCGCAGACCTGCGGGCGAGCATGGCAAGCCTGCCCAAGGAGTCGAGCGGAGCCGTCGGAGATGACGTAGCCTCGCTGACGCGCTCGATAATGAAGCAAGTGGGGGACATGGTCGGCGCCAGGCTCGAGGCGCTCGAGAGGCGGCTCCCTCCCCAGCCGATCGTCCGGCCTTCTTTGGCTGCCGACAGGGCGAGGGATGCGGCTGCGCCCCCCGCCGCCGCTCCTGCGGCCGGTGAGGCCGGGCCAGGCAGCCGAAAGAAGAAGGAGAAGGCGCCTCCTAAGGCTTACGCCGAGGTGGCAGCGCCGAAAGCCGGCGCCCCGAAGCCCCCGACGGCCCCCTCCGAGGAGTGGAGCGTGGTGGTCAAGCGGGGCAAGCGGGACAAAGTGAAGAAGGGCAGCGCACCGCCGGACAAGCCGGCAAAGGTGCCGAAAAAGAAGGACCCGGCGGCGAAGCTTCGCCCCCCGCGCTCGGCTGCGGTTACTATAACCATCCAGCCGGGCGCCACCTTGGAGGAGACCCCCGAGACGGCTGGCGTGTTTGGTGTTCCGGCACGCCAGGTGAAAGGTCGGCAAACCGCCGGCCTTCGTCAACACAACCATAGCGAACGTACAACGGATTCTGATAACTTGCTTACGATGATGGAACAACGCACCGAACAGGGAGGAGAAAAGGGCGAGTGGGTCCAAGCAACCACTAGAAGACAGCGGCAAACCAGGGCCCGGGCAAAGGCATCACACGCCACCGCATCTGACGCAGACGAGCTTTCCGACTGCTCCCAGATGTCGGGATTTTCGGAGTGGTCCACCGCAACGTCGGTTGGTGGCCGTAAAAGGCCGCTACCGGCGAGCGATGGCCCATCCAAGGATGACGATCTGCTGGCCTTTTTGGAACCAGCGGCAAAAACGAGGGCTAAGGCAAAGCTCCCAAGTGCCGAAGAGCTAGGCCGGGAGATGAGTGAGCAGCCCACCTGCGATTTAGGTGCGCAGATTGTGGACAATTTGGAGGTCCTTGAGATGGTTGCGGATAAATCCAAAAACCTAAAAGGAAATTTCGTCCGCGCGATCCGCTTGGCGGTCCGCAAGATCCAGGCAGCTACAACGGAGGTAGTGCAGCGAGCGGCTACTGCACACCTCGAGAGGGACAACACGGCCTTGCGCATAGAGCTGGCCAGTATGGAAACAAAGATGGAGGCTCTTACGGCGGAACTCAGTGAGCTACGCCGTAGGACTATTACACGGGAAGACTCACGACCGGCACATGAGAATGCGCCGAGACCCGTGAATGAAACGAGACCCGGGAATGAGGAAGACGCCCTCTTGAAAAAGATCGGGGCAATGATGGACAGCAAGTTGGCGTCTTTCAGGGACGAACTCTTCCCGGGTAGAGCAGCTCGGCCTCCCCTCGGGAAAAAGGCCACGCCCGCCACCGTAACGCGGCCCTCAACGGCCGCAGAGCCGGAGCAGCCTCCGACGACGTGGGCCAAAGCAGTGGGGCAAAAGTCGAAGAAGCGGCCCCCCACCCAGATGCCAGCGCCTCGAAGGCAATCCCCGAGTCGCGGGAGAGGGGAAGGAAGTTCCCACCGGCCCTCTGCGGCCGAGGCCCCCCAGGCACCTAAGCCACTTCCGGTTCAGGACAGTCGCCGGGCCATGGGTGTTCAGCGGCGATGGGCACTGGGACGTTTGGACCGGGATATGCTCGTCGCCGCTGCACACGTCGCGGACTGGTCCGGGGCTGCGGTCGGGGAAACGGCGACAGGCCTGGAGGATTCCGCTGCCCGACTCCAAGACGACATGACGGCGATCTGCGACGCGTCGATGCCGCGGGTCAGATCGGGCAGGAGGGCACCGGCATACTGGTGGTCGGACCAAATTGCCCGGCTGCGCGAGGATTGTCTGCGATCCCGGCGCAATTACACCCGCGCCCGCAGACGTCGGCAACCCGCCGGCCTTCGTCAACACGACCAACGCACACCCATACCGGACACCGATAGCATGCGATTTGAGGAGGAGACACATGGAGCCGCGGAGGAGTCTGACTCGTCCGTGGCCACCATGGTGTCTGTGTCGTCTGCGGACGACGCCACGGGGGGATTTTGGCGCTCTGCGCGAAAACGCGGGCCCCCAGCACCCCAAAAGGGTGACGAGGAGGAGGCGGAGTCGGCCAAGGGCGAGGTCGGCGAGCTTCGTAAGGAGGTCGCAGACCTGCGGGCGAGCATGGCAAGCCTGCCCAAGGAGTCGAGCGGAGCCGTCGGAGACGACGTAGCCTCGCTGACGCGCTCGATAATGAAGCAAGTGGGCGACATGGTCGGCGCTAGGCTCGAGGCGCTCGAGAGGCGGCTCCCTCCCCAGCCGATCGTCCGGCCTTCTTTGGCTGCCGACAAGGCGAGGGATGCGGCTGCGCCCCCCGCCGCCGCTCCTGCGGCCGGTGAGGCCGGGCCAGGCAGCCAAAAGAAGAAGGAGAAGGCGCCTCCTAAGGCTTACGCCGAGGTGGCAGCGCCGAAAGCCGGCGCCCCGAAGCCCCCGACGGCCCCCTCCGAGGAGTGGAGCACGGTGCTCAAGCGGGGCAAGCGGGACAAAGTGAAGAAGGGCAGCGCACCGCCGGACAGGCCGGCAAAGGTGCCGAAAAAGAAGGACCCGGCGGCGAAGCTTCGCCCCCCGCGCTCGGCTGCGGTTACTATAACCATCCAGCCGGGCGCCACCGTTAGGTTAGGTTCCCCCCCGGGGGTGCCCGCCTTGTCGTGGCGGGGGGGCTCTAGTAACCGTGGTTTCGGCGCCCACGGTGAAGCTAAGAGGCACCCAGGGCCGGCCCGCGCGGAAAGGTGGTTGCGGGTTGGCGAGGTGGTCGCTCCCAGCAGGCTTGGCGCGACAAGCCTGCTTGGGATGCGCAAGGTGGGGACCGCAGAGGAAGGTTTTCCTTCCTCTGCGGCCGAGGCGGAGCCGCTCCGATTCGCGGCCGGAGCGGCATTGGTGGGGCCGCAGGGGAAGAGGAATGCCGGTATTGCGGTGCGCCGCTTTCCCTGTCCTCTGCGGCCGAGGTGTGGTTGGCGGTGGAGCCACAGATCTGATCCACCGCCGGGTGCTGGAGGTTTTCCGGCACCCATGACCTCCTTGTGGCGGACTCGGGGGGGTCCGCCACTGTACTGGAGGAG
Protein Sequence
MSLWKGQGVCNPLGLFRVGATGCRQPAGLRQHDQRTPIPDTDVLRTEEETHGAAEESDSSVATMVSVSSADDSTGGFWRSARKRGPPAPQKGDEEEAESAKRDLKEASANLAAIVAELAERSATEETARLQRANARLQGEVGELRKEVADLRASMASLPKESSGAVGDDVASLTRSIMKQVGDMVGARLEALERRLPPPPIVRPSLAADRARDAAAPPAAAPAAGEAERGGRKKEKAPPKAYAELEETPETAGVFGVPARQVKGPARAERQLRAGEVVAPSRLGATSLLGMRKVGTAEEGFPSSAAEAEPLRFAAGAALVGPQGKRNAGIAVRRFPCPLRPRCGWRWSHRSDPPPGAGGRQPAGLRQHDQRTPIPDTDSMRFEEETHGAAEESDSSVATMVSVSSADDATGGFWRSARKRGPPAPQKGDEEEAESAKGEVAELRKEIADLRASMASLPKESSGAVGDDVASLTRSIMKQVGDMVGARLEALERRLPPQPILRPLLAADRARDAAAPPAGATAAGEAGPGSRKRKEKAPPKAYAEVPPRDMIGARLEALERRLPPQPILRPPLAADTARDAAAPPAAPAAGEAGPGKKKKEKAPPKAYAEVAAPKAGAPKPPTAPSEEWSTVKRGKRGKPDKKKGSAPPDRPAAPKKRDPAAKLRPPRSAAVTITIQPGATETHGAAEESDSSVATMVSVSSADDATGGFWRSARKRGPPAPQKGDEEEAESAKGEVAELRKEIADLRASLVGLPKEPSGAVGAVNDVASLTRSIMKEVGNMIGARLEALERRLPPQPIVRPPLAADRARDAAAPPAAAPAAGEAGPGSRKKKEKAPPKAYAEVAAPKAGAPKPPTAPSEEWSTVLKRGKRDKVKKGSAPPDRPAKEETHGAAEESDSSVATMVSVSSADDATGGFWRSARKRGPPAPQKGDEEEAESAKGEVGELRKEVADLRASMASLPKESSGAVGDDVASLTRSIMKQVGDMVGARLEALERRLPPQPIVRPSLAADRARDAAAPPAAAPAAGEAGPGSRKKKEKAPPKAYAEVAAPKAGAPKPPTAPSEEWSVVVKRGKRDKVKKGSAPPDKPAKVPKKKDPAAKLRPPRSAAVTITIQPGATLEETPETAGVFGVPARQVKGRQTAGLRQHNHSERTTDSDNLLTMMEQRTEQGGEKGEWVQATTRRQRQTRARAKASHATASDADELSDCSQMSGFSEWSTATSVGGRKRPLPASDGPSKDDDLLAFLEPAAKTRAKAKLPSAEELGREMSEQPTCDLGAQIVDNLEVLEMVADKSKNLKGNFVRAIRLAVRKIQAATTEVVQRAATAHLERDNTALRIELASMETKMEALTAELSELRRRTITREDSRPAHENAPRPVNETRPGNEEDALLKKIGAMMDSKLASFRDELFPGRAARPPLGKKATPATVTRPSTAAEPEQPPTTWAKAVGQKSKKRPPTQMPAPRRQSPSRGRGEGSSHRPSAAEAPQAPKPLPVQDSRRAMGVQRRWALGRLDRDMLVAAAHVADWSGAAVGETATGLEDSAARLQDDMTAICDASMPRVRSGRRAPAYWWSDQIARLREDCLRSRRNYTRARRRRQPAGLRQHDQRTPIPDTDSMRFEEETHGAAEESDSSVATMVSVSSADDATGGFWRSARKRGPPAPQKGDEEEAESAKGEVGELRKEVADLRASMASLPKESSGAVGDDVASLTRSIMKQVGDMVGARLEALERRLPPQPIVRPSLAADKARDAAAPPAAAPAAGEAGPGSQKKKEKAPPKAYAEVAAPKAGAPKPPTAPSEEWSTVLKRGKRDKVKKGSAPPDRPAKVPKKKDPAAKLRPPRSAAVTITIQPGATVRLGSPPGVPALSWRGGSSNRGFGAHGEAKRHPGPARAERWLRVGEVVAPSRLGATSLLGMRKVGTAEEGFPSSAAEAEPLRFAAGAALVGPQGKRNAGIAVRRFPCPLRPRCGWRWSHRSDPPPGAGGFPAPMTSLWRTRGGPPLYWR

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01082186;
90% Identity
iTF_01082186;
80% Identity
iTF_01082186;