Basic Information

Gene Symbol
-
Assembly
GCA_964006375.1
Location
OZ023262.1:30884435-30902368[+]

Transcription Factor Domain

TF Family
TSC22
Domain
TSC22 domain
PFAM
PF01166
TF Group
Basic Domians group
Description
These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 22 2 9.8e+04 -4.4 0.3 15 25 370 380 367 383 0.59
2 22 0.00021 10 8.7 3.2 14 40 460 486 458 511 0.87
3 22 0.75 3.7e+04 -2.7 1.1 12 33 535 556 515 560 0.65
4 22 0.013 6.2e+02 3.0 3.6 21 52 642 672 630 676 0.77
5 22 0.022 1.1e+03 2.3 0.5 14 29 677 692 672 699 0.86
6 22 0.037 1.8e+03 1.5 4.7 14 44 733 763 730 768 0.87
7 22 0.071 3.5e+03 0.6 0.9 17 42 761 786 760 796 0.80
8 22 0.066 3.2e+03 0.7 4.1 13 42 817 845 814 860 0.80
9 22 0.8 3.9e+04 -2.8 1.9 31 52 965 986 955 991 0.71
10 22 0.13 6.5e+03 -0.2 0.3 14 32 1020 1038 1018 1047 0.72
11 22 0.44 2.2e+04 -1.9 0.8 15 39 1071 1095 1070 1103 0.84
12 22 0.36 1.8e+04 -1.6 0.1 21 34 1137 1150 1114 1152 0.72
13 22 0.53 2.6e+04 -2.2 0.1 13 44 1195 1226 1191 1232 0.79
14 22 0.016 7.9e+02 2.7 2.1 14 55 1235 1276 1229 1278 0.89
15 22 0.58 2.8e+04 -2.3 0.2 31 31 1338 1338 1321 1365 0.53
16 22 3.8e-09 0.00019 23.9 3.3 14 48 1370 1402 1368 1411 0.84
17 22 1.4 6.8e+04 -3.5 0.8 20 37 1421 1438 1414 1449 0.49
18 22 2e-07 0.0096 18.4 5.2 14 53 1464 1501 1462 1505 0.83
19 22 0.33 1.6e+04 -1.5 0.9 11 30 1528 1549 1509 1553 0.58
20 22 8.1e-08 0.004 19.6 4.5 14 48 1558 1592 1556 1601 0.89
21 22 0.0056 2.7e+02 4.2 0.4 15 43 1623 1651 1620 1666 0.87
22 22 1.2 5.8e+04 -3.3 0.2 24 41 1841 1858 1838 1861 0.73

Sequence Information

Coding Sequence
ATGAATCAACGAAAAGCTTGTAACAAATTAGCCTGTGCGAGTATTTATCAAGATGAAAAAGAAGAAATGCAAAGACGTCTGCGGGAATCTCAGCAGAATCTGTCAAACAGAGACATTCTCATCGGAATCTTACAAGGACAGCTGAGACAAATGGAGGATAACTATAAGAGAAGCATAGACCAGGTGCAAGAAATGAGAATGAAATACTTTTCAGCTGTTCAACAATTAGGCAgtatgaaaaaaatgtgtgaaAGGataGAAGATGTCAGTCACGACAATAAACCTTCCTATAAGCCTACCTGTCGAGAActgagtattatgaaaaatgaaCTGGATAACTGGAAGTTACAGTTAAGTAGTGCTTTAAAATATACgtTTGAAGAAATTCTTAAAAAGTTTGGTTTATTAGAAAGCAAAGCTTCTATAACGTACCAAGTCTTCTCAGGTGGCACAACAAACACTTGTCCTGGAGCAACTAAAAAAGAAGTTATAATTTCCCAACCACTGGAAACCCCGCCTTGCAAAAGTCTAGAGAACGGTGACTCACAATGTCTTCTGAATCAAATAAAAAAGATGGAATCTTTAATACAAAATCAGACGTCGGCACTTAACACACTTGTAACTAAGATCACCCAGTATTTAGAATCTCAAACGACGCTCCAGAACCCAACCAAACATACAACAACAAGTTGTGACTGCGCCACTGAGCCATCGTTCGCACCAAAGTATAATCAAACTACAGATAAGACCTTATCTGATACAGAAaatgaaaaattagaaaatacGATCGAGTATTGCGAAAATTTTTATTGCGACATAAGAAAACTCAGCCTAGTTCCTTCGTTTATTGATACGATCACCCAAACTGTGGATGATACATGCGATAATCCTGTTTGCAGTGTTACGAAGCAATCCAAAGGAACTAACAAACCTTCTGACACAGCTTTTGACATTTGTTGCAAGTCTCCGTTACCTACCAAAAATGTTGGCGCACCCGAAAAGAGAAAGGAGAGACTTGAAAGTGTTGTCTCTGACTACCTAAGCGAATCTGAAGACGAAAACCTTTTAGTAGTTGCAAAAGACAGCGTAATCCGATGCATCACTAAGGAAAACGAAGAGCTGAAAGAGAAACTGATGAAATTAGATGATTGCTTAGCAACGTTTGTCAGATCACCTGAAGGAACTGAGTACGATAAAGAAAACGAGGATGACGAAATAGAATCAATTCCAAGAATAGGGATATCATTTGGGGAGCTCCCTAAATATCACGTCGAAAGGAGAGGACAGAGTGATAGCAAAATGAGTCTTATAAAATGTGAAGTAGAAAAGATTGAAGAAGTGGCAAACAAACTTACCGAGTACCTCTCACAACAAACAGAAACAGTTGACACTCTACGGAATGAAATAGTGGAGCTAAAAGTAACTATATCGGAGAATGAACGTGATATTAGAGAGTTAACAGAAAGACTCAAAGAAGCAACAGAAGCTAAGAGTAGACTCGAAAAAGATAACGAAAATGCATTAACACAAAATTGTCTATGCAAGGACGAACTGGTCGCAACAGAAAATGAGAAGAATGCTCTCGAAAAAAGGCTACAAGATGTTACAGACGAAATGGAGGATTACAAACGAGAAATATCAGGACTTTTAAGCTTAAAAGAAAAGATGGAAAATGATTTAGATGATCTGGGCTTGGCTAATGATGATTTGCAAAGAGAATTGGCTGATATTCACAACATAAATTTACAGCAACAAAATGATAATGATCAGCAAGAAGCGACAATTCAAAAATTGAGGTCAGACCTTCAATATTCAAAGGATTCTCAAAATAATCAATTAAGCAAGCTCGAAAATGCAGAAAGAGAATTGGGATCATTAAGAAACACTATGATATCACTACAAGAAGAAAGAACAGGCAACCTGAACATGATTTCTCAACTACAAGAGAAAATTCAGAAACTCAATTCAGAGAATCAGCTTGCAAGTGAATCTCAGATGAATCAACTAAACAAGCTTGAATATGCAGAAAACGAGTTGGAGTCCTTAAAAAATACCATACGAAGTCTTGAAGAAGAAAGGGCAGGCAACCTTCATACAATTTATGAGttacaaaatttaattaatgaGGATAAGTCAAGGTTGGAACAGTGCTTATCCCGTTTGAAGGTGAATTATGATGAAATTGATTCATTGACTCACGAAGTTGAGGCTTTGAAAGAAAAGTGCAAGTGTTTGATAGAAGAGCAAGAAGAAAATACTACAGAAGCCAATAATTTAACTGAGACACTACATAATCAAGAGAAATATTATGCAGAAAAGTTAAAGAATTTAGAAATAGAaaatgtgatgttaaaaaatgaacgaGATAAAATGAAAAATCTTTGCAAATGTGGCGAAGAACAGAATGAAAATTTAATAGCGAAATGTGAGGAAAAAGTAACGATGCTAAATAATGAAATCGAGCAGTACCGTTTGAAAATCCGCAACCTCGAAGGTAGGCTTGAGCAGAATGAAAACTGTGAAAGATTAAGAAACCAAGAGTGTGAAAAACAAAATTGTGAACTTTCCAGTAATTTAAGGCAATACGAAAACCAATGTCAATGTTTACAACTAGAATTGGAAGGTATTATTAGAGAAAACAGTCAATTAATTGAAAAGTATGAAGCGCAAATTAAAGAATGTGAAAAACATAACTGTGAACTTTCTAAGAATCTAAAGCAACACGAAAACCAATGCCAATGTTTACAACTAGAATTGGAAGGTATTATTAGAGAAAAAAGTCAATTAATCGAAATGTATGAAGCGCAAATTAAAGATCTCTCCCTTCAGATAATCACGTGTAAACAAAACTGTGAATTATCTTTCGAAGAATACAAGGAGAGAAACCGTAGGCTAACCTTAGAACAAGAGCAACATCAAAATAAGTGTAATTGTCTAGAAAATCAACTGAAAGAAATGATTAGAGAAAAAGACCAAATAATACAGGAATATGAAGATAAACTCAGCGAGCAAAGTTGTCAAATAGTaacatgtcaaaacatgtcgacGAGAAATCAAGATGAAACAAATAAAATGAGAGAGGAGATCCAAAACCTCAAAACGTCTTTAAATAGAACAGAAATTTATTATCAAAAGAAACTAGAAGATGTAAATCAGGAGATGGAAAGATGTAGAGAGTGTTGCGAAGCCGCAAAACAACAAGACTGCGCTGAAATCGAAAAACTAAAACAAAAGAAAAGAGATACAGAAAAACGCTGCGAAAATCTTTGCCATGAAGTAGAAATTTGCCAGACGGGTATCGAAAAGTTAGAAATGTGTTTATGCCAAACAAAAGACACGTACGAGTCAAAAATTAAGAATATGAATAACGAGATTACCAACTACGAAAAGATGTGCAAATGCTGGCAGGAGGATATTAAGAATCTTTCAGAAAGATTACACAAATCTGAAGAATATTATGAAGATAAGGTTTCCGCACTGACTTGTGAAATTGAGCGATGCGAAAAAAACTGCATTGCGATAGAGTTGGAGAGGCAATTATCAAGTACCAAGACTCAATACGAAAACAGGATAGCCAGCATGCAGGAAGAGATCAAAAGGCTTGAAGGTATGTATAAAAATTTACAGGATAAAGATTTTGAAAGAACCTTAGAATTGGAAGATCTCAGGCAAAAACTTTACGAAACTGAACTTTCTTATGACGAAACCGTTTCCGATCTTCGCAGTGAACTAAGAAATCTTCAAGAGAAAGAGATAGCTTTGTGCCAACAGGATACACAAATTGAATGCGAGTGTGCTATGGAAAACATGAACAAATTCAATTCTGAACTaattgaaacaaacaaaaaactttcTGAGGATCTGAGGAACTGCACCAACATGCTTGCACAGTTGCAGATGTCTCTCACTGAAAAAGAAGATAAAATCAACTTTTACgaaaacagacataataataatgaattgagAATCGCGAATCTTGAAGCAGATCTTAGACTTCTGCAGGAAAAGCTTGATGCGATAACCAGGGAAAATGAGGAAAACCTAAGAAAAATCAAGGGACTTGCAGACTTGGAGAAACAGGTAGCTGCTTGTGAATGCAATTATCAAGAAATAGTGGATAATTTAAAAAAGACAATATTTGAACTTCAAACAATTAACAACCAATTACAAAACGAAAACACAATCCTAAAACAAAAAGAAGACACCATTAATTTATACGAAACCGAGCTTCAAAATTACAATGTGAAGATATCAAATCTTGAGGCAGCTCTAAAAAGTTTAAAAGAGAAGCTAACGCAAAAAACAAGAGAAAACCAAGAAAATATTAATAAACTCATGGAATTCAAGGAATTAGAAAAGACGCTTAGAGCTTGCGAATGTAACTATGAAGAAATTATTGAGAATTTAAAACAGAAAATCTATGAACTGGAAACACTGAATAATAAATTGGAAAATGAAAACAACAGTCTGAAACAAAAAGAAGATACAATTAAAATGTATGAAGCCAGTCTAAAAAATAACAACACTAAAATAAGAAACCTCGAAGCAGATCTTCAACATCTAAAAGAAAAGCTCGAACAGAAAACCAGAGAAaatgaagaaaatttaaaaaaagttgagcACCTCGCAGATTTAGAAGACACAATACGTTCTTGTGATTGCAACTACCAGGAAATTATTGAGGATTTAAAAAAGACAATTTATCAACTAGAAACAATaaatgaaaagttgcaaaatgaAAACAGAACACTAAGGCAAACAATGAGCAATGAAATAAACAACGCCAACAGAACCAATCAAGAAAATCTACAAGAATTATTAGGTTTAGtcgaaaaatataaaaatgctcTCGAACAATGCGAAAAAGAACTAAACGAAATGAGGAACCAAATGGATCACTTACATAAAAACAATGACAAACTTAAGTGTGAATGTGCATCTCTGTCGTCGAATTTAGGATTTATGTCCAAGACGGTTGAAGAATACAAACACGAATGCGAGTGCAAATGCCAAGAAATTGAGACATTAAGAAAGAACGAGAAATTGCTGAAGCAATTGTTAGATACTTGCAACCAATGTTCACCGCAAACTGTTGGATCAAAAACAGAAGCAATGAGGAACATGGAATGCGAATATTATAACATACGATCACAACTCTGCCAAATGGtagaagaaaatgaagaaaagtCTATTATGATAAAGAATTTACAATGTAAACTTGAAGGGCTTTCGAGGGGAAATCCTTGCTCTGGCATGATGGATGACCTAGCCACTGCACAGGATTCTTACGCTAGACTGAAACAGGACCACGATGAACTAAAAGGTCTTTATGAGAAACACTTGGAGATGTATAGGGCAGAGGTTCTTAAAATGAAGAAAGACAGTAAATCTTCTTTATGTTGTAAGCCAAAGCGACCAGAACAAGAAAAGAGGATCAAAGAGCTGGAGCTTGAGCTCATGGAGAAGCACATAGCATTAAAAACGCTCGAACAAAAATGCGATTCAAAGGTATACAGGTTACATGAATTAAAAAAGCATTGTGAGATACTGCtgcaagaaaataaggaaattAAAGATGCCATACAGTGTTTTATTAATAGAATAGATAACAACCTTAAAGAGCTTACAAAGGCgtataatgaaaataataattgtCTTATCAGTGTTATCATCAGAGAAATGCAACAGAATCTGAATCAGGTGAAATCTGCTGATCTTACAAGAAGCAAAGGTTGCGTTAGAAAAATTACGGAATCTTTAAAGTGTCCAGATGACTATGAGTGTGGACCTGGTAGGGATAAAGTTGTTGGGTACCCACTAAAAGCAGGTAACAAACCGTAA
Protein Sequence
MNQRKACNKLACASIYQDEKEEMQRRLRESQQNLSNRDILIGILQGQLRQMEDNYKRSIDQVQEMRMKYFSAVQQLGSMKKMCERIEDVSHDNKPSYKPTCRELSIMKNELDNWKLQLSSALKYTFEEILKKFGLLESKASITYQVFSGGTTNTCPGATKKEVIISQPLETPPCKSLENGDSQCLLNQIKKMESLIQNQTSALNTLVTKITQYLESQTTLQNPTKHTTTSCDCATEPSFAPKYNQTTDKTLSDTENEKLENTIEYCENFYCDIRKLSLVPSFIDTITQTVDDTCDNPVCSVTKQSKGTNKPSDTAFDICCKSPLPTKNVGAPEKRKERLESVVSDYLSESEDENLLVVAKDSVIRCITKENEELKEKLMKLDDCLATFVRSPEGTEYDKENEDDEIESIPRIGISFGELPKYHVERRGQSDSKMSLIKCEVEKIEEVANKLTEYLSQQTETVDTLRNEIVELKVTISENERDIRELTERLKEATEAKSRLEKDNENALTQNCLCKDELVATENEKNALEKRLQDVTDEMEDYKREISGLLSLKEKMENDLDDLGLANDDLQRELADIHNINLQQQNDNDQQEATIQKLRSDLQYSKDSQNNQLSKLENAERELGSLRNTMISLQEERTGNLNMISQLQEKIQKLNSENQLASESQMNQLNKLEYAENELESLKNTIRSLEEERAGNLHTIYELQNLINEDKSRLEQCLSRLKVNYDEIDSLTHEVEALKEKCKCLIEEQEENTTEANNLTETLHNQEKYYAEKLKNLEIENVMLKNERDKMKNLCKCGEEQNENLIAKCEEKVTMLNNEIEQYRLKIRNLEGRLEQNENCERLRNQECEKQNCELSSNLRQYENQCQCLQLELEGIIRENSQLIEKYEAQIKECEKHNCELSKNLKQHENQCQCLQLELEGIIREKSQLIEMYEAQIKDLSLQIITCKQNCELSFEEYKERNRRLTLEQEQHQNKCNCLENQLKEMIREKDQIIQEYEDKLSEQSCQIVTCQNMSTRNQDETNKMREEIQNLKTSLNRTEIYYQKKLEDVNQEMERCRECCEAAKQQDCAEIEKLKQKKRDTEKRCENLCHEVEICQTGIEKLEMCLCQTKDTYESKIKNMNNEITNYEKMCKCWQEDIKNLSERLHKSEEYYEDKVSALTCEIERCEKNCIAIELERQLSSTKTQYENRIASMQEEIKRLEGMYKNLQDKDFERTLELEDLRQKLYETELSYDETVSDLRSELRNLQEKEIALCQQDTQIECECAMENMNKFNSELIETNKKLSEDLRNCTNMLAQLQMSLTEKEDKINFYENRHNNNELRIANLEADLRLLQEKLDAITRENEENLRKIKGLADLEKQVAACECNYQEIVDNLKKTIFELQTINNQLQNENTILKQKEDTINLYETELQNYNVKISNLEAALKSLKEKLTQKTRENQENINKLMEFKELEKTLRACECNYEEIIENLKQKIYELETLNNKLENENNSLKQKEDTIKMYEASLKNNNTKIRNLEADLQHLKEKLEQKTRENEENLKKVEHLADLEDTIRSCDCNYQEIIEDLKKTIYQLETINEKLQNENRTLRQTMSNEINNANRTNQENLQELLGLVEKYKNALEQCEKELNEMRNQMDHLHKNNDKLKCECASLSSNLGFMSKTVEEYKHECECKCQEIETLRKNEKLLKQLLDTCNQCSPQTVGSKTEAMRNMECEYYNIRSQLCQMVEENEEKSIMIKNLQCKLEGLSRGNPCSGMMDDLATAQDSYARLKQDHDELKGLYEKHLEMYRAEVLKMKKDSKSSLCCKPKRPEQEKRIKELELELMEKHIALKTLEQKCDSKVYRLHELKKHCEILLQENKEIKDAIQCFINRIDNNLKELTKAYNENNNCLISVIIREMQQNLNQVKSADLTRSKGCVRKITESLKCPDDYECGPGRDKVVGYPLKAGNKP

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-