Aros000113.1
Basic Information
- Insect
- Athalia rosae
- Gene Symbol
- -
- Assembly
- GCA_000344095.2
- Location
- NW:57081-63456[-]
Transcription Factor Domain
- TF Family
- TSC22
- Domain
- TSC22 domain
- PFAM
- PF01166
- TF Group
- Basic Domians group
- Description
- These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 31 2.4 8.2e+03 -3.7 0.1 32 44 70 82 65 87 0.75 2 31 0.011 39 3.8 0.1 26 44 101 119 90 134 0.67 3 31 0.71 2.5e+03 -2.0 0.2 18 34 159 175 156 179 0.81 4 31 0.005 18 4.9 0.9 14 34 220 240 213 261 0.75 5 31 1 3.6e+03 -2.5 0.1 16 33 277 294 276 298 0.55 6 31 0.069 2.4e+02 1.2 0.1 23 41 305 323 301 334 0.86 7 31 0.46 1.6e+03 -1.4 0.2 14 42 376 404 371 415 0.76 8 31 0.00031 1.1 8.7 1.5 12 36 448 472 445 483 0.65 9 31 0.042 1.5e+02 1.9 0.1 23 42 483 502 480 508 0.61 10 31 0.02 69 3.0 3.6 14 48 495 527 493 551 0.84 11 31 0.013 45 3.5 0.1 11 43 576 608 574 612 0.89 12 31 0.00013 0.44 10.0 1.5 12 44 598 630 595 638 0.88 13 31 0.0059 20 4.7 2.5 15 43 615 643 609 650 0.63 14 31 0.00038 1.3 8.5 6.4 11 44 646 679 636 686 0.87 15 31 0.0041 14 5.1 2.4 15 42 671 698 668 718 0.83 16 31 0.013 44 3.6 0.5 18 38 730 750 723 765 0.84 17 31 0.015 53 3.3 0.4 18 52 765 798 758 803 0.71 18 31 0.0091 32 4.0 0.1 13 42 802 831 800 838 0.86 19 31 0.00037 1.3 8.5 1.9 15 45 839 869 835 872 0.89 20 31 0.0056 19 4.7 0.8 22 45 874 897 872 911 0.67 21 31 0.0032 11 5.5 1.0 15 43 923 951 919 964 0.72 22 31 0.0011 3.8 7.0 0.0 15 42 972 999 968 1003 0.91 23 31 0.00064 2.2 7.7 2.1 13 50 1005 1042 1000 1047 0.75 24 31 0.0087 30 4.1 0.3 23 45 1043 1065 1040 1068 0.86 25 31 0.00013 0.44 10.0 1.3 15 43 1070 1098 1067 1107 0.88 26 31 0.0041 15 5.1 2.4 19 44 1095 1120 1094 1138 0.78 27 31 0.00029 1 8.8 0.9 14 42 1139 1167 1130 1169 0.86 28 31 0.0001 0.36 10.3 2.4 13 44 1166 1197 1164 1208 0.88 29 31 0.16 5.5e+02 0.1 0.1 16 43 1197 1224 1196 1232 0.57 30 31 0.0051 18 4.9 0.1 25 44 1376 1395 1372 1400 0.87 31 31 0.065 2.3e+02 1.3 0.6 14 37 1469 1492 1468 1498 0.91
Sequence Information
- Coding Sequence
- atgtccTACTATCGCACGTGCCAATGCGGCTGCACGGACCCACCGGAAATGACAGGGGGTGACCCACCGCACGAGGGATCCTGTGGGTGCAGTTATAACCCCTTTGCCGACGTGGGGCGTGACACCGAGATAACAGATTTATCTTACGCTCTGAGAAAACTGACCTCGATGAAGTGTCAAATGAAAAAGTGGCGCATAGAACGTCTCCAGTTGGAAAGCGAAGTGAGAGCTTTGAAGCAGGTTTTGCAGGAGCACgGGCTTAACTCCGATATGGTGAAACCGGATCCGCTCATGGTTCATCTGCGAGAGGAGAACGGTCGCTTGGAAAACGAGAATGCGGAGCTTCGGGACAAGGTGAAAACCCTTGCCGACACAATCTCGGAATACGAGTACAACGACTCGCCTTGCGAGGCGGTGAAAAAGGTGCGAATTAAAATGAAGATCTTGAAGGAGGAACACGcggcggaaaaaaatcgATTGCGGGAGATCATTTCGGAGCTTAAAATCCGTTTGCAAGAGGCTGAGGGTGACTCTTCGTGTGCAGCGATGAATCGTTTACGAGCGAAGCTCAGGGAGCTGATGAAGGGTGGCAAGGTAGCCGACGAGCGGGTATCTATGGTCGTCCAGAGATCCATAGAGACGCTGGTCGAGCTTACGGATAACGTGGACGAGCTGAAAGCTGAGATAGAGCGTTTAAAAGCGGAGATAAGGAGGCTGAAGGATCTTCTGAAGGAATGCGAAGACCGACGTTTGACTTCCTCGACGGTCGCCGTCGAGACAGCACCCATCGATTTGAAACCGGCCGAGAAACCGTTAGCCGAAATGGACGTGTCCGACCTCTTGCAGAGAATAAAGGATCTCGAAGCGTTGATTGCTATGCTGAGAAAACAACTCGTCGACAAGGACGCGGTCATCAACGATCTGCAAAATCAGTCGTTCGATCTCGCTACCGAGAACAAACGTCTTTCCGTAGATCTCGACCAGATGAACGTCAGCTACAAAGCACTGATGGACGAAGTTAAAGCCATGAAGGAGGAGctcaaaaaaagagatgacaagGTATCCGATCTCTTGCGGAGCTTGCAAGCCTCAGCCATCGAGTTGCTTGGAATGAACAGGCTACAGAGTGAAATGGACACTCTAAAACCACAGTTGTATAGCCTTGAATTGGAGCGGGACCAGCTGGTTTCGGAGCTCGGTAAAGTACGGGGTGTCGTGTCTGAGAGGAATGATcagataattaaaatattggaAGAGAAAGACAAACACGTAAAAGCTCTTGCCAGAACGTCGAATATTATACAGTCAACGGTGGAGCCTTTGATGGAGCAAGAAGCTGCCCTGAAGAGAGAGATCGATGGGCTCAAGGATCGAATAGCTGAACTGGAAAGGGAGCTTGCCGAACtaagaaaaaagctcgcgcAATTGGAATCAGAGAATGCCGAGATACCGGGACTCCTCGAGAAGATCAAGAACCTCGAAGACGAACTGGCCAGACTCAGAGCCCAGCTGGCTGAAGCGAACGATAGAATACGAgagttagaaaaagaaatagcaGAATTGAAGGCTGATAAAGCGCAGTTGGAAAAAGACCTCGCCGAGGCTAGAAAGGAGATGGAGAAAATGAGGGAAGAACTCGCGGCGGAGAGAGCAGCGAAAGAGGCGGCGCTGAAAGAGTTGGGAAATTGTAGGGCGGAAAATGAACGATTGAACAAAGAGTTGAACGCGGCTAAGGCAGAGGCTGATAATCTTCgcggtgaaattgaaagactGAAAAACGCCTTGGATGCGGCGAAGGGTGAAGCTGACAAGCTCAGGAGTGAtatggaaaaactgaaaaatgagcTTGACAAATTGAGAGCTGAGAACGATCAGCTGAAAAATCAGCTGGCGGGACTGACCGCGGAGAACGAGCGACTCAGAGGTGAAATAGACGCACTGAAGGACGAGAGAGACAAGCTACGAAATGAGATCAATGCCCTCAAAGCGGAGAACGATAAATTGCAGGCAGAAGTCAACAAGCTGAAAGCGGAAGTTGAAAGACTCGAGGCTGAAAACGGAAGACTCAAAGCGGAGTTTCAGAAACTCAAAAATGACTACGATGCCCTGAAATCGGAGAATGACGATCTTAAGAAGAGTCTGGCCGACGCCGAAGGAAGGATAAAATCTCTGGAAGCTGAAAAAGCTAATCTCTTGAACAAAATTGCGGAGTTGAAGAATCAGATCGACCGGCTTCAGGGTGAACTCGCTGCAGAAAAAGCCGCAAAAGATGCAGCGTTGCAAGAGCTAGCAGCGATCAAATCCGAGCTCAAAGCTCTGTTGGCAGAAATGGATAAGTTGAAAGCAGAGCGCGATAAACTCAAAGCTGCAGTTGACGATCTCACGAAACAACTTTCTCAGCTAAACGACGACCTGGACCAGCTGAAATCGAAATATGCCGCATTGTTGGCGGAAAATGACAAGCTGAAAGGAGAGGTCGATCGGTTGAAGGGGGAGAATGACAGACTCAAAAATGACCTGGACAAAATCAAAGCAGAGCTTGATAACTTGAAAGCAGAAAACGCCAAGCTCAAAGAAGAGAAcgcgaagttgaaaaaagaccTCAGCGCCGCTGAATCTAAGATAAAAGGtcttgaaaatcaaatcaaggcttgcgaagaagaaaaagccaGATTAAGGAACGAAATCGATGCCCTTAAAGATCAAGTTGACAAGCTTGGCAAAGAGCTAGCGGCAGAGAGGGCTGCGAAAGAAGCAGCTCTGCGGGAGCTGGATGCCCTGAAAAATGAGTTGTCCGCATTGAGAGCAGAGTTGGATAAAGTACGGGGGGAAAATACACGGCTAAAGGGTGAACTGGACAAACTGAAGGCTGAGAACGAGGCTCTCAAAGCTGacaacaataaaatgaaaggcGAGCTCGATCGGCTGAACGCTCAAGTCGCGAAACTATTGGGCGACATCGATGCTCTGAAAGCAGAAAATGCAAAGCTCAAAGGAGATTTGGACAAACTGAATGATGAGATTAAGGCCTTGCGAGCTGAGAACGACAAACTTAAGGCCGAGCTCGAACAGATGAAGGCCGAGAATGCGAAACTGAAGGATCAGCTGGCCAGCGCACAAGCGGAAATGGCGAAGTTGAAAGAAGAGCTGGATAAGCTGAAATCCGAGAACGATGCGCTTCGAGGTGagctttcaaaaatgaaaggagagTTGGACAAGCTAAATGCGGAGATCGCAAAACTTCAAAAAGATCTCGACGCTCTCAAAGCGGAGAACGCGAAGCTCAAAGACGAACTCGACAAACTTTCTGCTGAGAACAAAGAACTGAGATCTGAGAATGCTAAACTCAAAGGAGAGTTGGATAGCCTGAAATCCGAGAACGAAAAGCTGAAGAAAGATCTGGCTGCGGCGATGGCGGAAGTCGCCAAACTCAAAGAAGATCTCGATAAATTGCAGGCTGAAAACGACGCGCTCAAAGCTGAGAATGCTAAAATAAAGAGTGAACTCGACAAGCTCAAATCCGAGAATGCGGAGCTCCAAAAAGCACTCGATTCTCTGAAGGCTGAGAATGCTAGACTGAAATCGGAAGTCGATGATCTTAAAAAAGACAACGAAAAGCTCAAAAATGATCTCCAGAACGCGATTGCAGAAATGGACAAACTGAAAGCGGAATCTAGTGGCTCAAGGCGATCGAGTAAAGCGACCCCGAGGAGTCAGGATCCCTCTAAGCCAAGTCCTGAGGCTTCAACGGACGCTGTCCCTCTTGCTGAAATTGAGCGGCTCAGCCCGGTTCCGAAAACTGAGAAGAGGGTCAGACGTGTCAGTTCGGTTGTAAAAAAGGATCAAGGATCGCAAGGGGCGGGTTGCGGTGATTACGAGAATGCAAACGAACAGCTGAGGAAGAATATGAATATGCAGGACAGAGCCGTGCAACGAATACGAAATTTCGTCAAATACATACTCGGCGAGAGACCGTCACCCCCGGAAATGGCTCAGGAACTGAATCATCGGATGTCATCTgtgatgagaaataaattcgcCGAAGATCTGATGGAATTACTCAGGGAGTCTCAGTTCTTATCGGAAAGTATATTCAACGCTGAAACCGACGTTCAAGGACTGATGAAACTTCTGGACGAGATCAACAGGCTCAGAGATGAAAATACGGCGCTGAAAAATCAAGCTGACGACACTCGCGACATCAACAGTTCCGGCGACGTCTTCGACGCGGAGTCCTGGCTCAGATCGTTGACGTTGACGGAATTGGCGGAGCTTCACGACAGGATTTGTTTAGTGACGTCGTGCATAGTGCAGCAAGACATAAACCCGGAAGATTACGTAGACGGTACCGTGGAAGTCGACGGTGTCTGTCGTCCCTGCGTAAAAATATCCGAAGATCCAGTCGACGAGTACGAGGCATTGAACCGAAGAATAGCGGCTCTCCAGCGTCAGATAAATGAGAAGCAGAATGAAGCGGCTAAAAAAGTGCAGGAAATGCGGGAAGTCATGTGGCGAGAACAGGAGAACTTGATCCGGTTGTCGGACGAGATGAACGCCCAAAAACGTAGACATTTGTCGATGCAATTGAAAATTAGCGCGAATAGCTGCGCCGGTAAGCTCGACCCTTGGGCAATGGCGGAGAGACTCGACGCCCTGAATGACCAAAGACTCGGCCTCGATTTCAAGACCTATTCGGAGATCACTCGCGAAGAAGATATTTGCGACGATAATAACTGCCCGGAGATGAGAAGGGATTCCTCGCCGAAATCGGACCTCGTCAACAATCGGGAagccgaagagaaaaaagacgaagaaaaagaggctTCGGTACTTTCGATGTTCAAGGGAATCGGTACAGTGACTCGTGAAGTCGAATATCCGTTCGATAATTCTACCTGCGTAAAatctttgatgaaaaatcgtaGGACGAACGCTCCTGCGTCTTGTTCACTTCCCGTGAAACATGCGGACGTTCCATGCTGTCGAAATCCTTGTTGTCCATCGTTTGTGAATCAGaacgttcttttttcgaataaaaatgctGCGGCATCGTCCAAAAATGACAACGACGTCGACGATAACGCCGAGATAGAGGTGGCGGTTTGGTGTAAAGGTGTACGCGATCGTAGTTGA
- Protein Sequence
- MSYYRTCQCGCTDPPEMTGGDPPHEGSCGCSYNPFADVGRDTEITDLSYALRKLTSMKCQMKKWRIERLQLESEVRALKQVLQEHGLNSDMVKPDPLMVHLREENGRLENENAELRDKVKTLADTISEYEYNDSPCEAVKKVRIKMKILKEEHAAEKNRLREIISELKIRLQEAEGDSSCAAMNRLRAKLRELMKGGKVADERVSMVVQRSIETLVELTDNVDELKAEIERLKAEIRRLKDLLKECEDRRLTSSTVAVETAPIDLKPAEKPLAEMDVSDLLQRIKDLEALIAMLRKQLVDKDAVINDLQNQSFDLATENKRLSVDLDQMNVSYKALMDEVKAMKEELKKRDDKVSDLLRSLQASAIELLGMNRLQSEMDTLKPQLYSLELERDQLVSELGKVRGVVSERNDQIIKILEEKDKHVKALARTSNIIQSTVEPLMEQEAALKREIDGLKDRIAELERELAELRKKLAQLESENAEIPGLLEKIKNLEDELARLRAQLAEANDRIRELEKEIAELKADKAQLEKDLAEARKEMEKMREELAAERAAKEAALKELGNCRAENERLNKELNAAKAEADNLRGEIERLKNALDAAKGEADKLRSDMEKLKNELDKLRAENDQLKNQLAGLTAENERLRGEIDALKDERDKLRNEINALKAENDKLQAEVNKLKAEVERLEAENGRLKAEFQKLKNDYDALKSENDDLKKSLADAEGRIKSLEAEKANLLNKIAELKNQIDRLQGELAAEKAAKDAALQELAAIKSELKALLAEMDKLKAERDKLKAAVDDLTKQLSQLNDDLDQLKSKYAALLAENDKLKGEVDRLKGENDRLKNDLDKIKAELDNLKAENAKLKEENAKLKKDLSAAESKIKGLENQIKACEEEKARLRNEIDALKDQVDKLGKELAAERAAKEAALRELDALKNELSALRAELDKVRGENTRLKGELDKLKAENEALKADNNKMKGELDRLNAQVAKLLGDIDALKAENAKLKGDLDKLNDEIKALRAENDKLKAELEQMKAENAKLKDQLASAQAEMAKLKEELDKLKSENDALRGELSKMKGELDKLNAEIAKLQKDLDALKAENAKLKDELDKLSAENKELRSENAKLKGELDSLKSENEKLKKDLAAAMAEVAKLKEDLDKLQAENDALKAENAKIKSELDKLKSENAELQKALDSLKAENARLKSEVDDLKKDNEKLKNDLQNAIAEMDKLKAESSGSRRSSKATPRSQDPSKPSPEASTDAVPLAEIERLSPVPKTEKRVRRVSSVVKKDQGSQGAGCGDYENANEQLRKNMNMQDRAVQRIRNFVKYILGERPSPPEMAQELNHRMSSVMRNKFAEDLMELLRESQFLSESIFNAETDVQGLMKLLDEINRLRDENTALKNQADDTRDINSSGDVFDAESWLRSLTLTELAELHDRICLVTSCIVQQDINPEDYVDGTVEVDGVCRPCVKISEDPVDEYEALNRRIAALQRQINEKQNEAAKKVQEMREVMWREQENLIRLSDEMNAQKRRHLSMQLKISANSCAGKLDPWAMAERLDALNDQRLGLDFKTYSEITREEDICDDNNCPEMRRDSSPKSDLVNNREAEEKKDEEKEASVLSMFKGIGTVTREVEYPFDNSTCVKSLMKNRRTNAPASCSLPVKHADVPCCRNPCCPSFVNQNVLFSNKNAAASSKNDNDVDDNAEIEVAVWCKGVRDRS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00174786;
- 90% Identity
- iTF_00173906;
- 80% Identity
- iTF_00175503;