Mann012181.1
Basic Information
- Insect
- Macrophya annulata
- Gene Symbol
- -
- Assembly
- GCA_963924015.1
- Location
- OZ001259.1:23242820-23250199[+]
Transcription Factor Domain
- TF Family
- TSC22
- Domain
- TSC22 domain
- PFAM
- PF01166
- TF Group
- Basic Domians group
- Description
- These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 43 0.49 7e+03 -2.1 3.1 13 54 101 144 96 147 0.60 2 43 0.12 1.7e+03 -0.1 0.1 14 31 137 154 134 166 0.80 3 43 0.03 4.3e+02 1.8 0.4 16 33 222 239 213 247 0.59 4 43 0.41 5.9e+03 -1.8 0.5 19 34 276 291 274 294 0.68 5 43 0.00035 5 8.0 2.2 13 43 445 468 438 480 0.57 6 43 0.064 9.2e+02 0.8 0.4 22 43 478 499 472 506 0.71 7 43 0.068 9.8e+02 0.7 4.3 14 51 512 549 506 555 0.88 8 43 0.86 1.2e+04 -2.8 2.1 29 41 555 567 532 581 0.52 9 43 0.0028 39 5.1 3.3 11 41 579 609 577 614 0.87 10 43 0.0093 1.3e+02 3.5 0.7 20 42 616 638 610 640 0.72 11 43 0.00041 5.8 7.8 5.7 14 44 638 668 630 683 0.75 12 43 0.0044 63 4.5 1.2 13 44 686 717 681 723 0.89 13 43 0.0044 63 4.5 4.1 17 42 725 750 721 756 0.86 14 43 0.0035 50 4.8 1.9 17 43 753 779 752 782 0.78 15 43 1.9e-05 0.27 12.1 2.2 11 45 782 816 779 824 0.89 16 43 0.0024 34 5.3 1.6 19 42 825 848 818 854 0.83 17 43 0.003 42 5.0 2.7 18 41 859 882 849 892 0.51 18 43 0.00026 3.8 8.4 5.2 15 45 891 921 888 934 0.90 19 43 0.0043 61 4.5 0.4 19 36 937 954 935 964 0.87 20 43 0.026 3.6e+02 2.0 0.1 18 44 971 997 964 1009 0.71 21 43 0.011 1.6e+02 3.2 0.1 17 42 991 1016 985 1029 0.78 22 43 0.07 1e+03 0.6 1.4 16 37 1011 1032 996 1046 0.63 23 43 0.0039 55 4.7 4.4 15 42 1045 1072 1034 1076 0.58 24 43 4.6e-05 0.65 10.8 5.3 13 43 1071 1101 1069 1107 0.92 25 43 0.35 4.9e+03 -1.6 0.1 22 40 1108 1126 1106 1130 0.81 26 43 0.027 3.8e+02 2.0 0.3 15 31 1129 1145 1126 1159 0.87 27 43 0.003 43 5.0 2.5 12 44 1168 1200 1162 1205 0.62 28 43 1.4e-05 0.2 12.5 2.3 14 43 1205 1234 1203 1243 0.88 29 43 0.051 7.3e+02 1.1 0.4 18 42 1244 1268 1238 1272 0.71 30 43 0.00085 12 6.8 1.4 11 42 1272 1303 1267 1305 0.92 31 43 1.4e-05 0.2 12.5 2.8 15 43 1304 1332 1301 1341 0.88 32 43 0.016 2.3e+02 2.7 0.8 19 42 1329 1352 1328 1370 0.53 33 43 0.00016 2.3 9.1 0.5 7 42 1366 1401 1363 1403 0.90 34 43 1.1e-05 0.16 12.8 3.7 14 43 1401 1430 1398 1439 0.88 35 43 0.046 6.6e+02 1.2 0.8 19 36 1441 1458 1428 1467 0.44 36 43 0.00015 2.2 9.2 0.8 11 44 1468 1501 1466 1510 0.88 37 43 0.00014 2 9.3 2.1 13 49 1512 1546 1509 1556 0.88 38 43 0.0077 1.1e+02 3.7 0.4 13 42 1526 1555 1526 1565 0.82 39 43 0.00035 5 8.0 0.4 11 42 1566 1597 1560 1601 0.91 40 43 5.7e-05 0.82 10.5 1.3 15 44 1598 1627 1595 1660 0.83 41 43 0.00018 2.6 8.9 2.2 11 43 1664 1696 1658 1709 0.87 42 43 0.21 3e+03 -0.9 0.1 15 43 1696 1724 1692 1740 0.76 43 43 0.93 1.3e+04 -3.0 0.1 17 37 1995 2015 1994 2018 0.88
Sequence Information
- Coding Sequence
- ATGGCCAATCGTACGTGTCAATGCGGATGCACAGACCCGCCGGAAATGTCGGCCGCAGATCCTCCCCATGAGGGTTCGTGCGGTTGTAGTTACAACCCTTTTGCGGATCAAGGAAGAGAGGGTGAAATCACGGATTTATCATTTGCACTGCGTAAATTAACTCTGATGAAATGCCAGATGAAAAAATGGAGAATAGAACGGCTTCAGTTGGAGAGCGAGACCAGAGGATTGAAACAAGTGCTTCAGGCCCACGGTTTGAACGATGATATTGTCAGACCAGATCCTCTGCTTGCTCATCTTCGTGAGCATAATGAGAGATTGGAGAACGAGAAAGCGGAGCTTGAGGAGAGCGTGAAAAGCCTTTCCGAAACCGTATCGGATTATGAGAATCAGGAATCCTCGTCATCTGATGCTGTGAATAAATTACGTGAAAAAATTCGGACAATAAAGGAAACAAACGCCATTGAAAAAAGGAGATTAAGAGACCTCATATCTGGGCTGAAGATAAGGCTTCAAGAAGCCGAGAACGAGTCGTCATGCGCTGCCTTAAACCGACTGCGAGCAAAACTTAGAGAAATGATGAAAGGCGGTCAAGAAGCTGACCAAAGAGTTTCTATGGTGGTTCAGCGGTCCATAGAAACTTTAACTGAATTAACAGGGAACGTCGACGATCTTAAGGCCGAGATCGAGAGACTTAGAGCCGAGATAAAGAGGCTGAAGGATTTGCTGAAGACATGTGAAGACCGAAAGGACGTCGGAGTTGAGACGATTGTCGTGGACGTTAAACCGGTAGAAAAACCGCTTGACCAAATGGACGACTCGGAATTGTTGAACAGGATCAAGGAACTCGAAGCGTTGATAGCTCAGCTGAgaaaacaattaattgataaagatGCTACGATAAATGGCCTTCATAATCAATTGTTCGAGGCTGGTACAGACAACAAACGTTTGTCCACGGATTTGGACCAGATGAAGGTCAGCTACAACGCCGTTATGCAGGAGGTCAAGGCTATGAAGGATGAGCTGAAGAAGAGGGACGCCAAGGTATCTGAGCTTCTCAAAGACTTGAAAGCATCCGCGATCGATATGCTGGGATTAAACAGACTGCAAAGTGAAATGGACGCAATCAAGCCCCAGATGTACAATCTCGAGGTAGAACGCGGCCAGCTATTATCTGAGCTCGGTAGAGTGCGGGGCGTTGTATCGGAGCGGAATGATCAGATCATTAAAATACTCGAGGATAGGGACAAGCACGTTAAAGCACTGGCCAAGGCATCAAGCGTAATGCAGGCGACGATTGAACCgatgaaagagaaagaaacggTTTTAAAAAATGAGGTTCAAGGATTGAAAGACCGGATAGCAGAGCTTGAACTAGAGTTATCCGAGCTCAGGAAAAAGCTAGCTCAATTGGAAAAGGAAAATTCTAAGATACCTGGACTAGTGGATAAGATTAAGGAGCTCGAAGACGAACTAGAAAAGGTCAGATCTCAATTGGCCGAGGCAAACTCCAGAGTGATTGAGCTTGAAAAAGAAGTAGCTCGGTTCAAATCTGAGAAAAAAGAACTAGAAAATGAGCTTGGCGAGGCGAGGAAGGAGATTGAAAAGCTGAAGGAGGAGCTTGCTAAGGAGAAAGCTGCGAAAGATACTGCTCTAAAAGATCTTGAGGATTGTAGAGCTGAGAACGAGAAGCTCAGAGCAGAAAATGATCGTTTAATTAATGAGCTAAACGCGGCTAAGACTGAGAGTGATAAGCTCAGAAATGATTTGGAGAAACTGAAAgaagaaatggaaaatttaaaaGCTGAGAATGATCGGTTGACAGATCTATTGAACGTGGCCAAAAATGAGATCGACAAGCTCAGAGGTGAGCTCGAGAATCTCAAGGCAGAAAATGACAAGCTCAAAAATGATGTAGAGAAACTTAAAGGAGATATAGATAACCTTAAGACAGAAAATAATAGCCTCAAAGAAGAGCTCGACAAACTTAGAAACCAGCTCGCTGGACTTGAGGATGAAATCGAGAAACTAAAGAACGCTTTAGCGGTAACTAAAGCGGAAGCTGAGAAGCTCAAACAAGATTTAGATAACACGAAAAATGAGCTTGATAAATTAAGAGCTGAGAATGATCAGGTCAAAAGTCAGTTAGCCGATGCCAAAGCTGATAACGAGAGGCTTAAAATTGAACTCGAAAATCTTAAGggagaaatagagaaattaaaacaagaaaacagcGAACTTAAGACAGATCGAGACAAGCTTCAAGCAGAGGTCgaaaaacttggaaaaaaGCTCGATGAGCTTGAATCTGAGaatgaaaaactgaagaaCGCTTTAACAGCGGCTAAAGCTGAGGTTGAAAAGCTCAAAAATGATTTAGACAAAATGAAGTCCGATATCGAGAAGCTCAAGGCAGAAAATGATCAGCTGAATAATGAGCTAGCCAATGCTAAGGCAGAGAACTCGAGACTCGGAAAGGAGCTTGATGAGTTGAAGGGAGAAATGGCGAAGCTgaaagaagagaataaaaaccTTAAGACAGAAAAAGATGGGCTAGAAGCAGAGATCAAAAAACTTAGAGAAGAATTAGACAGTCTTAAGGGTGAGAATGAGAGACTTAAAGGTGAGAATGAGGGACTTAAAGGTGAGAATGAGAGACTTAAAGGTGAGAATGAAAAACTTAAGACTGAGCTTGACAAACTTGAAAGCGAGAACGAGGCCCTGAAATCAGAAAACGAGAAGTTGAAAAAGAGCTTGAGTGATATGGAAGGAAAGCTAAAAGAGCTCGAAGCCTCCAACGCTGATCTCGCAAATAAAATAGCAGAGCTGAAGAATCAAATTGATAAACTTCAGACCGAGTTGGCGTCCGAAAAAGCTGCCAAAGAAGCAGCTGTTAAAGAATTGGCGGCTATTAAGGCCGAGCTAAAAGCTCTGCTGGCGGAAATGGACAAACTTAAGACGGACTGCGACAGCCTAAAAGGACAAGTCGACGATCTCACTAAACAACTATCGGATTTGAAGAACGATTTTGATCAGCTCAAATACAAGTATGCTGAATTGCAAAAAGAGAGGGATAACCTCACGGCTGAACTTGATAAGTTTAGGGAAGAAAACgacaagatgaaaaatgaccTAGACAAGCTCAAGGCAGAGCTTGACACGTTAAAAACAGAGAATGGTAAGCTCAAGGAGGAAAATGGTAAGCTCAAGGAGGAGAATAGTAAGCTCAAGGAGGAAATTGGTAAGCTGAAAGAAGAGAATGGTAAGCTTAAAGAAGAGAACGAGAAGCTAAGGAAAGACTTAGGCGATGCTGAAGCCCGGATAAAAGCCCTTGAGGATGAAGTAAAGGCATGCGAGGAGGAAAAAACAAGACTGCTAAAGGAGATTGAGGGCCTCAAAAATGGGATTGAAGAACTCAAAAAGGAGCTTGCTGCAGAAAAAGCTGCGAAAGAGGCAGCCATGAAGGAATTATCAGCACTTAAAACTGAACTAGCTGCACTGAGAGCAGAGCTGGATAAAGTGAGGGCAGAGAACGCCAAACTAAAGAGTGAACTCGATAAACTGAAAGCAGAAAACGACGAGCTCAAAAATGAGAATAGTAAAATGAAGGGAGAAATTGATAAGCTCAAAGCAGAGGTTAcgaaattagaaaatgatCTAAGCACCTTGAGGGCAGAGAACGCGAAACTCAAAGAAGATCTGGACAAGCTAAATGCAGAAAATAATGCTCTGAGATCTGATAATGACAAGTTGAAAGGAGAATTGGATCAGCTCAAATCTGAGAACGCGAAATTGAAGAACGATCTAGCTGCGGCTAAAGAAGAAGCTGCAAAGCTCAAAAATGACTTGGATAAACTAAGATCTGAACGGGATGCGTTGCAAgcagagaataataaaatgaagggTGAGCTTGAAAAGCTGAAAtcagatattataaaattacaagATGACTTGAACACTTTGAAGGCAGAGAATACGAAGCTCAAAGAAGACCTTGACAAACTAAACGCAGAGAACAAAGCTCTGAGATCTGATAATGACAAGTTGAAAGGGGAATTGGATCAGCTCAAATCTGAGAACGCGAAATTGAAGCACGATCTAGCTGCGGCTAAAGAAGAAGCCGCAAAACTCAAAAATGATTTAGACAAACTAAAATCCGACTTGGATACGATGCAAgcagagaataataaaatgaaaggtGAGCTTGAAAAGCTGAAATCAAATATTgcaaaattagaaaatgacTTGAACACTCTGAAGGCAGAGAATGCAAAGCTTAAAGAAGACCTTGACAAACTAAACGCAGAGAACAAAGCTCTGAGATCTGATAATGACAAGTTGAAAGGAGAATTGGATCAGCTCAAATCTGAGAACGCGAAATTGAAGAACGATCTAGCTGCGGCTAAAGAAGAAGCAGCAAAGCTCAAAAATGACTTGGATAAACTAAAATCCGACTTGGATGCGCTGCAAGCAGAGAACAATAAAATGAAGGATGACTTGAACACTCTGAAGGCAGAGAATGCGAAGCTTAAAGAAGACGTTGACAAACTAAATGCAGAGAATGCAAAGCTTAAAGAAGACGTTGACAAACTAAATGCAGAAAATAAGTCATTGAAAGCTGATAATGACAAGCTGAAGGGAGAATTGGATCAGCTCAAATCTGAGAAcggaaaattgaagaaagatCTAGCTGCGGCTAAAGAAGAATCTGCAAAGCTCAAAAATGACTTAGACAAACTAAAATCCGACTTGGATGCCCTGCAAgcagagaataataaaatcaaggGTGAGCTTGGAAGGTTAAAGTCAGATATTGCAGGATTACAAAATGACCTAAACACCTTGAAAGCAGAGAATGCGAAGCTCAAAGAAGACGTTGACAAACTAAATGCAGAAAATAAGTCATTAAAAGCTGCTAATGACAAGTTGAAAGGAGAACTGGATCAGCAGAAATCTGAGAACGCGAAATTGAAGACCGATTTAGCTGCGGCTAAAGACGAAGCTGCAAAGCTCAAAAATGACTTGAATGAACTAAAATCTGAACGGGATGCGCTGCAAGCAGAAAATAACAAGATTAAGGGAGAGCTTGAAAGGGTGAAATCCGAGAATGCAAAACTACAGAAAGATCTCGATGTCTTGAAGGGAGAGAACGGCAAGCGAAAGTCTGAGGTTGACAAACTGAAAATCGATaatgaaaagttgaataaGGATCTTCAGAAGACAGACTCGGGGTTAGACAAGTATAAAGCTGAGGCCGATAATCAGAAGAAATTAACCAAGAAACTAAAAGatACACCGAGGAAACTCGATACAATGAGGCCAAGTTACACTGTCGAGAAAGAACTGCAGATTAAAGTACCCACCAGCGTTGTTTCATCgggtaaaattattaaaagtgATCAACGACCCTCGGTTATAAAAAAGGACCAAGGTTCACAGGGCGCGGGTTGTGGCGATTACGAAAATGCAAACGAACAGCTGATGAGAAACATGAATATGCAAGAGAGGGCCGTACAGCGTATAcgagaatttataaaattcgtaCTCGGTGAGAGAAGTTCACCACCTGAAATGGCGCAGGAATTGATCCATCGCATGTCTTCAGTAACGCGGAATAATTTCCCCGAAGATATAATGCGAATTTTGAAAGAGTCACAGTTCTTATCGGAATCGATATTCAACGCCGAAACTGAAGTTCAACGTCTGGTCAAACCCGTCGATGAATTAAACAATCTGAATGATGAAGATATCCAGGACATCGATGGACTCGGTGATGCCTTCGATGCCGAGTCGTGgctaaaaACATTGACGTTAACCGAATTGGCCGAACTCCACGACAGGATATGCCTCGTAACTTCGTGCATGGTTCAGCAAGACATAAACCCCGAGGATTACGTAGACGGTATTGAAACCGATGGAATTTGCCATCCTTGTAGTACCGTAACGGAGGAATTTACCGATGATTCAATACCCGATTACGAAGCTTTGAACAGAAGAATCGGAGCTCTTCAAATGCAGATAAACAAGAAACAGGACGAGGCTgctaaaaaagttaaaaaaatgcgCCAATCCATGTGGCGTGAACAAGAAAACCTAATCAAACTATCCGAAGAAATGAACGCCCAAAAACGAAGGAATTTATCGCTGAAGATCAAGATCAATGAAAACATCGAAGTGGATGgggaagaagataaaaataaaaatgaaaaggcgGTTTTATGCGGTGGAAAATTTTCCCCAAAGATAGGGCAGAGGAAGGACGATATGTTTGGTGAGAAAAATGGCAATTCCGGACGTTTTATTGGTGCCAATTCGAACGTCGGGGTGAATTGGGTCCCTTGCTTTAACCCAAAGGAAAATGAGCCCGATTCAGGATCGTCTTTAACCTCGGTTAGGGTTGGGAGACAAAAACCACCTCCGTGTGCGGCTCCGGTTAGACACGCGGATGTACCTTGCTGTGTTAAGCCCTGCTGTCCCTCGGTTGTAAATCGGGATACGCTATTTAGTAAAAAGAGGTAA
- Protein Sequence
- MANRTCQCGCTDPPEMSAADPPHEGSCGCSYNPFADQGREGEITDLSFALRKLTLMKCQMKKWRIERLQLESETRGLKQVLQAHGLNDDIVRPDPLLAHLREHNERLENEKAELEESVKSLSETVSDYENQESSSSDAVNKLREKIRTIKETNAIEKRRLRDLISGLKIRLQEAENESSCAALNRLRAKLREMMKGGQEADQRVSMVVQRSIETLTELTGNVDDLKAEIERLRAEIKRLKDLLKTCEDRKDVGVETIVVDVKPVEKPLDQMDDSELLNRIKELEALIAQLRKQLIDKDATINGLHNQLFEAGTDNKRLSTDLDQMKVSYNAVMQEVKAMKDELKKRDAKVSELLKDLKASAIDMLGLNRLQSEMDAIKPQMYNLEVERGQLLSELGRVRGVVSERNDQIIKILEDRDKHVKALAKASSVMQATIEPMKEKETVLKNEVQGLKDRIAELELELSELRKKLAQLEKENSKIPGLVDKIKELEDELEKVRSQLAEANSRVIELEKEVARFKSEKKELENELGEARKEIEKLKEELAKEKAAKDTALKDLEDCRAENEKLRAENDRLINELNAAKTESDKLRNDLEKLKEEMENLKAENDRLTDLLNVAKNEIDKLRGELENLKAENDKLKNDVEKLKGDIDNLKTENNSLKEELDKLRNQLAGLEDEIEKLKNALAVTKAEAEKLKQDLDNTKNELDKLRAENDQVKSQLADAKADNERLKIELENLKGEIEKLKQENSELKTDRDKLQAEVEKLGKKLDELESENEKLKNALTAAKAEVEKLKNDLDKMKSDIEKLKAENDQLNNELANAKAENSRLGKELDELKGEMAKLKEENKNLKTEKDGLEAEIKKLREELDSLKGENERLKGENEGLKGENERLKGENEKLKTELDKLESENEALKSENEKLKKSLSDMEGKLKELEASNADLANKIAELKNQIDKLQTELASEKAAKEAAVKELAAIKAELKALLAEMDKLKTDCDSLKGQVDDLTKQLSDLKNDFDQLKYKYAELQKERDNLTAELDKFREENDKMKNDLDKLKAELDTLKTENGKLKEENGKLKEENSKLKEEIGKLKEENGKLKEENEKLRKDLGDAEARIKALEDEVKACEEEKTRLLKEIEGLKNGIEELKKELAAEKAAKEAAMKELSALKTELAALRAELDKVRAENAKLKSELDKLKAENDELKNENSKMKGEIDKLKAEVTKLENDLSTLRAENAKLKEDLDKLNAENNALRSDNDKLKGELDQLKSENAKLKNDLAAAKEEAAKLKNDLDKLRSERDALQAENNKMKGELEKLKSDIIKLQDDLNTLKAENTKLKEDLDKLNAENKALRSDNDKLKGELDQLKSENAKLKHDLAAAKEEAAKLKNDLDKLKSDLDTMQAENNKMKGELEKLKSNIAKLENDLNTLKAENAKLKEDLDKLNAENKALRSDNDKLKGELDQLKSENAKLKNDLAAAKEEAAKLKNDLDKLKSDLDALQAENNKMKDDLNTLKAENAKLKEDVDKLNAENAKLKEDVDKLNAENKSLKADNDKLKGELDQLKSENGKLKKDLAAAKEESAKLKNDLDKLKSDLDALQAENNKIKGELGRLKSDIAGLQNDLNTLKAENAKLKEDVDKLNAENKSLKAANDKLKGELDQQKSENAKLKTDLAAAKDEAAKLKNDLNELKSERDALQAENNKIKGELERVKSENAKLQKDLDVLKGENGKRKSEVDKLKIDNEKLNKDLQKTDSGLDKYKAEADNQKKLTKKLKDTPRKLDTMRPSYTVEKELQIKVPTSVVSSGKIIKSDQRPSVIKKDQGSQGAGCGDYENANEQLMRNMNMQERAVQRIREFIKFVLGERSSPPEMAQELIHRMSSVTRNNFPEDIMRILKESQFLSESIFNAETEVQRLVKPVDELNNLNDEDIQDIDGLGDAFDAESWLKTLTLTELAELHDRICLVTSCMVQQDINPEDYVDGIETDGICHPCSTVTEEFTDDSIPDYEALNRRIGALQMQINKKQDEAAKKVKKMRQSMWREQENLIKLSEEMNAQKRRNLSLKIKINENIEVDGEEDKNKNEKAVLCGGKFSPKIGQRKDDMFGEKNGNSGRFIGANSNVGVNWVPCFNPKENEPDSGSSLTSVRVGRQKPPPCAAPVRHADVPCCVKPCCPSVVNRDTLFSKKR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00938403;
- 90% Identity
- iTF_00939256;
- 80% Identity
- iTF_00939256;