Acor009009.1
Basic Information
- Insect
- Athalia cordata
- Gene Symbol
- -
- Assembly
- GCA_963932425.1
- Location
- OZ010626.1:31513073-31522448[+]
Transcription Factor Domain
- TF Family
- TSC22
- Domain
- TSC22 domain
- PFAM
- PF01166
- TF Group
- Basic Domians group
- Description
- These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 32 1.1 6.4e+03 -2.6 0.1 32 45 70 83 58 88 0.81 2 32 0.11 6.5e+02 0.6 0.1 26 44 101 119 98 134 0.59 3 32 0.65 3.9e+03 -1.9 0.2 18 34 159 175 155 179 0.81 4 32 0.0042 25 5.1 0.6 14 34 220 240 215 261 0.76 5 32 0.47 2.9e+03 -1.5 0.3 15 34 276 295 273 297 0.79 6 32 0.0055 33 4.7 0.2 22 41 304 323 301 336 0.86 7 32 0.00035 2.1 8.6 2.4 11 35 447 471 446 483 0.67 8 32 0.0079 48 4.2 1.3 14 46 495 525 492 539 0.84 9 32 0.00095 5.8 7.2 0.3 13 43 578 608 574 615 0.89 10 32 0.00053 3.2 8.0 0.6 12 44 598 630 596 638 0.88 11 32 0.0045 28 5.0 1.3 13 43 613 643 609 646 0.83 12 32 0.00048 2.9 8.1 5.4 13 44 648 679 640 685 0.85 13 32 0.00094 5.7 7.2 3.6 15 45 671 699 668 711 0.85 14 32 0.0073 44 4.4 0.8 18 44 688 714 683 725 0.90 15 32 0.086 5.3e+02 0.9 0.2 14 37 705 728 701 732 0.79 16 32 0.008 49 4.2 0.8 18 37 730 749 725 763 0.85 17 32 0.013 78 3.6 0.5 18 52 765 798 758 806 0.70 18 32 0.015 88 3.4 0.2 14 42 803 831 799 839 0.82 19 32 0.00023 1.4 9.2 2.2 15 45 839 869 835 872 0.90 20 32 0.0053 32 4.8 0.6 22 45 874 897 870 911 0.72 21 32 0.0045 27 5.0 1.1 15 43 923 951 919 961 0.89 22 32 0.0034 21 5.4 0.5 13 45 935 967 932 971 0.57 23 32 0.0012 7.4 6.8 0.0 15 42 972 999 969 1003 0.91 24 32 0.003 18 5.6 1.2 13 44 1005 1036 1000 1043 0.74 25 32 0.0031 19 5.5 0.3 20 45 1040 1065 1037 1068 0.87 26 32 0.00024 1.4 9.1 0.7 15 43 1070 1098 1067 1101 0.91 27 32 0.0057 35 4.7 3.1 19 44 1095 1120 1094 1136 0.78 28 32 0.00031 1.9 8.8 1.4 15 42 1140 1167 1130 1169 0.84 29 32 0.00017 1 9.6 3.0 13 44 1166 1197 1163 1208 0.88 30 32 0.037 2.3e+02 2.1 0.2 14 45 1195 1226 1193 1229 0.72 31 32 0.0064 39 4.5 0.1 25 44 1376 1395 1372 1398 0.88 32 32 0.03 1.8e+02 2.4 0.5 14 39 1469 1494 1468 1506 0.90
Sequence Information
- Coding Sequence
- atggCCTACTATCGCACGTGCCAATGTGGCTGCACAGACCCACCGGAAATGACAGGGGGTGACCCACCACAAGAGGGATCCTGTGGGTGCAGTTATAACCCCTTTGCCGACGTGGGTCGTGACACCGAGATAACGGATTTATCTTACGCTCTGAGAAAATTGACCTCGATGAAGTGCCAAATGAAAAAGTGGCGCATAGAACGTCTGCAGCTAGAAAGTGAAGCAAGAGCTTTGAAGCAGGTTTTGCAGGAGCACgGACTGAATGCCGACATAGTGAAACCGGATCCGCTCGTGGTTCATTTGCGAAAAGAGAACGGGCGTTTGGAAAACGAGAATGCAGAGCTTCAGGACAAGGTAAAAACCCTTGCTGACACGATCTCAGAATACGAGTACAAGGACTCGCCTTGTGAGGCGGTAAAAAAGGTGCGAATTAAAATGAAGACCTTGAAAGAGGAACACGCAGCGGAAAAAAATcgATTGCGCGAGATCATTTCGGAGCTTAAAATTCGATTGCAAGAAGCTGAGGGCGACTCTTCATGTGCAGCGATGAATCGTCTACGAGCTAAGCTCAGAGAGCTGATGAAGGGTGGTAAGGTAGCTGATGAGCGAGTATCGATGGTCGTCCAGAGATCCATAGAGACGTTGGTTGACCTTACGGATAACGTGGATGAGCTGAAAGCTGAGATAGAGCGTTTAAAAGCGGAGATAAGGAGGCTGAAGGACCTTCTGAAGGAATGCGAAGACCGACGTTTGACTTCTTCGGATGTTGCCGTCGAGACAGCACCCATTGATTTTAGACCAGCTGAAAAACCATTGGTCGAAATGGACGTATCTGACCTGTTGCAGAGAATAAAAGATCTCGAAGCGTTGATTGCTGAGCTGAGAAAACAACTTGTCGACAAGGATGCGGCCATAAATGACCTCCAAAATCAGTCGTTCAGGCTTGAAACTGAGAACAAACGCCTTTCCGTAGATTTGGATCAGATGAACGTGAGCTATAAAGCACTGATGGATGAAGTTAAAGCCATGAAGGAGGAGCTCAAAAAAAGGGATGAGAAGGTATCCGAACTCTTGCGGAGCTTGCAAGCCTCAGCCATCGAGTTGCTTGGAATGAACAGGCTACAGAGTGAAATGGATGCTATAAAACCACAGATGTACAGCCTTGAATTAGAGCGGGACCAGCTGGTTTCGGAGCTCGGTAAAGTACGAGGTGTTGTATCTGAGAGAAATGAtcagataattaaaatactgGAAGAAAAGGACAAACATGTAAAGGCTCTTGCCAGAACATCGAATGTTATACAGTCAACCGTAGAGCCTTTGTTGGAGCAAGAAGCTGCCCTGAAGAAAGAGATCGATGCGCTTAAGGATCGAATAGCTGAATTGGAAAGGGAGCTTGCTGAATTAAGAAAGAAGCTCGCGCAACTGGAATCAGAGAACGCTGGGATACCAGGACTTCTCGAGAAGATCAAGAACCTCGAAGACGAACTGGCTAAACTCAAAGCCCAGCTAGCTGAAGCAAACGATAGAATACGTGagttagaaaaagaaatagtagGATTGAAAGCTGATAAAGCGCAGTTAGAAAAGGACCTTGCGGAAGCTAAAAAGGAGAtagagaaaatgaaggaagaGCTAGCTGCAGAGAGAGCAGCGAAAGAGGCGGCACTGAAAGAGTTAGGAAATTGTAGGGCGGAAAATGAACGATTGAATCAAGAGCTGAACGCGGCTAAGGTAGAGGTTGATAATCTTCGAGGTGAAATTGAAAGACTGAAAAACGCCTTGGATGCGGCGAAAGGTGAAGCTGACAAGCTCAGAAGTGATatggaaaagatgaaaaatgagcTTGATAAATTGAGAGCTGAAAATGATCAGCTGAAGAATCAGTTGGCGGGACTGACCGCGGAGAATGAGCGACTCAAAGGTGAAATTGACAAACTGAAGGATGAGAGAGATAAGCTACGAAATGAGATCAATGCCCTCAAAGCAGAGAACGATAAATTACAAGCAGAAGTCAATAAGCTAAAAGCAGAAGTTGAAAATCTTGAGGCTGAAAACGGAAGACTCAAAGCGGAGTTACAAAAACTTAAAACTGACTATGACACCCTGAAATCAGAGAATGACAATCTAAAGAAGAGCCTGGCAGACGCAGAAGGAAGGATAAAATCTCTGGAAGCTGAAAAAGGTAatcttttgaataaaatcgCGGAGTTGAAGAATCAGATCGACCAACTTCAGGGTGAACTCGCTGCAGAAAAAGCCGCAAAAGAGGCGGCGTTGCAAGAGCTGGCAGCGATCAAGTCTGAGCTCAAGGCTCTGTTGGCAGAAATGGATAAATTAAAAGCAGAGCGTGATAAACTTAAAGCTGCAGTTGACGATCTCACCAAACAACTTTCTCAGCTAAATAACGACCTTGACCAACTCAAATCAAAGTATGCCGCGTTGTTGGCGGAAAATGACAAGTTGAAAGGAGAGGTCGATCGGTTGAAGGGGGAGAATGACAAGCTTAAAAATGATCTAGACAAAATTAAAGCAGAGCTCGATAACTTAAAAGCAGAAAACGCTAAGCTCAAAGAAGAAAACGCAAACTTGAAAAAAGACCTCAGCGATGCTGAAGCTAAGATTAAAGGCcttgaaaatcaaatcaagGCTTGCGAAGAAGAGAAAGCCAGATTACGGAAAGAAGTCGATGCCCTTAAAGACCAAGTTGACAAGCTTGGCAAAGAGCTAGCAGCAGAACGAGCTGCGAAAGAAGCAGCTCTGCGGGAGCTAGATGCCCTGAAAAATGAGTTAGTCGCATTAAGAGCAGAGCTGGATAAAGTACGGGGGGAAAATTCAAGGCTAAAGGGTGAGCTGGACAAACTGAAGGCTGAGAACGAGGCTCTCAAAGCTGAGAACAGTAAAATGAAAGGGGAGCTCGATAGGCTGAACGCTCAAGTCGCGAAACTATTGGGTGACATCGATGCTTTGAAAGCAGAAAATGCAAAACTCAAAGGAGATTTGGATAGACTGAATGATGAGATTAAGGCCTTGCGAGCTGAGAACGACAAACTCAAGGCTGAGCTCGATCAGATGAAGGATGAGAATGCGAAATTGAAAGACCAGCTGGCCAGCGTTAAAGCGGAAATGGCGAAGTTGAAAGAAGAGCTGGATAAACTGAAATCTGAGAACGATGCGCTACGAGGTGagctttcaaaaatgaaaggagAGTTGGATAAGCTGAATGCAGAGATCGCGAAACTCCAAAGAGATCTTGACACTCTCAAAGCAGAGAACGCGAAGCTCAAAGACGAACTTGATAAACTTGCTGCTGAGAACAAAGAACTGAGATCTGAAAATGCTAAACTCAAAGGAGAGTTGGATAACCTGAAATCCGAAAacgaaaagttgaagaaagaTCTGGCTGCAGCGATAGCGGAGGTCGCCAAACTAAAAGAAGATCTCAATAAACTGCAGGCTGAAAACGATGCACTCAAAGCTGAGAAtgctaaaataaaaagtgaactCGACAAGCTGAAATCTGAGAACGCGGAGCTGAAAAAAGCACTTGACTCTCTGGAGGCAGAGAATGCTAGATTGAAATCGGAAGTCGATGATCTTAAAAAAGACAACGAAAAGCTCAAAAATGATCTCCAGAAAGCGATTGCAGAAATGGACAAACTAAAAGCGGAATCTAGTGATACGAGGCGACCAAGTAAAGCGACCCCGAGGAGTCAGGATCCCTCTAAGCCAAGAACTGAGGCTTCAACGGAGGCTGTCCCTCTTGTTGAAATTGAGCGGCTCAGTCCCGTTCCGAAAACTGAGAAGAGGGTTAGACGTGGCAGTTCGGTTGTAAAAAAGGATCAAGAATCGCAAGGGGAGGGTTGCGGTGATTACGAGAATGCAAACGAACAGCTGAGGAAGAATATGAATATGCAGGACAGAGCTGTGCAACGAATACGAAATTTCATCAAGTACATACTTGGCGAGAGACCGTCACCTCCGGAAATGGCGCAGGAATTAGATCATCGGATGTCATCTgtgatgagaaataaattcgcTGAAGATCTGATGGAATTACTTAAGGAGTCTCAGTTCTTATCGGAAAGCATCTTTAACGCCGAAAACGACGTTCAAGGACTGATTAAACTTCTGGACGAAATCAACAGGCTCCGAGATGAAAATAGGGCCCTAAAAAATCAAGCTGACGATATTCGCGATATGGACAGTTTCGGCGACGTCTTCGACGCAGAGTCCTGGCTCAGATCGTTGACGTTGACGGAATTGGCGGAACTTCACGACAGGATTTGTTTAGTAACGTCGTGCATAGTGCAGCAAGATATAAACCCCGAAGATTACGTAGACGGTTCCGTCGAAGTCGACGGAGTCTGCCGTCCGTGTGTAGAAATATCTGAAGATCCGGTCGACGAATACGAGGCATTAAACCGAAGAATAGCAGCTCTTCAGCGTCAGATAAACGAGAAGCAAAATGAAGCATCTCAAAAAGTGCAGCAAATGCGTGAAGTTATGTGGCGAGAACAGGAGAATTTAATCCGTTTGTCAGATGAGATGAACAGCCAAAAACGTAGAAATTTATCAATGCAATTAAAAATCGGCGCGAGTAATTGCGCCGGTAAATTAGACCCTTGGGCAATGGCCGACAGACTCGACGCTCTGAATGACCAAAGACTCGGCCtcgaattcaaaaattattcagagatTACTCACAAAGAAGATATTTGCGACGATAATAATTGCTCAGTAATAAGAAGGGATTCTTCACCGAAATCGGACCTCATCAACAATCAggaagctgaagaaaaaaaagacgaagaaaaagaagcttcGggATTGATTTACTACTTGGAAGGAGCCGAAGTCGTCTACTTGGATCCCGAATATTTCGATGATGAATCCTCCGAGATTGaagttatatatttgaatagcACGTTCAGCGGACTTTCGACCAATTTTACTATAATGAAACAAATCCCCGATTCAGTTATGGGTCACTTTAATCTATATTTAAAATCTATGGGAGATTACACAGTTTCCGGTGGTATATCCTTAACTATGCCCTTGTGCGAAATGACAAGCGAGCCCATCTTGATGGGAAAAATTTTGAGCCTGCTGGGAATTAATGACGAAAGCTGCCCTCCACCACCGGGAGTATACGGAATGCCTTTTTGGGCCCCAACGGTCGATTTATTGCCCGATTCGATGCCGGGAAACGACTATAAAGTTTCTTTCACTGCGGACTACGACGATGACAAGATTCTGGTAGACTTAGCTGTATACGTTCAAgtgttttga
- Protein Sequence
- MAYYRTCQCGCTDPPEMTGGDPPQEGSCGCSYNPFADVGRDTEITDLSYALRKLTSMKCQMKKWRIERLQLESEARALKQVLQEHGLNADIVKPDPLVVHLRKENGRLENENAELQDKVKTLADTISEYEYKDSPCEAVKKVRIKMKTLKEEHAAEKNRLREIISELKIRLQEAEGDSSCAAMNRLRAKLRELMKGGKVADERVSMVVQRSIETLVDLTDNVDELKAEIERLKAEIRRLKDLLKECEDRRLTSSDVAVETAPIDFRPAEKPLVEMDVSDLLQRIKDLEALIAELRKQLVDKDAAINDLQNQSFRLETENKRLSVDLDQMNVSYKALMDEVKAMKEELKKRDEKVSELLRSLQASAIELLGMNRLQSEMDAIKPQMYSLELERDQLVSELGKVRGVVSERNDQIIKILEEKDKHVKALARTSNVIQSTVEPLLEQEAALKKEIDALKDRIAELERELAELRKKLAQLESENAGIPGLLEKIKNLEDELAKLKAQLAEANDRIRELEKEIVGLKADKAQLEKDLAEAKKEIEKMKEELAAERAAKEAALKELGNCRAENERLNQELNAAKVEVDNLRGEIERLKNALDAAKGEADKLRSDMEKMKNELDKLRAENDQLKNQLAGLTAENERLKGEIDKLKDERDKLRNEINALKAENDKLQAEVNKLKAEVENLEAENGRLKAELQKLKTDYDTLKSENDNLKKSLADAEGRIKSLEAEKGNLLNKIAELKNQIDQLQGELAAEKAAKEAALQELAAIKSELKALLAEMDKLKAERDKLKAAVDDLTKQLSQLNNDLDQLKSKYAALLAENDKLKGEVDRLKGENDKLKNDLDKIKAELDNLKAENAKLKEENANLKKDLSDAEAKIKGLENQIKACEEEKARLRKEVDALKDQVDKLGKELAAERAAKEAALRELDALKNELVALRAELDKVRGENSRLKGELDKLKAENEALKAENSKMKGELDRLNAQVAKLLGDIDALKAENAKLKGDLDRLNDEIKALRAENDKLKAELDQMKDENAKLKDQLASVKAEMAKLKEELDKLKSENDALRGELSKMKGELDKLNAEIAKLQRDLDTLKAENAKLKDELDKLAAENKELRSENAKLKGELDNLKSENEKLKKDLAAAIAEVAKLKEDLNKLQAENDALKAENAKIKSELDKLKSENAELKKALDSLEAENARLKSEVDDLKKDNEKLKNDLQKAIAEMDKLKAESSDTRRPSKATPRSQDPSKPRTEASTEAVPLVEIERLSPVPKTEKRVRRGSSVVKKDQESQGEGCGDYENANEQLRKNMNMQDRAVQRIRNFIKYILGERPSPPEMAQELDHRMSSVMRNKFAEDLMELLKESQFLSESIFNAENDVQGLIKLLDEINRLRDENRALKNQADDIRDMDSFGDVFDAESWLRSLTLTELAELHDRICLVTSCIVQQDINPEDYVDGSVEVDGVCRPCVEISEDPVDEYEALNRRIAALQRQINEKQNEASQKVQQMREVMWREQENLIRLSDEMNSQKRRNLSMQLKIGASNCAGKLDPWAMADRLDALNDQRLGLEFKNYSEITHKEDICDDNNCSVIRRDSSPKSDLINNQEAEEKKDEEKEASGLIYYLEGAEVVYLDPEYFDDESSEIEVIYLNSTFSGLSTNFTIMKQIPDSVMGHFNLYLKSMGDYTVSGGISLTMPLCEMTSEPILMGKILSLLGINDESCPPPPGVYGMPFWAPTVDLLPDSMPGNDYKVSFTADYDDDKILVDLAVYVQVF
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00175527; iTF_00174761; iTF_00175503; iTF_00173906;
- 90% Identity
- iTF_00173906;
- 80% Identity
- iTF_00174761;