Acor009009.1
Basic Information
- Insect
- Athalia cordata
- Gene Symbol
- -
- Assembly
- GCA_963932425.1
- Location
- OZ010626.1:31513073-31522448[+]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 31 3 2.1e+03 -1.2 0.9 47 63 67 83 53 85 0.54 2 31 0.00082 0.58 10.2 0.2 34 63 98 127 96 129 0.84 3 31 0.00065 0.46 10.6 1.0 35 64 216 245 213 246 0.91 4 31 3.8 2.7e+03 -1.5 0.2 29 61 279 297 272 300 0.53 5 31 0.3 2.1e+02 2.0 2.6 35 64 306 335 301 368 0.77 6 31 0.73 5.1e+02 0.8 0.1 27 61 371 405 368 409 0.78 7 31 0.0045 3.1 7.9 5.7 30 64 441 475 425 476 0.75 8 31 0.056 39 4.4 7.9 28 63 467 505 456 507 0.76 9 31 0.00048 0.34 11.0 4.8 25 63 509 547 506 548 0.94 10 31 0.039 28 4.9 1.9 31 64 557 590 550 591 0.83 11 31 0.028 20 5.3 2.2 25 61 565 601 562 605 0.80 12 31 7.5e-07 0.00053 20.0 5.7 21 64 617 660 616 661 0.95 13 31 1.3e-05 0.0094 16.0 6.8 21 61 645 685 644 686 0.94 14 31 3.4e-05 0.024 14.7 8.6 24 62 662 700 660 716 0.71 15 31 0.00029 0.2 11.7 0.9 31 62 718 749 711 752 0.91 16 31 0.0023 1.6 8.8 4.2 28 63 771 806 753 808 0.75 17 31 9.6e-06 0.0067 16.4 0.7 28 63 806 841 803 843 0.93 18 31 0.0011 0.8 9.8 3.5 30 57 843 870 840 875 0.85 19 31 1.6e-07 0.00011 22.1 2.5 25 63 873 911 871 913 0.93 20 31 0.34 2.4e+02 1.8 0.4 39 63 922 946 914 948 0.76 21 31 0.00024 0.17 11.9 6.8 24 64 942 982 928 983 0.90 22 31 0.00099 0.69 10.0 1.3 22 64 975 1017 974 1018 0.89 23 31 0.0001 0.071 13.1 5.2 27 62 1001 1036 988 1039 0.75 24 31 0.00053 0.37 10.8 5.6 23 64 1032 1073 1030 1081 0.73 25 31 0.00022 0.16 12.0 6.8 22 64 1073 1115 1072 1116 0.90 26 31 6.1e-05 0.043 13.8 8.9 24 65 1089 1130 1088 1130 0.94 27 31 4e-05 0.028 14.4 5.5 27 64 1141 1178 1135 1179 0.90 28 31 2.1e-06 0.0015 18.5 6.7 21 64 1170 1213 1170 1214 0.95 29 31 8.3 5.8e+03 -2.6 0.0 36 57 1291 1312 1288 1316 0.76 30 31 0.16 1.1e+02 2.9 0.2 43 61 1376 1394 1345 1400 0.79 31 31 1.8 1.3e+03 -0.5 0.8 36 57 1466 1487 1464 1501 0.74
Sequence Information
- Coding Sequence
- atggCCTACTATCGCACGTGCCAATGTGGCTGCACAGACCCACCGGAAATGACAGGGGGTGACCCACCACAAGAGGGATCCTGTGGGTGCAGTTATAACCCCTTTGCCGACGTGGGTCGTGACACCGAGATAACGGATTTATCTTACGCTCTGAGAAAATTGACCTCGATGAAGTGCCAAATGAAAAAGTGGCGCATAGAACGTCTGCAGCTAGAAAGTGAAGCAAGAGCTTTGAAGCAGGTTTTGCAGGAGCACgGACTGAATGCCGACATAGTGAAACCGGATCCGCTCGTGGTTCATTTGCGAAAAGAGAACGGGCGTTTGGAAAACGAGAATGCAGAGCTTCAGGACAAGGTAAAAACCCTTGCTGACACGATCTCAGAATACGAGTACAAGGACTCGCCTTGTGAGGCGGTAAAAAAGGTGCGAATTAAAATGAAGACCTTGAAAGAGGAACACGCAGCGGAAAAAAATcgATTGCGCGAGATCATTTCGGAGCTTAAAATTCGATTGCAAGAAGCTGAGGGCGACTCTTCATGTGCAGCGATGAATCGTCTACGAGCTAAGCTCAGAGAGCTGATGAAGGGTGGTAAGGTAGCTGATGAGCGAGTATCGATGGTCGTCCAGAGATCCATAGAGACGTTGGTTGACCTTACGGATAACGTGGATGAGCTGAAAGCTGAGATAGAGCGTTTAAAAGCGGAGATAAGGAGGCTGAAGGACCTTCTGAAGGAATGCGAAGACCGACGTTTGACTTCTTCGGATGTTGCCGTCGAGACAGCACCCATTGATTTTAGACCAGCTGAAAAACCATTGGTCGAAATGGACGTATCTGACCTGTTGCAGAGAATAAAAGATCTCGAAGCGTTGATTGCTGAGCTGAGAAAACAACTTGTCGACAAGGATGCGGCCATAAATGACCTCCAAAATCAGTCGTTCAGGCTTGAAACTGAGAACAAACGCCTTTCCGTAGATTTGGATCAGATGAACGTGAGCTATAAAGCACTGATGGATGAAGTTAAAGCCATGAAGGAGGAGCTCAAAAAAAGGGATGAGAAGGTATCCGAACTCTTGCGGAGCTTGCAAGCCTCAGCCATCGAGTTGCTTGGAATGAACAGGCTACAGAGTGAAATGGATGCTATAAAACCACAGATGTACAGCCTTGAATTAGAGCGGGACCAGCTGGTTTCGGAGCTCGGTAAAGTACGAGGTGTTGTATCTGAGAGAAATGAtcagataattaaaatactgGAAGAAAAGGACAAACATGTAAAGGCTCTTGCCAGAACATCGAATGTTATACAGTCAACCGTAGAGCCTTTGTTGGAGCAAGAAGCTGCCCTGAAGAAAGAGATCGATGCGCTTAAGGATCGAATAGCTGAATTGGAAAGGGAGCTTGCTGAATTAAGAAAGAAGCTCGCGCAACTGGAATCAGAGAACGCTGGGATACCAGGACTTCTCGAGAAGATCAAGAACCTCGAAGACGAACTGGCTAAACTCAAAGCCCAGCTAGCTGAAGCAAACGATAGAATACGTGagttagaaaaagaaatagtagGATTGAAAGCTGATAAAGCGCAGTTAGAAAAGGACCTTGCGGAAGCTAAAAAGGAGAtagagaaaatgaaggaagaGCTAGCTGCAGAGAGAGCAGCGAAAGAGGCGGCACTGAAAGAGTTAGGAAATTGTAGGGCGGAAAATGAACGATTGAATCAAGAGCTGAACGCGGCTAAGGTAGAGGTTGATAATCTTCGAGGTGAAATTGAAAGACTGAAAAACGCCTTGGATGCGGCGAAAGGTGAAGCTGACAAGCTCAGAAGTGATatggaaaagatgaaaaatgagcTTGATAAATTGAGAGCTGAAAATGATCAGCTGAAGAATCAGTTGGCGGGACTGACCGCGGAGAATGAGCGACTCAAAGGTGAAATTGACAAACTGAAGGATGAGAGAGATAAGCTACGAAATGAGATCAATGCCCTCAAAGCAGAGAACGATAAATTACAAGCAGAAGTCAATAAGCTAAAAGCAGAAGTTGAAAATCTTGAGGCTGAAAACGGAAGACTCAAAGCGGAGTTACAAAAACTTAAAACTGACTATGACACCCTGAAATCAGAGAATGACAATCTAAAGAAGAGCCTGGCAGACGCAGAAGGAAGGATAAAATCTCTGGAAGCTGAAAAAGGTAatcttttgaataaaatcgCGGAGTTGAAGAATCAGATCGACCAACTTCAGGGTGAACTCGCTGCAGAAAAAGCCGCAAAAGAGGCGGCGTTGCAAGAGCTGGCAGCGATCAAGTCTGAGCTCAAGGCTCTGTTGGCAGAAATGGATAAATTAAAAGCAGAGCGTGATAAACTTAAAGCTGCAGTTGACGATCTCACCAAACAACTTTCTCAGCTAAATAACGACCTTGACCAACTCAAATCAAAGTATGCCGCGTTGTTGGCGGAAAATGACAAGTTGAAAGGAGAGGTCGATCGGTTGAAGGGGGAGAATGACAAGCTTAAAAATGATCTAGACAAAATTAAAGCAGAGCTCGATAACTTAAAAGCAGAAAACGCTAAGCTCAAAGAAGAAAACGCAAACTTGAAAAAAGACCTCAGCGATGCTGAAGCTAAGATTAAAGGCcttgaaaatcaaatcaagGCTTGCGAAGAAGAGAAAGCCAGATTACGGAAAGAAGTCGATGCCCTTAAAGACCAAGTTGACAAGCTTGGCAAAGAGCTAGCAGCAGAACGAGCTGCGAAAGAAGCAGCTCTGCGGGAGCTAGATGCCCTGAAAAATGAGTTAGTCGCATTAAGAGCAGAGCTGGATAAAGTACGGGGGGAAAATTCAAGGCTAAAGGGTGAGCTGGACAAACTGAAGGCTGAGAACGAGGCTCTCAAAGCTGAGAACAGTAAAATGAAAGGGGAGCTCGATAGGCTGAACGCTCAAGTCGCGAAACTATTGGGTGACATCGATGCTTTGAAAGCAGAAAATGCAAAACTCAAAGGAGATTTGGATAGACTGAATGATGAGATTAAGGCCTTGCGAGCTGAGAACGACAAACTCAAGGCTGAGCTCGATCAGATGAAGGATGAGAATGCGAAATTGAAAGACCAGCTGGCCAGCGTTAAAGCGGAAATGGCGAAGTTGAAAGAAGAGCTGGATAAACTGAAATCTGAGAACGATGCGCTACGAGGTGagctttcaaaaatgaaaggagAGTTGGATAAGCTGAATGCAGAGATCGCGAAACTCCAAAGAGATCTTGACACTCTCAAAGCAGAGAACGCGAAGCTCAAAGACGAACTTGATAAACTTGCTGCTGAGAACAAAGAACTGAGATCTGAAAATGCTAAACTCAAAGGAGAGTTGGATAACCTGAAATCCGAAAacgaaaagttgaagaaagaTCTGGCTGCAGCGATAGCGGAGGTCGCCAAACTAAAAGAAGATCTCAATAAACTGCAGGCTGAAAACGATGCACTCAAAGCTGAGAAtgctaaaataaaaagtgaactCGACAAGCTGAAATCTGAGAACGCGGAGCTGAAAAAAGCACTTGACTCTCTGGAGGCAGAGAATGCTAGATTGAAATCGGAAGTCGATGATCTTAAAAAAGACAACGAAAAGCTCAAAAATGATCTCCAGAAAGCGATTGCAGAAATGGACAAACTAAAAGCGGAATCTAGTGATACGAGGCGACCAAGTAAAGCGACCCCGAGGAGTCAGGATCCCTCTAAGCCAAGAACTGAGGCTTCAACGGAGGCTGTCCCTCTTGTTGAAATTGAGCGGCTCAGTCCCGTTCCGAAAACTGAGAAGAGGGTTAGACGTGGCAGTTCGGTTGTAAAAAAGGATCAAGAATCGCAAGGGGAGGGTTGCGGTGATTACGAGAATGCAAACGAACAGCTGAGGAAGAATATGAATATGCAGGACAGAGCTGTGCAACGAATACGAAATTTCATCAAGTACATACTTGGCGAGAGACCGTCACCTCCGGAAATGGCGCAGGAATTAGATCATCGGATGTCATCTgtgatgagaaataaattcgcTGAAGATCTGATGGAATTACTTAAGGAGTCTCAGTTCTTATCGGAAAGCATCTTTAACGCCGAAAACGACGTTCAAGGACTGATTAAACTTCTGGACGAAATCAACAGGCTCCGAGATGAAAATAGGGCCCTAAAAAATCAAGCTGACGATATTCGCGATATGGACAGTTTCGGCGACGTCTTCGACGCAGAGTCCTGGCTCAGATCGTTGACGTTGACGGAATTGGCGGAACTTCACGACAGGATTTGTTTAGTAACGTCGTGCATAGTGCAGCAAGATATAAACCCCGAAGATTACGTAGACGGTTCCGTCGAAGTCGACGGAGTCTGCCGTCCGTGTGTAGAAATATCTGAAGATCCGGTCGACGAATACGAGGCATTAAACCGAAGAATAGCAGCTCTTCAGCGTCAGATAAACGAGAAGCAAAATGAAGCATCTCAAAAAGTGCAGCAAATGCGTGAAGTTATGTGGCGAGAACAGGAGAATTTAATCCGTTTGTCAGATGAGATGAACAGCCAAAAACGTAGAAATTTATCAATGCAATTAAAAATCGGCGCGAGTAATTGCGCCGGTAAATTAGACCCTTGGGCAATGGCCGACAGACTCGACGCTCTGAATGACCAAAGACTCGGCCtcgaattcaaaaattattcagagatTACTCACAAAGAAGATATTTGCGACGATAATAATTGCTCAGTAATAAGAAGGGATTCTTCACCGAAATCGGACCTCATCAACAATCAggaagctgaagaaaaaaaagacgaagaaaaagaagcttcGggATTGATTTACTACTTGGAAGGAGCCGAAGTCGTCTACTTGGATCCCGAATATTTCGATGATGAATCCTCCGAGATTGaagttatatatttgaatagcACGTTCAGCGGACTTTCGACCAATTTTACTATAATGAAACAAATCCCCGATTCAGTTATGGGTCACTTTAATCTATATTTAAAATCTATGGGAGATTACACAGTTTCCGGTGGTATATCCTTAACTATGCCCTTGTGCGAAATGACAAGCGAGCCCATCTTGATGGGAAAAATTTTGAGCCTGCTGGGAATTAATGACGAAAGCTGCCCTCCACCACCGGGAGTATACGGAATGCCTTTTTGGGCCCCAACGGTCGATTTATTGCCCGATTCGATGCCGGGAAACGACTATAAAGTTTCTTTCACTGCGGACTACGACGATGACAAGATTCTGGTAGACTTAGCTGTATACGTTCAAgtgttttga
- Protein Sequence
- MAYYRTCQCGCTDPPEMTGGDPPQEGSCGCSYNPFADVGRDTEITDLSYALRKLTSMKCQMKKWRIERLQLESEARALKQVLQEHGLNADIVKPDPLVVHLRKENGRLENENAELQDKVKTLADTISEYEYKDSPCEAVKKVRIKMKTLKEEHAAEKNRLREIISELKIRLQEAEGDSSCAAMNRLRAKLRELMKGGKVADERVSMVVQRSIETLVDLTDNVDELKAEIERLKAEIRRLKDLLKECEDRRLTSSDVAVETAPIDFRPAEKPLVEMDVSDLLQRIKDLEALIAELRKQLVDKDAAINDLQNQSFRLETENKRLSVDLDQMNVSYKALMDEVKAMKEELKKRDEKVSELLRSLQASAIELLGMNRLQSEMDAIKPQMYSLELERDQLVSELGKVRGVVSERNDQIIKILEEKDKHVKALARTSNVIQSTVEPLLEQEAALKKEIDALKDRIAELERELAELRKKLAQLESENAGIPGLLEKIKNLEDELAKLKAQLAEANDRIRELEKEIVGLKADKAQLEKDLAEAKKEIEKMKEELAAERAAKEAALKELGNCRAENERLNQELNAAKVEVDNLRGEIERLKNALDAAKGEADKLRSDMEKMKNELDKLRAENDQLKNQLAGLTAENERLKGEIDKLKDERDKLRNEINALKAENDKLQAEVNKLKAEVENLEAENGRLKAELQKLKTDYDTLKSENDNLKKSLADAEGRIKSLEAEKGNLLNKIAELKNQIDQLQGELAAEKAAKEAALQELAAIKSELKALLAEMDKLKAERDKLKAAVDDLTKQLSQLNNDLDQLKSKYAALLAENDKLKGEVDRLKGENDKLKNDLDKIKAELDNLKAENAKLKEENANLKKDLSDAEAKIKGLENQIKACEEEKARLRKEVDALKDQVDKLGKELAAERAAKEAALRELDALKNELVALRAELDKVRGENSRLKGELDKLKAENEALKAENSKMKGELDRLNAQVAKLLGDIDALKAENAKLKGDLDRLNDEIKALRAENDKLKAELDQMKDENAKLKDQLASVKAEMAKLKEELDKLKSENDALRGELSKMKGELDKLNAEIAKLQRDLDTLKAENAKLKDELDKLAAENKELRSENAKLKGELDNLKSENEKLKKDLAAAIAEVAKLKEDLNKLQAENDALKAENAKIKSELDKLKSENAELKKALDSLEAENARLKSEVDDLKKDNEKLKNDLQKAIAEMDKLKAESSDTRRPSKATPRSQDPSKPRTEASTEAVPLVEIERLSPVPKTEKRVRRGSSVVKKDQESQGEGCGDYENANEQLRKNMNMQDRAVQRIRNFIKYILGERPSPPEMAQELDHRMSSVMRNKFAEDLMELLKESQFLSESIFNAENDVQGLIKLLDEINRLRDENRALKNQADDIRDMDSFGDVFDAESWLRSLTLTELAELHDRICLVTSCIVQQDINPEDYVDGSVEVDGVCRPCVEISEDPVDEYEALNRRIAALQRQINEKQNEASQKVQQMREVMWREQENLIRLSDEMNSQKRRNLSMQLKIGASNCAGKLDPWAMADRLDALNDQRLGLEFKNYSEITHKEDICDDNNCSVIRRDSSPKSDLINNQEAEEKKDEEKEASGLIYYLEGAEVVYLDPEYFDDESSEIEVIYLNSTFSGLSTNFTIMKQIPDSVMGHFNLYLKSMGDYTVSGGISLTMPLCEMTSEPILMGKILSLLGINDESCPPPPGVYGMPFWAPTVDLLPDSMPGNDYKVSFTADYDDDKILVDLAVYVQVF
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00174786;
- 90% Identity
- iTF_00173906;
- 80% Identity
- iTF_00174786;