Basic Information

Insect
Athalia rosae
Gene Symbol
-
Assembly
GCA_000344095.2
Location
NW:57081-63456[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 30 0.67 3.1e+02 0.7 0.9 47 64 67 84 52 85 0.64
2 30 0.00089 0.41 10.0 0.2 35 63 99 127 98 129 0.82
3 30 0.00058 0.26 10.6 1.1 35 64 216 245 212 246 0.90
4 30 0.31 1.4e+02 1.8 0.2 36 64 307 335 302 357 0.79
5 30 0.059 27 4.1 0.3 28 58 372 402 368 409 0.60
6 30 0.0055 2.5 7.4 3.5 32 64 443 475 425 476 0.69
7 30 0.19 85 2.5 9.8 28 62 467 504 455 507 0.54
8 30 0.00043 0.2 10.9 7.4 25 63 509 547 506 548 0.94
9 30 0.067 31 3.9 1.8 31 64 557 590 550 591 0.82
10 30 0.02 9.2 5.6 2.8 25 61 565 601 563 605 0.87
11 30 0.001 0.47 9.7 5.2 31 63 599 631 587 633 0.68
12 30 2.1e-07 9.8e-05 21.5 3.8 21 64 617 660 617 661 0.95
13 30 1e-06 0.00046 19.4 5.8 24 64 662 702 659 710 0.68
14 30 0.0004 0.18 11.1 1.1 31 62 718 749 716 752 0.90
15 30 0.0087 4 6.8 6.9 20 65 756 801 744 807 0.66
16 30 1.1e-05 0.0051 16.0 0.5 28 63 806 841 802 843 0.92
17 30 0.00083 0.38 10.0 4.1 30 60 843 873 840 876 0.84
18 30 3.6e-07 0.00016 20.8 1.8 26 63 874 911 870 913 0.93
19 30 0.032 15 5.0 5.9 23 61 927 965 914 969 0.63
20 30 0.0043 2 7.8 7.6 11 64 928 982 926 983 0.83
21 30 0.0023 1 8.6 0.7 24 64 963 1003 961 1004 0.85
22 30 0.0002 0.092 12.0 7.4 27 62 1001 1036 988 1039 0.84
23 30 0.00021 0.097 11.9 4.8 22 64 1031 1073 1031 1074 0.88
24 30 0.00019 0.085 12.1 5.5 22 64 1066 1108 1065 1109 0.84
25 30 1.1e-05 0.0052 16.0 8.5 23 65 1088 1130 1086 1137 0.93
26 30 1.8e-05 0.008 15.4 5.0 27 64 1141 1178 1136 1179 0.90
27 30 1.8e-06 0.00084 18.5 6.4 21 63 1170 1212 1170 1214 0.95
28 30 6.7 3e+03 -2.5 0.0 36 58 1291 1313 1289 1317 0.77
29 30 0.089 41 3.5 0.1 43 61 1376 1394 1345 1403 0.69
30 30 1.6 7.4e+02 -0.5 0.6 36 56 1466 1486 1464 1498 0.81

Sequence Information

Coding Sequence
atgtccTACTATCGCACGTGCCAATGCGGCTGCACGGACCCACCGGAAATGACAGGGGGTGACCCACCGCACGAGGGATCCTGTGGGTGCAGTTATAACCCCTTTGCCGACGTGGGGCGTGACACCGAGATAACAGATTTATCTTACGCTCTGAGAAAACTGACCTCGATGAAGTGTCAAATGAAAAAGTGGCGCATAGAACGTCTCCAGTTGGAAAGCGAAGTGAGAGCTTTGAAGCAGGTTTTGCAGGAGCACgGGCTTAACTCCGATATGGTGAAACCGGATCCGCTCATGGTTCATCTGCGAGAGGAGAACGGTCGCTTGGAAAACGAGAATGCGGAGCTTCGGGACAAGGTGAAAACCCTTGCCGACACAATCTCGGAATACGAGTACAACGACTCGCCTTGCGAGGCGGTGAAAAAGGTGCGAATTAAAATGAAGATCTTGAAGGAGGAACACGcggcggaaaaaaatcgATTGCGGGAGATCATTTCGGAGCTTAAAATCCGTTTGCAAGAGGCTGAGGGTGACTCTTCGTGTGCAGCGATGAATCGTTTACGAGCGAAGCTCAGGGAGCTGATGAAGGGTGGCAAGGTAGCCGACGAGCGGGTATCTATGGTCGTCCAGAGATCCATAGAGACGCTGGTCGAGCTTACGGATAACGTGGACGAGCTGAAAGCTGAGATAGAGCGTTTAAAAGCGGAGATAAGGAGGCTGAAGGATCTTCTGAAGGAATGCGAAGACCGACGTTTGACTTCCTCGACGGTCGCCGTCGAGACAGCACCCATCGATTTGAAACCGGCCGAGAAACCGTTAGCCGAAATGGACGTGTCCGACCTCTTGCAGAGAATAAAGGATCTCGAAGCGTTGATTGCTATGCTGAGAAAACAACTCGTCGACAAGGACGCGGTCATCAACGATCTGCAAAATCAGTCGTTCGATCTCGCTACCGAGAACAAACGTCTTTCCGTAGATCTCGACCAGATGAACGTCAGCTACAAAGCACTGATGGACGAAGTTAAAGCCATGAAGGAGGAGctcaaaaaaagagatgacaagGTATCCGATCTCTTGCGGAGCTTGCAAGCCTCAGCCATCGAGTTGCTTGGAATGAACAGGCTACAGAGTGAAATGGACACTCTAAAACCACAGTTGTATAGCCTTGAATTGGAGCGGGACCAGCTGGTTTCGGAGCTCGGTAAAGTACGGGGTGTCGTGTCTGAGAGGAATGATcagataattaaaatattggaAGAGAAAGACAAACACGTAAAAGCTCTTGCCAGAACGTCGAATATTATACAGTCAACGGTGGAGCCTTTGATGGAGCAAGAAGCTGCCCTGAAGAGAGAGATCGATGGGCTCAAGGATCGAATAGCTGAACTGGAAAGGGAGCTTGCCGAACtaagaaaaaagctcgcgcAATTGGAATCAGAGAATGCCGAGATACCGGGACTCCTCGAGAAGATCAAGAACCTCGAAGACGAACTGGCCAGACTCAGAGCCCAGCTGGCTGAAGCGAACGATAGAATACGAgagttagaaaaagaaatagcaGAATTGAAGGCTGATAAAGCGCAGTTGGAAAAAGACCTCGCCGAGGCTAGAAAGGAGATGGAGAAAATGAGGGAAGAACTCGCGGCGGAGAGAGCAGCGAAAGAGGCGGCGCTGAAAGAGTTGGGAAATTGTAGGGCGGAAAATGAACGATTGAACAAAGAGTTGAACGCGGCTAAGGCAGAGGCTGATAATCTTCgcggtgaaattgaaagactGAAAAACGCCTTGGATGCGGCGAAGGGTGAAGCTGACAAGCTCAGGAGTGAtatggaaaaactgaaaaatgagcTTGACAAATTGAGAGCTGAGAACGATCAGCTGAAAAATCAGCTGGCGGGACTGACCGCGGAGAACGAGCGACTCAGAGGTGAAATAGACGCACTGAAGGACGAGAGAGACAAGCTACGAAATGAGATCAATGCCCTCAAAGCGGAGAACGATAAATTGCAGGCAGAAGTCAACAAGCTGAAAGCGGAAGTTGAAAGACTCGAGGCTGAAAACGGAAGACTCAAAGCGGAGTTTCAGAAACTCAAAAATGACTACGATGCCCTGAAATCGGAGAATGACGATCTTAAGAAGAGTCTGGCCGACGCCGAAGGAAGGATAAAATCTCTGGAAGCTGAAAAAGCTAATCTCTTGAACAAAATTGCGGAGTTGAAGAATCAGATCGACCGGCTTCAGGGTGAACTCGCTGCAGAAAAAGCCGCAAAAGATGCAGCGTTGCAAGAGCTAGCAGCGATCAAATCCGAGCTCAAAGCTCTGTTGGCAGAAATGGATAAGTTGAAAGCAGAGCGCGATAAACTCAAAGCTGCAGTTGACGATCTCACGAAACAACTTTCTCAGCTAAACGACGACCTGGACCAGCTGAAATCGAAATATGCCGCATTGTTGGCGGAAAATGACAAGCTGAAAGGAGAGGTCGATCGGTTGAAGGGGGAGAATGACAGACTCAAAAATGACCTGGACAAAATCAAAGCAGAGCTTGATAACTTGAAAGCAGAAAACGCCAAGCTCAAAGAAGAGAAcgcgaagttgaaaaaagaccTCAGCGCCGCTGAATCTAAGATAAAAGGtcttgaaaatcaaatcaaggcttgcgaagaagaaaaagccaGATTAAGGAACGAAATCGATGCCCTTAAAGATCAAGTTGACAAGCTTGGCAAAGAGCTAGCGGCAGAGAGGGCTGCGAAAGAAGCAGCTCTGCGGGAGCTGGATGCCCTGAAAAATGAGTTGTCCGCATTGAGAGCAGAGTTGGATAAAGTACGGGGGGAAAATACACGGCTAAAGGGTGAACTGGACAAACTGAAGGCTGAGAACGAGGCTCTCAAAGCTGacaacaataaaatgaaaggcGAGCTCGATCGGCTGAACGCTCAAGTCGCGAAACTATTGGGCGACATCGATGCTCTGAAAGCAGAAAATGCAAAGCTCAAAGGAGATTTGGACAAACTGAATGATGAGATTAAGGCCTTGCGAGCTGAGAACGACAAACTTAAGGCCGAGCTCGAACAGATGAAGGCCGAGAATGCGAAACTGAAGGATCAGCTGGCCAGCGCACAAGCGGAAATGGCGAAGTTGAAAGAAGAGCTGGATAAGCTGAAATCCGAGAACGATGCGCTTCGAGGTGagctttcaaaaatgaaaggagagTTGGACAAGCTAAATGCGGAGATCGCAAAACTTCAAAAAGATCTCGACGCTCTCAAAGCGGAGAACGCGAAGCTCAAAGACGAACTCGACAAACTTTCTGCTGAGAACAAAGAACTGAGATCTGAGAATGCTAAACTCAAAGGAGAGTTGGATAGCCTGAAATCCGAGAACGAAAAGCTGAAGAAAGATCTGGCTGCGGCGATGGCGGAAGTCGCCAAACTCAAAGAAGATCTCGATAAATTGCAGGCTGAAAACGACGCGCTCAAAGCTGAGAATGCTAAAATAAAGAGTGAACTCGACAAGCTCAAATCCGAGAATGCGGAGCTCCAAAAAGCACTCGATTCTCTGAAGGCTGAGAATGCTAGACTGAAATCGGAAGTCGATGATCTTAAAAAAGACAACGAAAAGCTCAAAAATGATCTCCAGAACGCGATTGCAGAAATGGACAAACTGAAAGCGGAATCTAGTGGCTCAAGGCGATCGAGTAAAGCGACCCCGAGGAGTCAGGATCCCTCTAAGCCAAGTCCTGAGGCTTCAACGGACGCTGTCCCTCTTGCTGAAATTGAGCGGCTCAGCCCGGTTCCGAAAACTGAGAAGAGGGTCAGACGTGTCAGTTCGGTTGTAAAAAAGGATCAAGGATCGCAAGGGGCGGGTTGCGGTGATTACGAGAATGCAAACGAACAGCTGAGGAAGAATATGAATATGCAGGACAGAGCCGTGCAACGAATACGAAATTTCGTCAAATACATACTCGGCGAGAGACCGTCACCCCCGGAAATGGCTCAGGAACTGAATCATCGGATGTCATCTgtgatgagaaataaattcgcCGAAGATCTGATGGAATTACTCAGGGAGTCTCAGTTCTTATCGGAAAGTATATTCAACGCTGAAACCGACGTTCAAGGACTGATGAAACTTCTGGACGAGATCAACAGGCTCAGAGATGAAAATACGGCGCTGAAAAATCAAGCTGACGACACTCGCGACATCAACAGTTCCGGCGACGTCTTCGACGCGGAGTCCTGGCTCAGATCGTTGACGTTGACGGAATTGGCGGAGCTTCACGACAGGATTTGTTTAGTGACGTCGTGCATAGTGCAGCAAGACATAAACCCGGAAGATTACGTAGACGGTACCGTGGAAGTCGACGGTGTCTGTCGTCCCTGCGTAAAAATATCCGAAGATCCAGTCGACGAGTACGAGGCATTGAACCGAAGAATAGCGGCTCTCCAGCGTCAGATAAATGAGAAGCAGAATGAAGCGGCTAAAAAAGTGCAGGAAATGCGGGAAGTCATGTGGCGAGAACAGGAGAACTTGATCCGGTTGTCGGACGAGATGAACGCCCAAAAACGTAGACATTTGTCGATGCAATTGAAAATTAGCGCGAATAGCTGCGCCGGTAAGCTCGACCCTTGGGCAATGGCGGAGAGACTCGACGCCCTGAATGACCAAAGACTCGGCCTCGATTTCAAGACCTATTCGGAGATCACTCGCGAAGAAGATATTTGCGACGATAATAACTGCCCGGAGATGAGAAGGGATTCCTCGCCGAAATCGGACCTCGTCAACAATCGGGAagccgaagagaaaaaagacgaagaaaaagaggctTCGGTACTTTCGATGTTCAAGGGAATCGGTACAGTGACTCGTGAAGTCGAATATCCGTTCGATAATTCTACCTGCGTAAAatctttgatgaaaaatcgtaGGACGAACGCTCCTGCGTCTTGTTCACTTCCCGTGAAACATGCGGACGTTCCATGCTGTCGAAATCCTTGTTGTCCATCGTTTGTGAATCAGaacgttcttttttcgaataaaaatgctGCGGCATCGTCCAAAAATGACAACGACGTCGACGATAACGCCGAGATAGAGGTGGCGGTTTGGTGTAAAGGTGTACGCGATCGTAGTTGA
Protein Sequence
MSYYRTCQCGCTDPPEMTGGDPPHEGSCGCSYNPFADVGRDTEITDLSYALRKLTSMKCQMKKWRIERLQLESEVRALKQVLQEHGLNSDMVKPDPLMVHLREENGRLENENAELRDKVKTLADTISEYEYNDSPCEAVKKVRIKMKILKEEHAAEKNRLREIISELKIRLQEAEGDSSCAAMNRLRAKLRELMKGGKVADERVSMVVQRSIETLVELTDNVDELKAEIERLKAEIRRLKDLLKECEDRRLTSSTVAVETAPIDLKPAEKPLAEMDVSDLLQRIKDLEALIAMLRKQLVDKDAVINDLQNQSFDLATENKRLSVDLDQMNVSYKALMDEVKAMKEELKKRDDKVSDLLRSLQASAIELLGMNRLQSEMDTLKPQLYSLELERDQLVSELGKVRGVVSERNDQIIKILEEKDKHVKALARTSNIIQSTVEPLMEQEAALKREIDGLKDRIAELERELAELRKKLAQLESENAEIPGLLEKIKNLEDELARLRAQLAEANDRIRELEKEIAELKADKAQLEKDLAEARKEMEKMREELAAERAAKEAALKELGNCRAENERLNKELNAAKAEADNLRGEIERLKNALDAAKGEADKLRSDMEKLKNELDKLRAENDQLKNQLAGLTAENERLRGEIDALKDERDKLRNEINALKAENDKLQAEVNKLKAEVERLEAENGRLKAEFQKLKNDYDALKSENDDLKKSLADAEGRIKSLEAEKANLLNKIAELKNQIDRLQGELAAEKAAKDAALQELAAIKSELKALLAEMDKLKAERDKLKAAVDDLTKQLSQLNDDLDQLKSKYAALLAENDKLKGEVDRLKGENDRLKNDLDKIKAELDNLKAENAKLKEENAKLKKDLSAAESKIKGLENQIKACEEEKARLRNEIDALKDQVDKLGKELAAERAAKEAALRELDALKNELSALRAELDKVRGENTRLKGELDKLKAENEALKADNNKMKGELDRLNAQVAKLLGDIDALKAENAKLKGDLDKLNDEIKALRAENDKLKAELEQMKAENAKLKDQLASAQAEMAKLKEELDKLKSENDALRGELSKMKGELDKLNAEIAKLQKDLDALKAENAKLKDELDKLSAENKELRSENAKLKGELDSLKSENEKLKKDLAAAMAEVAKLKEDLDKLQAENDALKAENAKIKSELDKLKSENAELQKALDSLKAENARLKSEVDDLKKDNEKLKNDLQNAIAEMDKLKAESSGSRRSSKATPRSQDPSKPSPEASTDAVPLAEIERLSPVPKTEKRVRRVSSVVKKDQGSQGAGCGDYENANEQLRKNMNMQDRAVQRIRNFVKYILGERPSPPEMAQELNHRMSSVMRNKFAEDLMELLRESQFLSESIFNAETDVQGLMKLLDEINRLRDENTALKNQADDTRDINSSGDVFDAESWLRSLTLTELAELHDRICLVTSCIVQQDINPEDYVDGTVEVDGVCRPCVKISEDPVDEYEALNRRIAALQRQINEKQNEAAKKVQEMREVMWREQENLIRLSDEMNAQKRRHLSMQLKISANSCAGKLDPWAMAERLDALNDQRLGLDFKTYSEITREEDICDDNNCPEMRRDSSPKSDLVNNREAEEKKDEEKEASVLSMFKGIGTVTREVEYPFDNSTCVKSLMKNRRTNAPASCSLPVKHADVPCCRNPCCPSFVNQNVLFSNKNAAASSKNDNDVDDNAEIEVAVWCKGVRDRS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00174786;
90% Identity
iTF_00173906;
80% Identity
iTF_00175527;