Mgen002082.1
Basic Information
- Insect
- Megalopta genalis
- Gene Symbol
- -
- Assembly
- GCA_011865705.1
- Location
- NW:2875545-2900462[-]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 41 1.5 6.1e+02 -0.4 0.1 26 50 59 83 55 90 0.67 2 41 1.4 5.8e+02 -0.3 0.8 33 55 104 126 101 129 0.66 3 41 0.96 3.9e+02 0.2 0.6 38 60 154 176 144 180 0.81 4 41 0.018 7.2 5.8 3.1 30 61 218 249 212 252 0.76 5 41 1 4.1e+02 0.2 2.9 28 63 275 310 273 312 0.89 6 41 0.0045 1.8 7.7 1.1 24 46 364 386 359 397 0.76 7 41 0.085 34 3.6 2.1 25 62 400 441 398 444 0.83 8 41 0.93 3.8e+02 0.3 2.7 33 62 450 479 440 482 0.80 9 41 0.0015 0.6 9.3 1.3 24 63 483 522 481 524 0.92 10 41 0.0019 0.78 8.9 5.3 14 57 536 579 529 584 0.92 11 41 0.0015 0.62 9.2 1.1 22 62 579 619 578 622 0.88 12 41 5.6e-06 0.0023 17.0 2.5 24 63 637 676 636 678 0.95 13 41 0.16 64 2.8 6.0 24 62 665 703 664 720 0.89 14 41 0.00049 0.2 10.8 2.9 22 62 712 752 698 762 0.85 15 41 0.039 16 4.7 0.5 35 63 767 795 756 799 0.48 16 41 0.00063 0.26 10.4 4.8 24 60 791 827 789 832 0.79 17 41 0.029 12 5.1 3.1 31 62 840 871 826 878 0.61 18 41 0.0079 3.2 6.9 4.2 20 61 864 905 864 909 0.67 19 41 3.4e-05 0.014 14.5 7.9 21 63 907 949 900 951 0.90 20 41 0.0028 1.1 8.4 6.0 27 60 955 988 952 990 0.91 21 41 1.3e-05 0.0054 15.8 7.8 22 63 985 1026 984 1028 0.94 22 41 0.099 40 3.4 0.4 30 62 1021 1053 1020 1055 0.71 23 41 0.049 20 4.4 2.7 27 62 1074 1109 1064 1112 0.62 24 41 0.00052 0.21 10.7 0.7 32 62 1114 1144 1107 1146 0.89 25 41 0.00022 0.091 11.9 4.7 24 63 1176 1215 1175 1217 0.92 26 41 0.0034 1.4 8.1 3.6 24 59 1218 1253 1215 1259 0.81 27 41 0.12 49 3.1 1.4 27 65 1249 1287 1246 1291 0.83 28 41 0.011 4.6 6.4 7.4 20 59 1298 1337 1286 1340 0.80 29 41 0.00022 0.089 11.9 4.5 24 64 1358 1398 1357 1399 0.92 30 41 0.017 6.7 5.9 5.6 26 60 1409 1443 1406 1448 0.80 31 41 0.021 8.6 5.5 8.6 24 65 1459 1500 1445 1500 0.90 32 41 0.001 0.42 9.7 3.1 29 63 1492 1526 1487 1528 0.93 33 41 0.067 27 3.9 2.5 31 57 1529 1555 1526 1566 0.73 34 41 8.3 3.4e+03 -2.8 14.8 33 58 1569 1594 1535 1601 0.68 35 41 0.001 0.43 9.7 5.3 21 64 1578 1621 1577 1622 0.94 36 41 0.19 79 2.5 1.9 25 62 1617 1654 1613 1657 0.81 37 41 0.024 9.7 5.4 0.1 24 65 1658 1699 1655 1699 0.89 38 41 0.032 13 5.0 0.2 24 60 1672 1708 1668 1712 0.82 39 41 0.00066 0.27 10.4 3.4 29 58 1716 1745 1704 1752 0.62 40 41 0.097 40 3.4 2.1 29 50 1751 1772 1741 1786 0.54 41 41 8.7 3.6e+03 -2.8 2.6 36 52 1971 1987 1950 1998 0.54
Sequence Information
- Coding Sequence
- ATGGAGAATCGAGGTTGTCCTTGCGGATGCGAGTCATCGTATTCGTCGGAATCAATTAATCCGCCGAACGAGCCACGTTGTAGCTGCAATTACAATCCCTTCAGCGACAATTCGAAAGAATCGGAGATTTACGATCTGCCGTTTGCCCTGAGGAAACTCGCCGTAATGAAGTGTCAGATGAAGAAATGGCGAATGGAACGACTTCAGTTCGAGAGCGAAAATAGGTCTTTGAAACAAGCCCTGCAGTCATTCGGTGTAAATGTGGATGAGATATTGAAGCCTGATCCGCTGCTCGTGCAGTCTCGGGAAGAAATCGAATGGCTGCAAAATGCAAACGCGGCGCTCGAAGATAAAGTGAGGGATCTGGAAGAAACCCTCGCCGAACGAGATTACGGCGACGATCCTTGTGCATCGGTACACTATATCAGAGAGAAGATGAGAGTTCTACGGGAGCGCTTTGCACTTGAAAAGAAAGAATTAAGGGACACGATATCGCACTTGAAATTAAAACTCGCGCAGACCGAGGAGGACGTGAGCTGTCCTGCGATATATCGCCTGAGGGCGAAGCTCCGTGAACTGATGAAAGGAGGTCAAACAGCGGACCAGCAGGTCTCGAAAGTCGTGGAAAGATCGATCGAAACGCTGGTGGATCTCTCGAAGAGCTGCGACGACCTGCGTCTGGAAAATGAACAGCTCGTGGCACAGCTGGACGAATTACGTCGTCAATTGGCGGAGCTTGAGAAAAAGCGACCGAAAGAAGAGATACCGATGGAGATGCTGAGAACGGCCGAAACAACGACGGTTCCTGAATATATAGACGTTTCGGAGCTATTACAGAAGCTACGGGACTGCGAGCTCGCTGTTTCCGAATTGAAACAGCAACTTGACGAGAAAGACAAGATTATCGAGGGACTGAAGAAAGAACTCGAAGGCAGGCTGGATGAGCAGGCTCTGTTGGCTGAGATTGCGGCTATGAAAATAGAACTCGAGAAAAGAGATGACAAGGTGAGAGATCTCCTAAACGAAATGAGACAATCGGAGATAGACCTGTTGGGCCTGGGCAACCTACGATCGGAGCTGGACACCTTGAAGCCTCGTATATCCGAACTCGAAGACGAGCGGGACTCGCTGACAGACGAGGTGGCGAAATTGAGAAAACTATTAGAGGAAAGGAACGACCAAATAATCGAGATCCTGGAATACAAGAACAAGCTGGAGCAGGAGATGGCAGAAAAGGAACAAGAGGCCGAGCGGATTATCGACGGCTTGAAAAAGGAGATCGACGACCTGATGTCGCAGATAGCGAATCTGCGGGGCGAGGTCGACGAGCGTAACAAGCGAGTCGCGGAGCTGGAGAAGTGTTGCGCGGATAGAGACGAGCTTTCGAAGAAATTACAGCTCGCGGAGGACGAGCTGGGAACACTGCGAAACGAACTCGCGTCCGCGAAAGCTGCGATTGCGGATCTCCGGGGAGAGGTAGACGCCCTGCAGAATGACAAGGACAAACTGTTGAAACAGTTGGACGAAACGAAGGAGCGGGTTAACGCGTTGTCTGATCAATTGGCGGACGAGATGGCGGCCAACGAGGCTTTGCAGAAAGAGCTCGAAGCTGCCTGGAACGCGACGGAGGCACTGCGAGAAGAGAATTCGGATCTGAAAGGTCGGCTGGACATGGCGGAAGAGGAGAACGATAGATTAAGGGAGGAGAATAGTGACTACAAAGCGGAGCTTGATAAATCGAATGAAGCAAACCGCGAGTTGCAGAATAGCTTGGACGCGGCGCAATCGGAGAACGACGCGTTGAAGCAAGACATGTCGGCGTTGCGGAGCGACCTTGATGACGCGAGGAAAGAGGTTGACAGGCTGAGAGGCGATGGGGACGCATTGAAAGCTGCAGAGGCATACGCAAAGACGAAGATCGATGAATTTGAGTCGCGGATGGAGGATTTGAAAGCGGAGAAGGACCGTCTGGCCAGCGAGAACGCCGACGTGAAAGCCAAGAATTTGGAATTGGAACAAAAATTGGACGACGCGTTGAAAGACCTGGGAAAAATGAAGGCAGAGAATGCCGATCTGTTGGTAGAACTGGAGCGTCTGAAGGGAGAATTAGCGAGGGCCAAGAGCACGATCGATCAGCTGAAAGCGGAGATGGATTCTCTGAGGGACGGCCTGGACAAGTGCGTGGATGAAATGGAGAAACTAAAAGCCGAAAATGCCGAGCTTAAGGCGAAGAATGAGGCTCTCGGAGCGGGGCTTGATCAATGCAGGGCCGATGGGGACTCGTTGCGGGACGATTTGCAGCGGGCCGAGGCGGAATTGGACGGTGTGACCAACGAGCTGAATAAGCTGAAAGACGAGAACGGAGCGCTTAAACAAAATTTGGATCGACTGGCTGGTGAAAAGGACGCGTTACTGGCTGAGGTGGAGCGGCTGAAGAGGGAAGCGGAGGAGCTTCGAGCGGAAATAGATAGCGGAAAGCAGAGAGAAGCCGCGTTGAACGGCGAACTCGATCGCATTAAGCAGGAGAAGGACGCGTCGGCGGCGGAATTAGACAAGCTGAGAGATGAACGTTCTGCGTTGCGGAACGAGAGGGATCGGTTTAAGAAACAATTGGACGACGCGAACGATGAGATCGAGAGGCTGAGGGACGAGTTGGAACGGTTgaggaacgagaacgagaagcTGAGAACAGATGGGGAAGCGTGCAGAGATGATTACGATAAGCTGAAAGCCGAAGTCGAGGAACTaaggaaagagaaagattcGCTGAACGACGAGACCGAGCGACTCGAACGAGAGAAGGACGCGTTGGCAGCAGAATTAGGTAAATTGAAGGATGAACACTCTGCTTTGCAGAGGGGGAAGGATCAGCTGAAGAAACAATTGGACGATACGAACAATGAGAACGATAAGTTAAAGGACGAGTTGGCCCGGTTGAAGGAAGAGCTGGCTCGCTCGAAGGACGAGAACGAGGGGCTGAAGAAAGATGCAGACTCGTGCGAACAGGAGCGCGAGAAGTTGAAAGCCGAACTCGAGCAACTGAGGAAGGAAAGGGACGCGTTGAACGACGAATTAAACGGGCTGCGCGACGAAAATGGTAGGCTACGGAACGATCTGAATACTTTGGAAGGCGAAGCGAACGATTTAAAGACGAAGCTTGACGCAGAAAAGAAGACCAACGAAATTATGAGAAACGACGCGATGATGCTAAACAGTAAGGTGCAAGATTTAAGCGAAGCACTCGACGAGGCCAGGGCGAAGAATGCCGCGTTGACGGAGGAGAACGAAGGGCTACAATCGAAGTTGCGGAACGTGCAAGACGAATTGAACAGCGCGAGAGCAGAGTGCGCCGATTTGAGAGCGGAGATCGACTCTTTAAAGAAGGAGATTGCTAATCTGAAGGCGGAGATCGCTAAATTGAAGGAGGATCTCGAATACTGGAAATTAGAGCATTGTAAGATCAAGATGGAGATGGACAAACTCGAAGCTGACTTGGAGAAGGCGTTGATAGACTTGAACGAATGCAAGGCTGCGAAAGAGGCGTGCCAGGCAGAGTGTAATCGCCTGCGAGCCGAGAAAGCCAATTTGGAGAAGAAGATCGCGGATTTAACCGCGGAGATCGAGCAGCTAAAGAAAGCTCTTGAAGCAGAGAAAGCGGCTACAAGCAAATGCGAGGCGGAGCTCGCGGCCTTGAGCAGCGAGCTGGACGCATTGAAGAGGGAGTTGGAAAATTTGAGGGACGAGAACCGAAGGTGCGAAGGAGAAGCGGACGATCTTCGGAAACAGATCGCAGGGCTGAACGGCGAACTGGACAGGCGCAGGGAAGAGCTCGCCGCGTTGAGGAACGCCAACGAGAAGCTGAAGGCCGACGTGGCTGCGTCGAACAACGAGAGAAACAGACTGCAGGAGGAACGTGACAAACTGAAGGCCGAGGCAGATGCTGTGAAGGAGGAGAACGCCGCGCTTCATCGGGATCGAGACAAGCTGGCGAACGAAGCTACCAGGCTGAGGGGCGAGGGTGACGGACAGCGAGGGGAGCTCGATAAGCTCAGGTCCGACTTGGCTTCGCAGAGGGCTGCCGTCGACAAGTTGCAGACCGACCTGAAGGACTGCCAGGCCGAGAACGGCAAACTGCAACAGCAGCTGAACGAATTAAAGAGCGATTTGGATAGATCGAAAGACGAGAGGAGCAAGCTGGACAAGTCACTGGCGGACGCGGAGGCTGAAGCTAACGCTTTGGAGGCCGAACTCAGGGATCTGCGGGCCGAGAAGCGAGAACTGCTCGATGAACTTGAGCGGCTGCGCGGCGAGTTGGTTCGCCTTATGAACGAGACTGATAAACATAAAGCAGCCACGAAAGAGCTGGTCGCCTTGAAGGAGGAGTTGGACGCTTTAAGAGCGGCGCTGGACAAGGCCCGCGCCGAAAACGAAGAATTGATCAGGGAAAACGAGAAACTGAGGTCTGACGCGGCGAAATTAAATCAGCAGCTGCGGGCGTTACGGGACGAGAACGAGGCGACGAAGAGGGAGAACGCGGCGATGTCGTCGGAATTAGCCGCGACAAAGGCCAAATTAAAAGATGCTGAGGACCGGCTGAACGATCTGCGCGCCGAGAACGATGCCCTGAGGGAAAGGGTCGCCGACCTCGAAGGCGTGGCGAAGGAGCTCGAAACGGCGGTGAAACAGTTGGAGGATCGCAGAACGGAGCTGGAGAGATTAAAGGCGgaattgcaaaaattgaaagACGAGAACGGGAATTTGGAGAACGAGCTCGAAGACGCCCGGGACGAATCGAAGAGGCTTAAGGAAGTGAACGACAAGCTGCGAACGGATCTTGACGCTTTGAAGGGGGCTCGTGATAAGTTGAGGGAAGATTTGGAGAATCTGAAGGCGATCAATGAGAAATTAAAGGACGAATTGGGCGACTCGAAGACGCTGAACGACAAGCTGAAGGACGATTTGAAAAATCAGAAGGCGATTAACGATAAGCTGTCGGATGATTTGGACAGGTCGAATGCCGCGAACGATAAGTTACGGAATGACCTCGACGCGCTCCGGGCAGCCAATGATCGTTTGAGAAATGATTATACGAAGTTGTCGGCCGAAGCTGATAAATTGAAGAGCGACAGCGAAAATTCTGCGGCGGCGTTGAAGAATGAATCGGAGAGGTTGAGAAAGGATAATGACGATTTAAACAACGAGATTGAAAGGTTGAGGGAACAATTGAGCGACTGCCAGGATGTAGTTGCAAGGTTGAAAGGAGAATTGAAGGAGTGCCAGGATGAAAATGATAGGATGCGCAAAGAATTACGGATCTCGAAGAACGCAAACAGGAAACTAAGAGCGTCCTCAGAGCAAGACACGGCGATCTTGAAAAGCCTTGCAGACAATTGCGGCGATTTCATTAGGGCGAACGATTTGTTAACGAGAAAGCTTGACAAGCAAAACGAAGGCGTGCAACGTGTTCGTgattacataacatttttggAAGGTAAGGTTGAAGGTGAACCCAAGATGGCAGACGAGATGGACGACGACCTATATATTGATCCGGAAATGAAAAAGGAGATAGCAGACTTATTGAAAAAGTCGAAGGATTTGTCCGACAATATTTATAACACTGAACAAGAGATTCAGAATATTGCAGGCATTCTGAAGCAACTTCAGGATGCGGAGAAACCAACAGGAGCGTTCGATCCCGATTCCTGGCTTAACtCGTTGACATTGACGCAACTGGCAGAGCTGCACGACAAGATCTGCCTATTGACCTCGGACATGGTGAATCAGGATAAACCGAGAGCAGTTAGCTGCGAGCCGGGAAGTCCACTAAGAGCggattacaatattttaaatcagCGGATAGCCGCTCTGCAGAGACAGATAGCGGAGAAGCAAATGGAGACGGGATCGAAGCTGCAAGAGCTGAGGCGGGCTCTTCGGCAGGAGCAGGCTAACCTAATTCGAATCAACGACGAAATGAATTTAGAGAGAAGACGTAATTTGACCCTTCAGTCCAGAATGGATGATGCATCTTAA
- Protein Sequence
- MENRGCPCGCESSYSSESINPPNEPRCSCNYNPFSDNSKESEIYDLPFALRKLAVMKCQMKKWRMERLQFESENRSLKQALQSFGVNVDEILKPDPLLVQSREEIEWLQNANAALEDKVRDLEETLAERDYGDDPCASVHYIREKMRVLRERFALEKKELRDTISHLKLKLAQTEEDVSCPAIYRLRAKLRELMKGGQTADQQVSKVVERSIETLVDLSKSCDDLRLENEQLVAQLDELRRQLAELEKKRPKEEIPMEMLRTAETTTVPEYIDVSELLQKLRDCELAVSELKQQLDEKDKIIEGLKKELEGRLDEQALLAEIAAMKIELEKRDDKVRDLLNEMRQSEIDLLGLGNLRSELDTLKPRISELEDERDSLTDEVAKLRKLLEERNDQIIEILEYKNKLEQEMAEKEQEAERIIDGLKKEIDDLMSQIANLRGEVDERNKRVAELEKCCADRDELSKKLQLAEDELGTLRNELASAKAAIADLRGEVDALQNDKDKLLKQLDETKERVNALSDQLADEMAANEALQKELEAAWNATEALREENSDLKGRLDMAEEENDRLREENSDYKAELDKSNEANRELQNSLDAAQSENDALKQDMSALRSDLDDARKEVDRLRGDGDALKAAEAYAKTKIDEFESRMEDLKAEKDRLASENADVKAKNLELEQKLDDALKDLGKMKAENADLLVELERLKGELARAKSTIDQLKAEMDSLRDGLDKCVDEMEKLKAENAELKAKNEALGAGLDQCRADGDSLRDDLQRAEAELDGVTNELNKLKDENGALKQNLDRLAGEKDALLAEVERLKREAEELRAEIDSGKQREAALNGELDRIKQEKDASAAELDKLRDERSALRNERDRFKKQLDDANDEIERLRDELERLRNENEKLRTDGEACRDDYDKLKAEVEELRKEKDSLNDETERLEREKDALAAELGKLKDEHSALQRGKDQLKKQLDDTNNENDKLKDELARLKEELARSKDENEGLKKDADSCEQEREKLKAELEQLRKERDALNDELNGLRDENGRLRNDLNTLEGEANDLKTKLDAEKKTNEIMRNDAMMLNSKVQDLSEALDEARAKNAALTEENEGLQSKLRNVQDELNSARAECADLRAEIDSLKKEIANLKAEIAKLKEDLEYWKLEHCKIKMEMDKLEADLEKALIDLNECKAAKEACQAECNRLRAEKANLEKKIADLTAEIEQLKKALEAEKAATSKCEAELAALSSELDALKRELENLRDENRRCEGEADDLRKQIAGLNGELDRRREELAALRNANEKLKADVAASNNERNRLQEERDKLKAEADAVKEENAALHRDRDKLANEATRLRGEGDGQRGELDKLRSDLASQRAAVDKLQTDLKDCQAENGKLQQQLNELKSDLDRSKDERSKLDKSLADAEAEANALEAELRDLRAEKRELLDELERLRGELVRLMNETDKHKAATKELVALKEELDALRAALDKARAENEELIRENEKLRSDAAKLNQQLRALRDENEATKRENAAMSSELAATKAKLKDAEDRLNDLRAENDALRERVADLEGVAKELETAVKQLEDRRTELERLKAELQKLKDENGNLENELEDARDESKRLKEVNDKLRTDLDALKGARDKLREDLENLKAINEKLKDELGDSKTLNDKLKDDLKNQKAINDKLSDDLDRSNAANDKLRNDLDALRAANDRLRNDYTKLSAEADKLKSDSENSAAALKNESERLRKDNDDLNNEIERLREQLSDCQDVVARLKGELKECQDENDRMRKELRISKNANRKLRASSEQDTAILKSLADNCGDFIRANDLLTRKLDKQNEGVQRVRDYITFLEGKVEGEPKMADEMDDDLYIDPEMKKEIADLLKKSKDLSDNIYNTEQEIQNIAGILKQLQDAEKPTGAFDPDSWLNSLTLTQLAELHDKICLLTSDMVNQDKPRAVSCEPGSPLRADYNILNQRIAALQRQIAEKQMETGSKLQELRRALRQEQANLIRINDEMNLERRRNLTLQSRMDDAS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00183702; iTF_00183040;
- 90% Identity
- -
- 80% Identity
- -