Mwil008451.1
Basic Information
- Insect
- Megachile willughbiella
- Gene Symbol
- -
- Assembly
- GCA_945859635.1
- Location
- CAMAOE010000041.1:94432-113598[+]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 47 8.9e-05 0.084 13.3 1.4 30 58 38 66 25 70 0.89 2 47 9 8.5e+03 -2.8 0.1 28 47 93 112 91 116 0.53 3 47 2.7 2.6e+03 -1.1 1.7 37 62 136 161 132 171 0.51 4 47 0.0024 2.3 8.7 8.3 24 60 175 211 168 215 0.79 5 47 0.0075 7.1 7.1 10.1 27 63 213 249 210 250 0.94 6 47 0.022 21 5.6 9.4 24 64 252 292 251 299 0.80 7 47 0.043 40 4.7 5.5 34 61 300 327 298 331 0.86 8 47 0.7 6.6e+02 0.8 10.0 25 57 347 379 336 387 0.71 9 47 0.039 37 4.8 5.8 25 63 382 420 380 422 0.91 10 47 0.043 41 4.7 5.5 21 63 399 441 399 443 0.89 11 47 0.001 0.98 9.9 5.3 21 56 462 497 461 501 0.92 12 47 0.00043 0.41 11.1 2.2 32 61 501 530 497 534 0.87 13 47 7.5e-05 0.071 13.5 0.9 25 63 515 553 513 554 0.93 14 47 3e-05 0.028 14.8 1.7 25 59 543 577 531 583 0.51 15 47 0.0001 0.099 13.1 4.8 33 63 572 602 570 610 0.55 16 47 7.9e-05 0.075 13.4 11.1 24 65 605 646 604 646 0.95 17 47 0.017 16 6.0 1.8 37 62 639 664 637 666 0.73 18 47 9e-06 0.0086 16.5 5.5 26 64 670 708 667 709 0.93 19 47 0.0012 1.2 9.6 11.3 24 61 703 740 703 743 0.91 20 47 0.0043 4.1 7.9 2.2 37 62 737 762 735 763 0.91 21 47 5.8e-07 0.00055 20.3 6.1 26 64 768 806 765 807 0.93 22 47 0.0025 2.3 8.7 5.6 35 64 805 834 803 835 0.88 23 47 0.031 30 5.1 4.1 33 62 831 860 830 862 0.83 24 47 0.0016 1.5 9.3 3.4 26 60 866 900 863 903 0.77 25 47 0.00093 0.88 10.0 6.3 28 63 917 959 909 961 0.68 26 47 0.0027 2.5 8.5 2.4 27 64 979 1016 975 1023 0.80 27 47 0.084 80 3.7 5.2 31 57 1032 1058 1016 1065 0.61 28 47 0.011 11 6.5 7.6 27 63 1077 1113 1066 1115 0.66 29 47 0.021 20 5.7 0.8 24 59 1109 1144 1106 1150 0.73 30 47 0.011 11 6.5 4.7 24 52 1154 1182 1146 1195 0.66 31 47 0.087 83 3.7 3.5 24 57 1203 1236 1200 1243 0.82 32 47 0.078 74 3.8 0.3 32 57 1228 1253 1227 1259 0.85 33 47 0.034 32 5.0 5.3 27 62 1258 1293 1252 1296 0.77 34 47 0.19 1.8e+02 2.6 1.2 28 64 1287 1323 1284 1324 0.90 35 47 0.12 1.1e+02 3.3 12.0 24 65 1311 1352 1294 1352 0.90 36 47 0.00011 0.11 13.0 12.8 22 63 1337 1378 1336 1386 0.93 37 47 0.076 72 3.9 4.6 28 60 1385 1417 1378 1422 0.68 38 47 0.045 43 4.6 6.7 31 64 1409 1442 1406 1444 0.61 39 47 5.5e-05 0.052 13.9 10.8 24 63 1430 1469 1427 1471 0.92 40 47 0.084 80 3.7 3.3 30 58 1471 1499 1467 1513 0.63 41 47 0.033 31 5.0 2.4 31 65 1500 1534 1495 1534 0.89 42 47 2.6e-05 0.024 15.0 8.1 24 63 1521 1560 1515 1562 0.87 43 47 0.1 98 3.5 3.3 33 58 1565 1590 1560 1599 0.50 44 47 0.0086 8.1 6.9 4.7 31 63 1591 1623 1584 1625 0.82 45 47 0.011 10 6.6 5.7 28 56 1630 1658 1625 1664 0.75 46 47 0.00083 0.79 10.2 2.9 34 65 1668 1699 1663 1699 0.91 47 47 0.026 24 5.4 3.5 25 46 1694 1715 1693 1718 0.68
Sequence Information
- Coding Sequence
- ATGATATCACAATTGAAATTAAAACTGGCGGAAACCGAGGAGGACGTCAGCTGTCCTGCCATATATCGTTTAAGGGCAAAGCTTCGCGAATTGATGAAAGGTGGTCGGGATTTGCCGAAGAACTGCGATGATCTGCAAGTTGAAAACGATCGTCTTTTAGCTGAAATAGCTGAGTTACGACGCCAGTTGGCCAATCTTGACGAAAGGGAAATAACCGAGCGGATGTTGCCAGCGAAATCGGTCGAAACAACCACTGTGCCGGAATACATAGATGTATCTGAGTTGTTGCGAAAGCTCAAAGATTGTGAAAATACTGTATTCGGTTTGACACAACAGCTAGCAGAAAAAGACGATCTCATTGATTCATTGAATAAGGGACTTGAAGGCACGATCAGTCAAAAAGATTTATTGGATGAGATCGCGGCACTGAAAGCGGAACTTCAAGAAAAAGATGACAAGATTCAGGAACTATTAAACGAACTAAGACAAGCAGAAATAAATTTGCTCGAGTTAAATAATTTGAAATCACAACTGGATGACCTGAAATCAGAATTAGAGGATTTAGAATCGGAGAGGAACCAACTGTTGGAAGAGCTCGCTAAATTACAAAACGAACTTGCGCACTGTAACGCAATAAAGGAAAGTTTAGAAAAGCAGCTAGAAGTTTTAAGGAATGACAACGAAAAATTATTGAAAGAGCTGGATAATGCGAAGGAACAACTCCTGGCACTGACTAATCAGTTGGAAGAGGAAAAGGCAGCTAGAAATGCGTTAGAAGAAAATTTGAAAAATTGCCAAGATGAACTCGAAAGGTTACAAAAAGATAATACGAATCTGAGGGATCAGCTGGAGGCTGCAAAAGAAGAAAATAATAAACTGCGTGAAGATGTTGAAGCGGCGAAGAAGCTGGCTGAAGAGAACGAGAGGTTGAAAGCGGATCTGGAGAAGATGAAGAAAGAGAACGAGGAATTGATGAATTTGAACAACGTCTTAAAGAGCGATTACGACAACATGAAGCAAGCATTGGATAACCTAGAAGCAGAAATTAACAGACTGCAGAACGAATTGAATAAGGCTGAAGAAGAACGCAAAGCGTTGCTGGACGAGAACAGCAACATTAAGAAGCAACTTGAAGAAGCAATGGCGAGGAACGAGAGTCTACAAGCTGAATTGGATAATGTCGGTGAACAACTCAACAAACTGAAATTAGAGAAGGATAAACTGCAAGAGGCTCTCAACGACATGAAGCTCGAGAACGATGCGTTGAAACAAGATGTGCGGGCTTTGCAAAGCGACCTTGATCATGCGAGAAAGGAAGCGGAAGACCTAAGAGGCGCTGGAGACGCATTAAGAGCCGCGGACAAAGACAAACTGTCCGAGCTTCAAAAGCTCAAAGATGAATTGGACAATTTGACAACCGAAAAGGATCGCTTAACGAACGAAAATATCGATTTGAAGGCCAGAAATGCGGAGCTCGAGAAAAAACTCAAGGATGCAATGGAACAGGTGGAACAAATGAAATCGGAGAATGCCGATTTACTGGCCGAGATCGATCGACTAAAAAAGGAGCTCGACAAAGCTGCGAACGAAGTCGATCGATTGAAATCTGAAATAGGTTCTTTGAAGGACGCTGTCGACAAGTGTATGGACGAATTGGAGAAATTGCAAACTGAAAATGGCGATCTTAAATCAGAGAATGAAGCTGTTAAAAGTGAAATTGAGAAGTGCAAAGCTGAGAGGGACGCTTTGAAACGGGAAAATTCTACTTTGCAAAACGAGATTGACGAGTTAAGGAAACAACTGAACGATTGTAAAACAGAGAACGAAAACCTGAAGGCGCAGAAAAATCAATTGGAAGCTGAAAATAATAAGTTGAGAGAAGAGTTGAACGCTTGCAAACAAGAAAATGAGGCGATGAAGGCTGAATGTGAAAAATTACGAGAACAGGTACAATCGTTGAGTGACGAATTGAGTAAGCTACGGAGTCAGCTGGATATTGCAGAACGTAAAATTCAGGAACTCGAGCCTCTGGTCGATCGTTCGCAAAAGGAAAATGATAAATTGCAAAACGAGATTGACGAGTTGAGAAAACAACTGAACGATTGTAAAACGGAGAACGAAAACTTAAAGGCGCAGAAAAATCAATTGGAAGCTGAAAATAATAAGTTGAGAGAAGAGTTGAACGCTTGTAAACAAGAAAATGAGGCGATGAAGGCTGAAAGTGAAAAATTACGAGAACAGGTACAATCGTTGAATGACGAATTGGGTAAGCTAAGGAGCCAGCTGGATATTGCAGAACGTAAAATTCAGGAACTCGAGCCTCTGGTCGATCGTTTGCAAAAGGAAAATGACAAATTGCAAAACGAGATTGACGAGTTGAGAAAACAACTGAACGATTGTAAAACAGAGAACGAAAACTTAAAGGCGCAGAAAAATCAATTGGAAGCTGAAATTGATAAGTTGAGAGAAGAGTTGAACGCTTGCAAACAAGAAAATGAGGCGATGAAGGCTGAAGGTGAAAAATTACGAGAACAGGTACAATCGTTGAATGACGAATTGAGTAAGCTAAGGAGTCAGCTGGATATTGCAGAACGTAAAATTCAGGAACTCGAGCCTCTGGTCGATCGTTTGCAAAAAGAAAACGATAAATTGCAAAATGATCTGAAAGCGTTAGAGGATGATGCAAGAAACTTAAGATCAAGGCTAGATGGCGGAATGAGTGACAATGAAAGAATGCGAAACGACATGGCGATGTTAGAAAATCAAGTAGGAGATTTGAATGAGAAATTAAAGGCAGCTAAAGCAGAAAATGACGCTTTGCAGCAAGAGAATCAAACGCTACGAGCAAAACTGTTAGAAATGGATGACGAATTGTCTCAAGCGAAAGCAGAATGCGCGGATTTGAAGGCGGAAATTGCTGATTTAAATAATTTAATTTCCGAATTACGAGCAAAAATTGCTAAATTGGAAGAGGATGTAGAACATTGGAAACTGGAGAATTGTAAGCTGCAGATGGAAATAGATAAATTAAAAGCGGACCTCGAGAAAGCATTAAAGGATTTATCCGAATGCCAGGCGCTGAAAAAAGCACAAGAAGCAGAGTTGAACCGGCTGCAGAACGAAAAAGCTGAGTTGAATAAACAAATCGCCGGTCTAACTGCGCAGATAGAACAACAGAAAAAAGCTGCCGAATTAGAAAAGTCCGCCAAGGACGAAAGTGAGGCAAAACTCAAAGCTTTGCGGGAAGAGCTGGACGCATTGAAGAAGGAGTTAGAGAAACTTCGAATGGAGAATAACGATTACAAGAACGAAATAGATAATTTGAAAAGACAGCTTTCTACGTTAAACGGTCAGTTAGATTCGTGCAAAGAAGAGATCGCTGCGTTGAGAGCCACAAATGATGCGTTGAAGACTGAATTAAACGCATTGAGTGGTCTAAAAGACGAATACGCTCAACTGAAAGCTAAAGTGAACAGTTTGGAAAATGAAATCGCGAGTCTTCAAGAAAACGCAAAGAATTTGGAACAGGAACGCAACAAACTTAGAGGAGAGGGTGACGGACAGAGAATTGAAATTGATAAACTGAAATCAGACTTGGATGCCGAAAAAGCAGCCGCAGGGAAACTGAAGTTAGATTTGGAGAATTGTCAAGCAGAAAACGACAGATTACGGGCACAGTTGAAAGATTTAGAAAAATGTAAAAGCGAGATTGATCGGTTAAATGCCGAAATTGGTGAACTGAAGAAGGCATTAGCGGCCGCTGAAGGTAAAGCGAAGTCGTTGGAAGATCAACTCTCGAACCTCAAAGATGAAAAGCAACAGTTGATTAATGAACTCAACAATCTTCGCGGAGATCAGAGCAATCTTAGAAATGAAATAGAAAAACAGACAGCCGCAAAGGACAAGGCGTTAAAGGAGTTGGCTGACGTTAAAGAGGAGCTGAATGCTCTGAAGTCGACGTTGGATAAAATGCGCAATGAAAATGAAACGTTACTGAACGAGAATGAAAAATTGAAGTCGAAATTGGCAGAATTAAATGGACAATTGGAAGCGTTAAGAAATGAGAACGAAAAGTTAAAGAAAGAAAATGAGAACTTAAAGAACGAGATTGCAAAATTGACTTCGGAATTAGCTACGATGACAAATAAATTAAAAGAAGCGGAGGATCAGTTGAACGCGCTCAAGAGTGAGAACGATACTCTGAAGAATACGATAGCTAAACAAGAGAAAACAATAAAAGAGCTCGAAGCAGCAAAAATACAATTAGAGCAGGAGATCAACGAGTTGAAGCCGAAATTGGCAGAATTAAATAAACAGTTGGAAACGTTAAGAAACGAGAACGAAAAATTAAAGAAAGAAAATGAGGTTTTAAAGAACCAGACTGCAAAATTGACTTCGGAATTAGATGCGATGGCAAATAAATTAAAAAGCGCAGAGGATCAGTTGAACGCGCTAAAGAATGAGAATGATAGTCTGAAGAATACGATAGCTAAGCAAGAGAAAACAATAAAAGACCTCGAAGCAGCAAAAATACAATTAGAGCAGGCTATCAACGAGTTGAAGCCGAAATTGGCAGAATTAAATAAACAGTTGGAAACGTTAAGAAACGAGAACGATAAATTAAAGAAAGAAAATGAGGGTTTAAAGAACCAGACTGCAAAATTGACTTCGGAATTAGATGCAATGGCAAATAAATTAAAAAAGGCGGAGGATCAGTTGAACGCGCTAAAGAATGAGAACGATACTCTGAAGAATACGATAGCTAAACAAGAGAAAGCAATAAAAGAGCTCGAAGCAGCAAAAACACAATTAGAGCAGGCTATGAAGGAGTTGAAGTCGGAAAATGAAGGACTGAAAGGCAAGCTAGAAGACGCGCAAAACGCAGCGAATAAATTGAAAAACGATTTGGAGAAGCTGAAAACAGACAACGCGAAATTGCAAAATGAATTAGGTAAATTAAAAGAGGAGAAGGAAAAGTCTGATGCAGCGGCGAAAGGTGATGCAGACAGAATAAAAAAAGAAAATGAGAAATTGAGAGCTGAAAACGCGAAATTGATGGACGAGTTGAACACTTGTCGAGCAGAAAACGAAGAGTTACGTAAACAATTGGAAAAGTTACAAGGAGAAAATGATAAATTGAAAAGAGCTGCAGTGATCAAGGAGTGTGTGACATTTAGTGTGTTATCGAGAGACAGAACTTGTCTAAAAAAGGCAAATCTTCAAAGAGGAAGATCAGCAGATAGCAGAGTGTTGATTGGACATCAACACACATCAGTAGAGGACAACGTGCAACGTCCAACAGTAGAGGACAACGTGCAGCGTCCAGCAGTAGAGGACAGCGTACACCGTTCAACAGTTGAAGACATCGTGCAGCATCCAACAGAAGAGGATTGTTTGCAACAGTCATCAGCATAG
- Protein Sequence
- MISQLKLKLAETEEDVSCPAIYRLRAKLRELMKGGRDLPKNCDDLQVENDRLLAEIAELRRQLANLDEREITERMLPAKSVETTTVPEYIDVSELLRKLKDCENTVFGLTQQLAEKDDLIDSLNKGLEGTISQKDLLDEIAALKAELQEKDDKIQELLNELRQAEINLLELNNLKSQLDDLKSELEDLESERNQLLEELAKLQNELAHCNAIKESLEKQLEVLRNDNEKLLKELDNAKEQLLALTNQLEEEKAARNALEENLKNCQDELERLQKDNTNLRDQLEAAKEENNKLREDVEAAKKLAEENERLKADLEKMKKENEELMNLNNVLKSDYDNMKQALDNLEAEINRLQNELNKAEEERKALLDENSNIKKQLEEAMARNESLQAELDNVGEQLNKLKLEKDKLQEALNDMKLENDALKQDVRALQSDLDHARKEAEDLRGAGDALRAADKDKLSELQKLKDELDNLTTEKDRLTNENIDLKARNAELEKKLKDAMEQVEQMKSENADLLAEIDRLKKELDKAANEVDRLKSEIGSLKDAVDKCMDELEKLQTENGDLKSENEAVKSEIEKCKAERDALKRENSTLQNEIDELRKQLNDCKTENENLKAQKNQLEAENNKLREELNACKQENEAMKAECEKLREQVQSLSDELSKLRSQLDIAERKIQELEPLVDRSQKENDKLQNEIDELRKQLNDCKTENENLKAQKNQLEAENNKLREELNACKQENEAMKAESEKLREQVQSLNDELGKLRSQLDIAERKIQELEPLVDRLQKENDKLQNEIDELRKQLNDCKTENENLKAQKNQLEAEIDKLREELNACKQENEAMKAEGEKLREQVQSLNDELSKLRSQLDIAERKIQELEPLVDRLQKENDKLQNDLKALEDDARNLRSRLDGGMSDNERMRNDMAMLENQVGDLNEKLKAAKAENDALQQENQTLRAKLLEMDDELSQAKAECADLKAEIADLNNLISELRAKIAKLEEDVEHWKLENCKLQMEIDKLKADLEKALKDLSECQALKKAQEAELNRLQNEKAELNKQIAGLTAQIEQQKKAAELEKSAKDESEAKLKALREELDALKKELEKLRMENNDYKNEIDNLKRQLSTLNGQLDSCKEEIAALRATNDALKTELNALSGLKDEYAQLKAKVNSLENEIASLQENAKNLEQERNKLRGEGDGQRIEIDKLKSDLDAEKAAAGKLKLDLENCQAENDRLRAQLKDLEKCKSEIDRLNAEIGELKKALAAAEGKAKSLEDQLSNLKDEKQQLINELNNLRGDQSNLRNEIEKQTAAKDKALKELADVKEELNALKSTLDKMRNENETLLNENEKLKSKLAELNGQLEALRNENEKLKKENENLKNEIAKLTSELATMTNKLKEAEDQLNALKSENDTLKNTIAKQEKTIKELEAAKIQLEQEINELKPKLAELNKQLETLRNENEKLKKENEVLKNQTAKLTSELDAMANKLKSAEDQLNALKNENDSLKNTIAKQEKTIKDLEAAKIQLEQAINELKPKLAELNKQLETLRNENDKLKKENEGLKNQTAKLTSELDAMANKLKKAEDQLNALKNENDTLKNTIAKQEKAIKELEAAKTQLEQAMKELKSENEGLKGKLEDAQNAANKLKNDLEKLKTDNAKLQNELGKLKEEKEKSDAAAKGDADRIKKENEKLRAENAKLMDELNTCRAENEELRKQLEKLQGENDKLKRAAVIKECVTFSVLSRDRTCLKKANLQRGRSADSRVLIGHQHTSVEDNVQRPTVEDNVQRPAVEDSVHRSTVEDIVQHPTEEDCLQQSSA
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00961571;
- 90% Identity
- -
- 80% Identity
- -