Apur001983.1
Basic Information
- Insect
- Augochlora pura
- Gene Symbol
- -
- Assembly
- GCA_028453695.1
- Location
- CM052100.1:15856107-15867284[-]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 44 2.2 1.3e+03 -0.7 0.1 26 50 59 83 55 85 0.67 2 44 1.7 1e+03 -0.4 0.8 33 55 104 126 101 129 0.66 3 44 5.2 3.1e+03 -1.9 0.4 38 60 154 176 145 179 0.81 4 44 0.019 12 5.8 4.6 30 60 218 248 211 250 0.84 5 44 0.56 3.4e+02 1.2 2.8 28 63 275 310 273 312 0.90 6 44 0.021 13 5.7 0.5 21 46 361 386 361 397 0.86 7 44 0.14 86 3.1 2.0 25 61 400 440 398 444 0.86 8 44 0.47 2.8e+02 1.4 3.2 33 62 450 479 443 482 0.90 9 44 6.5e-05 0.039 13.8 3.1 24 62 483 521 480 524 0.92 10 44 0.002 1.2 9.0 3.1 24 64 511 551 510 552 0.94 11 44 0.019 12 5.8 1.4 27 64 549 586 546 587 0.84 12 44 0.027 17 5.4 3.9 22 63 586 627 585 629 0.87 13 44 8.4e-06 0.0051 16.6 3.4 24 63 644 683 643 685 0.95 14 44 0.023 14 5.6 6.7 24 63 672 711 671 734 0.89 15 44 0.00014 0.082 12.7 2.6 33 60 737 764 728 769 0.93 16 44 0.074 45 4.0 4.2 32 61 785 814 770 815 0.81 17 44 0.008 4.9 7.1 7.8 22 62 789 829 788 846 0.70 18 44 0.003 1.9 8.4 3.4 24 57 847 880 843 886 0.71 19 44 0.32 2e+02 1.9 11.9 25 64 883 922 864 923 0.65 20 44 0.00031 0.19 11.6 8.7 21 63 914 956 909 958 0.91 21 44 0.0064 3.9 7.4 4.5 26 59 961 994 957 998 0.90 22 44 4.9e-05 0.03 14.1 4.4 22 61 1006 1045 998 1049 0.85 23 44 6.4e-05 0.039 13.8 4.3 27 63 1025 1061 1023 1063 0.80 24 44 0.046 28 4.6 8.8 26 56 1052 1082 1043 1086 0.84 25 44 0.00064 0.39 10.6 7.2 22 65 1069 1112 1068 1112 0.94 26 44 0.041 25 4.8 5.7 22 62 1111 1151 1110 1166 0.86 27 44 0.038 23 4.9 5.4 32 62 1184 1214 1168 1217 0.64 28 44 0.018 11 5.9 1.7 31 60 1211 1240 1203 1246 0.66 29 44 0.00064 0.39 10.6 5.5 25 63 1275 1313 1272 1315 0.81 30 44 0.017 10 6.1 3.4 36 63 1321 1348 1316 1350 0.79 31 44 0.00027 0.16 11.8 0.2 32 65 1352 1385 1349 1385 0.92 32 44 8.2e-05 0.05 13.4 6.0 22 63 1384 1425 1380 1427 0.88 33 44 0.0022 1.3 8.9 0.7 20 59 1403 1442 1403 1452 0.76 34 44 3.1e-05 0.019 14.8 5.4 24 63 1456 1495 1455 1497 0.92 35 44 4.9e-05 0.029 14.2 6.4 25 65 1506 1546 1504 1546 0.90 36 44 0.045 27 4.7 7.9 24 65 1561 1602 1544 1602 0.88 37 44 1.9e-05 0.012 15.5 4.6 28 63 1593 1628 1587 1630 0.93 38 44 0.013 7.6 6.4 0.6 31 60 1631 1660 1629 1668 0.75 39 44 0.017 10 6.0 9.6 32 62 1670 1700 1660 1703 0.75 40 44 0.026 16 5.4 1.8 32 65 1698 1731 1694 1731 0.91 41 44 1 6.2e+02 0.3 4.8 22 63 1730 1771 1729 1772 0.88 42 44 0.011 6.7 6.6 2.3 24 65 1760 1801 1757 1801 0.90 43 44 7.1e-05 0.043 13.6 5.2 20 63 1809 1852 1803 1854 0.85 44 44 0.00036 0.22 11.4 3.0 27 61 1851 1885 1848 1888 0.90
Sequence Information
- Coding Sequence
- ATGGAGAACCAGGGCTGCCCTTGCGGATGCGAGTCATCGTATTCGTCCGAATCAATTAATCCGCCGAATGAGCCGCGTTGTAGCTGCAATTACGACCCCTTCAGCGACAATTCGAAAGAATCGGAGATTTACGATCTCTCGTTTGCCCTGAGGAAACTCGCCGTAATGAAGTGTCAGATGAAGAAATGGCGAATGGAACGACTTCAGTTCGAAAGCGAAAATAGGTCTTTGAAACAAGCCCTGCAGTCATTCGGTGTAAATGCGGATGAGATATTGAAGCCTGATCCGCTGCTCGTGCAGTCCCGGGAAGAAATCGAATGGCTGCAAAATGCAAACGCGGCGCTCGAAGATAAAGTGAGGGATCTGGAAGAAACCCTCGCCGAACGAGATTACGGGGACGATCCCTGTGCATCGGTACACTATATCAGGGAGAAGATGAGAGTTCTACGGGAGCGGTTCGCCCTCGAAAAGAAAGAATTAAGGGACGTGATATCGCACTTGAAATTAAAACTCGCGCAGACCGAGGAGGACGTGAGCTGTCCAGCGATATATCGCCTGAGGGCGAAGCTCCGCGAACTGATGAAAGGAGGTCAAACGGCGGACCAGCAAGTCTCGAAGGTCGTGGAGAGATCGATCGAGACCCTGGTGGATCTCTCGAAGAGCTGCGACGACCTGCGTCTGGAAAATGAACAGCTTCTGGCGCAGCTGGACGAATTACGTCGTCAATTGGCGGAGCTCGAGAAAAAGCAGCCGAGGGAAGAACCGCCGACGGAGATGCTGAGAACGGCCGAAACAACGACGGTTCCTGAATACATAGACGTCTCCGAGCTGCTGCAGAAGCTACGGGACTGCGAGCACGCTGTTTCCGAATTGAAACAGCAGCTTGACGAGAAAGACAAGATTATCGAGGCGCTGAGGAAAGAATTAGAAGGCAGGGTGGACGAGAAGGATCTGTTGGCCGAGATCGCGGCTATGAAGGCGGAACTCGGCAAAAGGGATGACAAGATGAGGGACCTCCTGAACGAAATGAGACAATCGGAGATAGACCTGTTGGGTCTAAGCAAACTACGATCGGAGCTGGACATGCTGAAGCCTCGCATATCCGAACTCGAGGTCGAGAGGGACTCGCTGATAGACGAGGTGGCGAAATTGCGAAAGTTGCTGGAAGAGAGGAACGATCAAATAATCGAGATCCTGGAGTACAAGAACAAGCTGGAGCAGGAGATGGCGGAGAAGGAGCAAGAGGCGGCGCGGGTGATCGACGGCTTGAGAAAGGAGATCGACGACCTGATGTCGCAACTAGGGAATCTTCAGGGCGAGGTCGACGAGCGTAACAAGCGGATCGCGGAGCTGGAGAAGTGCTGCGCGGAAAGGGATCAGCTTGCGAAGAAATTGCAGCTAGCCGAGGACGAGTTGGGGACGTTGCGAAACGAACTCGCGTCCGCGAAGGCGGCGATAGCGGACCTCCAGGCGGAGGTGGACGCCCTGCAGAATGACAAggagaaactgttgaaggagttggacgagacgagggagcgggttaacgcgttgactggccaattggcggacgagaGGGCGGCCAAGGAAGCTTTGCAGAGAGAGCTGGAAGCCGCGAGGAACGAGATCGAGGGACTGCGGAACGAGAACTCGGATCTGAAAGATCGGCTGGACGCGGCGGACGAGGAGAATAATAAATTAAGGAGGGAGAACGACGGCTACAAAGCGGAGCTAAGCAACATGGCCGCGGAGCTGGACAAACTGAACAAGGCGAACCAAGATCTGCAGAAGAGCTTGGAGGCAGCGCAGCTGGAGAACGACGCGCTGAAACTAGACGTGTCGGCGTTGAGGAACGACCTTGACAACGCGAGAAACGAGGCGGAGAGGCTGAGAGCCGACGGGGACGCGTTGAAGGCCGCGGAGGGAGGCGCGAAGGCTAGAATCGACGAGCTGGAGTCGCGGCTGGATGATCTGCAAGTGGAGAAGGACCGTCTGGCCGGCGAGAACGCTGAAATGAAAGCCAAGAATTTGGAGTTGGAACGCAAATTGGATGACGCGATCAGAGAGCTGGATAAAATGAAAGCAGAGAATGCCGATCTGTTGGCAGAGCTGGAACGGCTGAAGGGAGAATTGGCGAAGGCCAAGAGCACGATCGATCAGCTGAAAGCGGAGATGGACTCTTTGAGGGACGGGCTGGACAAGTGCGTGGACGAGATGGAGAAACTAAAAGCCGAAAATGCCGAGCTTAAGGCGAGGGACCAGGCTCTCAGAGCGGAGCTTGATCAATGCAAGGCGGACGGGGACTCGTTGCGGAACGATCTGCGGCGGGTCAACGCGGAATTGGACGACGCGACCAACGAGCTGAATAAGCTGAAGGAGGAGAACGGGGCGCTTAAACAAAATTTGGATCGACTGGCTGGTGAAAAGGACGCGCTCCAGGCCGAGCTGGACCGTCTGAGGGACGAGGCGGAGAAGCTCCGCGAGGAAATAGATGCCGGGAAACGGAGAGAGGCCGCGTTGAGCGACGAACTCGATCGCATCCGGCGGGAAAAGGACGCGTTGGCGGCGGAATTAGACAAGCTGAAAGATGAACGTTCCGCGTTGCAGAACGAGAGGGATCGGTTCAAGAAACAATTGGACGACGCGAACGACGAGATCGAGAGGCTGAAGGAAGAGTTGGAGCGGTTGAGGGACGAGAACGAGAGGCTGAGGACGGACGGGGAAGTGTGCAGAGATGATTACGATAAGCTGAAAGCCGAACTCGAGCAACTGAGGAAAGAGAAAGATTCGTTGAACGACGAGCTCGAACGGCTTAAGCGGGAGAAGGACGCGTTGGCGGCGGAATTAGGTAAATTGAAGGATGAACACTCTGCTTTGCAGAAGGAGTGGGATCCGCTGAAGAAACAATTGGACGATGCGAACGACGAGATCGAGAGGCTGCGGAAAGAGTTGGAGCGGTTGAACAACGAGAACGAGAGGCTGAGAACGGACGGGGAAGTGTGCAGAGATGATTACGAGAAGCTGAAGGCCGAATTCGAGCGACTGAGAAAAGAGAAAGATTCGTTGGACGACGAGGTCGACCGACTTAAAAGGGAGAAGGACGCGTTGACGGCGGAATTAGGTAAATTGAAGGATGAACACGCCGCTTTGCAGAGGGAAAGGGATCAGCTGAAGAAACAATTGGACGACGCGAACAATGAGAACGAGAAGTTGAAGGACGAGTTGGCCCAGCTGAAGGACGAGCTGGCTCGCTCGCAGGCCGAGAACGATAGGCTGAAGAGCGACGCGGACGCGTGCAAACGGGAATGCGACGAGTTGAAAGCCGAACTCGAACGATTGCGAAAGGAAAGAGACAGCTTGAATGACGAATTGAACAAGCTCCGTGACGAGCATAGTAGACTACAAAACGCTCTGAACGCTTTGCAAGCGGAAGCGGATGATTTGAGAGCGAGACTTGACGCGGAAAGGAAGACCAACGAAATCATGAAAAACGACTTGATGATTTTAAACAGCAAGGTGCAGGAGCTGAGCAAAGCGCTCGACGAGGCCAGGGCGAAGAATGCGGCTTTGACCGAGGAGAATCAAGGACTGCAATCAAAGTTGCGGGACGCGCAAGAAGAATTGAATAGCACGAGAGCGGAGCTCGCGGATTTGAGAGCGGAGATCGACTCTTTGAACAAGGAGATTGCTAATTTGAGGGCGCAGATCGCTAAATTGGAGGAGGATCTCGAATATTGGAAATTAGAGCACTGTAAGATCAAGATGGAGGTGGACAAACTGAAGGCTGACCTGGAGAATGCGTTGAACGCCTTGAACGAATGCAAGGCTGCGAAGGAGGCATGCCAGGCACTGTGTAATCGGCTGCAAGCGGAGAAGGCCGAACTGGAGAGGAAGATCGCGGATTTAACTGCGGAGGTCGAGCAGCTGAAGAAAGCTGTTGAGGCAGAGAAAGCGGGTAGAAGCAAGTGCGAGGCTGAGCTCGCAGCCCTGAGGAACGAGTTGGACGCATTGAAGAAGGAGTTGGAGAATTTGAGGGATGAGAACCGGCGGTGCAAAGGGGAGGTGGACGATCTACGGAAAGAGATCGCGGGGCTGAAAAGCGAGGTGGACAGGTGCAAGGAGGAAGTCGCCGCGTTGCGGGACGCGAACGAGAAGCTGAAGGCCGACGTGGCCGCGTTGAACAACGAGAAGAACAGGCTGCAAGAGGAACGTGACAAACTGAAGGCCGAGGCGGACGCTTTGAGGACGGAGAACGACGCGCTCCACCAGGATCGGGACAAGTTGGCGAACGAAGCTAACAGGCTGAGGGGCGAGGGCGACGGTCAGCGGGGCGAGCTCGATAAGCTCAGGTCCGACTTGGCTTCGCAGAGGGCCGCCGTCGACAAGCTGCAGGCCGACCTGAAGGACTGCCAGGCCGAGAACGGGAAACTGCAGCAGCAGCTCAACGAATTAAAGAACGATTTGGATAGATTGAGAGACGAGAAGAGCAGGCTGGACAAGTCGTTGGCGGACGCCGAGGCCAGGGCTAAAGCTTTGGAAGCCCAGCTAGGGGATCTGCAGGCCGAGAAGCAAGAGCTGCTCAATGAACTTCAGCGGCTGCGCGACGAGCTCTCTCGCCTTATGAACGAGATTGATAAACATAAAGCTGCCGCGGAGGCAGCCGGGAAAGAGCTGGCCGCCTTGAAGGAGGAGTTGAACGCTTTGAGGGCGGCGCTGGACAAAGCCCGTGCCGAAAACGAGGCATTGATCAAGGAAAACGAGAAACTGGGGTCTGACGTGGCGAAATTAAATCAGCAGCTGCAGGCGTTACAGGCCGAGAACGAGGCGATGAAGAAGGAGAACGCGGCGATGTCGTCGGAATTAGCCGCGACAAAGGCCAAATTAAAAGATGCTGAGGACCGGCTGAACGATCTGCGCGCTGAGAACGACGCCCTCAAGGCGAGGGTCGCCGACCTCGACGCTGTGACGAAGGAGCTCGAAACGGTGAAGAAACAGTTGGCGGATTGCAGGTCGGAGCTGGATAGATTAAAGGCGGAATTGCAAAAATTGAAAGAGGAGAACGAGAATTTGCGAAACGAGCTCGACGAGGCTCGAAACGAGGCGAAGCGGCTTAAGGAAGCGAACGACAAGCTGCAAACGGATCTTGACGCTTTGAAGGGGGCCCACGATAAGTTGAAGGACGATTTGGAGAAGCTGAAGGCGATCAATGAGAAATTGAAGGACGAGCTGGCCGATTCGCAGGCGTTGAATGATAAACTGAAAGAGGATTTGAAAAATCAGAAGGCGATCAATGACAAGCTGAAAGACGATTTGGAGAAGTCGAATGCCGCGAATGAGAAGCTGAAGAAAGACCTCGACGCGCTGCAGGCAGCCAATGATAATTTGAAGAATGATTATTCAAAGTTGTTGGCCGAGGCTGAGAAATTGAGGGATGACAACAAGAACTCCGCGACGGCGCTGAAGGATCAATTGGACAGGTTGAAGAAGGAAAACGATGATTTAAGCAATGAAATTGAAAGATTGAGGGAACAATTGAGTCACTGCCAGGACGAAGTTGCAAGGTTGAAACGGGAATTGAAAGACTGCCAGGATGAGAATGATAGGTTGCGCAAAGAATTAGGGCTATCGAGGAGCGCGAACAAGAAACTAAGAGCATCCTCGGAGCAAGACACGATCACCTTGAGGAGCCTTACAGACGATTGCGGCGATTTCATTAGGGCAAACGATTTGTTAACGAAAAAGCTCGACAAGCAAAACGAAGGTGTGCAACGTGTTCGCGATTACATAACATTTTTGGAAGGCAAGGTCGAAGATGAGCCCAAGATGGCGGACGAAATGGACGACGACCCGAGCATTGATCCGGAAATGAAAAAGGAAATAGCAGACTTATTGAAAAAGTCGAAGGATTTGTCTGACAATATTTATAACACTGAGAAAGAAATTCAGAATATCGCTGGCATCCTGAAGCAACTTCAGGAGAAACCAACAGGAGCGTTCGATCCCGATTCCTGGCTTAACTCGTTGACGTTGACGCAACTGGCGCAGCTGCACGACAAGATCTGTCTATTGACCTCGGACATGGTGAATCAGGACGAGCCGAGGGCAGTTAGCTGCGAGCCGGGTAGTCCACTAAGAGCGGATTACAATATTTTAAATCAGCGGATAGCCGCTCTGCAGAGACAGATAGCGGAGAAGCAAATGGAGACGGGATCGAAGCTGCAGGAGCTGAGGCGGGCTCTTCGGCAGGAGCAGGCTAACCTAATTCGAATCAACGACGAAATGAATTTAGAGAGAAGACGTAATTTGACCCTTCAGTCCAGAATGGATGATGCATCTTAA
- Protein Sequence
- MENQGCPCGCESSYSSESINPPNEPRCSCNYDPFSDNSKESEIYDLSFALRKLAVMKCQMKKWRMERLQFESENRSLKQALQSFGVNADEILKPDPLLVQSREEIEWLQNANAALEDKVRDLEETLAERDYGDDPCASVHYIREKMRVLRERFALEKKELRDVISHLKLKLAQTEEDVSCPAIYRLRAKLRELMKGGQTADQQVSKVVERSIETLVDLSKSCDDLRLENEQLLAQLDELRRQLAELEKKQPREEPPTEMLRTAETTTVPEYIDVSELLQKLRDCEHAVSELKQQLDEKDKIIEALRKELEGRVDEKDLLAEIAAMKAELGKRDDKMRDLLNEMRQSEIDLLGLSKLRSELDMLKPRISELEVERDSLIDEVAKLRKLLEERNDQIIEILEYKNKLEQEMAEKEQEAARVIDGLRKEIDDLMSQLGNLQGEVDERNKRIAELEKCCAERDQLAKKLQLAEDELGTLRNELASAKAAIADLQAEVDALQNDKEKLLKELDETRERVNALTGQLADERAAKEALQRELEAARNEIEGLRNENSDLKDRLDAADEENNKLRRENDGYKAELSNMAAELDKLNKANQDLQKSLEAAQLENDALKLDVSALRNDLDNARNEAERLRADGDALKAAEGGAKARIDELESRLDDLQVEKDRLAGENAEMKAKNLELERKLDDAIRELDKMKAENADLLAELERLKGELAKAKSTIDQLKAEMDSLRDGLDKCVDEMEKLKAENAELKARDQALRAELDQCKADGDSLRNDLRRVNAELDDATNELNKLKEENGALKQNLDRLAGEKDALQAELDRLRDEAEKLREEIDAGKRREAALSDELDRIRREKDALAAELDKLKDERSALQNERDRFKKQLDDANDEIERLKEELERLRDENERLRTDGEVCRDDYDKLKAELEQLRKEKDSLNDELERLKREKDALAAELGKLKDEHSALQKEWDPLKKQLDDANDEIERLRKELERLNNENERLRTDGEVCRDDYEKLKAEFERLRKEKDSLDDEVDRLKREKDALTAELGKLKDEHAALQRERDQLKKQLDDANNENEKLKDELAQLKDELARSQAENDRLKSDADACKRECDELKAELERLRKERDSLNDELNKLRDEHSRLQNALNALQAEADDLRARLDAERKTNEIMKNDLMILNSKVQELSKALDEARAKNAALTEENQGLQSKLRDAQEELNSTRAELADLRAEIDSLNKEIANLRAQIAKLEEDLEYWKLEHCKIKMEVDKLKADLENALNALNECKAAKEACQALCNRLQAEKAELERKIADLTAEVEQLKKAVEAEKAGRSKCEAELAALRNELDALKKELENLRDENRRCKGEVDDLRKEIAGLKSEVDRCKEEVAALRDANEKLKADVAALNNEKNRLQEERDKLKAEADALRTENDALHQDRDKLANEANRLRGEGDGQRGELDKLRSDLASQRAAVDKLQADLKDCQAENGKLQQQLNELKNDLDRLRDEKSRLDKSLADAEARAKALEAQLGDLQAEKQELLNELQRLRDELSRLMNEIDKHKAAAEAAGKELAALKEELNALRAALDKARAENEALIKENEKLGSDVAKLNQQLQALQAENEAMKKENAAMSSELAATKAKLKDAEDRLNDLRAENDALKARVADLDAVTKELETVKKQLADCRSELDRLKAELQKLKEENENLRNELDEARNEAKRLKEANDKLQTDLDALKGAHDKLKDDLEKLKAINEKLKDELADSQALNDKLKEDLKNQKAINDKLKDDLEKSNAANEKLKKDLDALQAANDNLKNDYSKLLAEAEKLRDDNKNSATALKDQLDRLKKENDDLSNEIERLREQLSHCQDEVARLKRELKDCQDENDRLRKELGLSRSANKKLRASSEQDTITLRSLTDDCGDFIRANDLLTKKLDKQNEGVQRVRDYITFLEGKVEDEPKMADEMDDDPSIDPEMKKEIADLLKKSKDLSDNIYNTEKEIQNIAGILKQLQEKPTGAFDPDSWLNSLTLTQLAQLHDKICLLTSDMVNQDEPRAVSCEPGSPLRADYNILNQRIAALQRQIAEKQMETGSKLQELRRALRQEQANLIRINDEMNLERRRNLTLQSRMDDAS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00965817;
- 90% Identity
- -
- 80% Identity
- -