Npin027152.1
Basic Information
- Insect
- Neodiprion pinetum
- Gene Symbol
- -
- Assembly
- GCA_021155775.2
- Location
- CM037746.1:2947536-2953097[-]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 25 2.1 2.6e+03 -0.6 0.9 46 64 66 84 52 85 0.58 2 25 0.00027 0.34 11.8 1.6 35 64 99 128 98 129 0.89 3 25 0.00079 0.98 10.3 1.3 28 59 216 247 209 252 0.67 4 25 1.8 2.2e+03 -0.4 0.2 41 62 277 298 268 300 0.69 5 25 0.87 1.1e+03 0.6 1.4 29 62 373 406 371 409 0.79 6 25 0.00023 0.28 12.1 4.2 25 60 439 474 437 479 0.84 7 25 0.058 73 4.4 0.3 25 45 477 497 474 504 0.86 8 25 4.7e-05 0.059 14.3 7.3 26 63 506 543 501 544 0.94 9 25 0.023 29 5.7 2.6 32 64 554 586 546 608 0.89 10 25 1.7e-07 0.00021 22.1 3.2 23 65 615 657 613 657 0.94 11 25 2e-05 0.025 15.4 3.4 21 62 648 689 648 692 0.84 12 25 0.01 13 6.8 4.3 25 65 673 713 672 720 0.49 13 25 0.038 47 5.0 7.7 24 64 693 733 684 734 0.82 14 25 0.0091 11 6.9 2.4 30 63 720 753 718 755 0.89 15 25 0.00024 0.31 12.0 3.1 26 64 779 817 759 818 0.86 16 25 3e-05 0.037 14.9 6.9 28 64 809 845 805 852 0.94 17 25 0.022 28 5.7 4.6 32 64 848 880 844 881 0.83 18 25 0.0016 2 9.3 1.4 32 62 876 906 874 908 0.49 19 25 9.8e-07 0.0012 19.7 3.1 25 63 897 935 895 937 0.93 20 25 0.77 9.6e+02 0.8 2.5 25 65 939 979 935 983 0.72 21 25 0.051 64 4.5 8.3 28 58 970 1000 952 1013 0.60 22 25 0.0017 2.2 9.2 5.5 24 62 1008 1046 1005 1053 0.72 23 25 9.1e-06 0.011 16.5 5.0 24 65 1050 1091 1047 1091 0.94 24 25 0.0013 1.6 9.7 3.6 23 57 1098 1132 1095 1139 0.66 25 25 0.099 1.2e+02 3.6 1.3 35 56 1290 1311 1261 1315 0.73
Sequence Information
- Coding Sequence
- ATGATGGCGGGTCGCACATGTCAATGCGGATGCACTGACCCACCGAAAATGACGGGAGGTGATCCTCCGAACGAAGGATCCTGCGGGTGCAGCTACAATCCGCTGGGAGAGGGTGGAAGGGATGCGGAAATAACAGACCTATCTTACGCCCTGCGGAAACTGACCTCGATGAAATGCCAGATGAAGAAATGGAGGATGGAACGTCTGCAGCTTGAGAGTGAGGCGAGGGCTTTGAAGCAGGTGCTGCAGGCCCACGGTCTTAACGACGACATCGTGAGGCCCGATCCACTGCTTGCTCATCTTCGAGAGGAGAATGCCAGGCTGGAAAACGAAAACGAAGAACTTCAGGACAAGGTTAAAGGACTCGAGGACACCATAACCGAGTACGAGTATGTCGAGTCACCGTGCGAACTGGTCAGCAAACTTCGCGAAAAGATGAGGAACATGAAGGAGGCTCATGCTGGTGAAAAACGAAGATTGAGAGAGTTAATTTCCGGGCTGAAGATCCGGCTCCAGGAGGCGGAGGCCGAGTCATCTTGTGCAGCATTGAATCGTCTGCGAGCAAAGCTTCGCGAGCTAACTGAAGGCGGACAGGAGGCAGACCAACGGGTTTCGAAAGTGGTTCAGCGTTCGATAGAAACGTTGGTCGAGCTGACGGATAACGTTGATGACCTAAAGGCGGAGATCGAGAGACTTCGTGCGGAGATCAAGCGTCTAAAGGACCTGCTGGACGCCTGTGAAGAGCGGCGAAGAACGGCGACTGATGTCGCTGTCGAAACAACCCTCCCGGAGGTGAAACCACCAGAAAAACCGCTGGTGGAAATGGACGTTTCAGATCTACTCAACAGGATCAAGGAGCTCGAAGCACTCATAGCTCAGCTGAGGAAGCAGCTCGTGGACAAGGATGCCGTCATCAATGACCTCCAAAATAAACTGTTCAACGTCACATCGGACAATAAGAGACTCAGCACTGACCTAGATCAAATGATGGTCAGCTACAGAGCCGTCATGGACGAGGTAAAAGCTATGAAGGATGAGCTCAAAAAGAGGGACGTGAAGGTTTCGGATCTCTTGCGTGAGCTCCAAGCGTCGGCGATCGATATGCTGGGATTGAACAGGCTGCAGAGTGAGATTGAATCGGTCAAGCCACAATTGTACAACCTTGAACTGGAGAGAGAACAGCTGTTGTCAGAGCTCGGTAAGGTTCGGGGAGTAGTTTCGGAGAGGAACGATCAGATAATTAAGATCCTGGAGGAAAGAGACAAGCATGCTCGAGCTCTGGGCAAGGTCGCGAGCACGATACAGGAAACGGCCGAACGGGAGGAGGCGCTGAAACGCGAGATTGATCGGCTGAAGGATCAAATAGCTGAGCTTGAGAAGGAGATAGCTGAGCTTAAGAAAAAGGTGGCTGAGCTCGCGGCGGGGAACGAAAAAATTCCTggacttgagaaaaaaattaaggagcTCGAAGACGAGCTAGCGAAGCTCAGAGGCGATCTTGCCGCTGCTGACACGAAGATGAACGACCTTGAGAAAGAAATAGCCGATCTGAAGGCAGAAAAAGATGAATTAGCAAGAGAGCTAGCAAAGGCAAAGGAGCAGGTGGAGAAGCTGAAAGAGGAGCTTGCTGCTGAGAGATCTGCAAAAGAAGCGGCCATGAAAGAACTCGAGGTCTGTAGAGCTGAGAACGAGAAACTGAGAGGAGACAACGAGCGAATGAGCAACGAACTCAACGCGGCAAAGGGAGAAATTGAAAGACTGAAAAATGAGCTTGACAAGGTTAATGGCGAGTTGGATAAATCGAGAGCCGAAAACAGTGAGCTCAAGGACCTGCTCGCTGCAGCTAAGGCGGAAATCGACAAGCTCAGAAGCGAGGTCGAGGGGTGCAAGGCTGAGAATGCCAAGCTTAAAGGTGAGATTGTGCGATTAAATGAGGAAGTACAAAAGTTGAAGGCAGAGAATAGCGAGCtcaagaaagagagagacacgCTGCAGGCTGAAGTGGGAAAGCTGAAAGAAAAGATCGACGGAATGCAAGGTGAAATTGATAAGCTGAAGAACGATCTGGCCGCATCTAAAAGCGAGATGGAAAAGCTCAAGAATGATTTGGACGCTTTGAAATCGGAGAACGAGAAGCTCAAGAACAGTTTACGCGAAGCCGAGGCGAAGATAAAGGCATTGGAAGCGGAGAACTCAGATCTTGCTAATAAATTAGCCGATCTGAAGATCAAGAtagaaaatcttgaaaaacagCTTGCAGATGAAAAAGCCGCGAAAGAAGCGGCGCTAAAGGAATTGGCAGCGTTAAAGTCGGACCTCAAAGCGTTACTCGGAGAGATGGACAAGCTCAAAGCCGAGCGGGACAAACTGAAAGGGGAAGTGGATGACCTGACGAAGCGGATGGCAGACTTGACCAATGAGCTAAATCAGCTGAAATCAAAGTGTGCTGCCCTCGCGGCAGAGAACGAGAAGTTGAAAGCAGAAGTCAACGGTCTTAAAACAGAGAATGAGAGGCTGAAGAACGACCTGGAGAAGGTCAAGGCTGACCTTGAGGCAGCGAAATCAGAGAACGCGAAGTTAAAAGCGGAAAATGAGAAGCTGAAGAAAGATTTAATTGATGCTGAGGCAAAGGTCAAGGCGCTCGAGGATAAGGTCAAGGCGCTCGAAGACAAGGTGAAGACGCTCGAAGACAAGGTGAAAACACTTGAAGACAAGGTCAAGGCATGTGAGGACGAAAAGGCGAAGCTTCGTCAGGAGATCGAGGGGCTCAAAAGTCAGATTGACAAACTCAACAGTGAGCTCGCAGCAGAGAAAGCGGCGAAAGAGGCGGCTTTGAAAGAACTGGCAGCGACCAAGGCCGAGCTAGCTGCGCTCAGAACAGAGCTGGACAAAGTGAGAGCCGAGTACGCGAGACTGAATGGTGAGCTCGAAAAGTTGAAGTCGGAGAATGAGAAGATGAAGGGGGAGCTTGACCGACTCAAAGCGGAAAACGCGAAGCTACAAGGCGACCTAGACGCCCTAAAGGCGGAGAATTCGAAGCTCAAAGGAGATCTGGATAAATTGAATTCCGAACTGAGCGCGTTGCGAGCTGAGAACGATAAGTTGAAGgccgaaaattcgaaattaaaGGATGATTTGGCGGCCGCAAAAGAGGAAGCTGCGCGTCTCAAGAGTGATCTGGAAAAACTAAAATCCGAGAATGACGCCCTGAGAGCTGAGAATGACAAGGTGAAGGGAGAACTTGAGGGGCTCAAAGCAGAGCTTAACAAACTACGCGGGGATTTAGACGCCATGAAGGACGAGAATGCGAGGCTCAGGTCTGAGGTTGACAAACTAAAAAGCGATAATGAGAATCTAAAGAACGAGCTTGCGAAGGCCAACGCCGAATTGGAAAATTCTAAGAAAGCAGTTGACAAACCAAAAGCTGCTGTGGCTGCCACTACTGGTCCTCTTCCTGAGAAAGTTCGTCGATCTATTCCAATGGAAGTCCCACCCACTGCTGCTAAGCCTGCAGTGAAGAAGGAACCGCCCCAAATGCCCTCAAAAACTCCAAAAGTTGCACGGCGAGCTTCGGTTGCAAAGAGAGACCAAGGATCCCAAGGCGAGGGCTGCGGTGATTACGTAAGCGCGAATGAACAGCTCAGGAAGAACATAAACAATCAAGACAGAGCTGTTCAGCGGATACGAAACTTCGTGAAGTACGTGCTGGGAGAGAGAGAATCACCTCCGGAGATGGCTGACTCACAAACTCACCGTATGTCATCAGTGATGAGGAACAAATTTGCGGAAGACATAATGGAGTTGCTTAAGGAGTCGCAGTATCTCTCAGAGAGTATATTCAACGCTGAGGCGGACGTTCAGCGGCTTCTGAAGATCCTCGAAGAGCTAGAAAAGTTGAGAAATGAGAACGCTGCGCTCAGAGACAAACTCGAAGTAGCGGAGGAGCCCGTAGGCTTCGGGGACGCTTTCGATGCCGAATCGTGGCTGAAGACGCTGACGTTGACCGAGCTCGCCGAGCTCCACGACAGGATCTGCCTCGTGACGTCGAGCATGGTTAAGCAGGACATAAACCCGGAGGACTACGTTGACGACTCTACTTCACCCGACGGAGTCTGCAGGCCCTGTTCAGGTCTGCAGGACGCGGACGATGAGATGATCGGCGGGGACTACGAAGCTCTGAATAAGAGGATAGCAGCGCTTCAAATGCAGATTAACGAGAAGCAGAACGAGGCAGCAGCCAAGGTTCAAGAGATGCGGAAGGCCATGTGGCGAGAGCAGGACCGGCTGATACGCCTATCGGAGGAGATGAATGCCCAGAAAAGGAACAACCTGACGATGAAGATGAAGATCGGTGGTGAGCTCGATCCTTTCGGGGTTCCCGAAGGAGTCGCCGTCCTTTGCGGTGGGAGTCGGAGCTCTGGGGAGCGTGACGATTATCCGGGGACGAAACGGAACCCGCGATTCTCTGCCATCGGAGAGAAAGATGACGGAAAGTGGAAATCTGGCTGCGTTAGGTCGAAGAGAAAAAGCTCTCTCCAAGCTCCGGCACCTTGCGCTCTTCCTGTTAAACACGCAGACGTCCCGTGCTGTCAGAAACTCTGCTGTCCGTCTACCCTAAATCCGAAAACTCTTTTCCCGAAGAAGAGGAAGCAGGATGATGACGATAATATGCAGGAGGTCGCGGTTCTTTGTGGGGAAAATCGACGGGCCACGAATACTTAG
- Protein Sequence
- MMAGRTCQCGCTDPPKMTGGDPPNEGSCGCSYNPLGEGGRDAEITDLSYALRKLTSMKCQMKKWRMERLQLESEARALKQVLQAHGLNDDIVRPDPLLAHLREENARLENENEELQDKVKGLEDTITEYEYVESPCELVSKLREKMRNMKEAHAGEKRRLRELISGLKIRLQEAEAESSCAALNRLRAKLRELTEGGQEADQRVSKVVQRSIETLVELTDNVDDLKAEIERLRAEIKRLKDLLDACEERRRTATDVAVETTLPEVKPPEKPLVEMDVSDLLNRIKELEALIAQLRKQLVDKDAVINDLQNKLFNVTSDNKRLSTDLDQMMVSYRAVMDEVKAMKDELKKRDVKVSDLLRELQASAIDMLGLNRLQSEIESVKPQLYNLELEREQLLSELGKVRGVVSERNDQIIKILEERDKHARALGKVASTIQETAEREEALKREIDRLKDQIAELEKEIAELKKKVAELAAGNEKIPGLEKKIKELEDELAKLRGDLAAADTKMNDLEKEIADLKAEKDELARELAKAKEQVEKLKEELAAERSAKEAAMKELEVCRAENEKLRGDNERMSNELNAAKGEIERLKNELDKVNGELDKSRAENSELKDLLAAAKAEIDKLRSEVEGCKAENAKLKGEIVRLNEEVQKLKAENSELKKERDTLQAEVGKLKEKIDGMQGEIDKLKNDLAASKSEMEKLKNDLDALKSENEKLKNSLREAEAKIKALEAENSDLANKLADLKIKIENLEKQLADEKAAKEAALKELAALKSDLKALLGEMDKLKAERDKLKGEVDDLTKRMADLTNELNQLKSKCAALAAENEKLKAEVNGLKTENERLKNDLEKVKADLEAAKSENAKLKAENEKLKKDLIDAEAKVKALEDKVKALEDKVKTLEDKVKTLEDKVKACEDEKAKLRQEIEGLKSQIDKLNSELAAEKAAKEAALKELAATKAELAALRTELDKVRAEYARLNGELEKLKSENEKMKGELDRLKAENAKLQGDLDALKAENSKLKGDLDKLNSELSALRAENDKLKAENSKLKDDLAAAKEEAARLKSDLEKLKSENDALRAENDKVKGELEGLKAELNKLRGDLDAMKDENARLRSEVDKLKSDNENLKNELAKANAELENSKKAVDKPKAAVAATTGPLPEKVRRSIPMEVPPTAAKPAVKKEPPQMPSKTPKVARRASVAKRDQGSQGEGCGDYVSANEQLRKNINNQDRAVQRIRNFVKYVLGERESPPEMADSQTHRMSSVMRNKFAEDIMELLKESQYLSESIFNAEADVQRLLKILEELEKLRNENAALRDKLEVAEEPVGFGDAFDAESWLKTLTLTELAELHDRICLVTSSMVKQDINPEDYVDDSTSPDGVCRPCSGLQDADDEMIGGDYEALNKRIAALQMQINEKQNEAAAKVQEMRKAMWREQDRLIRLSEEMNAQKRNNLTMKMKIGGELDPFGVPEGVAVLCGGSRSSGERDDYPGTKRNPRFSAIGEKDDGKWKSGCVRSKRKSSLQAPAPCALPVKHADVPCCQKLCCPSTLNPKTLFPKKRKQDDDDNMQEVAVLCGENRRATNT
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01047992;
- 90% Identity
- iTF_01047992;
- 80% Identity
- -