Nlec010032.1
Basic Information
- Insect
- Neodiprion lecontei
- Gene Symbol
- -
- Assembly
- GCA_001263575.1
- Location
- NW:19941-25523[-]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 28 1.9 8.3e+02 -0.6 0.9 46 64 66 84 52 85 0.58 2 28 0.00024 0.11 11.9 1.6 35 64 99 128 98 129 0.89 3 28 0.00069 0.31 10.4 1.3 28 59 216 247 209 252 0.67 4 28 1.6 7e+02 -0.4 0.2 41 62 277 298 268 300 0.69 5 28 6 2.7e+03 -2.2 0.0 25 47 303 325 301 330 0.82 6 28 0.77 3.4e+02 0.6 1.4 29 62 373 406 371 409 0.79 7 28 0.00026 0.12 11.7 5.0 28 63 442 477 434 479 0.55 8 28 0.078 35 3.8 1.0 25 45 477 497 474 504 0.85 9 28 2.9e-05 0.013 14.8 6.4 25 63 505 543 501 544 0.94 10 28 0.0093 4.2 6.7 1.5 32 64 554 586 546 587 0.86 11 28 0.77 3.5e+02 0.6 0.1 47 63 583 599 581 612 0.67 12 28 1.4e-07 6.2e-05 22.2 3.1 23 65 615 657 613 657 0.94 13 28 7.7e-06 0.0035 16.6 3.6 21 62 648 689 648 692 0.91 14 28 0.0039 1.7 8.0 5.9 22 65 670 713 670 720 0.71 15 28 0.035 16 4.9 7.8 24 64 693 733 684 734 0.82 16 28 0.0074 3.3 7.1 2.4 30 63 720 753 716 755 0.90 17 28 0.0005 0.23 10.8 2.5 26 61 779 814 756 815 0.73 18 28 1.1e-05 0.0049 16.1 5.4 28 64 809 845 807 846 0.94 19 28 0.021 9.4 5.6 3.9 32 61 848 877 844 880 0.81 20 28 4.2e-07 0.00019 20.7 2.4 25 63 876 914 874 916 0.93 21 28 0.089 40 3.6 6.1 27 62 934 969 916 970 0.83 22 28 0.0041 1.8 7.9 7.8 22 63 957 998 942 1000 0.74 23 28 0.0025 1.1 8.6 5.7 26 62 989 1025 984 1034 0.60 24 28 7.2e-06 0.0032 16.7 4.9 24 65 1029 1070 1026 1070 0.94 25 28 0.00092 0.41 10.0 4.1 28 63 1068 1103 1064 1105 0.85 26 28 0.00094 0.42 9.9 3.3 23 57 1077 1111 1074 1118 0.66 27 28 8.9 4e+03 -2.8 0.0 39 59 1188 1208 1184 1211 0.79 28 28 0.054 24 4.3 1.3 34 57 1268 1291 1243 1295 0.77
Sequence Information
- Coding Sequence
- ATGATGGCGGGTCGCACATGTCAATGCGGATGCACTGACCCACCGAAAATGACGGGAGGTGATCCTCCGAACGAAGGATCCTGCGGGTGTAGCTACAATCCGCTGGGAGAGGGCGGGAGGGATGCGGAAATAACGGACCTATCTTACGCCCTGCGGAAACTGACCTCGATGAAATGCCAGATGAAGAAATGGAGGATGGAACGTCTGCAGCTTGAGAGTGAGGCGAGGGCTTTGAAGCAGGTGCTGCAGGCCCACGGTCTTAACGACGACATCGTGAGGCCTGATCCACTGCTTGCTCATCTTCGAGAGGAGAATGCCAGGCtggaaaacgaaaacgaagaaCTTCAGGACAAGGTTAAAGGACTCGAGGACACCATAACCGAGTACGAGTATGTCGAGTCACCGTGCGAACTGGTCAGCAAACTTCGCGAAAAGATGAGGAACATGAAGGAGGCTCATGCTGGTGAAAAACGAAGATTGAGAGAGTTAATTTCCGGGCTGAAGATCCGGCTCCAGGAGGCGGAGGCCGAGTCATCTTGTGCAGCATTGAATCGTCTGCGAGCAAAGCTTCGCGAGCTAACTGAAGGCGGACAGGAGGCAGACCAACGGGTTTCGAAAGTGGTTCAGCGTTCGATAGAAACGTTGGTCGAGCTGACGGATAACGTTGATGACCTAAAGGCGGAGATCGAGAGACTTCGTGCGGAGATCAAGCGTCTAAAGGACCTGCTGGACGCCTGTGAAGAGCGGCGAAGAACGGCGACTGATGTCGCTGTCGAAACAACCCTCCCGGAGGTGAAACCACCAGAAAAACCGCTGGTGGAAATGGACGTTTCAGATCTACTCAACAGGATCAAGGAGCTCGAAGCACTCATAGCTCAGCTGAGGAAGCAGCTCGTGGACAAGGATGCCGCCATCAATGACCTCCAAAACAAACTGTTCAACGTCACATCGGACAATAAGAGACTCAGCACTGACCTAGATCAAATGATGGTCAGCTACAGAGCCGTCATGGACGAGGTAAAAGCTATGAAGGATGAGCTCAAAAAGAGGGACGTGAAGGTTTCGGATCTCTTGCGTGAGCTCCAAGCGTCGGCGATCGATATGCTGGGATTGAACAGGCTGCAGAGTGAGATTGAATCGGTCAAGCCACAATTGTACAACCTTGAACTGGAGAGAGAACAGCTGTTGTCAGAGCTCGGTAAGGTTCGGGGAGTAGTTTCGGAGAGGAACGATCAGATAATTAAGATCCTGGAGGAAAGAGACAAGCATGCTCGAGCTCTGGGCAAAGTCGCGAGCACGATACAGGAAACGGCCGAACGGGAGGAGGCGCTGAAACGCGAGATTGATCGGCTGAAGGATCGAATAGCTGAGCTTGAGAAGGAGATAGCTGAGCTTAAGAAAAAGGTGGCTGAGCTCGCGGcggagaacgaaaaaattcctggacttgagaaaaaaattaaggagcTCGAAGACGAGCTAGCGAAGCTCAGAGGCGATCTTGCCGCTGCTAACACGAAGATGAACGACCTTGAGAAAGAAATAGCCGATCTAAAGGCAGAAAAAGATGCATTAGCAAGAGAGCTAGCAAAGGCAAAGGAGCAGGTGGAGAAGCTGAAAGAGGAGCTTGCTGCTGAGAGATCTGCAAAAGAAGCGGCCATGAAAGAACTCGAGGTCTGTAGAGCTGAGAACGAGAAACTGAGAGGAGACAACGAGCGAATGAGCAACGAACTCAACGCGGCAAAGGGCGAAATTGAAAGACTGAAAAATGAGCTTGACAAGGTTAAGGGCGAGTTGGATAAATCGAGAGCCGAAAACAGTGAGCTCAAGGACCTGCTCGCTGCAGCTAAGGCGGAAATCGACAAGCTCAGAAGCGAGGTCGAGGGGTGCAAAGCTGAGAATGCCAAGCTTAAAGGTGAGATTGTGCGATTAAATGAGGAAGTACAAAAGTTGAAGGCAGAGAATAGCGAGCtcaagaaagagagagacacgcTGCAGGCTGAAGTGGGAAAGCTGAAAGAAAAGATCGACGGAATGCAAGCTGAAATTGATAAGCTGAAGAACGATCTGGCCGCATCTAAAAGCGAGATGGAAAAGCTCAAGAATGATTTGGACGCTTTGAAATCGGAGAACGAGAAGCTCAAGAACAGTTTACGCGAAGCCGAGGCGAAGATAAAGGCATTGGAAGCGGAGAACTCAGATCTTGCTAATAAATTAGCCGATCTGAAGATCAAGAtagaaaatcttgaaaaacagCTTGCAGATGAAAAAGCCGCGAAAGAAGCGGCGCTAAAGGAATTGGCAGCGTTAAAGTCGGACCTCAAAGCGTTACTCGGAGAGATGGACAAGCTCAAAGCCGAGCGGGACAAACTGAAAGGGGAAGTGGATGACCTGACGAAGCGGATGGCAGACTTGACCAATGAGCTAAATCAGCTGAAATCAAAGTGTGCTGCCCTCGCGGCAGAGAACGAGAAGTTGAAAGCAGAAGTCAACGGTCTTAAAACAGAGAATGAGAGGCTGAAGAACGACCTGGAGAAGGTCAAGGCTGACCTTGAGGCAGCGAAATCAGAGAACGCGAAGTTAAAAGCGGAAAATGAGAAGCTGAAGAAAGATTTGATTGATGCTGAGGCAAAGGTCAAGGCGCTCGAGGATAAGGTCAAGGCATGTGAGGACGAAAAGGCGAAGCTTCGTCAGGAGATCGAGGGGCTCAAAAGTCAGATTGACAAACTCAACAGTGAGCTCGCAGCAGAGAAAGCGGCGAAAGAGGCGGCTTTGAAAGAACTGGCAGCGACCAAGGCCGAGCTAGCTGCGCTCAGAACAGAGCTGGACAAAGTGAGAGCCGAGAACGCGAGACTGAATGGTGAGCTCGAAAAGTTGAAGTCGGAGAATGAGAAGATGAAGGGGGAGCTTGACCGACTCAAAGCGGAAAACGCGAAGCTACAAGGTGACCTGGACGCCCTAAGGGCGGAGAATTCGAAGCTCAAAGGAGATCTGGATAAATTGAATTCCGAACTGAGCGCGTTGCGAGCTGAGAACGATAAGTTGAAGgccgaaaattcgaaattgaagGATGATTTGGCGGCCGCAAAAGAGGAAGCTGCGCGTCTCAAGAGTGATctggaaaaactaaaatccGAGAATGACGCCCTGAGAGCTGAGAACGACAAGGTGAAGGGAGAACTTGAGGGGCTCAAAGCAGAGCTTAACAAACTACGCGGGGATTTAGACGCCATGAAGGACGAGAATGCGAGGCTCAGGTCTGAGGTTGACAAACTAAAAAGCGATAATGAGAATCTAAAGAACGAGCTTGCGAAGGCCAACGCCGAGTTGGAAAATTCTAAGAAAGCAGTTGACAAACCAAAAGCTGCTGTGGCTGCCACTACTGGTCCTCTTCCTGAGAAAGTTCGTCGATCTATTCCAATGGAAGTCCCACCCACTGCTGCCAAGCCTGCAGTGAAGAAGGAACCGCCCCGAATGCCCTCAAAAACTCCAAAAGTTGAACGGCGAGCTTCGGTTGCAAAGAGAGACCAAGGATCCCAAGGCGAGGGCTGCGGTGATTACGTAAGCGCGAATGAACAGCTCAGGAAGAACATAAACAATCAAGACAGAGCTGTTCAGCGGATACGAAACTTCGTGAAGTACGTGCtgggagagagagaatcaCCTCCGGAGATGGCTGACTCACAAACTCACCGTATGTCATCAGTGATGAGGAACAAATTTGCGGAAGACATAATGGAGTTGCTTAAGGAGTCGCAGTATCTCTCAGAGAGTATATTCAACGCTGAGGCGGACGTTCAGCGGCTTCTGAAGATCCTCGAAGAGCtagaaaagttgagaaatgAGAACGCTGCGCTCAGAGACAAACTCGAAGTGACGGAGGAGCCCGTAGGCTTCGGGGACGCTTTCGACGCCGAATCGTGGCTGAAGACGCTGACGTTGACCGAGCTCGCCGAGCTCCACGACAGGATCTGCCTCGTGACGTCGAGCATGGTTAAGCAGGACATAAACCCGGAGGACTACGTTGACGACTCTACTTCACCCGACGGAGTCTGCAAGCCCTGTTCAGGTCTGCAGGACGCGGACGATGAGATGATCGGCGGGGACTACGAAGCTCTGAATAAGAGGATAGCAGCGCTTCAAATGCAGATTAACGAGAAGCAGAACGAGGCAGCAGCCAAGGTTCAAGAGATGCGGAAGGCCATGTGGCGAGAGCAGGACCGGCTGATACGCCTATCGGAGGAGATGAATGCCCAGAAAAGGAACAACCTGacgatgaagatgaagatcGGTGGTGAGCTCGATCCTTTCGGGGTTCCCGAAGGAGTCGCCGTCCTTTGCGGTGGGAGTCGGAGCTCTGGGGAGCTTGACGATTATCCGGGGACGAAATGGAACCCGCGATTCTCTGCCATCGGAGAGAAAGATGACGGAAAGTGGAAATCTGGCTGCGTTAGGTCGAAGAGAAAAAGCTCTCTCCAAGCTCCGGCACCTTGCGCTCTTCCTGTTAAACACGCAGACGTCCCGTGCTGTCAGAAACTCTGCTGTCCGTCTACCCTAAATCCGAAAACTCTTTTCCCGAAGAAGAGGAAGCAGGATGATGACGATAATATGCAGGAGGTCGCGGTTCTTTGTGGGGAAAATCGACGGGCCACGAATACTTATgatagataa
- Protein Sequence
- MMAGRTCQCGCTDPPKMTGGDPPNEGSCGCSYNPLGEGGRDAEITDLSYALRKLTSMKCQMKKWRMERLQLESEARALKQVLQAHGLNDDIVRPDPLLAHLREENARLENENEELQDKVKGLEDTITEYEYVESPCELVSKLREKMRNMKEAHAGEKRRLRELISGLKIRLQEAEAESSCAALNRLRAKLRELTEGGQEADQRVSKVVQRSIETLVELTDNVDDLKAEIERLRAEIKRLKDLLDACEERRRTATDVAVETTLPEVKPPEKPLVEMDVSDLLNRIKELEALIAQLRKQLVDKDAAINDLQNKLFNVTSDNKRLSTDLDQMMVSYRAVMDEVKAMKDELKKRDVKVSDLLRELQASAIDMLGLNRLQSEIESVKPQLYNLELEREQLLSELGKVRGVVSERNDQIIKILEERDKHARALGKVASTIQETAEREEALKREIDRLKDRIAELEKEIAELKKKVAELAAENEKIPGLEKKIKELEDELAKLRGDLAAANTKMNDLEKEIADLKAEKDALARELAKAKEQVEKLKEELAAERSAKEAAMKELEVCRAENEKLRGDNERMSNELNAAKGEIERLKNELDKVKGELDKSRAENSELKDLLAAAKAEIDKLRSEVEGCKAENAKLKGEIVRLNEEVQKLKAENSELKKERDTLQAEVGKLKEKIDGMQAEIDKLKNDLAASKSEMEKLKNDLDALKSENEKLKNSLREAEAKIKALEAENSDLANKLADLKIKIENLEKQLADEKAAKEAALKELAALKSDLKALLGEMDKLKAERDKLKGEVDDLTKRMADLTNELNQLKSKCAALAAENEKLKAEVNGLKTENERLKNDLEKVKADLEAAKSENAKLKAENEKLKKDLIDAEAKVKALEDKVKACEDEKAKLRQEIEGLKSQIDKLNSELAAEKAAKEAALKELAATKAELAALRTELDKVRAENARLNGELEKLKSENEKMKGELDRLKAENAKLQGDLDALRAENSKLKGDLDKLNSELSALRAENDKLKAENSKLKDDLAAAKEEAARLKSDLEKLKSENDALRAENDKVKGELEGLKAELNKLRGDLDAMKDENARLRSEVDKLKSDNENLKNELAKANAELENSKKAVDKPKAAVAATTGPLPEKVRRSIPMEVPPTAAKPAVKKEPPRMPSKTPKVERRASVAKRDQGSQGEGCGDYVSANEQLRKNINNQDRAVQRIRNFVKYVLGERESPPEMADSQTHRMSSVMRNKFAEDIMELLKESQYLSESIFNAEADVQRLLKILEELEKLRNENAALRDKLEVTEEPVGFGDAFDAESWLKTLTLTELAELHDRICLVTSSMVKQDINPEDYVDDSTSPDGVCKPCSGLQDADDEMIGGDYEALNKRIAALQMQINEKQNEAAAKVQEMRKAMWREQDRLIRLSEEMNAQKRNNLTMKMKIGGELDPFGVPEGVAVLCGGSRSSGELDDYPGTKWNPRFSAIGEKDDGKWKSGCVRSKRKSSLQAPAPCALPVKHADVPCCQKLCCPSTLNPKTLFPKKRKQDDDDNMQEVAVLCGENRRATNTYDR
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01048633;
- 90% Identity
- iTF_01048633;
- 80% Identity
- -