Npin029534.1
Basic Information
- Insect
- Neodiprion pinetum
- Gene Symbol
- -
- Assembly
- GCA_021155775.2
- Location
- CM037746.1:19673282-19678588[+]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 19 0.0015 1.9 9.4 0.0 37 62 291 316 287 318 0.90 2 19 0.0021 2.6 9.0 5.4 25 64 585 624 584 625 0.88 3 19 0.0062 7.8 7.5 8.6 27 63 615 651 614 660 0.74 4 19 0.0043 5.4 8.0 3.0 29 61 666 698 662 702 0.78 5 19 0.0039 4.9 8.1 2.4 30 61 709 740 704 744 0.77 6 19 0.0058 7.3 7.6 3.5 30 61 751 782 746 788 0.72 7 19 0.0043 5.4 8.0 3.0 29 61 802 834 798 838 0.78 8 19 0.0043 5.4 8.0 3.0 29 61 844 876 840 880 0.78 9 19 0.002 2.5 9.1 2.5 25 58 889 922 882 929 0.61 10 19 0.15 1.9e+02 3.1 0.7 25 57 931 963 930 971 0.89 11 19 0.0055 6.9 7.6 6.1 23 63 1027 1067 1025 1069 0.87 12 19 0.029 36 5.3 2.8 25 55 1057 1087 1056 1089 0.83 13 19 0.22 2.7e+02 2.5 0.1 33 54 1111 1132 1109 1136 0.66 14 19 0.34 4.3e+02 1.9 0.1 33 54 1157 1178 1155 1183 0.59 15 19 0.37 4.6e+02 1.8 0.1 33 54 1203 1224 1201 1227 0.56 16 19 0.24 3e+02 2.4 0.0 33 55 1249 1271 1247 1274 0.79 17 19 0.12 1.5e+02 3.3 0.2 32 54 1294 1316 1290 1320 0.76 18 19 0.35 4.3e+02 1.9 0.1 33 54 1341 1362 1339 1366 0.57 19 19 0.14 1.8e+02 3.1 0.3 32 54 1386 1408 1382 1411 0.75
Sequence Information
- Coding Sequence
- ATGGCGCCAGCAGCGGTGGCTGTTTTCGTAATATCGCTGTTCGTAGAATCGTTCTCAGCACCGAACGGGTGTATCAGATGTGTGACATCAGATACTACCGGTGGCTACCAGACGCAGAGTGGTTGGGTAAATAGCAACAACTTATCGCAGAGGTCAGCAAATTTGGAAGATTTGACACAACAAGTAGAGGGTGAACTCGGTGGACCCCACAATCAGTTAGCCTTTGATAATACCAGGCCTGGAAACTGGAGGGACGTGAAGCAGTATCGAACAGCTGACGGTCATGGTAGGGTGTACGAAGAGCAAGGTCAACAAGTTCAAGGCCCAGTCCGAGTgagatattacaaaaaaaatttcacctcgaGCTACAGCAGCGGAAACCCAGGTGGTTTTGAAGTACCGTCTTTATCTGGATTTGATTCTGACAACATTCGATACGGCAGTCAATCGGCAAACCAGGGCAGTTTTGGCCAGGAACAGAACTCTGCTTACGATCAATCTGCAATCCGTGAAAATTCATACGCTTCGCAGGGTTCTTTACATTCGGCAAATCAATTCAGCAGCCAGGATGAAAGACTAGGAAGTCAACAGCGACGCGTAACGAGCAGTCAGTTTGGTGAAAATACATCCGGATTCAACAGTCGATCGACGCAGCAAGGATTGTCTGAACAGGAATCACTGAGGCCAGGAAATTGGACCACAGCTAACACTTATAAAACTGACGGAGGTAACGGCAGGGTATACGAAGAACGAGGACAAGTTGTTACAGGACCGCAAAAAATTCGTTTCTACAGGAAAAATTATACCTCGACTTACAGCTCGGATGGCGGGATTCCAAATTTGATTTCAGGGACTGATGGAGCTACAACCTTTGAAAGAGAAGTACAGCAACTGCAGAAACAGATTGATTCTATGGGACGAGAGGTTCATCAAACTGGTCAACATTTCTCAAATAGCGATTACGCGCAGCAGACTTCTAGAAATTACGGACAGTCAAGTCCGACTGTAGACCGTACAAACTTCAGACATGTGTCAACGACAGGTAACTACGGGTCACAGAACTATGATAACTTGGGATTAAATTCTCAACAAATACGGCACGGTGCTTATGACAGCGAAAATCAACGTGTATATCAGACTGGGAGCAGTATTAATAGGCAAATAGAAAGTGGAAACCAAAGGGGTCAAACTTATGGCCACATGACCGGCAGGTACGACAATACCAATGGACATGGTGCTGGGGGAGTACAAAGACCGATTTACCCAGAATCAAATACTCGGCAAACTCAGCGTCAGTATCTTTCTCAGTCTGAGCAGGAAGAGTATATACGCAATCACGGGTCTCAAAGTTGGAACCAAGATCAAGTGTCCACTGGTAATTTGAATCGTGAGCAGCAAATTCATCATTCTTCATCACATAACGCAGGAACAGGACCAGGCCGCATTCAGCATTATGATCAATTTCAAACTACCTCTTCGTCTTCGTCCCAGTATCCGAATATTGACAGACGAAACCTCCAAGCTGATAGTAATAGAGAAACACAACAGACGCATGGTGTGAGCCAGGATAAAACCGTCGGTAATAATTATCACAGGAATTACAGAATTCAATCTGGAAGACTGAGTACACAGGGTATTGATTTAGGGCGACTGGCACACGGACCTGATTGTGTAGATACTGCAAGTGGACATACCTCGTATGAGCAATCCCAATATCATACACAATATAGACGGAATACTGAAGATTTTGATCAACAAACCCAACATATCAACCAGCACACGGAAGATCTTACCCAGCAAACAGAAGATCTTACCCAACAAATAGAAGATCTTACCCAACAAACACAGGATCTTACCCAGCAAACGGAGGATCTTACCCAGCAAACACAGGATCTTACTCAGCAAACAGAGGATTTAACTCAGCAAACACAAGATCTTACCCAACAGACGGAGGATTTTAGTCAACAATCACAGGGTGTTATTGAGCAATCGGGAGACCTTACTCAGCAAACACAGGATCTTACCCAACAAACGGAGGATTTAACCCAGCAAACACAAGATCTTACCCAACAGACGGAGGATTTTAGTCAACAATCGCAGGGTGTTATTGAGCAATCGGGATACCTTACTCAGCAAACACAGGATCTTACCCAACAAACGGAGGATCTTACCCAGCAAACACAGGATCTGACCCAACAGACGGAGGACTTTAGTCAACAATCACAGGGTGTTATTGAGCAATCGGGAGACCTTACTCAGCAAACACAGGATCTTACCCAACAAACGGAGGATCTTACCCAGCAAACACAGGATCTGACCCAACAGACGGAGGACTTTAGTCAACAATCACAGGGTACGGAGGATTTTAGTCAACAATCACAGGGTGTTATTGAGCAATCGGGAGACCTTACTCAGCAAACACAGGATCTTACCCAACAAACGGAGGATCTTACCCAGCAAACACAGGATCTGACCCAACAGACGGAGGACTTTAGTCAACAATCACAGGGTGTTATTGAGCAATCGGGAGACCTTACTCAGCAAACACAGGATCTTACCCAACAAACGGAGGATCTTACCCAGCAAACACAGGATTTGACCCAACAGACGGAGGACTTTAGTCAACAATCGCAGGGTGTTATTGAGCAATCGGGAGACCTTACCCAGCAAACAGAGGATCTTGCCCAGCAAACACAGGATCTTACTCAACAAACAGAGGATCTTACCCAGCAAACACAAGACTTTACCCAACAGACGGACGACTTTACGCAACAATCACAGGATTTCACACAGCAAACGGAAGAACTCCCTCAACAAACACAAGGATATATTCAACAATCGCAAGATCTGACGCAACAGATAGAAGATCTTCCTCAACAAACGGAAGGATTTATTGCAGAATCAGGAAACCTTGGCCAGCAAGCTGAGGATCTTACCCAACAAACCGTAGATGTGACACAAAGTCTTCAACCAGAATCTGGTCACTCCATAGATGTAGGGGAGCCAAACTATGTGCCGCATGTATCAAATTTTAAAGATGtgcatgaattatttttcgatcatGAAAAAGAAGACAATAGACGTCAACAGATAGAGAAGATCTCTCCACAAGAGGAAGATCTCACCCAACAAACACAAGATCTTAACCAAGAAACAGAGGACCTTACCCAGCAAACACAGGACCTTACCCAGCAGACGGAGGACTTTAATCAACAATCGCAGGATCTCACTCAAGAAACGGAAGACCTCTCACAGCAAATTACTGGTGGAAATTCAGACTTTGGACAGCAGACTAATTGGGGGCTCGATCATTTTCAAGTTGGAGATTCACAGGCCGAAGATCTGAACCAAAGAACACAGGGTCTCACTCAAGAAACAGAAGACCTCTCACAGCAAATTACTGGTGGAAATTCAGACTTTGGACAGCAGACTAATTGGGGGCTCGATCGTTTTCAAGTTGGAGATCCACAGGCCGAAGATCTGAACCAAAGAACACAGGGTCTCACTCAAGAAACGGAAGACCTCTCACAGCAAATTACTGGTGGAAATTCAGACTTTGGACAGCAGACTAATTGGGGGCTCGATCGTTTTCAAGTTGGAGATCCACAGGCCGAAGATCTGAACCAAAGAACACAGGGTCTCACTCAAGAAACGGAAGACCTCTCACAGCAAATTACTGGTGGAAATTCAGACTTTGGACAGCAGAATAATTGGGGGCTCGATCGTTTTCAAGTTGGAGATCCACAGGCCGAAGATCTGAACCAAAGAACACAGGGTCTCACTCAAGAAACGGAAGACCTCTCACGGCAAATTACTGGTGGAAATTCAGACTTTGGACAGCAGATTAATTGGGGGCTTGATCGTTTTCAAGTTGGAGATCAACAGGCCGAAGACCTGAACCAAAGAACACAGGGTCTCACTCAAGAAACGGAAGACCTCTCACAGCAAATTACTGGTGGAAATTCAGACTTTGGACAGCAGAATAATTGGGGGCTCGATCGTTTTCAAGTTGGAGATCCACAGGCCGAAGATCTGAACCAAAGAACACAGGGTCTCACTCAAGAAACGGAAGACCTCTCACAGCAAATTACTGGTGGAAATTCAGACTTTGGACAGCAGACTAATTGGGGGCTTGATCGTTTTCAAGTTGGAGATCAACAGGCCGAAGACCTGAACCAAAGAACACAGGGTCTCACTCAAGAAACGGAAGACCTCTCACAGCAAATTACTGGTGGAAATTCAGACTTTGGACAGCAGACTAATTGGGGGCTCGATCGTTTTCAAGTTGGAGGTCAACAGGCCGAAGATCTGAACCAACAAGCAACAAGTAACTCAGGAATTTACAATggaaatttaaatcaattcgGACAAGATCAAGTATTGGACTACCCAGCGCAAGTAGGAATACAGCCTGCACCAAAACCAAAACCAAAACGTCCAAAACACAGAACAATCCACATACAAGAATTCTCCACAGAACAGGAACAGCCATCAACGGCTAATATACAAAATGGTGCACCAGAAATCATTGTGACTTCCCCATCAAACAGAGGAGATCAACCCAGTCATGAAACAATTTCCAATACAGAATTATACAGTAGTGAACAGTCACATCGAGTTCAGCCGACTAAAACTGGGGGCCGTCGGAGGAATAGGTATTACGGCATTCAACATAAACCTCAAGTTAACCAAGGACAACATGGGCAGGATTCGCAAGTAGAGCATGTTCCAAACGAGGTAACGACCAGCCAAGAGCCGACTCTAAGATACACCCATGTAAATCAACTTGACAAACAGAATTCAGGCGATTCAAATGTCAATCCAAAAAATGTACAACCGATTCCTCAGATTGGAACCAGAATTCTAGAAGCATACGGAGCAAATGGACCATACAACAGTGATCATGAACCAGATTTATTTAATACTGTTAAACCAAATCCAAGTGCAACATTACCACCTGTCTATGGCGACAAGGAGCCTTTCGAAATTATTTGGTCCTACGCagttccaaaaatttttaccaatacCGCCGCTCCAACTACATCCACAGAGCCCACAACTACATCCACAGAGCCCACTACTATGACCACCGAGGTCTCAATTCCAACAACCACTGAGCCTCCAATTGCTACATCCACAACTGTCCCTCCGCCAACAACAACTGCTGCTCCCTCATTATGGCGCAGATTTAGAAACAGAGTTAGTAACACAATAGACAAAGCCAGAGAACGCGCAGCGAGTATCTTTGGTTAA
- Protein Sequence
- MAPAAVAVFVISLFVESFSAPNGCIRCVTSDTTGGYQTQSGWVNSNNLSQRSANLEDLTQQVEGELGGPHNQLAFDNTRPGNWRDVKQYRTADGHGRVYEEQGQQVQGPVRVRYYKKNFTSSYSSGNPGGFEVPSLSGFDSDNIRYGSQSANQGSFGQEQNSAYDQSAIRENSYASQGSLHSANQFSSQDERLGSQQRRVTSSQFGENTSGFNSRSTQQGLSEQESLRPGNWTTANTYKTDGGNGRVYEERGQVVTGPQKIRFYRKNYTSTYSSDGGIPNLISGTDGATTFEREVQQLQKQIDSMGREVHQTGQHFSNSDYAQQTSRNYGQSSPTVDRTNFRHVSTTGNYGSQNYDNLGLNSQQIRHGAYDSENQRVYQTGSSINRQIESGNQRGQTYGHMTGRYDNTNGHGAGGVQRPIYPESNTRQTQRQYLSQSEQEEYIRNHGSQSWNQDQVSTGNLNREQQIHHSSSHNAGTGPGRIQHYDQFQTTSSSSSQYPNIDRRNLQADSNRETQQTHGVSQDKTVGNNYHRNYRIQSGRLSTQGIDLGRLAHGPDCVDTASGHTSYEQSQYHTQYRRNTEDFDQQTQHINQHTEDLTQQTEDLTQQIEDLTQQTQDLTQQTEDLTQQTQDLTQQTEDLTQQTQDLTQQTEDFSQQSQGVIEQSGDLTQQTQDLTQQTEDLTQQTQDLTQQTEDFSQQSQGVIEQSGYLTQQTQDLTQQTEDLTQQTQDLTQQTEDFSQQSQGVIEQSGDLTQQTQDLTQQTEDLTQQTQDLTQQTEDFSQQSQGTEDFSQQSQGVIEQSGDLTQQTQDLTQQTEDLTQQTQDLTQQTEDFSQQSQGVIEQSGDLTQQTQDLTQQTEDLTQQTQDLTQQTEDFSQQSQGVIEQSGDLTQQTEDLAQQTQDLTQQTEDLTQQTQDFTQQTDDFTQQSQDFTQQTEELPQQTQGYIQQSQDLTQQIEDLPQQTEGFIAESGNLGQQAEDLTQQTVDVTQSLQPESGHSIDVGEPNYVPHVSNFKDVHELFFDHEKEDNRRQQIEKISPQEEDLTQQTQDLNQETEDLTQQTQDLTQQTEDFNQQSQDLTQETEDLSQQITGGNSDFGQQTNWGLDHFQVGDSQAEDLNQRTQGLTQETEDLSQQITGGNSDFGQQTNWGLDRFQVGDPQAEDLNQRTQGLTQETEDLSQQITGGNSDFGQQTNWGLDRFQVGDPQAEDLNQRTQGLTQETEDLSQQITGGNSDFGQQNNWGLDRFQVGDPQAEDLNQRTQGLTQETEDLSRQITGGNSDFGQQINWGLDRFQVGDQQAEDLNQRTQGLTQETEDLSQQITGGNSDFGQQNNWGLDRFQVGDPQAEDLNQRTQGLTQETEDLSQQITGGNSDFGQQTNWGLDRFQVGDQQAEDLNQRTQGLTQETEDLSQQITGGNSDFGQQTNWGLDRFQVGGQQAEDLNQQATSNSGIYNGNLNQFGQDQVLDYPAQVGIQPAPKPKPKRPKHRTIHIQEFSTEQEQPSTANIQNGAPEIIVTSPSNRGDQPSHETISNTELYSSEQSHRVQPTKTGGRRRNRYYGIQHKPQVNQGQHGQDSQVEHVPNEVTTSQEPTLRYTHVNQLDKQNSGDSNVNPKNVQPIPQIGTRILEAYGANGPYNSDHEPDLFNTVKPNPSATLPPVYGDKEPFEIIWSYAVPKIFTNTAAPTTSTEPTTTSTEPTTMTTEVSIPTTTEPPIATSTTVPPPTTTAAPSLWRRFRNRVSNTIDKARERAASIFG
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -