Pate007706.1
Basic Information
- Insect
- Phymatocera aterrima
- Gene Symbol
- -
- Assembly
- GCA_963170745.1
- Location
- OY720660.1:6766228-6771606[-]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 16 5 4.7e+03 -1.9 0.0 38 62 271 295 267 297 0.85 2 16 0.039 36 4.9 2.6 25 59 633 667 631 670 0.69 3 16 0.0041 3.8 8.0 6.5 25 61 681 717 680 721 0.87 4 16 0.0023 2.1 8.8 7.7 25 61 736 772 734 776 0.87 5 16 0.0023 2.1 8.8 7.7 25 61 791 827 789 831 0.87 6 16 0.0023 2.1 8.8 7.7 25 61 846 882 844 886 0.87 7 16 0.0023 2.1 8.8 7.7 25 61 901 937 899 941 0.87 8 16 0.0023 2.1 8.8 7.7 25 61 956 992 954 996 0.87 9 16 0.0041 3.8 8.0 6.5 25 61 1011 1047 1010 1051 0.87 10 16 0.0023 2.1 8.8 7.7 25 61 1066 1102 1064 1106 0.87 11 16 0.0023 2.1 8.8 7.7 25 61 1121 1157 1119 1161 0.87 12 16 0.0028 2.6 8.5 6.6 26 61 1177 1212 1175 1218 0.81 13 16 0.0028 2.6 8.5 7.2 25 62 1231 1268 1230 1271 0.90 14 16 0.015 14 6.2 5.2 26 61 1294 1329 1289 1336 0.52 15 16 0.76 7.1e+02 0.7 1.0 32 63 1348 1379 1343 1383 0.72 16 16 0.013 12 6.4 2.1 34 61 1399 1426 1395 1435 0.48
Sequence Information
- Coding Sequence
- ATGGCTCCCACCGCAGTGGCAATTTTCGTTATCTCTATATTCACACAAGCGTTAGCAGCTCCGAATCGCTGTGTTACTTGCCAAGTTGGATCAACCGAATGGGTAAACACAGACAACCTATCGCAGAGGTCAGCGAACTTAGAAGACTTAACACGACAAGTTGAAAGTGAGTTAGGAAGATCTCCAAATCAATTAGCTTTCGATGACAGAAGACCTGGAAATTGGACAGATGTAAATCGGTACAGAACAGCTGATGGTCATGGAAAGGTATATGAAGAACACGGCCAACGTGTAGATGGTTCAAAACGGATTAGATTCTTCAGAAAAAACTTCACATCCAGCTACAGTAGTGGAAACTTAGGTGGATTGGGAGAATCTGATTTCGGAGGCTTTGATTCTTCTAGTAGACAAGGAGCCAGCCGCCTGACAAGCCACGGATCTTttaatcagaatcaaaattcagCCTATGACCAATCTGCTATTCGTCGAAATTCTGATGGATACAATGAAGCTTccaaaattcatgaaaatcgtGAAAGTAACGGTCGGTTTACAGAAAATACTCTTGCACATAATGGGCAATTGTCACAGCGAGGATCAATTGCTTGGGATCGAACAAGACCAGGAAATTGGAGTACTCATAATTCTTATAGCACTGATGAAGGTAATGGCAGAGTTTATGAAGAACGAGGACAGTATATATCAGGGCCTGGTCAAGTTCGgttctataaaaaaaattatactacGAGTTACACTTCACATGGAGCTATTCCAAATATAAATCTAGAAACAGATGGGGCAACGAGTTTCGAGGGAGAAGTACAGGGCTTACAGAGACGTTTTGACAGCCAGGGGAGAGAGATTCATCAAACTTCCCAAGGTTTGACAAGTGGTGGATACAGTCAGCACAATCATGGATATCAGACACAGCCTGGCCAGACTCAAGTTCAAACAAATTACAGATATGTAGTACAGCCCGGTAGCCATGAATCCCAAAATCAAAATACCTTGAATGCAAATTCTCGACGAACATACCAGCACACTGATAACTTTAGAAATCAACATGTGTCTCAGTTTGATAGTAGTTCTGGTAGAGACTGGGAACCAGATAATTCAAGAAATCCAAATGTTGGCTACACCACGGGTACTCATAGCACTAGTCATGAATTCGATACACAAAGGACACAAGAAGTATCTGTGGAAAATCCTCAAAGGCGAAGACCTTTCCACTCTGGATCACAGACTATGCAAGCTTATTATGATAGTCTTTCACAGTCACAACAAGCAGACTATAGACGTCGATATGGAACCCAAGACCTGAGGCAACGTGGTACATCTACCAGTGATAGCAATGGATTTAATACTCAGGAAACGCAGGAAATATTTGTGGAACACCCTGAAAGAGCCGGACGTCCTCACTCTGGATCACAAACTCTGCAAGCCTATTATGACAGTCTTTCACGGTCACAGCAAGCAGAGTATAGACGTATTTATGGAATGCAGGGTTTGACCCAAAGTGAAGTTTCTACTGGTGATGCTGACCGTCAACAAGGATATCATCACACTGGTTCATATGGCAGTGGTAGACCATCAGGTCAAATGCAACATTACGAACAATTTCAAACTTCGTCCTCTTCTTCCTTGTCCTCATTACATCACCCTGATGTGATTGTAGAAAATCAACAATTCAGTAACAATCAAGAAGCACAACAAACGCAGAATGGGTATGATGGGTCTTATAGAGTTCAGAATGGGAAATTAGTCACACATGGAATTGACTTGGGTCAGGCAGTTCAAGCTGTTGATTGTGCAGAAGGTACAAATGGACATTCCTCATATGAACAGTCCCACTATCATAGAATCTACAGACAGGTTGCTAAGCCTGGAGATCAGAGTCAACAAGTGGAAGATCTGTCTTACCAAACAAAGGACCTCACACAACAAACGGAGGATTTGACCCAGCGAACAGAAGATCTTACACAGCAGAGTCAACACATTGGACAGGAATCTTGGAAGCCCGGTCAATTAGAAATTGAAAGTCAACAAGTTCAAGATCTAACTCAACAGACTGAGGATCTTACCCAGCAGACACGGGATCTCACACAACAAACGGAAGATCTTACACAGCAAACAGAAGATCTTACGCAGCAGAGTCAACACATTGGACAAGAATCTTGGAAGCCCGGTCAATTAGAAATTGAAAGTCAACAAGTTCAAGATCTCACCCAACAGACCGAGGATCTTACCCAGCAGACACAGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAGAGTCAACATATTGGACAGGAATCTTGGAGGCCCGgtaaattggaaattgaaagtcAACAAGTTCAAGATCTCACTCAACAGACTGAGGATCTTACCCAGCAGACACAGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAAAGTCAACATATTGGACAGGAACCTTGGAGGCCCGgtaaattggaaattgaaagtcAACAAGTTCAAGATCTCACTCAACAGACTGAGGATCTTACCCAGCAGACACAGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAGAGTCAACATATTGGACAGGAATCTTGGAGGCCCGgtaaattggaaattgaaagtcAACAAGTTCAAGATCTCACTCAACAGACTGAGGATCTTACCCAGCAGACACAGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAGAGTCAACATATTGGACAGGAATCTTGGAGGCCCGgtaaattggaaattgaaagtcAACAAGTTCAAGATCTCACTCAACAGACTGAGGATCTTACCCAGCAGACACAGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAAAGTCAACATATTGGACAGGAACCTTGGAGGCCCGgtaaattggaaattgaaagtcAACAAGTTCAAGATCTCACTCAACAGACTGAGGATCTTACCCAGCAGACACGGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAAAGTCAACATATTGGACAGGAACCTTGGAGGCCCGgtaaattggaaattgaaagtcAACAAGTTCAAGATCTCACTCAACAGACTGAGGATCTTACCCAGCAGACACAGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAGAGTCAACATATTGGACAGGAATCTTGGAGGCCCGgtaaattggaaattgaaagtcAACAAGTCCAAGATCTCACTCAACAGACTGAGGATCTTACCCAGCAGACACAGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAAAGTCAACATATTGGACAGGAATCTTGGAGGCCCGgtaaattggaaattgaaagtcAACAAGTTCACGATCTCAGTCAACAGACTGAGGATCTTACCCAGCAGACACAGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAGAGTCAACATACTGTACAGGAATCTTGGAGGCCTGGTCAATTAGAAATTGAAAGTCAACAAGATCAAGATCTCACCCAACAGACTGAGGATCTTACCCAGCAGACACAGGATCTTACACAACAAACGGAAGATCTGACCCAGCAAACAGAAGATCTTACACAGCAGAGTCAACATATTGGACAGGAATCTTGGAGGCCCGgtaaattggaaattgaaggTCAACAAGATCAAGATCCTACTCAACAAACGGAAGATTTGACTCAACAATCCGAAGATCTTGGTCAACAAACCGAGGATCTTACCCAACAAACCGAAGATCTTGGTCAACAAACCGAGGATCTTGCCCGACAAAATCAAGATTTCAGACAGCAGTCTTGGACATCTGGTCAATTAGAAATTGAAGGCCAGCAAGTACAAGACCTCACACAACAAACGGAGGATTTGGGGCAACAAACCGCCGATCGTGGCCAACAGACTGCGGATCTTACCCAACAGAATCAGGATTTTGGCCAGCAGCAATCTTGGATCCCTGgtaaattagaaattcaagGCCAGCAGATCCAAGACCTCACACAACAAACAGAGAATCTTGGCCAGCAAACGGAAGATCTGACCCAGCAAACGGAGGGACTTACCCAACAATCTGAAGATTTTTCACAACAAAGTGGCAGTTCCGATGATTACACAGAGCAAATAACAGGAAACTCTGGATTCGGACAGGAGTCTTCTTGGAATTTTCAGAACATAGAAACTACGAGTCAGCAAACAGCCAATTTTGATCAACAAAATCACTTCAGCGGACAACAAACCTCCGTCAATCACATGCAAGAGACGAGACCAGCGCCAAAGCCAGCATATAAACTAAAACATCAGAGAGCCAAGGATTTTCATCCCTCTCAACAAATCAACGTTGAGCTTGAAGACACAACTGTATCAAATGCTGATAGTGATGCAATAGGACACGATGATCTACAAAATAATCCAAAATGGGAATCAAGAAAACCAATCGTTACTCCATCACCAAACAGAGGTGATCAAGGTATTATTGTGAATTCTAATGAACCTGAAGAAACTGATATTCAAGTTGGATCAGTGACACCTAAAGTACCTCATCAAGAAATTGGATATGTTTCGAGGGATCCATATCAATCCAACCAGCAAACTGCAGGTGAGGATCAATTCCGCCCAATTCAACCAACTAAAACTAAGACAAGTCATCGAAAAGGGTATCGCCCTCGTGCTCATTTGAACCAACAACAGCTATCACAAGGATCGCATATTCCTCGTAGCATGCCAACAAAATACGTCGATGGAAGTCAAGAAATTGAATCTGTTCAAAGGAGTCAGCAAATTCAAACGCTCGATTCTGGAATAGAATCAAGACAAGAACAAGGTCCTAAATCAGATCCTGTACCTTTTCCCGATTCAATTGGGCCTAGAATTTTAGAGGCATATGGAGCGAATGGACCATATGGAGAACACGATTCAAGCATATTCGATTCTGCAAAACCAAATTCTGGTGCAGTTTCAATTCCTCCACATGGAGATGATGCTTGGGATATTAGAGTCGCCAACAAAGTTACAACGACTGAGACTCCAGCTCCTccatcaacaacaacaacaacgacaacgcCTGCCCCTAGCACTACAACTTCAGCTCCTGGATTTTGGCATAGAATTGGTAATTCGATAAGCAATACCTACGATAAAGCCAAGGAGAAAGCCAAAAAATTATTTGGCTAA
- Protein Sequence
- MAPTAVAIFVISIFTQALAAPNRCVTCQVGSTEWVNTDNLSQRSANLEDLTRQVESELGRSPNQLAFDDRRPGNWTDVNRYRTADGHGKVYEEHGQRVDGSKRIRFFRKNFTSSYSSGNLGGLGESDFGGFDSSSRQGASRLTSHGSFNQNQNSAYDQSAIRRNSDGYNEASKIHENRESNGRFTENTLAHNGQLSQRGSIAWDRTRPGNWSTHNSYSTDEGNGRVYEERGQYISGPGQVRFYKKNYTTSYTSHGAIPNINLETDGATSFEGEVQGLQRRFDSQGREIHQTSQGLTSGGYSQHNHGYQTQPGQTQVQTNYRYVVQPGSHESQNQNTLNANSRRTYQHTDNFRNQHVSQFDSSSGRDWEPDNSRNPNVGYTTGTHSTSHEFDTQRTQEVSVENPQRRRPFHSGSQTMQAYYDSLSQSQQADYRRRYGTQDLRQRGTSTSDSNGFNTQETQEIFVEHPERAGRPHSGSQTLQAYYDSLSRSQQAEYRRIYGMQGLTQSEVSTGDADRQQGYHHTGSYGSGRPSGQMQHYEQFQTSSSSSLSSLHHPDVIVENQQFSNNQEAQQTQNGYDGSYRVQNGKLVTHGIDLGQAVQAVDCAEGTNGHSSYEQSHYHRIYRQVAKPGDQSQQVEDLSYQTKDLTQQTEDLTQRTEDLTQQSQHIGQESWKPGQLEIESQQVQDLTQQTEDLTQQTRDLTQQTEDLTQQTEDLTQQSQHIGQESWKPGQLEIESQQVQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHIGQESWRPGKLEIESQQVQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHIGQEPWRPGKLEIESQQVQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHIGQESWRPGKLEIESQQVQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHIGQESWRPGKLEIESQQVQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHIGQEPWRPGKLEIESQQVQDLTQQTEDLTQQTRDLTQQTEDLTQQTEDLTQQSQHIGQEPWRPGKLEIESQQVQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHIGQESWRPGKLEIESQQVQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHIGQESWRPGKLEIESQQVHDLSQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHTVQESWRPGQLEIESQQDQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHIGQESWRPGKLEIEGQQDQDPTQQTEDLTQQSEDLGQQTEDLTQQTEDLGQQTEDLARQNQDFRQQSWTSGQLEIEGQQVQDLTQQTEDLGQQTADRGQQTADLTQQNQDFGQQQSWIPGKLEIQGQQIQDLTQQTENLGQQTEDLTQQTEGLTQQSEDFSQQSGSSDDYTEQITGNSGFGQESSWNFQNIETTSQQTANFDQQNHFSGQQTSVNHMQETRPAPKPAYKLKHQRAKDFHPSQQINVELEDTTVSNADSDAIGHDDLQNNPKWESRKPIVTPSPNRGDQGIIVNSNEPEETDIQVGSVTPKVPHQEIGYVSRDPYQSNQQTAGEDQFRPIQPTKTKTSHRKGYRPRAHLNQQQLSQGSHIPRSMPTKYVDGSQEIESVQRSQQIQTLDSGIESRQEQGPKSDPVPFPDSIGPRILEAYGANGPYGEHDSSIFDSAKPNSGAVSIPPHGDDAWDIRVANKVTTTETPAPPSTTTTTTTPAPSTTTSAPGFWHRIGNSISNTYDKAKEKAKKLFG
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -