Basic Information

Gene Symbol
-
Assembly
GCA_963170745.1
Location
OY720660.1:6766228-6771606[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 16 5 4.7e+03 -1.9 0.0 38 62 271 295 267 297 0.85
2 16 0.039 36 4.9 2.6 25 59 633 667 631 670 0.69
3 16 0.0041 3.8 8.0 6.5 25 61 681 717 680 721 0.87
4 16 0.0023 2.1 8.8 7.7 25 61 736 772 734 776 0.87
5 16 0.0023 2.1 8.8 7.7 25 61 791 827 789 831 0.87
6 16 0.0023 2.1 8.8 7.7 25 61 846 882 844 886 0.87
7 16 0.0023 2.1 8.8 7.7 25 61 901 937 899 941 0.87
8 16 0.0023 2.1 8.8 7.7 25 61 956 992 954 996 0.87
9 16 0.0041 3.8 8.0 6.5 25 61 1011 1047 1010 1051 0.87
10 16 0.0023 2.1 8.8 7.7 25 61 1066 1102 1064 1106 0.87
11 16 0.0023 2.1 8.8 7.7 25 61 1121 1157 1119 1161 0.87
12 16 0.0028 2.6 8.5 6.6 26 61 1177 1212 1175 1218 0.81
13 16 0.0028 2.6 8.5 7.2 25 62 1231 1268 1230 1271 0.90
14 16 0.015 14 6.2 5.2 26 61 1294 1329 1289 1336 0.52
15 16 0.76 7.1e+02 0.7 1.0 32 63 1348 1379 1343 1383 0.72
16 16 0.013 12 6.4 2.1 34 61 1399 1426 1395 1435 0.48

Sequence Information

Coding Sequence
ATGGCTCCCACCGCAGTGGCAATTTTCGTTATCTCTATATTCACACAAGCGTTAGCAGCTCCGAATCGCTGTGTTACTTGCCAAGTTGGATCAACCGAATGGGTAAACACAGACAACCTATCGCAGAGGTCAGCGAACTTAGAAGACTTAACACGACAAGTTGAAAGTGAGTTAGGAAGATCTCCAAATCAATTAGCTTTCGATGACAGAAGACCTGGAAATTGGACAGATGTAAATCGGTACAGAACAGCTGATGGTCATGGAAAGGTATATGAAGAACACGGCCAACGTGTAGATGGTTCAAAACGGATTAGATTCTTCAGAAAAAACTTCACATCCAGCTACAGTAGTGGAAACTTAGGTGGATTGGGAGAATCTGATTTCGGAGGCTTTGATTCTTCTAGTAGACAAGGAGCCAGCCGCCTGACAAGCCACGGATCTTttaatcagaatcaaaattcagCCTATGACCAATCTGCTATTCGTCGAAATTCTGATGGATACAATGAAGCTTccaaaattcatgaaaatcgtGAAAGTAACGGTCGGTTTACAGAAAATACTCTTGCACATAATGGGCAATTGTCACAGCGAGGATCAATTGCTTGGGATCGAACAAGACCAGGAAATTGGAGTACTCATAATTCTTATAGCACTGATGAAGGTAATGGCAGAGTTTATGAAGAACGAGGACAGTATATATCAGGGCCTGGTCAAGTTCGgttctataaaaaaaattatactacGAGTTACACTTCACATGGAGCTATTCCAAATATAAATCTAGAAACAGATGGGGCAACGAGTTTCGAGGGAGAAGTACAGGGCTTACAGAGACGTTTTGACAGCCAGGGGAGAGAGATTCATCAAACTTCCCAAGGTTTGACAAGTGGTGGATACAGTCAGCACAATCATGGATATCAGACACAGCCTGGCCAGACTCAAGTTCAAACAAATTACAGATATGTAGTACAGCCCGGTAGCCATGAATCCCAAAATCAAAATACCTTGAATGCAAATTCTCGACGAACATACCAGCACACTGATAACTTTAGAAATCAACATGTGTCTCAGTTTGATAGTAGTTCTGGTAGAGACTGGGAACCAGATAATTCAAGAAATCCAAATGTTGGCTACACCACGGGTACTCATAGCACTAGTCATGAATTCGATACACAAAGGACACAAGAAGTATCTGTGGAAAATCCTCAAAGGCGAAGACCTTTCCACTCTGGATCACAGACTATGCAAGCTTATTATGATAGTCTTTCACAGTCACAACAAGCAGACTATAGACGTCGATATGGAACCCAAGACCTGAGGCAACGTGGTACATCTACCAGTGATAGCAATGGATTTAATACTCAGGAAACGCAGGAAATATTTGTGGAACACCCTGAAAGAGCCGGACGTCCTCACTCTGGATCACAAACTCTGCAAGCCTATTATGACAGTCTTTCACGGTCACAGCAAGCAGAGTATAGACGTATTTATGGAATGCAGGGTTTGACCCAAAGTGAAGTTTCTACTGGTGATGCTGACCGTCAACAAGGATATCATCACACTGGTTCATATGGCAGTGGTAGACCATCAGGTCAAATGCAACATTACGAACAATTTCAAACTTCGTCCTCTTCTTCCTTGTCCTCATTACATCACCCTGATGTGATTGTAGAAAATCAACAATTCAGTAACAATCAAGAAGCACAACAAACGCAGAATGGGTATGATGGGTCTTATAGAGTTCAGAATGGGAAATTAGTCACACATGGAATTGACTTGGGTCAGGCAGTTCAAGCTGTTGATTGTGCAGAAGGTACAAATGGACATTCCTCATATGAACAGTCCCACTATCATAGAATCTACAGACAGGTTGCTAAGCCTGGAGATCAGAGTCAACAAGTGGAAGATCTGTCTTACCAAACAAAGGACCTCACACAACAAACGGAGGATTTGACCCAGCGAACAGAAGATCTTACACAGCAGAGTCAACACATTGGACAGGAATCTTGGAAGCCCGGTCAATTAGAAATTGAAAGTCAACAAGTTCAAGATCTAACTCAACAGACTGAGGATCTTACCCAGCAGACACGGGATCTCACACAACAAACGGAAGATCTTACACAGCAAACAGAAGATCTTACGCAGCAGAGTCAACACATTGGACAAGAATCTTGGAAGCCCGGTCAATTAGAAATTGAAAGTCAACAAGTTCAAGATCTCACCCAACAGACCGAGGATCTTACCCAGCAGACACAGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAGAGTCAACATATTGGACAGGAATCTTGGAGGCCCGgtaaattggaaattgaaagtcAACAAGTTCAAGATCTCACTCAACAGACTGAGGATCTTACCCAGCAGACACAGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAAAGTCAACATATTGGACAGGAACCTTGGAGGCCCGgtaaattggaaattgaaagtcAACAAGTTCAAGATCTCACTCAACAGACTGAGGATCTTACCCAGCAGACACAGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAGAGTCAACATATTGGACAGGAATCTTGGAGGCCCGgtaaattggaaattgaaagtcAACAAGTTCAAGATCTCACTCAACAGACTGAGGATCTTACCCAGCAGACACAGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAGAGTCAACATATTGGACAGGAATCTTGGAGGCCCGgtaaattggaaattgaaagtcAACAAGTTCAAGATCTCACTCAACAGACTGAGGATCTTACCCAGCAGACACAGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAAAGTCAACATATTGGACAGGAACCTTGGAGGCCCGgtaaattggaaattgaaagtcAACAAGTTCAAGATCTCACTCAACAGACTGAGGATCTTACCCAGCAGACACGGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAAAGTCAACATATTGGACAGGAACCTTGGAGGCCCGgtaaattggaaattgaaagtcAACAAGTTCAAGATCTCACTCAACAGACTGAGGATCTTACCCAGCAGACACAGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAGAGTCAACATATTGGACAGGAATCTTGGAGGCCCGgtaaattggaaattgaaagtcAACAAGTCCAAGATCTCACTCAACAGACTGAGGATCTTACCCAGCAGACACAGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAAAGTCAACATATTGGACAGGAATCTTGGAGGCCCGgtaaattggaaattgaaagtcAACAAGTTCACGATCTCAGTCAACAGACTGAGGATCTTACCCAGCAGACACAGGATCTCACACAACAAACGGAGGATCTGACCCAGCAAACAGAGGATCTCACACAGCAGAGTCAACATACTGTACAGGAATCTTGGAGGCCTGGTCAATTAGAAATTGAAAGTCAACAAGATCAAGATCTCACCCAACAGACTGAGGATCTTACCCAGCAGACACAGGATCTTACACAACAAACGGAAGATCTGACCCAGCAAACAGAAGATCTTACACAGCAGAGTCAACATATTGGACAGGAATCTTGGAGGCCCGgtaaattggaaattgaaggTCAACAAGATCAAGATCCTACTCAACAAACGGAAGATTTGACTCAACAATCCGAAGATCTTGGTCAACAAACCGAGGATCTTACCCAACAAACCGAAGATCTTGGTCAACAAACCGAGGATCTTGCCCGACAAAATCAAGATTTCAGACAGCAGTCTTGGACATCTGGTCAATTAGAAATTGAAGGCCAGCAAGTACAAGACCTCACACAACAAACGGAGGATTTGGGGCAACAAACCGCCGATCGTGGCCAACAGACTGCGGATCTTACCCAACAGAATCAGGATTTTGGCCAGCAGCAATCTTGGATCCCTGgtaaattagaaattcaagGCCAGCAGATCCAAGACCTCACACAACAAACAGAGAATCTTGGCCAGCAAACGGAAGATCTGACCCAGCAAACGGAGGGACTTACCCAACAATCTGAAGATTTTTCACAACAAAGTGGCAGTTCCGATGATTACACAGAGCAAATAACAGGAAACTCTGGATTCGGACAGGAGTCTTCTTGGAATTTTCAGAACATAGAAACTACGAGTCAGCAAACAGCCAATTTTGATCAACAAAATCACTTCAGCGGACAACAAACCTCCGTCAATCACATGCAAGAGACGAGACCAGCGCCAAAGCCAGCATATAAACTAAAACATCAGAGAGCCAAGGATTTTCATCCCTCTCAACAAATCAACGTTGAGCTTGAAGACACAACTGTATCAAATGCTGATAGTGATGCAATAGGACACGATGATCTACAAAATAATCCAAAATGGGAATCAAGAAAACCAATCGTTACTCCATCACCAAACAGAGGTGATCAAGGTATTATTGTGAATTCTAATGAACCTGAAGAAACTGATATTCAAGTTGGATCAGTGACACCTAAAGTACCTCATCAAGAAATTGGATATGTTTCGAGGGATCCATATCAATCCAACCAGCAAACTGCAGGTGAGGATCAATTCCGCCCAATTCAACCAACTAAAACTAAGACAAGTCATCGAAAAGGGTATCGCCCTCGTGCTCATTTGAACCAACAACAGCTATCACAAGGATCGCATATTCCTCGTAGCATGCCAACAAAATACGTCGATGGAAGTCAAGAAATTGAATCTGTTCAAAGGAGTCAGCAAATTCAAACGCTCGATTCTGGAATAGAATCAAGACAAGAACAAGGTCCTAAATCAGATCCTGTACCTTTTCCCGATTCAATTGGGCCTAGAATTTTAGAGGCATATGGAGCGAATGGACCATATGGAGAACACGATTCAAGCATATTCGATTCTGCAAAACCAAATTCTGGTGCAGTTTCAATTCCTCCACATGGAGATGATGCTTGGGATATTAGAGTCGCCAACAAAGTTACAACGACTGAGACTCCAGCTCCTccatcaacaacaacaacaacgacaacgcCTGCCCCTAGCACTACAACTTCAGCTCCTGGATTTTGGCATAGAATTGGTAATTCGATAAGCAATACCTACGATAAAGCCAAGGAGAAAGCCAAAAAATTATTTGGCTAA
Protein Sequence
MAPTAVAIFVISIFTQALAAPNRCVTCQVGSTEWVNTDNLSQRSANLEDLTRQVESELGRSPNQLAFDDRRPGNWTDVNRYRTADGHGKVYEEHGQRVDGSKRIRFFRKNFTSSYSSGNLGGLGESDFGGFDSSSRQGASRLTSHGSFNQNQNSAYDQSAIRRNSDGYNEASKIHENRESNGRFTENTLAHNGQLSQRGSIAWDRTRPGNWSTHNSYSTDEGNGRVYEERGQYISGPGQVRFYKKNYTTSYTSHGAIPNINLETDGATSFEGEVQGLQRRFDSQGREIHQTSQGLTSGGYSQHNHGYQTQPGQTQVQTNYRYVVQPGSHESQNQNTLNANSRRTYQHTDNFRNQHVSQFDSSSGRDWEPDNSRNPNVGYTTGTHSTSHEFDTQRTQEVSVENPQRRRPFHSGSQTMQAYYDSLSQSQQADYRRRYGTQDLRQRGTSTSDSNGFNTQETQEIFVEHPERAGRPHSGSQTLQAYYDSLSRSQQAEYRRIYGMQGLTQSEVSTGDADRQQGYHHTGSYGSGRPSGQMQHYEQFQTSSSSSLSSLHHPDVIVENQQFSNNQEAQQTQNGYDGSYRVQNGKLVTHGIDLGQAVQAVDCAEGTNGHSSYEQSHYHRIYRQVAKPGDQSQQVEDLSYQTKDLTQQTEDLTQRTEDLTQQSQHIGQESWKPGQLEIESQQVQDLTQQTEDLTQQTRDLTQQTEDLTQQTEDLTQQSQHIGQESWKPGQLEIESQQVQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHIGQESWRPGKLEIESQQVQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHIGQEPWRPGKLEIESQQVQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHIGQESWRPGKLEIESQQVQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHIGQESWRPGKLEIESQQVQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHIGQEPWRPGKLEIESQQVQDLTQQTEDLTQQTRDLTQQTEDLTQQTEDLTQQSQHIGQEPWRPGKLEIESQQVQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHIGQESWRPGKLEIESQQVQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHIGQESWRPGKLEIESQQVHDLSQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHTVQESWRPGQLEIESQQDQDLTQQTEDLTQQTQDLTQQTEDLTQQTEDLTQQSQHIGQESWRPGKLEIEGQQDQDPTQQTEDLTQQSEDLGQQTEDLTQQTEDLGQQTEDLARQNQDFRQQSWTSGQLEIEGQQVQDLTQQTEDLGQQTADRGQQTADLTQQNQDFGQQQSWIPGKLEIQGQQIQDLTQQTENLGQQTEDLTQQTEGLTQQSEDFSQQSGSSDDYTEQITGNSGFGQESSWNFQNIETTSQQTANFDQQNHFSGQQTSVNHMQETRPAPKPAYKLKHQRAKDFHPSQQINVELEDTTVSNADSDAIGHDDLQNNPKWESRKPIVTPSPNRGDQGIIVNSNEPEETDIQVGSVTPKVPHQEIGYVSRDPYQSNQQTAGEDQFRPIQPTKTKTSHRKGYRPRAHLNQQQLSQGSHIPRSMPTKYVDGSQEIESVQRSQQIQTLDSGIESRQEQGPKSDPVPFPDSIGPRILEAYGANGPYGEHDSSIFDSAKPNSGAVSIPPHGDDAWDIRVANKVTTTETPAPPSTTTTTTTPAPSTTTSAPGFWHRIGNSISNTYDKAKEKAKKLFG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-