Basic Information

Gene Symbol
-
Assembly
GCA_963924015.1
Location
OZ001259.1:23242820-23250199[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 48 1.5 1.6e+03 -0.1 0.3 47 65 66 84 57 84 0.84
2 48 0.013 14 6.4 2.4 36 63 99 126 97 131 0.72
3 48 0.0032 3.3 8.4 0.7 28 58 216 246 209 252 0.74
4 48 0.59 6.3e+02 1.1 0.1 40 62 272 294 259 296 0.81
5 48 8.6 9e+03 -2.6 0.1 26 54 300 328 297 346 0.65
6 48 9.6 1e+04 -2.7 0.2 36 60 335 359 324 361 0.70
7 48 9.2 9.7e+03 -2.7 0.0 26 50 373 397 368 403 0.53
8 48 0.021 22 5.8 8.2 24 63 438 477 436 479 0.86
9 48 1.8 1.9e+03 -0.4 0.2 22 40 485 503 479 507 0.57
10 48 0.0003 0.32 11.7 7.1 28 63 508 543 503 545 0.92
11 48 0.00095 1 10.1 4.0 22 60 544 582 543 587 0.88
12 48 0.11 1.2e+02 3.5 4.7 24 52 581 609 578 617 0.67
13 48 0.0002 0.21 12.3 2.6 24 63 616 655 615 657 0.91
14 48 0.00024 0.25 12.0 7.1 25 63 631 669 629 678 0.64
15 48 0.2 2.1e+02 2.7 6.0 24 62 658 696 655 699 0.72
16 48 0.47 4.9e+02 1.5 5.3 26 63 674 711 669 717 0.72
17 48 0.00033 0.35 11.6 6.5 21 65 704 748 704 748 0.95
18 48 6.2e-05 0.065 13.9 7.6 20 63 752 795 749 797 0.93
19 48 0.0008 0.84 10.3 7.8 22 64 803 845 802 846 0.94
20 48 0.00024 0.25 12.0 5.7 24 64 847 887 844 889 0.91
21 48 0.00019 0.2 12.3 11.1 22 63 887 928 886 929 0.92
22 48 0.00012 0.13 12.9 1.5 30 62 923 955 921 957 0.92
23 48 0.0026 2.8 8.7 0.6 24 64 980 1020 963 1021 0.88
24 48 0.0016 1.7 9.3 4.5 26 62 1017 1053 1007 1056 0.60
25 48 3.3e-05 0.035 14.8 6.2 21 61 1040 1080 1040 1083 0.91
26 48 0.0004 0.42 11.3 8.1 22 53 1069 1100 1069 1107 0.57
27 48 3.7e-06 0.0039 17.8 3.7 25 63 1107 1145 1104 1146 0.94
28 48 0.065 68 4.2 1.2 38 64 1155 1181 1148 1184 0.63
29 48 5.8e-05 0.061 14.0 8.8 21 65 1173 1217 1164 1217 0.93
30 48 3.1e-05 0.033 14.8 3.8 22 64 1202 1244 1201 1245 0.92
31 48 0.0013 1.3 9.7 6.8 25 64 1226 1265 1224 1273 0.77
32 48 0.0019 2 9.1 4.8 26 64 1248 1286 1243 1287 0.89
33 48 0.0048 5.1 7.8 5.9 24 62 1274 1312 1271 1322 0.83
34 48 0.0061 6.4 7.5 2.7 29 62 1314 1347 1309 1350 0.78
35 48 0.00084 0.89 10.3 7.1 24 64 1323 1363 1320 1370 0.83
36 48 0.0043 4.6 8.0 5.0 26 64 1346 1384 1341 1389 0.89
37 48 0.13 1.4e+02 3.2 1.8 32 65 1373 1406 1370 1406 0.83
38 48 0.0016 1.7 9.3 7.4 22 62 1405 1445 1397 1448 0.82
39 48 0.0017 1.8 9.3 4.2 27 64 1445 1482 1442 1483 0.91
40 48 0.0031 3.3 8.4 5.7 26 63 1493 1530 1481 1532 0.76
41 48 0.00062 0.65 10.7 7.4 22 64 1510 1552 1504 1553 0.72
42 48 6e-05 0.064 13.9 4.1 22 62 1524 1564 1522 1567 0.87
43 48 0.0079 8.3 7.1 1.1 24 60 1568 1604 1565 1609 0.85
44 48 6e-05 0.063 13.9 3.4 22 64 1601 1643 1601 1644 0.91
45 48 0.064 67 4.2 4.0 24 65 1638 1679 1637 1679 0.93
46 48 0.043 46 4.8 6.9 25 64 1667 1706 1664 1717 0.70
47 48 0.036 38 5.0 4.0 20 64 1683 1727 1680 1728 0.85
48 48 1.1 1.1e+03 0.3 1.1 32 51 1723 1742 1706 1755 0.56

Sequence Information

Coding Sequence
ATGGCCAATCGTACGTGTCAATGCGGATGCACAGACCCGCCGGAAATGTCGGCCGCAGATCCTCCCCATGAGGGTTCGTGCGGTTGTAGTTACAACCCTTTTGCGGATCAAGGAAGAGAGGGTGAAATCACGGATTTATCATTTGCACTGCGTAAATTAACTCTGATGAAATGCCAGATGAAAAAATGGAGAATAGAACGGCTTCAGTTGGAGAGCGAGACCAGAGGATTGAAACAAGTGCTTCAGGCCCACGGTTTGAACGATGATATTGTCAGACCAGATCCTCTGCTTGCTCATCTTCGTGAGCATAATGAGAGATTGGAGAACGAGAAAGCGGAGCTTGAGGAGAGCGTGAAAAGCCTTTCCGAAACCGTATCGGATTATGAGAATCAGGAATCCTCGTCATCTGATGCTGTGAATAAATTACGTGAAAAAATTCGGACAATAAAGGAAACAAACGCCATTGAAAAAAGGAGATTAAGAGACCTCATATCTGGGCTGAAGATAAGGCTTCAAGAAGCCGAGAACGAGTCGTCATGCGCTGCCTTAAACCGACTGCGAGCAAAACTTAGAGAAATGATGAAAGGCGGTCAAGAAGCTGACCAAAGAGTTTCTATGGTGGTTCAGCGGTCCATAGAAACTTTAACTGAATTAACAGGGAACGTCGACGATCTTAAGGCCGAGATCGAGAGACTTAGAGCCGAGATAAAGAGGCTGAAGGATTTGCTGAAGACATGTGAAGACCGAAAGGACGTCGGAGTTGAGACGATTGTCGTGGACGTTAAACCGGTAGAAAAACCGCTTGACCAAATGGACGACTCGGAATTGTTGAACAGGATCAAGGAACTCGAAGCGTTGATAGCTCAGCTGAgaaaacaattaattgataaagatGCTACGATAAATGGCCTTCATAATCAATTGTTCGAGGCTGGTACAGACAACAAACGTTTGTCCACGGATTTGGACCAGATGAAGGTCAGCTACAACGCCGTTATGCAGGAGGTCAAGGCTATGAAGGATGAGCTGAAGAAGAGGGACGCCAAGGTATCTGAGCTTCTCAAAGACTTGAAAGCATCCGCGATCGATATGCTGGGATTAAACAGACTGCAAAGTGAAATGGACGCAATCAAGCCCCAGATGTACAATCTCGAGGTAGAACGCGGCCAGCTATTATCTGAGCTCGGTAGAGTGCGGGGCGTTGTATCGGAGCGGAATGATCAGATCATTAAAATACTCGAGGATAGGGACAAGCACGTTAAAGCACTGGCCAAGGCATCAAGCGTAATGCAGGCGACGATTGAACCgatgaaagagaaagaaacggTTTTAAAAAATGAGGTTCAAGGATTGAAAGACCGGATAGCAGAGCTTGAACTAGAGTTATCCGAGCTCAGGAAAAAGCTAGCTCAATTGGAAAAGGAAAATTCTAAGATACCTGGACTAGTGGATAAGATTAAGGAGCTCGAAGACGAACTAGAAAAGGTCAGATCTCAATTGGCCGAGGCAAACTCCAGAGTGATTGAGCTTGAAAAAGAAGTAGCTCGGTTCAAATCTGAGAAAAAAGAACTAGAAAATGAGCTTGGCGAGGCGAGGAAGGAGATTGAAAAGCTGAAGGAGGAGCTTGCTAAGGAGAAAGCTGCGAAAGATACTGCTCTAAAAGATCTTGAGGATTGTAGAGCTGAGAACGAGAAGCTCAGAGCAGAAAATGATCGTTTAATTAATGAGCTAAACGCGGCTAAGACTGAGAGTGATAAGCTCAGAAATGATTTGGAGAAACTGAAAgaagaaatggaaaatttaaaaGCTGAGAATGATCGGTTGACAGATCTATTGAACGTGGCCAAAAATGAGATCGACAAGCTCAGAGGTGAGCTCGAGAATCTCAAGGCAGAAAATGACAAGCTCAAAAATGATGTAGAGAAACTTAAAGGAGATATAGATAACCTTAAGACAGAAAATAATAGCCTCAAAGAAGAGCTCGACAAACTTAGAAACCAGCTCGCTGGACTTGAGGATGAAATCGAGAAACTAAAGAACGCTTTAGCGGTAACTAAAGCGGAAGCTGAGAAGCTCAAACAAGATTTAGATAACACGAAAAATGAGCTTGATAAATTAAGAGCTGAGAATGATCAGGTCAAAAGTCAGTTAGCCGATGCCAAAGCTGATAACGAGAGGCTTAAAATTGAACTCGAAAATCTTAAGggagaaatagagaaattaaaacaagaaaacagcGAACTTAAGACAGATCGAGACAAGCTTCAAGCAGAGGTCgaaaaacttggaaaaaaGCTCGATGAGCTTGAATCTGAGaatgaaaaactgaagaaCGCTTTAACAGCGGCTAAAGCTGAGGTTGAAAAGCTCAAAAATGATTTAGACAAAATGAAGTCCGATATCGAGAAGCTCAAGGCAGAAAATGATCAGCTGAATAATGAGCTAGCCAATGCTAAGGCAGAGAACTCGAGACTCGGAAAGGAGCTTGATGAGTTGAAGGGAGAAATGGCGAAGCTgaaagaagagaataaaaaccTTAAGACAGAAAAAGATGGGCTAGAAGCAGAGATCAAAAAACTTAGAGAAGAATTAGACAGTCTTAAGGGTGAGAATGAGAGACTTAAAGGTGAGAATGAGGGACTTAAAGGTGAGAATGAGAGACTTAAAGGTGAGAATGAAAAACTTAAGACTGAGCTTGACAAACTTGAAAGCGAGAACGAGGCCCTGAAATCAGAAAACGAGAAGTTGAAAAAGAGCTTGAGTGATATGGAAGGAAAGCTAAAAGAGCTCGAAGCCTCCAACGCTGATCTCGCAAATAAAATAGCAGAGCTGAAGAATCAAATTGATAAACTTCAGACCGAGTTGGCGTCCGAAAAAGCTGCCAAAGAAGCAGCTGTTAAAGAATTGGCGGCTATTAAGGCCGAGCTAAAAGCTCTGCTGGCGGAAATGGACAAACTTAAGACGGACTGCGACAGCCTAAAAGGACAAGTCGACGATCTCACTAAACAACTATCGGATTTGAAGAACGATTTTGATCAGCTCAAATACAAGTATGCTGAATTGCAAAAAGAGAGGGATAACCTCACGGCTGAACTTGATAAGTTTAGGGAAGAAAACgacaagatgaaaaatgaccTAGACAAGCTCAAGGCAGAGCTTGACACGTTAAAAACAGAGAATGGTAAGCTCAAGGAGGAAAATGGTAAGCTCAAGGAGGAGAATAGTAAGCTCAAGGAGGAAATTGGTAAGCTGAAAGAAGAGAATGGTAAGCTTAAAGAAGAGAACGAGAAGCTAAGGAAAGACTTAGGCGATGCTGAAGCCCGGATAAAAGCCCTTGAGGATGAAGTAAAGGCATGCGAGGAGGAAAAAACAAGACTGCTAAAGGAGATTGAGGGCCTCAAAAATGGGATTGAAGAACTCAAAAAGGAGCTTGCTGCAGAAAAAGCTGCGAAAGAGGCAGCCATGAAGGAATTATCAGCACTTAAAACTGAACTAGCTGCACTGAGAGCAGAGCTGGATAAAGTGAGGGCAGAGAACGCCAAACTAAAGAGTGAACTCGATAAACTGAAAGCAGAAAACGACGAGCTCAAAAATGAGAATAGTAAAATGAAGGGAGAAATTGATAAGCTCAAAGCAGAGGTTAcgaaattagaaaatgatCTAAGCACCTTGAGGGCAGAGAACGCGAAACTCAAAGAAGATCTGGACAAGCTAAATGCAGAAAATAATGCTCTGAGATCTGATAATGACAAGTTGAAAGGAGAATTGGATCAGCTCAAATCTGAGAACGCGAAATTGAAGAACGATCTAGCTGCGGCTAAAGAAGAAGCTGCAAAGCTCAAAAATGACTTGGATAAACTAAGATCTGAACGGGATGCGTTGCAAgcagagaataataaaatgaagggTGAGCTTGAAAAGCTGAAAtcagatattataaaattacaagATGACTTGAACACTTTGAAGGCAGAGAATACGAAGCTCAAAGAAGACCTTGACAAACTAAACGCAGAGAACAAAGCTCTGAGATCTGATAATGACAAGTTGAAAGGGGAATTGGATCAGCTCAAATCTGAGAACGCGAAATTGAAGCACGATCTAGCTGCGGCTAAAGAAGAAGCCGCAAAACTCAAAAATGATTTAGACAAACTAAAATCCGACTTGGATACGATGCAAgcagagaataataaaatgaaaggtGAGCTTGAAAAGCTGAAATCAAATATTgcaaaattagaaaatgacTTGAACACTCTGAAGGCAGAGAATGCAAAGCTTAAAGAAGACCTTGACAAACTAAACGCAGAGAACAAAGCTCTGAGATCTGATAATGACAAGTTGAAAGGAGAATTGGATCAGCTCAAATCTGAGAACGCGAAATTGAAGAACGATCTAGCTGCGGCTAAAGAAGAAGCAGCAAAGCTCAAAAATGACTTGGATAAACTAAAATCCGACTTGGATGCGCTGCAAGCAGAGAACAATAAAATGAAGGATGACTTGAACACTCTGAAGGCAGAGAATGCGAAGCTTAAAGAAGACGTTGACAAACTAAATGCAGAGAATGCAAAGCTTAAAGAAGACGTTGACAAACTAAATGCAGAAAATAAGTCATTGAAAGCTGATAATGACAAGCTGAAGGGAGAATTGGATCAGCTCAAATCTGAGAAcggaaaattgaagaaagatCTAGCTGCGGCTAAAGAAGAATCTGCAAAGCTCAAAAATGACTTAGACAAACTAAAATCCGACTTGGATGCCCTGCAAgcagagaataataaaatcaaggGTGAGCTTGGAAGGTTAAAGTCAGATATTGCAGGATTACAAAATGACCTAAACACCTTGAAAGCAGAGAATGCGAAGCTCAAAGAAGACGTTGACAAACTAAATGCAGAAAATAAGTCATTAAAAGCTGCTAATGACAAGTTGAAAGGAGAACTGGATCAGCAGAAATCTGAGAACGCGAAATTGAAGACCGATTTAGCTGCGGCTAAAGACGAAGCTGCAAAGCTCAAAAATGACTTGAATGAACTAAAATCTGAACGGGATGCGCTGCAAGCAGAAAATAACAAGATTAAGGGAGAGCTTGAAAGGGTGAAATCCGAGAATGCAAAACTACAGAAAGATCTCGATGTCTTGAAGGGAGAGAACGGCAAGCGAAAGTCTGAGGTTGACAAACTGAAAATCGATaatgaaaagttgaataaGGATCTTCAGAAGACAGACTCGGGGTTAGACAAGTATAAAGCTGAGGCCGATAATCAGAAGAAATTAACCAAGAAACTAAAAGatACACCGAGGAAACTCGATACAATGAGGCCAAGTTACACTGTCGAGAAAGAACTGCAGATTAAAGTACCCACCAGCGTTGTTTCATCgggtaaaattattaaaagtgATCAACGACCCTCGGTTATAAAAAAGGACCAAGGTTCACAGGGCGCGGGTTGTGGCGATTACGAAAATGCAAACGAACAGCTGATGAGAAACATGAATATGCAAGAGAGGGCCGTACAGCGTATAcgagaatttataaaattcgtaCTCGGTGAGAGAAGTTCACCACCTGAAATGGCGCAGGAATTGATCCATCGCATGTCTTCAGTAACGCGGAATAATTTCCCCGAAGATATAATGCGAATTTTGAAAGAGTCACAGTTCTTATCGGAATCGATATTCAACGCCGAAACTGAAGTTCAACGTCTGGTCAAACCCGTCGATGAATTAAACAATCTGAATGATGAAGATATCCAGGACATCGATGGACTCGGTGATGCCTTCGATGCCGAGTCGTGgctaaaaACATTGACGTTAACCGAATTGGCCGAACTCCACGACAGGATATGCCTCGTAACTTCGTGCATGGTTCAGCAAGACATAAACCCCGAGGATTACGTAGACGGTATTGAAACCGATGGAATTTGCCATCCTTGTAGTACCGTAACGGAGGAATTTACCGATGATTCAATACCCGATTACGAAGCTTTGAACAGAAGAATCGGAGCTCTTCAAATGCAGATAAACAAGAAACAGGACGAGGCTgctaaaaaagttaaaaaaatgcgCCAATCCATGTGGCGTGAACAAGAAAACCTAATCAAACTATCCGAAGAAATGAACGCCCAAAAACGAAGGAATTTATCGCTGAAGATCAAGATCAATGAAAACATCGAAGTGGATGgggaagaagataaaaataaaaatgaaaaggcgGTTTTATGCGGTGGAAAATTTTCCCCAAAGATAGGGCAGAGGAAGGACGATATGTTTGGTGAGAAAAATGGCAATTCCGGACGTTTTATTGGTGCCAATTCGAACGTCGGGGTGAATTGGGTCCCTTGCTTTAACCCAAAGGAAAATGAGCCCGATTCAGGATCGTCTTTAACCTCGGTTAGGGTTGGGAGACAAAAACCACCTCCGTGTGCGGCTCCGGTTAGACACGCGGATGTACCTTGCTGTGTTAAGCCCTGCTGTCCCTCGGTTGTAAATCGGGATACGCTATTTAGTAAAAAGAGGTAA
Protein Sequence
MANRTCQCGCTDPPEMSAADPPHEGSCGCSYNPFADQGREGEITDLSFALRKLTLMKCQMKKWRIERLQLESETRGLKQVLQAHGLNDDIVRPDPLLAHLREHNERLENEKAELEESVKSLSETVSDYENQESSSSDAVNKLREKIRTIKETNAIEKRRLRDLISGLKIRLQEAENESSCAALNRLRAKLREMMKGGQEADQRVSMVVQRSIETLTELTGNVDDLKAEIERLRAEIKRLKDLLKTCEDRKDVGVETIVVDVKPVEKPLDQMDDSELLNRIKELEALIAQLRKQLIDKDATINGLHNQLFEAGTDNKRLSTDLDQMKVSYNAVMQEVKAMKDELKKRDAKVSELLKDLKASAIDMLGLNRLQSEMDAIKPQMYNLEVERGQLLSELGRVRGVVSERNDQIIKILEDRDKHVKALAKASSVMQATIEPMKEKETVLKNEVQGLKDRIAELELELSELRKKLAQLEKENSKIPGLVDKIKELEDELEKVRSQLAEANSRVIELEKEVARFKSEKKELENELGEARKEIEKLKEELAKEKAAKDTALKDLEDCRAENEKLRAENDRLINELNAAKTESDKLRNDLEKLKEEMENLKAENDRLTDLLNVAKNEIDKLRGELENLKAENDKLKNDVEKLKGDIDNLKTENNSLKEELDKLRNQLAGLEDEIEKLKNALAVTKAEAEKLKQDLDNTKNELDKLRAENDQVKSQLADAKADNERLKIELENLKGEIEKLKQENSELKTDRDKLQAEVEKLGKKLDELESENEKLKNALTAAKAEVEKLKNDLDKMKSDIEKLKAENDQLNNELANAKAENSRLGKELDELKGEMAKLKEENKNLKTEKDGLEAEIKKLREELDSLKGENERLKGENEGLKGENERLKGENEKLKTELDKLESENEALKSENEKLKKSLSDMEGKLKELEASNADLANKIAELKNQIDKLQTELASEKAAKEAAVKELAAIKAELKALLAEMDKLKTDCDSLKGQVDDLTKQLSDLKNDFDQLKYKYAELQKERDNLTAELDKFREENDKMKNDLDKLKAELDTLKTENGKLKEENGKLKEENSKLKEEIGKLKEENGKLKEENEKLRKDLGDAEARIKALEDEVKACEEEKTRLLKEIEGLKNGIEELKKELAAEKAAKEAAMKELSALKTELAALRAELDKVRAENAKLKSELDKLKAENDELKNENSKMKGEIDKLKAEVTKLENDLSTLRAENAKLKEDLDKLNAENNALRSDNDKLKGELDQLKSENAKLKNDLAAAKEEAAKLKNDLDKLRSERDALQAENNKMKGELEKLKSDIIKLQDDLNTLKAENTKLKEDLDKLNAENKALRSDNDKLKGELDQLKSENAKLKHDLAAAKEEAAKLKNDLDKLKSDLDTMQAENNKMKGELEKLKSNIAKLENDLNTLKAENAKLKEDLDKLNAENKALRSDNDKLKGELDQLKSENAKLKNDLAAAKEEAAKLKNDLDKLKSDLDALQAENNKMKDDLNTLKAENAKLKEDVDKLNAENAKLKEDVDKLNAENKSLKADNDKLKGELDQLKSENGKLKKDLAAAKEESAKLKNDLDKLKSDLDALQAENNKIKGELGRLKSDIAGLQNDLNTLKAENAKLKEDVDKLNAENKSLKAANDKLKGELDQQKSENAKLKTDLAAAKDEAAKLKNDLNELKSERDALQAENNKIKGELERVKSENAKLQKDLDVLKGENGKRKSEVDKLKIDNEKLNKDLQKTDSGLDKYKAEADNQKKLTKKLKDTPRKLDTMRPSYTVEKELQIKVPTSVVSSGKIIKSDQRPSVIKKDQGSQGAGCGDYENANEQLMRNMNMQERAVQRIREFIKFVLGERSSPPEMAQELIHRMSSVTRNNFPEDIMRILKESQFLSESIFNAETEVQRLVKPVDELNNLNDEDIQDIDGLGDAFDAESWLKTLTLTELAELHDRICLVTSCMVQQDINPEDYVDGIETDGICHPCSTVTEEFTDDSIPDYEALNRRIGALQMQINKKQDEAAKKVKKMRQSMWREQENLIKLSEEMNAQKRRNLSLKIKINENIEVDGEEDKNKNEKAVLCGGKFSPKIGQRKDDMFGEKNGNSGRFIGANSNVGVNWVPCFNPKENEPDSGSSLTSVRVGRQKPPPCAAPVRHADVPCCVKPCCPSVVNRDTLFSKKR

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00938403;
90% Identity
iTF_00939281;
80% Identity
iTF_00939281;