Basic Information

Gene Symbol
-
Assembly
GCA_949628255.1
Location
OX451207.1:22159994-22167186[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 41 3.8 4.7e+03 -1.5 0.4 47 65 66 84 56 84 0.67
2 41 0.0069 8.6 7.3 3.8 35 63 98 126 97 128 0.82
3 41 0.0066 8.2 7.4 1.6 28 57 216 245 209 250 0.72
4 41 0.59 7.4e+02 1.1 0.3 40 62 272 294 262 296 0.79
5 41 4 5e+03 -1.5 0.1 25 55 299 329 297 346 0.74
6 41 9.3 1.2e+04 -2.7 0.2 36 60 335 359 324 361 0.70
7 41 9.1 1.1e+04 -2.7 0.0 26 50 373 397 368 403 0.53
8 41 0.029 36 5.3 6.5 23 63 437 477 435 479 0.90
9 41 0.22 2.8e+02 2.5 9.3 28 64 463 502 459 503 0.91
10 41 6e-05 0.075 13.9 8.9 26 63 506 543 503 545 0.94
11 41 0.00048 0.6 11.0 5.0 22 61 544 583 543 587 0.89
12 41 0.013 17 6.4 4.9 22 63 586 627 579 629 0.70
13 41 0.00059 0.74 10.7 10.2 24 63 630 669 615 678 0.84
14 41 0.26 3.3e+02 2.3 7.1 26 61 674 709 671 713 0.89
15 41 0.00014 0.18 12.7 8.3 21 65 704 748 704 748 0.95
16 41 0.00045 0.56 11.1 9.1 24 65 721 762 719 762 0.93
17 41 0.0022 2.8 8.9 8.5 20 63 752 795 749 797 0.91
18 41 0.015 19 6.2 6.4 24 63 784 823 781 829 0.80
19 41 0.0012 1.5 9.8 8.7 16 62 820 864 817 867 0.70
20 41 0.00031 0.38 11.7 7.6 27 63 871 907 865 909 0.69
21 41 0.0026 3.3 8.7 4.8 23 62 902 941 899 943 0.86
22 41 0.00019 0.24 12.3 0.4 24 64 966 1006 954 1007 0.92
23 41 0.0012 1.5 9.7 3.6 28 63 998 1033 977 1035 0.60
24 41 0.00058 0.72 10.8 12.2 21 61 1026 1066 1019 1086 0.83
25 41 1.2e-06 0.0015 19.3 3.8 25 63 1086 1124 1082 1126 0.94
26 41 0.006 7.5 7.5 6.7 26 62 1143 1179 1127 1182 0.81
27 41 2.5e-05 0.031 15.2 4.9 21 64 1180 1223 1179 1228 0.87
28 41 0.0049 6.1 7.8 4.0 27 64 1228 1265 1223 1266 0.88
29 41 0.0025 3.1 8.7 3.8 22 63 1286 1327 1271 1329 0.87
30 41 0.00054 0.68 10.9 4.1 24 64 1323 1363 1320 1364 0.91
31 41 0.0026 3.2 8.7 3.8 22 63 1384 1425 1369 1427 0.87
32 41 0.0011 1.4 9.9 5.9 29 64 1405 1440 1399 1447 0.55
33 41 0.007 8.7 7.3 5.3 25 64 1422 1461 1420 1475 0.91
34 41 0.2 2.5e+02 2.6 1.0 33 64 1451 1482 1446 1483 0.79
35 41 0.001 1.3 10.0 2.4 22 64 1482 1524 1481 1525 0.90
36 41 0.00046 0.58 11.1 3.7 25 64 1520 1559 1518 1560 0.91
37 41 0.0021 2.7 9.0 7.2 29 64 1601 1636 1579 1644 0.60
38 41 0.0021 2.6 9.0 4.2 26 64 1647 1685 1645 1686 0.84
39 41 0.0019 2.3 9.2 5.4 22 63 1678 1719 1677 1721 0.87
40 41 0.77 9.6e+02 0.8 7.1 24 62 1694 1732 1694 1755 0.86
41 41 1.3 1.6e+03 0.1 0.0 31 49 1875 1893 1867 1907 0.65

Sequence Information

Coding Sequence
ATGGCTAATCGTACGTGTCAATGCGGATGCACAGACCCGCCGGAAATGTCGGCCGCAGATCCTCCCCATGAGGGTTCGTGCGGTTGTAGTTACAACCCTTTCGCGGATCAAGGAAGAGAGAGTGAAATCACGGAATTATCATTCGCACTGCGTAAATTAACTCTGATGAAATGCCAGATGAAAAAATGGAGAATGGAACGTCTTCAGTTGGAGAGCGAGGCCAGAGGATTAAAGCAAGTTCTTCAGGCCCACGGTTTGAACGACGATATCGTCAGACCAGATCCTCTGCTTGCTCATCTTCGGGAGCAGAATGAGAGACTGGAGAACGAGAAAGCGGAGCTCGAGGAGAGCGTGAAGAGCCTTTCCGAGGCCGTATCGGAGTATGAGAATCAGGAATCCTCGACGTCCGATGCTGTGAAtaaattgcgtgaaaaaattcGGACAATGAAGGAAACTAACGCCGTTGAGAAAAGGAGATTAAGAGACCTCATATCTGGGCTGAAGATTCGGCTCCAAGAAGCTGAGAACGAGTCGTCATGCGCTGCCTTAAATCGACTGCGAGCAAAACTTCGAGAAATGATGAAAGGCGGTCAAGAAGCTGACCAAAGAGTTTCTATGGTGGTTCAGCGGTCGATAGAAACTCTGACCGAGTTAACGGGGAACGTCGACGATCTCAAGGCCGAGATCGAGAGACTTAGAGCCGAGATAAAGAGGCTGAAGGATTTGCTAAAGACATGCGAAGAGCGAAAGGACGTCGGGGTTGAGACGACGGTCGTGGACGTTAAACCGGTCGAGAAACCGCTCGACGAAATGGACGACTCGGAATTGTTGAACAGGATCAAGGAACTCGAGGCGCTGATAGCTCAGCTGAGAAAACAATTAATCGATAAAGATGCTGCGATGAATGATCTCCATAATCAATTGTTCGATGCTGGTGCAGACAACAAACGTTTGTCCACCGATTTGGACCAGATGAAGGTCAGCTACAACGCCGTTATGCAGGAGGTCAAGGCCATGAAGGATGAACTGAAGAAGAGGGACGCCAAGGTATCCGAACTTCTCAAAGACCTGAAAGCATCCGCGATCGATATGCTGGGATTAAACAGACTGCAGAGTGAAATGGACGCAATCAAGCCCCAGATGTACAATCTCGAGGTAGAACGCGGCCAGCTATTATCTGAGCTCGGTAGAGTGCGGGGCGTTGTATCGGAGCGGAATGATCAGATCATTAAAATACTCGAGGATAGGGACAAGCACGTTAAAGCACTGGCCAAGGCATCAAGCTTAATGCAGGCGACGGTTGAACCgatgaaagagaaagaaacggCTTTAAAAAATGAGGTTCAAGGATTTAAAGACCGGATAGCAGAGCTTGAACTCGAGCTATCCGAGCTCAGGAAGAAGCTAGCTCAATTAGAAGAGGAAAATTCTAAGATACCTGGACTAGTACATAAAATTAAGGAGCTCGAAGACGAACTAGAAAAGCTCAGGTCTCAGTTGACCGAGGCAAACTCCAGGGTGGTTGAGCTTGAGAAAGAAGTAGCCCGGTTCAAAGCTGAGAAAGAAGAACTAGAGAGTGAGCTTGGTGAGGCGAGAAAGGAGCTTGAAAAGCTGAAGGAGGAGCTCGCTAAGGAGAAAGCTGCGAAAGACGCTGCTCTAAAAGATCTTGAGGATTGCAGAGCAGAGAACGAGAAGCTCAGAGCGGAAAATGAACGCCTAATAAATGAGCTAAACGCGGCTAAGGCTGAGAGCGATAAGCTCAGAAACGATTTGGGGAAACTGAAagaagaaatggaaaatttaaaGGCTGAAAATGATCAGTTGAAAGATCAGTTCAACGTGGCCAAAAATGAGATCGACAAGCTCAGAGGTGAGCTCGAGAATCTCAAGGCAGAGAATGACAAGCTCAAAAATGACGTAGAGAAACTTAACGGAGATATGGATAAACTGAAGACGGAAAATAACAGCCTCAAAGAAGAGCTCGACAAACTTAGAAACAAGCTCGCTGGACTTGAGGATGAAATCGAGAAACTAAAGAGCGCTTTGGCTGCAGCTGAAGCTGAGGCTGAGAAGCTCAAACAAGATTTAGAAAACACGAAAAACGAGCTCGATAAATTAAGGGCTGAGAGTGATCAGCTCAAAAGTCAGTTAGCCGATGCCAAAGCTGATAACGAGAGGCTTAAAATTGAACTCGAAAAACTGAAGGGAGAAACAGAGAAactaaaacaagaaaacagcGAACTTAAGGCAGAGAGAGACAAGCTGCAAGCAGACCTCGAAAAACTTGGAAAAAGGCTCGATGAGCTTGAATCTGAAAATGAGAAACTGAAGAACGCTTTAACAGCGGCTAAAGCTGAGGTTGAAAGGCTCAAAAATGATCTAGACAAAATGAAATCCGATATCGAGAAGCTAAAGGCAGAAAATGATCAGCTGAATAACGAGCTGGCCAATGCTAAGGCAGAGAACTCGAGGGTCAGAAAGGAGCTTGATGAGTTGAAGGGAGAAATGGAGAAGCtgagagaagagaataaaaaccTTAAGGCAGAAAGAGACGGGCTAGAAGCGGAGCTCAAAAAACTTAGAGGAGAATTAGACAGTCTTAAGGGTGAGAATGAGAGACTTAAAGGTGAGAATGAAAGACTTAAGGCCGAGCTTGACAAACTTAAAAGCGAGAACGAGGCCTTGAAATCAGAAACCGAGAAGTTGAAAAAGAGCTTGAGCGATGCAGAGGCAAAGCTAAAAGAGCTTGAAGCCTCTAACGCTGATCTCGCGAATAAAATATCAGAGCTGAAGAACCAAATTGATAAACTTCAGACCGAGTTGGCGTCAGAAAAAGCTGCCAAAGAAGCAGCTGTTAAGGAATTGGCGGCTATTAAGGCCGAGCTAAAAGCTCTGCTGGCGGAAATGGACAAACTCAAGGCAGACTGCGACAGGCTAACAGGACAAGTAGACGATCTCACTAAACAACTATCGGATTTGAAGAACGATTTTGATCAGCTCAAGTCCAAATATGCCGAATTGGCGACAGAGAGAGACAAGCTCAAGACTGAGCTCGATAAGTTCAAGGCAGAAAACgacaagatgaaaaatgacCTGGAGAAGCTCAAGGCAGAGCTGGACGCGTTAAAAACCGAGAATGGTAAGCTCAAAGAGGAGAACGGTAAGCTTAAGGAGGAAATTGGTAAGCTAAAAGAAGAGAATGGTAAGTTTAAGGAAGAGAACGAGGAGCTGAGGAAAGACTTGCGCGATGCTGAAGCTCGGATAAAAACACTTGAGAACGAGGTAAAGTCGTGCGAGGAGGAAAAATCAAGGCTGCGAAAAGAGATCGAGGGCCTGAAAAATGGGATTGAAGAACTCAAGAAGGAGCTTGATGCAGAGAAAGCAGCGAAAGAGGCAGCCTTGAAGGAGTTATCAGCTCTAAAAACTGAGCTAGCTGCACTGAGAGCAGAGCTGGATAAAGTAAGGGCAGAGAACGCCAAGTTAAAAAGTGAGCTCGATAAACTGAAATCAGAAAACGACGAGCTCAGGaatgacaataataaaatgaagggAGAGATTGATAAGCTCAAAGCAGAGGTTGCGAAATTAGAAAATGATCTAAGTGCCTTGAAGGCAGAGAACACCAAACTCAAAGAAGATCTGGACAAGCTAGATGCAGAAAATAAAGCTCTGAGATCCGATAATAACAAGTTGAAAGGAGAATTGGATCAGCTCAAATCTGAGAACGCGAAATTGAAGGACGATTTAGCTGCGGCTAGAGAAGAAGCTACAAAGCTCAAAAATGATTTGGATAAACTGAAATCCGACTTGGATGCGATGCGAgcagagaataataaaatgaagggTGAGCTTGAAAGGCTGAAGTCAGATATTGCAGGATTACAAAACGACCTGAACACCCTGAAAGCAGATAATGCGAAGCTCAAAGAAGACCTCGACAAACTAAATGCAGAAAATAAGTCTCTGAGAGCTGATAACGACAAGTTGAAAGGCGAATTGGATCAGCTAAAATCTGAGAACGCGAAATTGAAGGACGATTTAGCTGCGGCTAGAGAAGAAGTTACAAAGCTCAAAAATGATTTGGATAAACTAAAATCCGACTTGGATGCGATGCGAGCAGAGAATAATAAGATGAAGGGTGAGCTTGAAAGGCTGAAGTCAGATATTGCAGGATTACAAAACGACCTGAACACCCTGAAAGCAGATAATGCGAAGCTCAAAGAAGACCTCGACAAACTAAATGCAGAAAATAAGTCTCTGAGAGCTGATAACGACAAGTTGAAAGGAGAATTGGATCAGCTAAAATCTGAGAACGCGAAATTGAAGGACGATTTAGCTGCGGCTAGAGAAGAAGCTACAAAGCTCAAAAATGATTTGGATAAACTGAAATCCGACTTGGATGCGATGCGAgcagagaataataaaatgaagggTGAGCTTGAAAGGCTGAAGTCAGATATTGCGGGATTACAAAACGATCTGAACACCCTGAAAGCAGATAATGCGAAGCTCAAAGAAGACCTCGACAAACTAAATGCAGAAAATAAGTCTCTGAGAGCTGATAACGACAAGTTGAAAGGCGAATTGGATCAGCTAAAATCTGAGAACGCGAAATTGAAGGACGATTTAGCTGCGGCTAGAGAAGAAGTTACAAAGCTCAAAAATGATTTGGATAAACTAAAATCCGACTTGGATGCGATGCGAGCAGAGAATAATAAGATGAAGGGTGAGCTTGAAAGGCTGAAGTCAGATATTGCAGGATTACAAAACGACCTGAACACCCTGAAAGCAGATAATGCGAAGCTCAAAGAAGACCTCGACAAACTAAATGCAGAAAATAAGTCTCTGAGAGCTGATAACGACAAGTTGAAAGGAGAATTGGATCAGCTAAAATCTGAGAACGCGAAATTGAAGGACGATTTAGCTGCGGTTGGAGAAGAAGTGTCAAGGCTCAAAAATGACTTGGATAAACTGAAATCCGAACGGGACGCGCTGCAAGCGGAGAATAACAAGATTAAAGGAGAGCTTGAAAGGCTGAAATCCGAGAATGCGAAACTACATAAAGATCTTGATGTCTTGAAGCAGGAGAACGGCAAGCTAAAGTCTGAGGTTGACAAACTGAAAATCGATaatgaaaagttgaataaGGATCTTCAGAAGGCAAACTCGGAGTTAGACAAGCACAAAGCCGAGGTCGATAATCTGAAGAAATTAACCAAGAAACTAAAAGAGACACCGAGAAAACTCGATACCCTGAAGCCAAATTACACCGTGGAAAAAGAACTGCAGATTAAAGTACCCCGCGTTGTTCCATCgggtaaaattattaaaagtGATCAACGACCCTCGGTTATAAAAAAGGACCAAGGTTCACAGGGCGCGGGTTGTGGCGATTACGAAAATGCAAACGAACAGCTGATGAGAAACATGAATATGCAAGAGAGGGCCGTACAGCGTATACgagaatttataaaatacgTACTCGGCGAGAGGAGTTCGCCACCCGAAATGGCGCGGGAATTGATCCATCGCATGTCTTCGGTAACGCGGAATAATTTTCCCGAAGACATAATGCGGATTCTGAAAGAGTCGCAGTTCTTATCGGAATCGATATTCAATGCCGAAACCGAGGTTCAACGTCTGGCCAAGCCCGAGGAGGAATTAAACAACCCGAAGGATGaaGATATTCGGGACGTCGATGGACTCGGTGACGCCTTCGACGCCGAGTCGTGGTTGAAAacacTGACGTTAACCGAATTGGCAGAACTCCACGACAGGATATGCCTCGTAACTTCGTGCATGGTTCAGCAGGACATAAACCCCGAGGATTACGTGGACGGTATTGAAACCGATGGAATTTGCCATCCTTGTAGTACCGTAACGGAGGAATTCACCGACGATTCAATTCCCGATTACGAAGCTTTGAACAGAAGAATCGGAGCTCTTCAAATGCAGATAAACAAGAAACAGGACGAGGCTGCtaaaaaggttaaaaaaatgcGCCAATCTATGTGGCGGGAACAGGAAAACCTGATCAAACTATCCGAGGAAATGAACGCccaaaaacgaagaaatttaTCGATGAAGATTAAGATCAATGAGAACATCGAAGCGGATGTAGAAggagaaacaaataaaaatgaaaagacgGTTCTATGCGGTGGAAAACTTCCCCCAAAgatcggACAGAGGACGGACGATATGTTCGgtgagaaaaattgcaattccGGACGTTTTATCGGCGCCAATTCCAACGTCGCGTTGAATTTGGTCCCTTGCTTTAACCCAGAGGAAAATAAGCCCGATTCAGGATCGTCTTTGACCGCGGTTAGGGTTGGGAGAAAAAAACCACCGCTGTGTGCGGCACCGGTTAGACACGCGGACGTACCTTGCCGTGTTAAGCCCCGCTGTCCCTCGGCTATGAATCGAGTTATTTAG
Protein Sequence
MANRTCQCGCTDPPEMSAADPPHEGSCGCSYNPFADQGRESEITELSFALRKLTLMKCQMKKWRMERLQLESEARGLKQVLQAHGLNDDIVRPDPLLAHLREQNERLENEKAELEESVKSLSEAVSEYENQESSTSDAVNKLREKIRTMKETNAVEKRRLRDLISGLKIRLQEAENESSCAALNRLRAKLREMMKGGQEADQRVSMVVQRSIETLTELTGNVDDLKAEIERLRAEIKRLKDLLKTCEERKDVGVETTVVDVKPVEKPLDEMDDSELLNRIKELEALIAQLRKQLIDKDAAMNDLHNQLFDAGADNKRLSTDLDQMKVSYNAVMQEVKAMKDELKKRDAKVSELLKDLKASAIDMLGLNRLQSEMDAIKPQMYNLEVERGQLLSELGRVRGVVSERNDQIIKILEDRDKHVKALAKASSLMQATVEPMKEKETALKNEVQGFKDRIAELELELSELRKKLAQLEEENSKIPGLVHKIKELEDELEKLRSQLTEANSRVVELEKEVARFKAEKEELESELGEARKELEKLKEELAKEKAAKDAALKDLEDCRAENEKLRAENERLINELNAAKAESDKLRNDLGKLKEEMENLKAENDQLKDQFNVAKNEIDKLRGELENLKAENDKLKNDVEKLNGDMDKLKTENNSLKEELDKLRNKLAGLEDEIEKLKSALAAAEAEAEKLKQDLENTKNELDKLRAESDQLKSQLADAKADNERLKIELEKLKGETEKLKQENSELKAERDKLQADLEKLGKRLDELESENEKLKNALTAAKAEVERLKNDLDKMKSDIEKLKAENDQLNNELANAKAENSRVRKELDELKGEMEKLREENKNLKAERDGLEAELKKLRGELDSLKGENERLKGENERLKAELDKLKSENEALKSETEKLKKSLSDAEAKLKELEASNADLANKISELKNQIDKLQTELASEKAAKEAAVKELAAIKAELKALLAEMDKLKADCDRLTGQVDDLTKQLSDLKNDFDQLKSKYAELATERDKLKTELDKFKAENDKMKNDLEKLKAELDALKTENGKLKEENGKLKEEIGKLKEENGKFKEENEELRKDLRDAEARIKTLENEVKSCEEEKSRLRKEIEGLKNGIEELKKELDAEKAAKEAALKELSALKTELAALRAELDKVRAENAKLKSELDKLKSENDELRNDNNKMKGEIDKLKAEVAKLENDLSALKAENTKLKEDLDKLDAENKALRSDNNKLKGELDQLKSENAKLKDDLAAAREEATKLKNDLDKLKSDLDAMRAENNKMKGELERLKSDIAGLQNDLNTLKADNAKLKEDLDKLNAENKSLRADNDKLKGELDQLKSENAKLKDDLAAAREEVTKLKNDLDKLKSDLDAMRAENNKMKGELERLKSDIAGLQNDLNTLKADNAKLKEDLDKLNAENKSLRADNDKLKGELDQLKSENAKLKDDLAAAREEATKLKNDLDKLKSDLDAMRAENNKMKGELERLKSDIAGLQNDLNTLKADNAKLKEDLDKLNAENKSLRADNDKLKGELDQLKSENAKLKDDLAAAREEVTKLKNDLDKLKSDLDAMRAENNKMKGELERLKSDIAGLQNDLNTLKADNAKLKEDLDKLNAENKSLRADNDKLKGELDQLKSENAKLKDDLAAVGEEVSRLKNDLDKLKSERDALQAENNKIKGELERLKSENAKLHKDLDVLKQENGKLKSEVDKLKIDNEKLNKDLQKANSELDKHKAEVDNLKKLTKKLKETPRKLDTLKPNYTVEKELQIKVPRVVPSGKIIKSDQRPSVIKKDQGSQGAGCGDYENANEQLMRNMNMQERAVQRIREFIKYVLGERSSPPEMARELIHRMSSVTRNNFPEDIMRILKESQFLSESIFNAETEVQRLAKPEEELNNPKDEDIRDVDGLGDAFDAESWLKTLTLTELAELHDRICLVTSCMVQQDINPEDYVDGIETDGICHPCSTVTEEFTDDSIPDYEALNRRIGALQMQINKKQDEAAKKVKKMRQSMWREQENLIKLSEEMNAQKRRNLSMKIKINENIEADVEGETNKNEKTVLCGGKLPPKIGQRTDDMFGEKNCNSGRFIGANSNVALNLVPCFNPEENKPDSGSSLTAVRVGRKKPPLCAAPVRHADVPCRVKPRCPSAMNRVI

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

90% Identity
iTF_00938428;
80% Identity
iTF_00938428;