Basic Information

Gene Symbol
-
Assembly
GCA_003710045.1
Location
NW:779282-791198[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 39 5 2.3e+03 -2.3 0.1 43 60 10 27 9 30 0.71
2 39 0.00036 0.16 11.0 3.9 30 60 69 99 61 103 0.90
3 39 2.3 1e+03 -1.2 1.4 28 64 131 167 129 195 0.75
4 39 0.0045 2 7.5 2.6 28 55 217 244 214 254 0.90
5 39 0.58 2.6e+02 0.8 2.7 25 62 260 297 247 300 0.64
6 39 1.7 7.6e+02 -0.7 7.9 26 62 289 328 279 337 0.73
7 39 0.0014 0.65 9.1 1.1 24 63 339 378 337 380 0.92
8 39 0.023 10 5.2 5.5 25 62 382 419 380 423 0.73
9 39 0.0061 2.8 7.1 1.5 27 63 405 441 404 443 0.91
10 39 0.016 7.4 5.7 7.5 21 65 420 464 415 466 0.93
11 39 0.02 8.9 5.5 5.1 24 60 444 480 441 485 0.84
12 39 6.7e-05 0.031 13.3 3.4 26 62 502 538 499 540 0.90
13 39 0.018 8.3 5.6 3.5 24 62 528 566 526 569 0.94
14 39 0.52 2.3e+02 0.9 0.0 42 61 574 593 570 596 0.85
15 39 0.00022 0.099 11.7 1.4 33 60 593 620 583 625 0.90
16 39 0.0062 2.8 7.1 9.0 21 63 651 693 633 695 0.96
17 39 0.0091 4.1 6.5 0.4 26 63 705 742 700 744 0.88
18 39 2.9e-05 0.013 14.5 4.9 27 65 748 786 745 786 0.91
19 39 0.003 1.3 8.1 2.8 26 62 789 825 783 827 0.66
20 39 0.00083 0.38 9.9 2.5 32 58 858 884 844 891 0.53
21 39 0.007 3.2 6.9 3.3 24 64 899 939 896 940 0.94
22 39 0.0091 4.1 6.5 4.3 27 63 951 987 947 989 0.84
23 39 0.26 1.2e+02 1.8 7.4 24 64 990 1030 982 1031 0.79
24 39 0.0063 2.9 7.0 0.5 33 65 1027 1059 1025 1066 0.74
25 39 0.0001 0.046 12.8 6.3 21 65 1057 1101 1055 1101 0.93
26 39 0.0051 2.3 7.3 4.2 34 61 1133 1160 1115 1164 0.83
27 39 0.034 15 4.7 0.4 28 64 1148 1184 1146 1185 0.87
28 39 2.4e-05 0.011 14.8 6.7 25 65 1180 1220 1172 1220 0.89
29 39 0.048 22 4.2 1.2 37 64 1227 1254 1223 1255 0.81
30 39 0.0038 1.7 7.7 5.3 27 64 1238 1275 1233 1276 0.88
31 39 0.0019 0.85 8.7 4.1 22 63 1261 1302 1260 1304 0.89
32 39 0.0067 3.1 6.9 0.9 22 58 1296 1332 1295 1337 0.90
33 39 0.74 3.3e+02 0.4 9.0 28 63 1333 1368 1328 1370 0.92
34 39 0.3 1.4e+02 1.7 0.7 29 62 1376 1409 1373 1412 0.82
35 39 1.6 7.3e+02 -0.7 1.0 26 62 1415 1451 1410 1454 0.77
36 39 0.05 23 4.2 0.9 24 60 1469 1505 1466 1509 0.86
37 39 1.5e-05 0.0067 15.5 4.4 22 58 1509 1545 1507 1546 0.94
38 39 0.00081 0.37 9.9 8.5 30 60 1538 1568 1537 1572 0.92
39 39 2.6 1.2e+03 -1.4 0.9 36 51 1751 1766 1730 1774 0.66

Sequence Information

Coding Sequence
atgaaaactaaaattgCATTTCCTGACAGATTAAGGGATACGATATCGCAGTTGAAATTAAAACTTGCCCAGACAGAGGAAGATGTAAGCTGTCCTGCGATATACCGGCTAAGAGCGAAGCTCCGTGAATTAATGAGGGGGGGTCAAACGGCGGACCAGCAGGTTACAAAAGTCGTCGAAAGATCGATCGAAACTTTGGTGGATCTCTCGAAAAGCTGCGAGGATTTACGTCTTGAAAATGAACGTCTTTCGGCCGAATTGGCCGAATTACGCCGTGAATTGGCGGATCTCGAGAAAAAGCAGCTCGCAGACGAGTATCGTGCGGTCGGCCTTCAAGAGGAAACGCTGAAGACAGTTGAAACAACAACAGTTCCAGAGTATATAGATGTGTCGGAGCTGCTGCAGAAGCTTCAAGACTGCGAGCTCGTTGTTTCCGATTTGAAGCATCATCTAGGGGAGAAAGATAAGGTTATCGATGCTCTGAAGAAAGAACTCGAAGCGATGATAGCTCAACAAGATCTGCTGGACCAGATTGCGGCTATGAAAGAAGAGCTCGCTAAAAAGGACGATAAGATACGAGATCTTCTGAATAACATGAGGCAATCGGAAATAAATTTGGTAGGCCTGACTCACCTTAGCTCGGAGCTGGACTCCTTAAAATCTCGATTATCTGAATtcgagaaagagaaggagtCGCTTCTAGACGAGATAAAGGAATTAAAGGAAGCATTGGTGGACAGAGATGCTCAAATAATTGACTTACTAGAACAAAAAAATCAGTTGGAACAGGCGTTGTCGGATAAGAATGCGGAAACGCAAGAGATTCTTAATAACTTGAAGAAGGAGATAGACGATTTAGTGTCGCAAATGGCCAGTTTGCAGGACGAACTCGATGAGTGTAACAAGCGAAACGCAGAGCTTCAGAAGTGTTGTTTCGAAAGGGGCAGACTGTTAGAGAAATTACAGAGTATGGAGGAAGAACTGGCATTGCTCCGAGACGAACTCGGGGCCGCGAAAGACATAATCAGTAGGTTCGAGGAAGAGATAAACACTCTGAAGGCTGACAAGGAAAAATTGCTGTGTGAGCTGGACGAGATTAAGAAACAGGCTGATGCGCTAAGTGACCAATTGGCAGATGAGAGGGCAGCCAAGGAAGCTTTACAAAGAGAGTTTGAATCAATTCGGGACGAAATAGAAGAGTTACGAAGAGAGAATTCAAATCTAAAAACTCAGCTCGAAGTTGCAAGGGAGGAAAACGACAAGCTCAAGGAAGCGAATAATGGTTTTAAAGCAGAACTGGACACGATGAGCGTAGAACTCGACAATttgaaaagagaaaatgaCGAGCTACAAAGGAATTTAGACGCAACGAAGCtagaaaatcaaaaattaaaagaagatgTGGCGGCATTAAAGAATGACCTTAGAGATGCCAGAGAAGAGGTGGTCATGTTGAGAGTCGACAGAGAGGCGTTAAAAGCAGCGGACGAAGACGCAAGGGGTAAGATGGATGAACTCGAGTCGCAAGTGGAAGACCTGAAAGCAGTAAAGGATCGGTTAGCCAGCGAGAATGCCGACGCGAAGGCTAGAAATTTGGAATTGGAACGGGAATTAGACGATGCCATGAAAgcaatggaaaaaataaaagcagAGAACGTTGATTTACTTGATGAATTGGAGCGTCTGAAGGGAGAATTAGCGACGGCTCGTAGCGTGATCGATCAATTGAAAACGGAGATTGGTTCTTTGAAAGACGCTCTCGACAAGTGCGTGGGTGAAATGGAGACGTTAAAAACCCAAAACAGCGAACTCAAGGAGAAAAATGAGTATCTACGGGCGGAGATTGATCAATGCAAAGCCGACGGGGACTCTCTGAGAAACGATTTCGATCGAGTAAAGAAGGAATTAAGTGAGGCGAACGCACGACTAGGCGACGCAGTTAACGAGCTGAACAAATTGAAGGAAGAAAACGAATTGCTTAAACAAAATTTGGACCGGCTGGTTGCTGAAAGGGACAAGCTGCAAGCGGGGATTGAGGAACTTGAAAGCGAAGCGGAGAAGCTTCGAGCGGACTTGGAGAACAGACATAAGATAGAAGCCGCATTAAACGACGAAATCGAACGAATTAAGGGCGAGAAGGATGTGCTGGCGGCAGAATTAAGTAAACTGAGGGATGATTATTCTGCTTTGGAAAAGGAAACGGATCGGTTAAAGACACAGTTCGAAGACTGCAATGCTGAGAATGCGACGCTAAAGTCGGAGAAGGCTCGGTTAGAAGGTGAAAATGAGAGGTTGAGGAGAGAAGGAAATTCGTGTAAAGAAGAGAACGATAAATTAGAAGCTGTAGTTGAAGAATTgcggaaagaaaaagaatcgttGAGTGACGGATTCAATAAGTTGCGAGAGGAGAATAGTAAACTGCGAAAGGATTTGAGTATTTTAGAGAACGAAGTGGATGATTTGAAGACGAAGTTtgagaatgaaagaaaaactaataaaaatatgataaacgATTTGATGGTACTGGACAGCAAGGTAAACGAATTAAGTCAGGCACTCGATGAGGCTCGGGCAAAGAATGAAGCCTTGGCAGAGGAGAATCAGAAGCTTCAGTCGAAAATACGGAACATGGATCAAGAGATAGGGGATTTGAGGTTAGAGTGTGCggatttgaaatttgagatCGAAAATTTGAAGCAGCAAATCGCTAACTTGAAAGCACAGGTTGCTACATTGGAAGAAGATGTGGAATattggaaattggaaaattgtaaagttaaGATAGAGGTAGATAAATTGAAAGCTGATTTGGAGAAGGCATTACTGGATTTGAGAGAGTGTCAGGCCGCGAAAGAAGCATGCCACGCAGAGTTAAATCGACTACAGACTGAAAAAGCCAGCCTAGATAAAAAAATCGCTGATTTAGAATCGGAAGTTGAACAACAGAAAAAAGACCTGCAAGCGAAAAGAACAGAAAAGGACAAGTGCGACGTGGCGCTCGCTGCTCTGAGGGATGAGCTCGAGGCGCTAAAAGAAGAGTTGGAAAATCTGAGGGAGGCTAATAGAAAATACAGAGGAGAACTGGATGACCTTACAAATCAAGTCGCAGCATTGAACAATGATCTTAACAGATGTAGGGAAGAAATCGCAGCGTTGAGAGACAGCAACGAAAAGTTGAAAGCCGATGTGAATGATTTAAATATCGAGAAAAACCGGCTACAGGACGAACGTAACAAATTGAAGTCCGAGGTGGACGTTTTAAGAGCGGAGAACGAGAATCTTCATCAGAACCGTAACAAATTGGAGAACGAAATCAACAGTCTGAGAGGCGAGGGTGACGGACAAAGAGTAGAACTCGATGAACTGAGATCAGATTTGGCTGCGCAGAAGACAGCAGCTGACAGGCTTCAGTCTGAGAATGAGAAATTACAACagcaattaaacgaaatgAAGGACGATTTGGACGGACTAAACGACGAGAAAAGACGCCTGGACGCTAAAGTGGAGGACCTAGCCAAGTCGCTTCGGGCGGCGGAAGCTGCAGTGAAAGCTTTAGAAGATCAGCTGGCGGACTTGCAGGCCGAGAAACAAGAATTACTCAATGAACTTCACCGACTCCGCGACGAGCTGGCACGCCTTATGAACGAGATTGACAAACACAACACTGCTGCTGAAGCAGCCATGAAAGAGCTGGCTGACTTAAAGCGGGAGCTAAGTGCTTTAAGAGCGGAACTGGATAAAACTCGCACTGAAAACGAAGCACTGATCGCCGAGAACGAGAAACTGAGGTCTGACGTGGCGAAATTAAATCAGCAGTTGCGGGCATCAGAAGGCGAGAACGCGGAGTTGAAAAAGGAGAACGCAACGCTATCGTCTGGATTAGCCATGATGAAGGATAAATTAGCACAAACGGAAAATCACTTGAACGATCTGTACAACGAAAATGATAATTTGAAGAAACGGATCATTGATCTTGAAAATGCAGTGCAAGAGTTTGATGCATTGAAAGAACAGTTAAAGGATTGCGGGGCGGAATTGGAAAGATTGAAATTGGAATTGCAACAATCGAAGTTGGAGAGCGAAAATTTGAGGAACGAGTTGCAAAATGCTCGGAACGTACTGAACGGATTAGAAGACGATTTGAGAAAGATGAAAGTTCTGAATGATATGTTGCAAGAAGAAATCGAGAAATTGAACGcggttattaataatttgaaaaacgaTTTGGATAGTCAGAAGACGATGAATGATAAGATGAAGGATGATTTGGAGAATCTGAAGACGATGAATGGGAAACTTAAGAACGATTTAGAGAATCAAAAGACGATCAATGAGAAACTTAAAGATGATTTAGAGAATCAGAAGGAGATCAATGAGAAAGTTAAACATGATTTAGAAAATCAGAAGGCAGTCAATGATGCGATAAAGGACGAGTTGGAAAAGCTGAAAGCAgtgaacaataatttaaagaaagacTACGATAAGTTGTTGGCTGAAACTGACGATCTAAGGTATGAGAGCTCTAAACGAAAGTCCTCCGAGGCGTCATTAAAGGACGATGTGAATAaattgaagaagaagaacgacGAGCTGAAATCCGAAATTGCGaggttgaaaaatgaattaaaggaCTGCCAGGATGAAAACAATAGgttgttgaaagaattggAGAATctgaagaaagaaaacagcagATTGAAAGAGGATGCAGagaGAGCGCCTGTGATGTACAGAGATGTCGTGGATGACAGTGGAGATTACATTAGGGCGAATGATTTACTGAACAAGAAacttgaaaaacaaaatgaagGCGTGCAACGTGTTCGAGATTACATAATTTACTTAGAAGGGAAGGTTAAAGATGAACCGAAAATGGCAGATAAGATGGACGACGATTTAAGTATTGATCCAGAAATGAGAAAAGAGATAGCAGACTTATTGAAAAAGTCGCAAATTCTATCcgacaatatttataaaactgaaaagGAAATTCAAAATGTTGCGAATATCTTGAAACAGCTTCAGGTAACGTTCGATGCTGATTCTTGGCTCAACtCATTAACGTTAACGCAGTTGGCAGATCTTCATGACAAGATTTGCTTGCTAACATCGGATATGGTGCATCAGGACAAACCGAGCGCTGTCAGCTGCATGCCGGGTGTTCCACTAAGAGcagattacaatattttaaatcagCGGATAGCCGCTCTACAGAAACAGATAGCGGAGAAGCAAATAGAAACGGGATGGAAGCTGCAAGAGCTGAAGCGGGCTCTTCGGCAGGAGCAAGCTAACCTAATTCGAATCTCTGACGAAATGAATTTAGAGAGAAAACGTAATTTGGCCCTTCAGTTCAGAATGGATGATGCATCTTAA
Protein Sequence
MKTKIAFPDRLRDTISQLKLKLAQTEEDVSCPAIYRLRAKLRELMRGGQTADQQVTKVVERSIETLVDLSKSCEDLRLENERLSAELAELRRELADLEKKQLADEYRAVGLQEETLKTVETTTVPEYIDVSELLQKLQDCELVVSDLKHHLGEKDKVIDALKKELEAMIAQQDLLDQIAAMKEELAKKDDKIRDLLNNMRQSEINLVGLTHLSSELDSLKSRLSEFEKEKESLLDEIKELKEALVDRDAQIIDLLEQKNQLEQALSDKNAETQEILNNLKKEIDDLVSQMASLQDELDECNKRNAELQKCCFERGRLLEKLQSMEEELALLRDELGAAKDIISRFEEEINTLKADKEKLLCELDEIKKQADALSDQLADERAAKEALQREFESIRDEIEELRRENSNLKTQLEVAREENDKLKEANNGFKAELDTMSVELDNLKRENDELQRNLDATKLENQKLKEDVAALKNDLRDAREEVVMLRVDREALKAADEDARGKMDELESQVEDLKAVKDRLASENADAKARNLELERELDDAMKAMEKIKAENVDLLDELERLKGELATARSVIDQLKTEIGSLKDALDKCVGEMETLKTQNSELKEKNEYLRAEIDQCKADGDSLRNDFDRVKKELSEANARLGDAVNELNKLKEENELLKQNLDRLVAERDKLQAGIEELESEAEKLRADLENRHKIEAALNDEIERIKGEKDVLAAELSKLRDDYSALEKETDRLKTQFEDCNAENATLKSEKARLEGENERLRREGNSCKEENDKLEAVVEELRKEKESLSDGFNKLREENSKLRKDLSILENEVDDLKTKFENERKTNKNMINDLMVLDSKVNELSQALDEARAKNEALAEENQKLQSKIRNMDQEIGDLRLECADLKFEIENLKQQIANLKAQVATLEEDVEYWKLENCKVKIEVDKLKADLEKALLDLRECQAAKEACHAELNRLQTEKASLDKKIADLESEVEQQKKDLQAKRTEKDKCDVALAALRDELEALKEELENLREANRKYRGELDDLTNQVAALNNDLNRCREEIAALRDSNEKLKADVNDLNIEKNRLQDERNKLKSEVDVLRAENENLHQNRNKLENEINSLRGEGDGQRVELDELRSDLAAQKTAADRLQSENEKLQQQLNEMKDDLDGLNDEKRRLDAKVEDLAKSLRAAEAAVKALEDQLADLQAEKQELLNELHRLRDELARLMNEIDKHNTAAEAAMKELADLKRELSALRAELDKTRTENEALIAENEKLRSDVAKLNQQLRASEGENAELKKENATLSSGLAMMKDKLAQTENHLNDLYNENDNLKKRIIDLENAVQEFDALKEQLKDCGAELERLKLELQQSKLESENLRNELQNARNVLNGLEDDLRKMKVLNDMLQEEIEKLNAVINNLKNDLDSQKTMNDKMKDDLENLKTMNGKLKNDLENQKTINEKLKDDLENQKEINEKVKHDLENQKAVNDAIKDELEKLKAVNNNLKKDYDKLLAETDDLRYESSKRKSSEASLKDDVNKLKKKNDELKSEIARLKNELKDCQDENNRLLKELENLKKENSRLKEDAERAPVMYRDVVDDSGDYIRANDLLNKKLEKQNEGVQRVRDYIIYLEGKVKDEPKMADKMDDDLSIDPEMRKEIADLLKKSQILSDNIYKTEKEIQNVANILKQLQVTFDADSWLNSLTLTQLADLHDKICLLTSDMVHQDKPSAVSCMPGVPLRADYNILNQRIAALQKQIAEKQIETGWKLQELKRALRQEQANLIRISDEMNLERKRNLALQFRMDDAS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-