Basic Information

Gene Symbol
-
Assembly
GCA_014825855.1
Location
CM026129.1:11702441-11714524[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 48 0.14 1.4e+02 2.7 0.5 32 57 53 78 47 85 0.54
2 48 3.4 3.3e+03 -1.7 1.6 33 55 99 121 97 124 0.54
3 48 4.8 4.7e+03 -2.2 0.1 40 60 151 171 146 174 0.86
4 48 0.063 62 3.8 0.3 30 55 212 237 204 244 0.80
5 48 0.38 3.8e+02 1.3 0.3 28 63 265 300 263 302 0.89
6 48 0.0057 5.6 7.2 6.2 29 64 345 380 343 381 0.88
7 48 0.0012 1.2 9.4 4.4 24 63 400 439 397 441 0.92
8 48 0.00044 0.43 10.7 0.3 24 58 466 500 464 506 0.83
9 48 0.1 1e+02 3.2 2.7 22 64 520 562 509 563 0.88
10 48 0.012 12 6.1 7.6 20 63 560 603 558 605 0.84
11 48 0.028 28 4.9 8.4 25 58 607 640 596 647 0.62
12 48 0.022 21 5.3 1.2 26 58 643 675 640 682 0.89
13 48 0.067 66 3.7 1.3 21 64 659 702 658 705 0.86
14 48 0.00037 0.37 11.0 4.7 28 65 722 759 707 759 0.89
15 48 0.013 13 6.0 3.1 26 58 748 783 746 797 0.79
16 48 0.014 14 5.9 1.3 25 59 771 805 760 811 0.60
17 48 0.042 42 4.4 0.4 33 57 807 831 791 838 0.59
18 48 0.027 27 5.0 2.1 27 60 826 859 824 864 0.89
19 48 6.1e-05 0.06 13.5 3.6 25 65 869 909 858 909 0.84
20 48 0.0073 7.2 6.8 0.2 33 63 905 935 904 937 0.88
21 48 0.0029 2.8 8.1 4.5 24 58 942 979 939 986 0.80
22 48 0.17 1.7e+02 2.5 0.1 39 63 981 1005 977 1007 0.81
23 48 0.03 29 4.9 0.0 33 61 1003 1031 987 1032 0.84
24 48 0.0035 3.4 7.9 2.8 28 63 1026 1061 1022 1063 0.87
25 48 5.9e-06 0.0058 16.7 4.1 27 62 1060 1095 1057 1098 0.89
26 48 0.18 1.8e+02 2.4 0.8 28 59 1093 1124 1092 1128 0.84
27 48 0.00062 0.61 10.3 0.5 21 63 1117 1159 1116 1160 0.94
28 48 0.0014 1.4 9.1 1.4 26 58 1178 1210 1167 1217 0.80
29 48 0.015 15 5.8 0.8 24 64 1204 1244 1204 1245 0.85
30 48 0.52 5.1e+02 0.9 0.5 24 56 1239 1271 1236 1279 0.82
31 48 0.017 16 5.7 3.1 24 64 1272 1314 1269 1315 0.84
32 48 0.0056 5.5 7.2 2.9 18 62 1324 1368 1320 1371 0.92
33 48 0.13 1.2e+02 2.9 0.9 32 61 1373 1402 1369 1406 0.84
34 48 0.17 1.7e+02 2.4 0.5 25 59 1404 1438 1402 1442 0.70
35 48 0.16 1.5e+02 2.6 0.6 24 63 1417 1456 1414 1458 0.91
36 48 0.0051 5.1 7.3 4.7 27 59 1462 1494 1458 1500 0.90
37 48 0.014 14 5.9 4.0 27 62 1476 1511 1473 1515 0.81
38 48 0.15 1.4e+02 2.7 5.7 25 57 1516 1548 1505 1556 0.82
39 48 2.5 2.4e+03 -1.3 10.4 28 64 1575 1611 1544 1612 0.82
40 48 0.00023 0.22 11.7 4.2 29 64 1604 1639 1600 1640 0.90
41 48 0.00099 0.97 9.6 3.1 24 62 1613 1651 1612 1654 0.72
42 48 0.0058 5.7 7.2 6.0 28 59 1652 1683 1647 1689 0.86
43 48 0.055 54 4.0 6.7 30 63 1685 1718 1681 1738 0.84
44 48 2.5 2.5e+03 -1.3 6.2 31 60 1742 1771 1725 1777 0.72
45 48 0.00043 0.43 10.8 1.8 27 63 1773 1809 1767 1811 0.89
46 48 0.0067 6.6 6.9 0.6 32 63 1813 1844 1809 1846 0.79
47 48 0.0035 3.5 7.8 3.8 21 55 1844 1878 1842 1879 0.93
48 48 0.00098 0.96 9.6 7.5 27 60 1871 1904 1868 1908 0.91

Sequence Information

Coding Sequence
ATGGAGATTTGCCGATGCGGATGCGAGGCATCGTCGCAAGAATCGATCGATCCACCGCACGAGCCGTGCTGTTGCTGCAATTACGATCCATTCAGCGACAAAGAAGCAGAAATCTACGACCTCTCGTTTGCCCTGAAGAAGCTCACTGTAATGAAGTGTCAGATGAAAAAGTGGCGAATGGAAAGACTTCAACTCGAGAGCGAAAATAGGTCTTTGAAACAAGCCCTCCAATCATTCGGTGTAAATGCGGATGAGATATTGAAGCCCGATCCGCTGTTAGTGCACTCCCGAGAAGAAATCGAAAGGCTGAAAAATGCAAACGCGGTGCTGCAAGATAAAGCGAGGGATCTTGAGGAAACTCTCGCCGAGCGAGATTGCTGTGACGATCCCGACGCCACGATCCACTTTCTCAGAGAGAAGATAAGATATCTCAGAGAGGGTTTCGCGCTTGAAAAGAAAGAATTGCGAGACATGATATCGGATTTGAGGTTGAAGCTGGCGCAGACCGAGGAGGACATCAGCTGCCCCGCGATATATCGTTTAAGGGCAAAGCTGCGCGATCTTATGAAAGGTGAAGCGGCTGAACAACAGATTTCCAAAGTTGTGGAGAAATCGATCGAAACCTTGGTGGATCTTTCGAAGAGCTGCGATGATCTGCGTTTGGAAAACGAACGTCTTCTTGCTCAAGTGACAGACTTGCGCGGTGCATTGGTTGATCTCGAAGGAAAGGAAACGCCAGAAACGATATTGCAAACGGCCGAGACAACTACAGTGCCGGAGTATATTGACATTTCAGATTTATTGGATAAGCTGAACAGCTGCGAGGATACCGTGGCTGATCTAAGAAAGCAGTTAGGACAAAGAGACGAGCAGATCGATGCACTGAACAAAGAACTGGAATCGATGATCAGTCGAAAAGGTTTAGAGGAACAGATAGAAGCCATGAAGGAGGAACTTAGGAGGAAAGATGACAAGATCGCAGGACTACTGAACAACCTAAGACAATCAGAAATAGACTTGTTGGGGCTATCTTCTCTGAAATCCGAAGCGGAGAACCTGAAATCAGAGTTACATGATCTGAAATCAGAGAGAACTGAATTACTGAACGAACTAAACAAACTGCGAGAATCACTGAAGGATAGAGACGATCAGATAATAGATTTACTGGAGCAGAGGAACAACTTGGAGAAGGAGTACAAGGATAAGACGGCCGAATTACAGTCAAAACTGGATGAGGCAAATGACGAGATCGATGATTTGAAAGCTGAGATAACCAAACTGAAGAATGAGTTGGAAGAGTGTAAGATGTTAAACGCGAAGCTGGAACAGTGCTGTCTGGAGAAAAACGCAATTTCGGAAAAGCTACACGGCTTCGAAGAGGATTTTGCAGCCGCGAAAGCGATAATAGCGAATCTGGAAAGTGAGGTGGACACTTTAAGGCGAGACAAGGAAAATTTGTCGAACGAACTGGACGAGGCAAGGAAACAGGTGGAGGCGTTCATTGGACGACTGGAGGATGAAAGGGCGGCCAGGACCGCATTGGAGAAGGAACTGGAGAGAAATCGAGATGAGATTGAATTGTTAGAGAGGGAGATATTCGATCTGAAAGGTCAGATCGATGCCGAAAGGAAGGAGAACGGCGAGCTTCGCGAAACGTTAGAAGCATCGGTCGGCGAGAGGGAAAAGTTGAGGGCTCGGTTAGAGCAGCTGGAGAACGAGAACGATGATCTGATGAAAAGGACGAAGGAGCTGGACAATTTGAATAACCAGCTAAGGAACGACCACGATAGCATGAAGCAGGCTTTGGACAATTTGCAAGCAGAGATCAACAAACTGGAGGATGAATTGGCCAAGGCGGAGCAAGAACGGGATGCGTTGTTGAACGAGAATAACGGTGTGAAAAAGCAGCTGGAACAAGCGATGGCGGAGAACGAGAGTCTGACAGGCAAATTGGACGAAGCTGGTAAAGAAGTTAATAAACTGAAACTACAGAGGGACGAGCTACAGAGGAGCCTCGATGGGATCAATCTCGAGAACGATTCACTGAAACGGGATATGAGGGCGTTAAGGGATGACCTTGAGGATTCCAGAAGGCAAGCGGAGGAACTAAAGGCCGCTGGTGACGCGTTAAAAGCGACGGATAAGGATAAGATACTGGAACTTGCAAAGCTGCAAGAACAAGCAGAGAACTGCAAGTTCGAAAATAATCGCTTAACGAAGGAAAATGATGATTTGAAATCTAAAATAATAGAATTGGAAGGAAAGTTGGAGGAGTTGGATAAGTTAAAGGGAATAAATACAGATTTACTGGCTGAAGTAGATCGTTTGAGAAAAGAATTAGAAAAAGCGTTGGAGGACATTGATCAATTGAAATCCGAAATAGGTTCCTTGAAGGATGGACTCGATAATTGTGTGGGCGAGATGCAAAAGCTGAGAGTCGAAAATGGCGAGCTGAAAATACAGAACGAAACCTTGAAGTCTGAGATGCAGGCAATTACCGATCGCTTAATGAAAGACAATGACGATTTAAAAGCGAAAATGTCCGAATTGGAAGAAAAGTTAAGTGAATTGGATAAAATGAGACTAGAAAATGTTGATTTGCTTGATGAAGTAGATCGTTTGAAACAGGAATTGGCAAAAGCCTGGGAAGAAGTCGATCGATTGAAATCCGAAGCAGCATCTCTGAAAAACGCACTCGATAAGTGCGTGGACGAGATGGAAAAGCTGAGAACTGAGAGCGACCAGCTTAAATCGGAGAATCAAGCTTTCAAGTCTGATATTCACGGACTTGATGATCGCTTAACGAAGGAAATCGCCAATCTGAAAGCAAAAAACGCGGAATTGGAAGGAAAATTAGTGGCATTTGATAAATTGAAGTCGGAAAATGAGGATTTACTTGGTGAAGTCGATCGTTTGAGACGTGAATTGGAAAAAGCCTTGGAGGATATGGATCAATTGAAATCTGAGATAGGTTCTTTGAAAAACGGACTGGATAAGTGTGTCGGCGAGATGGATCAGCTGAGAACTGAAAACAGCGGTTTGAAGTCTGAAATTCAGGGAATGAGGGGCGAGGGGGACAGTCTGTCGGCGGAGTTAAATAATCTGAAGAACGAGAATTCTCTTTTGAAAGACGAAAGAGATCGATTGAGCAAGCAATTGAGCGACTGTAAGATGGAAAACGAAAAATTCAGAGCGGAGAAGGCTCACCTGGAAGCTGAAAATGAGAAGCTGAAAGGAGAGATAAACTCGTGCAAGGAAGAGAATGACAAATTAAAGGACGAACTTGGAAAATCACGGGAACAATTGCAGTCATCGAACGACGAATTGAATAAATTAAAGGCTAATCTCGACAGAGCTGAGGAGAAAATTCGATCTCTGGAGCCGCTGATCTCCCGTTTGCACAGTGAAAATGATAAATTGCGTGACGATTTGACGAGTTTGGAGAACGAGGCCAACGATTTCAAAGCAAAATTGGCCCGAGAAACGGCTGACAATGAAAAGATGCAGAACGATCTGAAGATACTGGAGGATCAGGTGCACGATCTAAGTGGGAACCTGGTCAATGCTAGGGCAGAAAATGACACTTTGAAACAGGAAAATCAAAGTCTAAAAGCCAAGTTATTAAATATGGATCATGATCTGTCGAATTTGAAAGCGGAATGTGCGGATCTGAAACGAGAGATTGCTGATCTGAAGAGATTAATCGATGAATCAAAAGAAAAAATTGCTAAACTGGAAGCAGACATAGATCATTGGAAAATGGAGAACTGCAAACTTCAGTTGGACATTGATAAATCGAAAGCTGATCTTGAGAAAGCCTTGAAAGATTTGCTCGAATGCCAGGCTTCGAAGAAAGCACTAGAAGCAGAGATGTACCGCCTGAAGATTGAGAAGGGCGAGCTTGACAAGAAGCTTGTCGGTTTAACGTCTCAACTCGAGCAACAGGGAAAAGCGTTCGAAGCAGAAAAATCGGCCAGAGATAAGGGTGATTCAGAAATCACGGCCATGAAGGAGGAACTGAATGCCTTGAAAAAGGAACTAGAAAAATTGAGAACTGACAGCAACAGATACAGAAATGAAATAGAGGAGCTGGGAAGGCAGCTCGCGGTAACAAAAAATGAACTGGAGAAGTGCAAAGAAGAGGTTTCTGTATTAAGAGATGCCAATAACGCTTTAAAGTCTCAATTGGATCCGTTGAAAAGTTTAAAGGACGAATATAATAAGTTGAAGGCTGATTTAGATTCTCTTAAAGAGGAGAACGTGAACCTTCTGCAAGATAGGAAAAATTTCGAAGACGAGTATACTAGGCTGAAAGGAGAAGGCGATGGACAGAAAGCAGAGATCGATAGATTGAGGTCAAACTTGAATGCAGAGGAGGCAGCTGCGGAAAAATTGAGGGCGGATCTTCAAAATTGCCAAACTGAGAACGATAGACTGAAAAAGCAATTAAACGAAGTGAAAAATGAGTTGGATGAACTAACAAAGGGAAACGATCGTATAAAGAACGAGATGGATAAGCTGAAGAAGGCGCTCGCGGACGCGGAAGCAAGGATAAAGTTGCTGGAAAGTGAGCTATCCGATTTGTTAGACGAGAAAAAAGAATTGGTCAACGAACTGTATCGTTTCCGCGAACAGCTAAACAATCGTACAACCGAGCTAGAAGAGCAGATGGCCACGAAAGATGCGGCCAAGAAGGAATTGGCCGACATGAAGGATGAGCTGACCGCTCTAAAAGCGGCGTTGGATAAGGTTCGCAGCGAAAACGATAAGCTGAGGAACGAGAACGAAAAGGTGAATGTGGAATTAACCAAGTTGAACGGGCAATTAGAAACTCTGAAGGAGGATAATGCGAAGCTGGGAAACGAAAACGCGAATCTGAAGAACGAAAACGCGAATCTAAAGAACGACAATGCGAAATTGGCGGCGGAGTTAACTGGGACGAAAAACAAATTGGCAGAAGCGGAGAAACAGCTGAACGATCTAGAGAAAGAAAACGACGACTTGAATAACAAAATAGCTGATCTCGAGAACACAGTGAACGAGCTCGAACCTTTGAAGAAACAATTAGAAGATGCTAAGAAAGAAGTGGATAGGCTGAGGCCAGAGCTAGATAGATTGAAATCAGAGAATGCAGAACTGCAAAACAATTTAAATGACGCCATAGAGGAATCGAATAGGTTAAGAAATGATTTGGACAAATTAAAAAGCGATTACGACAAATCAAGGTCTGAATTAGCTGACCTGAAGGAGGAGAGAGATAGTCAGAGAGAACGGAACGCAGAATTGGAGAAAGAATTAGGGAAAATAAGGAAAGAGAGTGCGAATCTGAAGGGCGAGTTAGCCGATTGTCAAGCGGAGAACGAAAGATTGCGTAATGGATTGACAGATTTGAAGTCGCAAAATGCAAAACTGCAGGACGATTTAAACAAGGCGAAGAACGAAGCGAATAAATTAAAAGCCGATTTGGATAAATTGAAAAGCGATTATGGTGAATTGCGGTCGGAATTAGGTAAACTAAGGGATGAGAAGAATAGGCACAAAGAACGCGATACTGCGTTGGCCACGGATCTGGATAAATTGAAGAAAGAGAATGACGAGTTAAAAGATGGGAATGAGAAACTGAAAAGCCAGTTATTCGATTGCCAAGAGGAGAGAGAAAGGCTGCGCAAGGAATTGGGAAAGCTGGAAAGAGAAAATGCAAAATTGAAAGAAGTGAAAAAGGTGGAGCCCAAGGAAACGGAGATGGACAAAGATATTTTGGATGACTGCGGTGATTTCATAAAGGCGAATGAATTATTGGGGAAGAAATTTAAAAAACAGAATGAAGGTGTGCTGCGGATTCGGAATTATATTAGATATTTGGAAGGTAAAGCTACAGAACCGAAAATGGCGGACAGGCCGGAAGAAGAGGCAGAAATTGATCCAGTATTAAGGAAAGACATTGCAGATCTGTTGAAGAAATCTCACGATTTATCTAATGATATCTATCAAACGGAACAGGAAATAACAAATCTTGGAGCTAAACCCGGCGAGGGCATTTTCGATCCTGACAGCTGGCTAAACTCGTTGACATTGACACAGTTAGCTGAGCTTCACGATAAGATTTGCCAATTGACATTGGACATGGTGCACCAGGACAGCAGGGGAGCGGTGCCGTGCGATGGTTATGAACCAGCGACTCCGTCAAAGGCGGATTATAATATTTTGAACCAGCGTATAGCTGCTTTGCAGAAACAGATAGCGGAGAAGCAAATGGAGGCAGGCTGGAAGCTTCAGGAACTGAGGAGGGCTCTTCAAAACGAGCAGGCTAACTTGATTCGAATCTCCGATCGGATGAACATTGAGAGAAAACGCAATTTGAACCTTCAGCATAGCATGGACGATTCACCGTAA
Protein Sequence
MEICRCGCEASSQESIDPPHEPCCCCNYDPFSDKEAEIYDLSFALKKLTVMKCQMKKWRMERLQLESENRSLKQALQSFGVNADEILKPDPLLVHSREEIERLKNANAVLQDKARDLEETLAERDCCDDPDATIHFLREKIRYLREGFALEKKELRDMISDLRLKLAQTEEDISCPAIYRLRAKLRDLMKGEAAEQQISKVVEKSIETLVDLSKSCDDLRLENERLLAQVTDLRGALVDLEGKETPETILQTAETTTVPEYIDISDLLDKLNSCEDTVADLRKQLGQRDEQIDALNKELESMISRKGLEEQIEAMKEELRRKDDKIAGLLNNLRQSEIDLLGLSSLKSEAENLKSELHDLKSERTELLNELNKLRESLKDRDDQIIDLLEQRNNLEKEYKDKTAELQSKLDEANDEIDDLKAEITKLKNELEECKMLNAKLEQCCLEKNAISEKLHGFEEDFAAAKAIIANLESEVDTLRRDKENLSNELDEARKQVEAFIGRLEDERAARTALEKELERNRDEIELLEREIFDLKGQIDAERKENGELRETLEASVGEREKLRARLEQLENENDDLMKRTKELDNLNNQLRNDHDSMKQALDNLQAEINKLEDELAKAEQERDALLNENNGVKKQLEQAMAENESLTGKLDEAGKEVNKLKLQRDELQRSLDGINLENDSLKRDMRALRDDLEDSRRQAEELKAAGDALKATDKDKILELAKLQEQAENCKFENNRLTKENDDLKSKIIELEGKLEELDKLKGINTDLLAEVDRLRKELEKALEDIDQLKSEIGSLKDGLDNCVGEMQKLRVENGELKIQNETLKSEMQAITDRLMKDNDDLKAKMSELEEKLSELDKMRLENVDLLDEVDRLKQELAKAWEEVDRLKSEAASLKNALDKCVDEMEKLRTESDQLKSENQAFKSDIHGLDDRLTKEIANLKAKNAELEGKLVAFDKLKSENEDLLGEVDRLRRELEKALEDMDQLKSEIGSLKNGLDKCVGEMDQLRTENSGLKSEIQGMRGEGDSLSAELNNLKNENSLLKDERDRLSKQLSDCKMENEKFRAEKAHLEAENEKLKGEINSCKEENDKLKDELGKSREQLQSSNDELNKLKANLDRAEEKIRSLEPLISRLHSENDKLRDDLTSLENEANDFKAKLARETADNEKMQNDLKILEDQVHDLSGNLVNARAENDTLKQENQSLKAKLLNMDHDLSNLKAECADLKREIADLKRLIDESKEKIAKLEADIDHWKMENCKLQLDIDKSKADLEKALKDLLECQASKKALEAEMYRLKIEKGELDKKLVGLTSQLEQQGKAFEAEKSARDKGDSEITAMKEELNALKKELEKLRTDSNRYRNEIEELGRQLAVTKNELEKCKEEVSVLRDANNALKSQLDPLKSLKDEYNKLKADLDSLKEENVNLLQDRKNFEDEYTRLKGEGDGQKAEIDRLRSNLNAEEAAAEKLRADLQNCQTENDRLKKQLNEVKNELDELTKGNDRIKNEMDKLKKALADAEARIKLLESELSDLLDEKKELVNELYRFREQLNNRTTELEEQMATKDAAKKELADMKDELTALKAALDKVRSENDKLRNENEKVNVELTKLNGQLETLKEDNAKLGNENANLKNENANLKNDNAKLAAELTGTKNKLAEAEKQLNDLEKENDDLNNKIADLENTVNELEPLKKQLEDAKKEVDRLRPELDRLKSENAELQNNLNDAIEESNRLRNDLDKLKSDYDKSRSELADLKEERDSQRERNAELEKELGKIRKESANLKGELADCQAENERLRNGLTDLKSQNAKLQDDLNKAKNEANKLKADLDKLKSDYGELRSELGKLRDEKNRHKERDTALATDLDKLKKENDELKDGNEKLKSQLFDCQEERERLRKELGKLERENAKLKEVKKVEPKETEMDKDILDDCGDFIKANELLGKKFKKQNEGVLRIRNYIRYLEGKATEPKMADRPEEEAEIDPVLRKDIADLLKKSHDLSNDIYQTEQEITNLGAKPGEGIFDPDSWLNSLTLTQLAELHDKICQLTLDMVHQDSRGAVPCDGYEPATPSKADYNILNQRIAALQKQIAEKQMEAGWKLQELRRALQNEQANLIRISDRMNIERKRNLNLQHSMDDSP

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00232996;
90% Identity
iTF_00216838;
80% Identity
-