Basic Information

Gene Symbol
-
Assembly
GCA_028455795.1
Location
CM052276.1:362340-372785[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 50 1.5 1.4e+03 -0.5 0.1 26 50 44 68 40 75 0.67
2 50 1.4 1.4e+03 -0.4 0.8 33 55 89 111 86 114 0.66
3 50 2.1 2e+03 -0.9 1.2 36 60 137 161 128 165 0.71
4 50 0.00052 0.49 10.6 2.7 30 61 203 234 195 237 0.90
5 50 2.8 2.6e+03 -1.3 1.9 29 63 301 335 299 337 0.89
6 50 2.5 2.3e+03 -1.1 3.9 28 58 328 357 320 370 0.69
7 50 0.01 9.8 6.5 1.5 30 55 388 413 378 422 0.57
8 50 3.3 3.1e+03 -1.5 3.7 27 62 427 466 424 469 0.59
9 50 4.8 4.5e+03 -2.1 5.2 33 55 475 497 441 507 0.50
10 50 0.00011 0.1 12.9 4.6 24 63 508 547 505 549 0.94
11 50 0.023 22 5.4 7.5 24 64 536 576 536 577 0.94
12 50 0.51 4.8e+02 1.1 10.0 24 64 550 590 547 591 0.87
13 50 0.0013 1.2 9.4 5.6 26 65 573 612 570 612 0.91
14 50 0.026 25 5.2 7.1 21 60 610 649 610 654 0.94
15 50 1.3e-05 0.013 15.7 2.6 24 63 662 701 661 703 0.95
16 50 0.0052 5 7.4 8.7 24 63 690 729 689 738 0.95
17 50 0.027 25 5.2 9.0 19 63 706 750 703 755 0.84
18 50 0.00095 0.9 9.8 8.4 24 63 739 778 736 787 0.76
19 50 0.029 27 5.1 0.8 22 64 779 821 778 822 0.90
20 50 0.17 1.6e+02 2.6 10.1 24 60 816 852 806 857 0.81
21 50 0.0014 1.3 9.3 6.8 23 63 850 890 848 892 0.84
22 50 0.0019 1.8 8.8 6.1 27 61 890 924 887 926 0.92
23 50 6.2e-05 0.059 13.6 7.4 29 63 920 954 919 956 0.93
24 50 0.00013 0.12 12.5 5.5 22 64 955 997 954 998 0.89
25 50 0.012 12 6.2 3.7 28 61 982 1015 981 1017 0.88
26 50 0.0003 0.28 11.4 7.1 30 63 1012 1045 1004 1047 0.86
27 50 9.1e-05 0.086 13.1 5.0 22 64 1046 1088 1045 1089 0.89
28 50 0.0028 2.6 8.3 14.7 22 65 1095 1138 1087 1138 0.93
29 50 0.0015 1.4 9.2 5.8 25 57 1126 1158 1124 1166 0.87
30 50 0.15 1.4e+02 2.7 0.4 37 63 1166 1192 1159 1194 0.76
31 50 0.0013 1.2 9.4 1.6 36 64 1193 1221 1189 1222 0.88
32 50 0.02 19 5.6 1.5 34 58 1226 1250 1218 1275 0.55
33 50 0.0017 1.6 9.0 6.3 26 63 1281 1318 1278 1320 0.86
34 50 0.046 44 4.4 5.8 28 61 1325 1358 1319 1369 0.57
35 50 0.018 17 5.7 7.4 25 58 1364 1404 1356 1407 0.63
36 50 2.7e-05 0.025 14.8 10.3 21 64 1388 1431 1384 1432 0.92
37 50 0.0006 0.57 10.4 6.2 21 59 1402 1440 1401 1453 0.75
38 50 0.0087 8.3 6.7 8.7 23 53 1460 1490 1458 1505 0.84
39 50 4.9e-06 0.0047 17.1 4.5 26 65 1508 1547 1505 1547 0.91
40 50 0.013 12 6.2 9.9 24 65 1562 1603 1548 1603 0.91
41 50 3.4e-06 0.0033 17.6 13.6 22 64 1588 1630 1586 1631 0.95
42 50 0.035 33 4.8 5.8 29 61 1630 1662 1627 1666 0.86
43 50 0.017 16 5.8 9.1 29 63 1661 1695 1659 1697 0.94
44 50 0.00065 0.61 10.3 4.6 24 64 1705 1745 1697 1746 0.93
45 50 0.055 52 4.2 4.5 25 60 1748 1783 1746 1786 0.82
46 50 0.032 31 4.9 11.8 21 64 1765 1808 1760 1816 0.92
47 50 8.9e-07 0.00084 19.5 4.6 27 64 1831 1868 1828 1869 0.92
48 50 0.00095 0.9 9.8 2.3 25 61 1864 1900 1863 1903 0.91
49 50 5.1 4.8e+03 -2.1 0.0 36 60 1959 1986 1940 1990 0.71
50 50 3.3 3.2e+03 -1.6 1.2 36 51 2092 2107 2071 2115 0.64

Sequence Information

Coding Sequence
ATGGAATCGATGAATCCACCGAACGAGCCGTGTTGTTGCTGCAGTTACAATCCCTTCAGCGACAATGCGAAAGAATCGGAGATTTACGACCTCTCGTTTGCCCTGAGGAAACTGGCCGTAATGAAGTGTCAGATGAAGAAATGGCGAATGGAACGACTTCAGTTCGAGAGCGAAAATAGGTCTTTGAAACAAGCCCTCCAGTCATTCGGTGTAAATGTGGATGAGATATTGAAGCCTGATCCGCTGCTCGTGCAGTCCCGGGAAGAAATCGAATGGCTGCAAAATGCAAACGCGGCGCTCGAAGATAAAGTGAGGGATCTAGAAGAAACCCTCGCCGAACGAGATTGGAGCGACGATCCTTGTGCATCGGTACACTATATCAGGGAGAAGATGAGAGATCTACGGGAGCGCTTTGCACTTGAAAAAAAAGAATTAAGGGACATGATATCGCAGTTGAAATTAAAACTCGCGCAGACTGAGGAGGACGTGAGCTGTCCTGCGATATATCGCCTGAGGGCGAAGCTCCGCGAACTGATGAAAGGAGGTCAAATAGCGGACCAGCAGGTCTCGAAAGTCGTGGAGAGATCGATTGAAACGTTGGTGGATCTGTCGAAGAACTGCGACGATCTGCGTCTTGAAAACGAGCAGCTTTTGGCCCAGATAGCCGAGCTACGTCGTCAATTGGCGGAACTTGAAAAAATGCGACCGGAAGAACCGGTGGTAGAAATACGAATGGAAGAACCGGTGGTAGAAATACGAATGGAAGAACCGGTGGTAGAAATGCGAATGGAAGAACCGGTGGTAGAAATGCGAATGGAAGAACCGGTGGTAGAAATACGAATTGAAGAACCGGTGCGAACAGCCGAGACAACGACGGTTCCTGAATACATAGACGTTTCCGACCTGCTGAAGAAACTCGAAGACTGCAATCTCACTGTTTCCGAATTGAAACATCAATTGGACGAGAAAGATAAGCTTATCGAGGCGCTGAAGCAACAACTTGAAAACATGGTAGATCAGCAGGCTTTGTTGGATGAGATCGCTGCCATGAAGGAGGAGCTCGCGAAAAGAGACGACAAGATGAGGAACCTTCTGAACGAGATGAGGCAATCAGAGATAGACATGTTGGGCATGGGCAACCTAAGATCGGAGCTGGACACCTTGAAGCCTCGATTATCTCAACTCGAGGAGGAGAGGGACTCCTTGACAGACGAAGTGGCGAAGCTGAGAAAATTATTAGCCGAAAGGAACGATCAAATAATTGAGATACTGGAGCACAAGAACAGGTTAGAGCAAGAGCTGGCGGAGAAGGAGGCGGAGGCTCAGCGGATTATCGACAGCTTGAAAAAGGAGATGGACGACTTGCTGGCGCGAATGGCGAATTTGCAGGACGAGCTCGACGAGCGTGATAAGCGAATCGCGGAGCTGGAGAAGTGTTGCTCCGAGAGGGACGAGCTTTTAGAGAAATTACAGGCCGCGCAGGACGAGTTGGCTTCGATGCGAGGCGAACTAGCGTCTGCGAAAGCTACGATAGAGGATCTTAAGGGTGAAGTAGATGCCCTGAAGAAAGACAAGGAGAAACTGCTGAAAGAGTTGGACGAAGTCAAGGAACAGATGAATGCGTTGAGTGACCAACTAGCGGAGGAGAGAGCCGCGAAGGAGGCTCTACAAAAGGAGTTAGAATCTGCCCGGGAAGACCTCGAAGACCTGCGAAAAGAGAACTCCGATCTGAAGGGTCAGCTGGAGCAAGCAAGGGAGGAGAATGACAAGCTCCAAAAAGAGAAGGATGCTTTCAAAGCTGAGCTGGATAACATGACCTCGGAGCTTGATAAACTGAAGAAAGCGAATGACGAGCTGCAGCAGAACTTGGACGCAGCGAAGCTGGAGAACGATAAATTGAAGAACGATCTTGAAGAATCTCGACAAGAGATGGACAGATTGCGAGCTGATGGTGATGCGTTGAAGAGTTTAGATGCGGATACAAAAGCTAAGATGGACAAACTCCAATCGCAATTGGACGATTTGAAAGCGGAAAGGGACCGTTTGGCTGGCGAGAATGCGGACATGAAAGCCAAGAATTCGGAATTGGAACGGCAATTGGATGACGCAATGAAATCGTTGGAAAAAATGAGAGCAGAGAATGCTGATCTGTTGGAGGAACTGGAACGTTTGAAGGCAGAACTAGCGAGAGCCAAGAGCACGATTGAGCAATTGAAAGAGGAGATGGAATCCCTGAAGAACGCGTTGGACAAGTGCGTGAGTGAGATGGAGAAGCTAACATCCGAGAATGAGGACCTTAAGGCGAAGAACGAGGCTCTTAAGGCGGAACTTGATCGATGCAAGGCCGATAGGGACTCGCTGATGAAAGACTTAGAGCGAACCAAAGCGGAATTGGACGGTGTGACCAACGAGCTGAACGAGCTGAAGGAGGAGTACGGGTTGCTTAAACAAAATTTCGACCGACTGGAAGGTGAAAGGGATAAGCTGCAGGAGGAGCTGAACAATCTGAAGAACGAGATGAGGGAGCTTCAAGATGAAATAGATGCCGCAAAGCAGAGAGAAGCTGCGTTAAATTCCGAGCTTGATCGTGTTAAGAAGGAGAAGGATGAGTTAGCATCGGAACTGAAGGAACTGAAAGATGAAAATGCAGCTTTGCAGGACGAGATCAAGGATCGATTGAAGAAACAATTGAACGACGCGTATGCTGAGAACGAAAAGCTGAGGGAAGAGCTGGCTAGGTTGAACGACGAACTCGGGAAGCTGGAGGAAGAGAGAGACCAGCTGAAGCAGAACAATGAGAAATTGAATGCGGAAGTTGAGCAACTGCGCAAAGAAAGAGACTCGTTAAACGACGAGCTTGATCGTGTTAAGAAGGAGAAGGATGCGTTAGCATCAGAATTGGACAAATTGAAAGATGAAAATGCAGCTTTGCAGGGCGAGAGGGATCGATTGAAGAAACAATTGGACGATGCGAACGCTGAGATCGAGAAGCTGAAGGAAGAGCTGGCTAGGTTGAACGACGAACTCGGGAAGCTGGGGGAAGAGAGAGACCAGCTGAAGCAAGACAACGAGAAATTGAATGCGGAAGTTGAGCAACTGCGCAAAGAAAGAGACTCGTTAAACGACGAGCTTGATCGTGTTAAGAAGGAGAAGGATGCGTTAGCATCGGAATTGGACAAATTGAAAGATGAAAATGCAGCTTTGCAGGGCGAGAGGGATCGATTGAAGAAACAATTGGACGATGCGAACGCTGAGATCGAGAAGTTGAAGGAAGAACTGGCTAGGTTGAAAGACGAACTCGAGAAGCTGGAGAAAGAGAGAGATCAGTTGAAGCAGGACAACGAGAAACTGAACGCCGTAGTTGAACAATTGCGCAAAGAAAGAGACTCGTTAAACGACGATTTAAAGCAGCTGCGAGATGAGAATAGCAGGTTACAAGCCGAGATCAATGATTTGAGGGAGAAGCTCAATGATGAGAGGAAGGCCAACGAGCTCTTGAAGAAAGATTTGATGATGTTAGACAGCGAGGTGAAGGATTTGCGTAAGGCTCTTGATGAGGCCAGGGCGAAAAATGCTGCCATAGCGGAGGAGAATCAAGCACTTCAGTCAAAGTTAAAGGATTTGCAAGATGAGCTTGGCAATTTGAAGGCAGAATGCGTTGATTTGAAGACGGAGATTGAAGACTTGAAGCAGCAAATCGCTGATTTGAAGGCGAAGATCGCTAAATTGGAGGAGGACGTGGAACATTGGAAGTTAGAGAACTGTAAGATCATGATAGAGGTTGACAAACTGAAGGACGACTTGGAGAAGGCGTTAAAGGAGCTGAATGACTGCAAGGTCGCGAAGGATGCGTGCCAGGCAGAGTTGAACCGTCTCAAAGCCGAGAAAGCCGATCTAGAGAAGAAACTCGCAGATTTACAGTCAGAAATTGAACAGTTGAAGAAGTCGCTTGAAGCAGAAAAATCGGCTAGAAGCAAATGTGAGGCGGAGCTTGCAGCCCTGAGGAATGAGCTGGACGCGTTGAAGAAGGAGTTGGAAAACCTTCGAGACGAGAGCAGAAGGTGCAAAGGAGAAATAGAAGATCTCAGAAAGAAGATCTCGGAGCTGACCGATGACCTGAACAGGTGCAAGGAAGAAGTGAACGCGTTGAGGGATGCGAACGAAAAGTTGAAGGCGGAGGTGAACGCGTTGAACAACGAGAAGAACAAACTCCAAGAGGAGCGAGATAGATTGAAGAACGAAGCGGAATCCCTTCGGGCAGAGAACGCAGACCTTCGTCAAGATCGAGACAAGCTAGCGAACGAAACTAACAGGCTGAGGGGCGAGGGTGATGGACAGAAATCGGAGCTTGACAACTTGAAGTCTGACTTGTCTGCGCAGAAGGCTGCTGTTCAGAAGCTGCAAAACGATTTGAAGAATTGTCAAGACGAGAACGACAGGTTGAAACAGCAACTGGCTGAGATGGAGAAGTTGAAGGACGAGAAGAATCGATTGGAAAAGGCGTTAGAAGATGCGGAAGCTAAAGCCAAGGGTCTGGAAGGCCAGCTGAAAGATCTGCAGGCTGAGAAGCAAGAGCTGCTCAATGAGCTGCAGCGGCTGCGCGACGAGATAAGTCGCCTTATAAACGAGATTGACAAACATAAAGCTGCTGCTGAAGCAGCCATGAAAGAACTGGCTGCCTTGAAGCAGGAGTTAAGCAATTTGAGAGCTACGCTGGATAAAGCCCGCGCCGAAAACGAAGCATTGAACAAAGAGAACGAGAAACTGAAGTCGGACATGGCGAAATTAAATCAGCAGCTGCAGGCATTGCAGGAAGAGAATGCGAAGCTGAAGGAGGAGAACGAGCAGTTGGCCTCGGAATTAGCAGCGACAAAGGAGAAATTAAAAGAAGCCGAAGACCGGTTGAACAATCTGCGCAAGGAAAATGATGATTTGAAGAAGAGGATCGCTGATCTTGAAAATACTGTGAAGGAGTTGGAGACGCTGAAGAAACAGCTGGAGGATTGCAGGGCCGAGTTGGCGAGATTGAAGGCGGAATTGGAGAAATTGAAAGAGGACAATGAGAATTTGAAAAACCAGCTAAGGGATGCTCAAAATGAGTTGAAGGAGGTTAAGGACTTGAAGGATAAGCTGCAGCAGGAACTTGATGCTTTGAAGGCTGATCATAATAGATTGAAGGACGAGTTAGGGAAACTGAAGGAGATTAATGACAAATTGAAGGATGAGTTGGACGATTTGAAGGCATTGAATGATAAATTGAAGGATGATTTGAAAAATCAGAAGGCGATCAATGATAAGCTGAAGGATGAGGCAGAGAAGTTGCAGAAAGAACTTGAGGCGCAGAAAGCTGCCAATGATATGCTGAAGGATGAGGCAGAGAAGTTGAAGAAAGAAATTGAAGCGCAGAAGGCTGCCAATGATAAGTTGAAGGATGATCATAAGAAGTTGTTGGCTGACATTGATAAAGTAAAGAATGATGGTAGAAGTTCTGAGGTAGAGTTAACGAATGAATTGGATAGGTTGAAGAAGGAAAATAATGACTTGAAGTCTGAAATTGAGAGGTTGAAGAAGGAAATTGATGATTTGAAGTCTGAAATTGAGAGGTTGAAAAAGGAATTGAATAATTGCCAGGATTATAATGATAGGCTGCGCAGAGAACTGGATGGCTGCAAAGACGAACTTAAGAAACTGAAAGCTACTGCAGAGCACGATACAGTGACTTTAAGAAGCCTAGCAGACGATTGTGGCGATTACATCAGGGCAAATGATTTGTTAAGGGAAAAACTTGACAAGCAAAACGAAGGTGTACAACGTGTCCGTGACTATATAACTTTCCTGGAGGGTAAAGTTGAAAATGAACCCAAAATGGCGGACGAATTGGAGGACGATTACAGCATTGACCCAGAAATGAAAAAGGAGATAGGAGAGTTATTGAAAAAGTCGCGGGATTTATCTGAAAATATTTATAACACTGAAAAAGAAATTCAGAATATTGCTGCTATTCTGAAACAACTTAAGGTTGCTCCACAGGAAGCTCCACTTAGCGACAAGGCGTTTGATCCCGATTCGTGGCTTAACTCGTTGAAGTTAACACAATTGGCAGATCTCCACGACAAGATTTGCCTATTGACATCGGACATGGTGCATCAGGACAAGCCGAGGGCAGTTAATTGTATGCCAGGGAGTCCACTAAGTGCGGATTACAATATTTTAAATCAGCGGATAGCCGCTCTGCAGAAACAGATAGCGGAGAAGCAAATGGAGACGGGATGGAAGCTAGAAGAGCTGAAGCGAGCTCTTCGGCAGGAGCAGGCTAACCTAATTCGAATCTCCGACGAAATGAATTTAGAGAGAAAACGTAATTTGGCCCTTCAGTTAGGAATGGATGATGCATCTTAA
Protein Sequence
MESMNPPNEPCCCCSYNPFSDNAKESEIYDLSFALRKLAVMKCQMKKWRMERLQFESENRSLKQALQSFGVNVDEILKPDPLLVQSREEIEWLQNANAALEDKVRDLEETLAERDWSDDPCASVHYIREKMRDLRERFALEKKELRDMISQLKLKLAQTEEDVSCPAIYRLRAKLRELMKGGQIADQQVSKVVERSIETLVDLSKNCDDLRLENEQLLAQIAELRRQLAELEKMRPEEPVVEIRMEEPVVEIRMEEPVVEMRMEEPVVEMRMEEPVVEIRIEEPVRTAETTTVPEYIDVSDLLKKLEDCNLTVSELKHQLDEKDKLIEALKQQLENMVDQQALLDEIAAMKEELAKRDDKMRNLLNEMRQSEIDMLGMGNLRSELDTLKPRLSQLEEERDSLTDEVAKLRKLLAERNDQIIEILEHKNRLEQELAEKEAEAQRIIDSLKKEMDDLLARMANLQDELDERDKRIAELEKCCSERDELLEKLQAAQDELASMRGELASAKATIEDLKGEVDALKKDKEKLLKELDEVKEQMNALSDQLAEERAAKEALQKELESAREDLEDLRKENSDLKGQLEQAREENDKLQKEKDAFKAELDNMTSELDKLKKANDELQQNLDAAKLENDKLKNDLEESRQEMDRLRADGDALKSLDADTKAKMDKLQSQLDDLKAERDRLAGENADMKAKNSELERQLDDAMKSLEKMRAENADLLEELERLKAELARAKSTIEQLKEEMESLKNALDKCVSEMEKLTSENEDLKAKNEALKAELDRCKADRDSLMKDLERTKAELDGVTNELNELKEEYGLLKQNFDRLEGERDKLQEELNNLKNEMRELQDEIDAAKQREAALNSELDRVKKEKDELASELKELKDENAALQDEIKDRLKKQLNDAYAENEKLREELARLNDELGKLEEERDQLKQNNEKLNAEVEQLRKERDSLNDELDRVKKEKDALASELDKLKDENAALQGERDRLKKQLDDANAEIEKLKEELARLNDELGKLGEERDQLKQDNEKLNAEVEQLRKERDSLNDELDRVKKEKDALASELDKLKDENAALQGERDRLKKQLDDANAEIEKLKEELARLKDELEKLEKERDQLKQDNEKLNAVVEQLRKERDSLNDDLKQLRDENSRLQAEINDLREKLNDERKANELLKKDLMMLDSEVKDLRKALDEARAKNAAIAEENQALQSKLKDLQDELGNLKAECVDLKTEIEDLKQQIADLKAKIAKLEEDVEHWKLENCKIMIEVDKLKDDLEKALKELNDCKVAKDACQAELNRLKAEKADLEKKLADLQSEIEQLKKSLEAEKSARSKCEAELAALRNELDALKKELENLRDESRRCKGEIEDLRKKISELTDDLNRCKEEVNALRDANEKLKAEVNALNNEKNKLQEERDRLKNEAESLRAENADLRQDRDKLANETNRLRGEGDGQKSELDNLKSDLSAQKAAVQKLQNDLKNCQDENDRLKQQLAEMEKLKDEKNRLEKALEDAEAKAKGLEGQLKDLQAEKQELLNELQRLRDEISRLINEIDKHKAAAEAAMKELAALKQELSNLRATLDKARAENEALNKENEKLKSDMAKLNQQLQALQEENAKLKEENEQLASELAATKEKLKEAEDRLNNLRKENDDLKKRIADLENTVKELETLKKQLEDCRAELARLKAELEKLKEDNENLKNQLRDAQNELKEVKDLKDKLQQELDALKADHNRLKDELGKLKEINDKLKDELDDLKALNDKLKDDLKNQKAINDKLKDEAEKLQKELEAQKAANDMLKDEAEKLKKEIEAQKAANDKLKDDHKKLLADIDKVKNDGRSSEVELTNELDRLKKENNDLKSEIERLKKEIDDLKSEIERLKKELNNCQDYNDRLRRELDGCKDELKKLKATAEHDTVTLRSLADDCGDYIRANDLLREKLDKQNEGVQRVRDYITFLEGKVENEPKMADELEDDYSIDPEMKKEIGELLKKSRDLSENIYNTEKEIQNIAAILKQLKVAPQEAPLSDKAFDPDSWLNSLKLTQLADLHDKICLLTSDMVHQDKPRAVNCMPGSPLSADYNILNQRIAALQKQIAEKQMETGWKLEELKRALRQEQANLIRISDEMNLERKRNLALQLGMDDAS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00860130;
90% Identity
-
80% Identity
-