Basic Information

Gene Symbol
-
Assembly
GCA_000346575.1
Location
KB639272.1:967558-977838[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 49 1.5 1.4e+03 -0.5 0.1 26 50 44 68 40 75 0.67
2 49 1.4 1.3e+03 -0.4 0.8 33 55 89 111 86 114 0.66
3 49 5.7 5.1e+03 -2.3 1.0 36 60 137 161 128 164 0.76
4 49 0.00039 0.35 11.0 1.4 30 60 203 233 196 236 0.89
5 49 1.4 1.3e+03 -0.4 2.8 28 63 268 303 266 305 0.91
6 49 0.071 64 3.8 0.2 29 55 355 381 345 390 0.65
7 49 6.8 6.2e+03 -2.6 4.6 27 55 395 427 392 437 0.58
8 49 9.6e-05 0.087 13.0 2.0 24 63 476 515 474 517 0.92
9 49 0.041 37 4.6 6.9 24 64 504 544 504 545 0.94
10 49 0.48 4.3e+02 1.1 7.2 24 58 518 552 516 554 0.87
11 49 0.00031 0.28 11.4 4.5 26 63 541 578 538 580 0.90
12 49 0.21 1.9e+02 2.3 6.3 24 60 581 617 578 622 0.90
13 49 0.00018 0.16 12.2 2.2 24 63 630 669 629 671 0.95
14 49 0.0026 2.4 8.4 7.1 23 63 657 697 656 699 0.95
15 49 0.013 12 6.1 7.3 17 63 672 718 670 724 0.81
16 49 9.7e-05 0.088 13.0 2.7 33 62 723 752 718 761 0.84
17 49 0.027 24 5.1 0.5 22 64 747 789 746 790 0.91
18 49 0.0051 4.6 7.5 8.6 24 62 777 815 773 825 0.57
19 49 0.0024 2.1 8.5 4.0 21 54 830 863 815 864 0.62
20 49 0.009 8.2 6.7 6.3 27 64 857 894 855 895 0.85
21 49 6.1e-06 0.0055 16.8 6.9 22 63 894 935 893 937 0.93
22 49 0.00062 0.56 10.4 4.9 22 64 936 978 935 979 0.89
23 49 6.4e-06 0.0058 16.8 6.9 22 63 985 1026 983 1028 0.93
24 49 0.00046 0.41 10.8 4.3 22 64 1027 1069 1026 1070 0.90
25 49 0.0024 2.2 8.5 7.6 27 64 1053 1090 1052 1091 0.91
26 49 6.5e-05 0.059 13.5 10.5 22 65 1076 1119 1075 1119 0.92
27 49 0.0012 1.1 9.5 1.9 28 57 1110 1139 1109 1147 0.87
28 49 1.9 1.7e+03 -0.8 0.1 37 64 1147 1174 1140 1175 0.79
29 49 0.00015 0.13 12.4 5.5 25 64 1177 1216 1154 1217 0.90
30 49 0.0071 6.4 7.0 1.5 24 63 1211 1250 1210 1256 0.93
31 49 0.00013 0.12 12.6 7.0 24 64 1260 1300 1259 1301 0.93
32 49 0.067 61 3.9 7.7 27 64 1305 1342 1301 1350 0.68
33 49 0.023 21 5.3 3.2 29 64 1328 1363 1327 1364 0.91
34 49 0.049 45 4.3 9.3 24 58 1344 1385 1341 1388 0.62
35 49 2.4e-05 0.022 14.9 10.1 21 64 1369 1412 1365 1413 0.92
36 49 0.00078 0.7 10.1 6.8 21 58 1383 1420 1381 1425 0.63
37 49 0.0037 3.3 7.9 7.5 23 53 1441 1471 1439 1485 0.86
38 49 8.5e-06 0.0077 16.4 7.5 26 65 1489 1528 1486 1528 0.91
39 49 0.013 12 6.2 10.4 24 65 1543 1584 1526 1584 0.91
40 49 1.7e-05 0.015 15.4 12.7 22 64 1569 1611 1568 1612 0.94
41 49 0.32 2.9e+02 1.7 4.9 24 61 1606 1643 1605 1647 0.83
42 49 0.018 16 5.7 12.6 28 63 1641 1676 1637 1685 0.92
43 49 0.005 4.6 7.5 9.9 26 65 1688 1727 1683 1739 0.68
44 49 0.057 52 4.1 6.0 24 61 1728 1765 1725 1769 0.84
45 49 3 2.8e+03 -1.4 17.6 23 65 1748 1790 1747 1810 0.54
46 49 3e-06 0.0027 17.8 6.1 27 64 1812 1849 1805 1850 0.91
47 49 0.0011 0.97 9.6 2.3 24 61 1844 1881 1843 1884 0.91
48 49 1.7 1.5e+03 -0.6 0.9 33 60 1940 1967 1927 1986 0.65
49 49 3.3 3e+03 -1.5 1.2 36 51 2069 2084 2048 2092 0.64

Sequence Information

Coding Sequence
ATGGAATCGATGAATCCACCGAACGAGCCATGTTGTTGCTGCAGTTACAATCCCTTCAGCGACAATGCGAAAGAATCGGAGATTTACGACCTCTCGTTTGCCCTGAGGAAACTGGCCGTAATGAAGTGTCAGATGAAGAAATGGCGAATGGAACGACTTCAGTTCGAGAGCGAAAATAGGTCTTTGAAACAAGCCCTCCAGTCATTCGGTGTAAATGTGGATGAGATATTGAAGCCTGATCCGCTGCTCGTGCAGTCCCGGGAAGAAATCGAATGGCTGCAAAATGCAAACGCGGCGCTCGAAGATAAAGTGAGGGATCTAGAAGAAACCCTCGCCGAACGAGATTGCTGCGACGATCCTTGTGCATCGGTACACTATATCAGGGAGAAGATGAGAGATCTGCGGGAGCGTTTTGCACTTGAAAAAAAAGAATTAAGGGACGTGATATCGAACTTGAGATTAAAACTCGCGCAGACTGAGGAGGACGTGAGCTGTCCTGCGATATATCGCCTGAGGGCGAAGCTCCGCGAACTGATGAAAGGAGGTCAAACAGCGGACCAGCAGGTCTCGAAAGTCGTGGAGAGATCGATTGAAACGTTGGTGGATCTGTCGAACAGCTGCGACGATCTGCGTCTTGAAAACGAGCAGCTTTTGGCCCAGATAGCCGAGCTACGTCGTCAATTGGCGGAACTTGAAAAAATGCGACCGACAGAACCTGTGGGAGTAAGGCGAATGGTAGAACCTGTGGAAGAAATGCTACGAACAGCCGAGACAACGACGGTTCCTGAATACATAGACGTCTCCGACCTGCTGCAGAAACTCGAAGACTGCGATCACACTGTTTCCGAATTGAAACGTCAATTAGACGAGAAAGATAAGCTTATTGAGGCGCTGAAGAAGCAACTTGAAAACATGGTAGATCAGCAGGCTTTGCTGGATGAGATCGGTGCCATGAAGGGGGAACTCTCGAAACGAGACGATAAGATGAGAAACCTTCTGAACGAGATGAGGCAATCAGAGATAGACCTGTTGGGCATGAGCAACCTAAGATCGGAGCTGGACACCTTGAAGCCTCGATTATCTGAACTCGAGGATGAGAGGGACTCGTTGATAGACGAAGTGGCGAAGATGAGAAAGTTATTAGCCGAAAGGAACGATCAAATAATTGATATACTGGAGCACAAGAACAAGTTAGAGCAGGAGCTGGCGGAGAAGGAGGCGGAGGCTCAGCAGATTATCGACAGCTTGAAACAGGAGATGGACGACTTGCAGGCGCGAATGGTGAATTTGCAGGGCGAGCTCGACGAGCGTGATAAGCGAATCGCGGAGCTGGAGAAGTGTTGCTCCGAGAGGGACGAGCTTTTAGAGAAATTACAGGCCGCGCAGGACGAGTTGGCTTCGCTGCGAGACGAACTAGCGTCTGCGAAAGCTACAATAGGGGATCTTAAGGGTGAAGTAGATGCCCTGAAGAAAGACAAGGAGAAGCTGCTGAGAGAGTTGGACGAAATCAAGGAGCAGATGAATGCGTTGAGTGACCAACTAGCGGAGGAGAGAGCCGCGAAGGAGGCTCTCCAAAAGGAGCTAGAAACTGCCCGGAAGGACCTCGAAGACCTGCGAAAGGAGAACTCCGATCTGAAGGGTCAGCTGGAGCAAGCAAGGGAGGAGAATGACAAGCTCAAAAAAGAGAACGATGCTTTCAAAGCCGAGCTGGATAACATGACCTCGGAGCTTGATAATCTGAAGAAAGCGAATGACGAGCTGCAGAAGAACTTGGACGCAGCGAAGCTGGAGAACGATAAGTTGAAGAACGATCTAGAAGAATCTCGACAAGAGATGGAAAGGTTGCGAGCTGATGGTGATGCGTTGAAGAGTTTAGATGCGGATACAAAAGCTAAGATGGACAAACTTCAGTCGCAATTGGACGATCTGAAAGGGGAAAGGGATCGGTTGGCTGGCGAGAATGCGGACATGAAGGCCAAGAATTTGGAATTGGAACGGCAATTGGATGACGCGACGAAAGCGTTGGAAAAAATGAGAGCAGAGAATGCTGGTCTGCTGGAGGAGCTGGAACGTTTGAAGGCAGAACTAGCGAGAGCCAATAGCACGATTGAACAATTGAAAGAGGAGATGGAATCCCTGAAGAACGCGTTGGACAAGTGCGTGGGTGAGATGGAGAGGCTGACATCCGAGAACGAGGACCTTAAGGCGAAGAACGAGGCTCTTAAAGCGGAGCTTGATCGATGCAAGGCCGACAGGGACTCGCTGCAGAACGACTTGGAGCGAACCAAGGCGGAATTGGACGGTGTGATCAACGAACTGAACAAGCTGAAGGAGGAGCACGGGTTGCTTAAACAAAATTTCGACCGACTGGAAGGTGAAAGGGATAAGCTGCAGGAGGAGCTGAACAATCTGAAGAACGAGATGGGGAAGCTTCGAGAGGAAATAGATGCCGCGAAGCAGAGAGAAGCTGCGTTAAATTCCGAGCTTGATCGTGTCAAGAAGGATGACGATGCGTTGGCATCGGAATTGGGTAAACTGAAAGATGAAAATGCAGCTCTGCAGGACGAGAGGGATCGATTGAAGAAACAATTGAACGACGCGAACGCTGACATCGAGAAGTTGAAGGAAGAGCTGGATAGGTTGAAGGACGAGCTGGATAGGTTGAAGGACGAGCTGGCTAGGTTGAAGGACGATCACGAGAACCTGGGGAAGGAGAGAGATCAGTTGAAGCAGGACAATGAGAAACTGAATGCCGAAGTTGAGCAACTTCGCAAAGAAAGAGACTCGTTAAACGACGAGCTTGATCGTGTTAAGAAGGAGAAGGAGGCATTGGCATCGGAATTGGGCAAACTGAAAGATGAAAATTCAGCTTTGCAGGGCGAGAGGGATCGATTGGAGAAACAATTGGACGACGCGAACGCTGAGATCGAGAAGTTGAAGGAAGAGCTGGCTAGGTTGAAGGACGATCACGAGAACCTGGGGAAGGAGAGAGATCAGTTGAAGCAGGACAATGAGAAACTGAATGCCGAAGTTGAGCAACTTCGCAAAGAAAGAGACTCGTTAAACGACGAGCTTGATCGTGTTAAGAAGGAGAAGGATGCGTTAGCATCGGAATTGGGCAAACTGAAAGATGAAAATTCAGCTTTGCAGGGCGAGAGGGATCGATTGGAGAAACAATTGGACGACGCGAACGCTGAGATCGAGAAGCTGAAGGAAGAGCTGGCTAGGTTGAGGGACGAACTCGAGAAGCTGGGAAAAGAGAGAGATCAGTTGCAGCAGGACAACGAGAAACTGAACGCCGAAGTTGAACAACTGCGCAAAGAAAGAGACTCGTTGAACGACGATTTAAAGAAGCTGCGGGATGAGAATAGCAGGTTACAAGGCGAGGTGAATGATTTGAGGGAAAAACTGAATGATGAGAGAAAGGCCAACGAGATCTTGAAGAAAGATTTGATGATGTTAGACAGCGAGGTGAAGGATTTGGGTAAGGCTCTTGATGAGGCCAGGGCGAAAAATGCTGCCTTAGCGGAGGAGAATCAAGCGCTTCAGTCAAAGTTGAAGGATTTGCAAGATGAGCTTGGCAGCTTGAGGACAGAGTGCGCTGATTTGAAGTCGGAGATCGAAGACTTGAAGCAGCAAATTGCTGATTTGAAGGCGAAGATCGCTAAATTGGAGGAGGACGTGGAACATTGGAAGTTAGAGAACTGTAAGATCAAGATGGAGGTGGATAAACTGAAGGACGACTTGGAGAACGCGTTAAAGGAGCTGAATGACTGCAAGGCCGCGAAGGAAGCGTGCCAGGCAGAGGTGAACCGTCTCAAAGCCGAGAAAGCCGATCTAGAGAAGAAGCTCGCAGATTTACAGTCAGAAATTGAACAGTTGAAGCAGTCGCTTGAAGCAGAAAAATCGGCTAGAAGCAAGTGTGAGGCGGAGCTTGCAGCCCTGAGGAATGAGCTGGACGCGTTGAAGAAGGAGTTGGAAAACCTTCGAGACGAGAGCAGAAGGTGCAAAGGGGAATTAGAAGAGCTTAGGAAGAAGATCTCGGAGCTGACCAACGACCTGAACAAGTGCAAAGAAGAGGTCAACGCGTTGAGAGATGCGAACGAAAAGTTGAAGGCGGAGGTGAACGCGTTGAACAACGAGAAGAACAAACTTCAAGAGGAGCGAGATAGATTGAAGAACGAAGCGGAATCTCTTCGGGCAGAGAACGCAGACCTTCGTCAAGATCGAGACAAGCTAGCGAACGAAACTAACCGGCTGAGGGGCGAGGGTGATGGACAGAAATCGGAGCTCGACAACTTGAGGTCTGACTTATCTGCGCAGAAGGCTGCTGTTCAGAAGCTGCAAGACGATTTGAAGAAATGCCAGGACGAGAACGACAGGTTGAAACAGCAACTGGCTGAGATGGAGAAGTTGAAGGACGAGAAGAATCGATTGGAAAAGGCGTTAGCAGATGCGGAAGCTAAAGCGAAGGCTCTGGAAGACCAGCTGAAAGATCTGCAGGCTGAGAAACAAGAGCTGCTCAATGAGCTGCAACGGCTGCGCGACGAGCTGGGTCGCCTTATGAACGAGATTGACAAACATAAAGCTGCTACTGAAGCAGCCATGAAAGAACTGGCTGCCTTGAAGGAGGAATTAAGCAATTTGAGAGCCACGCTGGATAAAGCCCGCGCCGAAAACGAAGCATTGAACAAAGAGAACGAGAAACTGAAGTCGGACATGGCGAAATTAAATCAGCAGCTGCAGGCATTGCAGGAAGAGAACGCGAAGCTGAAGACGGAGAACGAGCAGTTGGCCTCGGAATTAGCAGCGGCAAAGGAGAAATTGAAAGAAGCCGAAGACATGTTGAACAATCTGCGCATAGAAAATGATGATTTGAAGAAGAGGATCGCTGATCTTGAAAATACTGTGAAGGAGTTGGAGACGCTGAAGAAACAGCTGGAGGATTGCAGGGCCGAGTTGGCGAGGTTGAAGGCGGAATTGGAGAAATTGAAAGAGGAGAATGAGAATTTGAAAAACCAGCTAAGGGATGCTCAAAATGAGTTGAAGGAGGTTAAGGACTTGAAGGATAAGCTGCAGCAGGAACTTGATGCTTTGAAGGCTGATCATGATAAATTGAAGGACGAGTTAGAGAAACTGAAGGAGATTAATGAGAAATTGAAGGACGAGTTGGACGCTTTGAAGGCATTAAATGATAAATTGAAGGATGATTTGAAAAATCAGAAGGCGATCAATGATAAGCTGAAGGAGGAGGCAGAGAAGTTGAAGAATGAACTTGAAGCGCAGAAGGCTGCCAATGATATGCTGAAGGATGAGGCAGAGAAGTTGAAGAAAGAACTTGAAGCGCAGAAGGCTGCCAATGAGAAGTTGAAGGATGATCAGAAGAAGTTGTTGGCTGACATTGATAAATTAAAGGATGCGGGTAGAAGTTCTGAGGCAGGGTTAACGAATGAATTGGATAGGTTGAAGAAGGAGAATAATGACTTGAAGTCTGAAATTGAGAGGTTGAAGAAGGAAAATGATGAATTGAAGTCTGAAATTGAGAGGTTGAAGAAGGAATTGAATAATTGCCAGGATTATAATGATAGGCTGCGCAGAGAAATGGAAGGCTGCAAAGACGAACTTACGAAACTGAAAGCTGCTGCAGATCAAGATACAGTGACTTTAAGAAGCCTAGCAGACGATTGTGGCGATTACATAAGGGCAAATGATTTGTTAAGGGAAAAACTTGACAAGCAAAACGAAGGTGTACAACGTGTCCGTGACTACATAACTTTCCTGGAGGGTAAAGTTGAAAATGAACCCAAAATGGCGGACGAATTGGAGGACGATTCCAGCATTGACCCAGAAATGAAAAAGGAGATAGAGGAGTTACTGAAAAAGTCGCGGGATTTATCTGAAAATATTTATAACACTGAAAAAGAAATTCAGAATATTGCTGCTATTCTGAAACAACTTAAGGATGTTCCAGTTAGCGACAAGGCGTTTGATCCCGATTCGTGGCTTAACTCGTTGAAGTTAACGCAATTGGCAGATCTCCACGACAAGATTTGCCTATTGACATCGGACATGGTGCATCAGGACAAGCCGAGGGCAGTTAGTTGTGTCCCAGGGAGTCCACTAAGTGCGGATTACAATATTTTAAATCAGCGGATAGCCGCTCTGCAGAAACAGATAGCGGAGAAGCAAATGGAGACGGGATGGAAGCTAGAAGAGCTGAAGCGAGCTCTTCGGCAGGAGCAGGCTAACCTAATTCGAATCTCCGACGAAATGAATTTAGAGAGAAAACGTAATTTGGCCCTTCAGTTAGGAATGGATGATGCATCTTAA
Protein Sequence
MESMNPPNEPCCCCSYNPFSDNAKESEIYDLSFALRKLAVMKCQMKKWRMERLQFESENRSLKQALQSFGVNVDEILKPDPLLVQSREEIEWLQNANAALEDKVRDLEETLAERDCCDDPCASVHYIREKMRDLRERFALEKKELRDVISNLRLKLAQTEEDVSCPAIYRLRAKLRELMKGGQTADQQVSKVVERSIETLVDLSNSCDDLRLENEQLLAQIAELRRQLAELEKMRPTEPVGVRRMVEPVEEMLRTAETTTVPEYIDVSDLLQKLEDCDHTVSELKRQLDEKDKLIEALKKQLENMVDQQALLDEIGAMKGELSKRDDKMRNLLNEMRQSEIDLLGMSNLRSELDTLKPRLSELEDERDSLIDEVAKMRKLLAERNDQIIDILEHKNKLEQELAEKEAEAQQIIDSLKQEMDDLQARMVNLQGELDERDKRIAELEKCCSERDELLEKLQAAQDELASLRDELASAKATIGDLKGEVDALKKDKEKLLRELDEIKEQMNALSDQLAEERAAKEALQKELETARKDLEDLRKENSDLKGQLEQAREENDKLKKENDAFKAELDNMTSELDNLKKANDELQKNLDAAKLENDKLKNDLEESRQEMERLRADGDALKSLDADTKAKMDKLQSQLDDLKGERDRLAGENADMKAKNLELERQLDDATKALEKMRAENAGLLEELERLKAELARANSTIEQLKEEMESLKNALDKCVGEMERLTSENEDLKAKNEALKAELDRCKADRDSLQNDLERTKAELDGVINELNKLKEEHGLLKQNFDRLEGERDKLQEELNNLKNEMGKLREEIDAAKQREAALNSELDRVKKDDDALASELGKLKDENAALQDERDRLKKQLNDANADIEKLKEELDRLKDELDRLKDELARLKDDHENLGKERDQLKQDNEKLNAEVEQLRKERDSLNDELDRVKKEKEALASELGKLKDENSALQGERDRLEKQLDDANAEIEKLKEELARLKDDHENLGKERDQLKQDNEKLNAEVEQLRKERDSLNDELDRVKKEKDALASELGKLKDENSALQGERDRLEKQLDDANAEIEKLKEELARLRDELEKLGKERDQLQQDNEKLNAEVEQLRKERDSLNDDLKKLRDENSRLQGEVNDLREKLNDERKANEILKKDLMMLDSEVKDLGKALDEARAKNAALAEENQALQSKLKDLQDELGSLRTECADLKSEIEDLKQQIADLKAKIAKLEEDVEHWKLENCKIKMEVDKLKDDLENALKELNDCKAAKEACQAEVNRLKAEKADLEKKLADLQSEIEQLKQSLEAEKSARSKCEAELAALRNELDALKKELENLRDESRRCKGELEELRKKISELTNDLNKCKEEVNALRDANEKLKAEVNALNNEKNKLQEERDRLKNEAESLRAENADLRQDRDKLANETNRLRGEGDGQKSELDNLRSDLSAQKAAVQKLQDDLKKCQDENDRLKQQLAEMEKLKDEKNRLEKALADAEAKAKALEDQLKDLQAEKQELLNELQRLRDELGRLMNEIDKHKAATEAAMKELAALKEELSNLRATLDKARAENEALNKENEKLKSDMAKLNQQLQALQEENAKLKTENEQLASELAAAKEKLKEAEDMLNNLRIENDDLKKRIADLENTVKELETLKKQLEDCRAELARLKAELEKLKEENENLKNQLRDAQNELKEVKDLKDKLQQELDALKADHDKLKDELEKLKEINEKLKDELDALKALNDKLKDDLKNQKAINDKLKEEAEKLKNELEAQKAANDMLKDEAEKLKKELEAQKAANEKLKDDQKKLLADIDKLKDAGRSSEAGLTNELDRLKKENNDLKSEIERLKKENDELKSEIERLKKELNNCQDYNDRLRREMEGCKDELTKLKAAADQDTVTLRSLADDCGDYIRANDLLREKLDKQNEGVQRVRDYITFLEGKVENEPKMADELEDDSSIDPEMKKEIEELLKKSRDLSENIYNTEKEIQNIAAILKQLKDVPVSDKAFDPDSWLNSLKLTQLADLHDKICLLTSDMVHQDKPRAVSCVPGSPLSADYNILNQRIAALQKQIAEKQMETGWKLEELKRALRQEQANLIRISDEMNLERKRNLALQLGMDDAS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2