Basic Information

Gene Symbol
-
Assembly
GCA_028455805.1
Location
CM052190.1:8509191-8519591[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 42 1.4 1.3e+03 -0.3 0.1 26 50 44 68 40 75 0.67
2 42 1.3 1.2e+03 -0.2 0.8 33 55 89 111 86 114 0.66
3 42 1.9 1.8e+03 -0.8 0.9 39 60 140 161 128 165 0.72
4 42 0.00057 0.53 10.5 1.8 30 60 203 233 196 236 0.90
5 42 5.1 4.8e+03 -2.1 1.3 52 63 283 294 258 321 0.57
6 42 0.0059 5.5 7.3 1.6 30 62 340 372 337 380 0.62
7 42 3.5 3.3e+03 -1.6 3.6 27 58 386 421 383 435 0.56
8 42 2.7e-05 0.025 14.8 3.9 24 63 467 506 464 507 0.94
9 42 0.012 11 6.3 3.8 24 63 495 534 494 536 0.93
10 42 9.9e-05 0.092 13.0 2.5 27 64 533 570 529 571 0.89
11 42 0.046 42 4.4 7.2 21 60 569 608 569 613 0.94
12 42 3.4e-05 0.032 14.4 2.5 24 63 621 660 620 662 0.94
13 42 0.17 1.6e+02 2.5 10.9 24 63 649 688 648 704 0.84
14 42 0.1 97 3.3 9.0 29 62 675 708 662 718 0.52
15 42 0.00012 0.12 12.6 2.9 26 63 714 751 709 753 0.77
16 42 0.022 21 5.4 5.4 22 61 766 805 765 811 0.64
17 42 0.0011 1 9.6 3.8 23 63 809 849 807 851 0.90
18 42 0.084 79 3.6 10.7 22 56 865 899 849 900 0.61
19 42 0.00076 0.71 10.1 17.3 22 65 872 915 864 915 0.91
20 42 0.00011 0.1 12.8 11.5 24 65 895 936 893 936 0.91
21 42 0.0021 2 8.7 3.9 28 57 927 956 926 964 0.87
22 42 1.7 1.6e+03 -0.6 0.1 37 64 964 991 957 992 0.79
23 42 0.00052 0.48 10.6 2.0 33 64 988 1019 982 1020 0.89
24 42 0.00032 0.3 11.3 2.9 26 61 995 1030 991 1047 0.83
25 42 0.87 8.1e+02 0.3 1.0 30 63 1034 1067 1027 1083 0.61
26 42 0.0066 6.1 7.1 7.5 24 63 1077 1116 1076 1122 0.81
27 42 0.048 45 4.3 9.2 26 65 1121 1160 1116 1160 0.87
28 42 0.022 21 5.4 7.8 25 58 1162 1202 1160 1206 0.61
29 42 2.8e-05 0.026 14.7 10.5 21 64 1186 1229 1182 1230 0.92
30 42 0.00082 0.77 10.0 7.8 21 58 1200 1237 1197 1241 0.63
31 42 0.0081 7.5 6.8 8.4 23 53 1258 1288 1256 1300 0.87
32 42 1.6e-05 0.015 15.5 5.9 26 65 1306 1345 1303 1345 0.90
33 42 0.016 15 5.9 9.6 24 65 1360 1401 1346 1401 0.91
34 42 5.2e-05 0.048 13.8 12.7 22 64 1386 1428 1385 1429 0.94
35 42 0.0014 1.3 9.3 4.1 24 59 1423 1458 1421 1460 0.92
36 42 0.044 41 4.5 10.9 28 63 1458 1493 1455 1495 0.93
37 42 0.00027 0.26 11.5 3.4 24 64 1503 1543 1500 1544 0.93
38 42 0.035 32 4.8 10.2 21 60 1563 1602 1560 1605 0.92
39 42 0.18 1.7e+02 2.5 5.9 21 60 1605 1644 1599 1648 0.83
40 42 1.5e-05 0.014 15.5 3.5 28 60 1651 1683 1647 1688 0.88
41 42 0.029 27 5.0 0.6 34 62 1678 1706 1677 1708 0.88
42 42 3 2.8e+03 -1.4 1.2 36 51 1896 1911 1875 1919 0.64

Sequence Information

Coding Sequence
ATGGAATCGATGAATCCACCGAACGAGCCATGTTGTTGCTGCAGTTACAATCCCTTCAGCGACAATGCGAAAGAATCGGAGATTTACGACCTCTCGTTTGCCCTGAGGAAACTGACCGTAATGAAGTGTCAGATGAAGAAATGGCGAATGGAACGACTTCAGTTCGAGAGCGAAAATAGGTCTTTGAAACAAGCCCTCCAGTCATTCGGTGTAAATGTGGATGAGATATTGAAGCCTGATCCGCTGCTCGTGCAGTCCCGGGAAGAAATCGAATGGCTGCAAAATGCAAACGCGGCGCTCGAAGATAAAGTGAGGGATCTAGAAGAAACCCTTGCCGAACGAGATTGGAGCGACGATCCTTGTGCATCGGTACACTATATCAGGGAGAAGATGAGAGATCTCCGGGAGCACTTTGCACTCGAAAAAAAAGAATTAAGGGACATGATATCGCAGTTGAAATTAAAACTCGCGCAGACTGAGGAGGACGTGAGCTGTCCTGCGATATATCGCCTGAGGGCGAAGCTCCGCGAACTGATGAAAGGAGGTCAAACAGCGGACCAGCAGGTCTCGAAAGTCGTGGAGAGATCGATTGAAACGTTGGTGGATCTGTCGAACAGCTGCGACGATCTGCGTCTTGAAAACGAGCAGCTTTTGGCCCAGATAGCCGAGCTACGTCGCCAATTGGCGGAACTTGAAAAAATGCAACCGAAAGTACCTGTGGAAGAAATGTTACGAACAGTCGAGACAACGACGGTTCCTGAATACATAGACGTCTCCGACCTGCTGAAGAAACTCGAAGACTGCAACCTCACTGTTTCCGAATTGAAACATCAATTAGACGAGAAAGATAAGCTTATCGAGGCGCTGAGGAAAGAACTTGAAAACATGGTAGATCAGCAGGCTTTGTTGGATGAGATCACTGCAATGAAGGGGGAACTCTCGAAAAGAGACGATAAGATGAGAGATCTTCTGAACGATATGAGGCAATCAGAGATAGACATGTTGGGCCTGAGCAACCTAACATCGGAACTGGACACCTTGAAGCCTCAATTATCTCAACTCGAGGATGAGAGGGACTCGTTGACAGACGAAGTGGCGAAGCTGCGAAAATTATTAGCCGAGAGGAACGATCAAATAATTGAGATACTGGAGCACAAGAACAGGTTAGAGCAGGAGCTGGCGGAGAAGGAGGCGGAGGCTCAGCGGATTATCGACAGCTTGAAAAAGGAGATGGACGACTTGCTGGCGCGAATGGCGAATTTGCAGGACGAGCTCGGCGAGCGTGATAAGCGAATCGCGGAGCTGGAGAAGTGTTGCACCGAGAGGGACGAGCTGTTAGAGAAATTACAGGCCGCGCAGGACGAGTTGGCTTCGCTGCGAGGCGAACTAGCGTCTGCGAAAGCTACGATAGAGGATCTCAAGGGTGAAGTTGATGACCTGAAGAAAGACAAGGAGAAACTGCTGAAAGAGTTGGACGAAGTCAAGGAACAGATGAATGCGTTGAGTGACCAACTAGCGGAGGAGAGAGCCGCGAAGGAGGCTCTACAAGAGGAGCTCGATTCTGCCCGGAAAGACCTCGAAGACCTGCGAAAAGACAACTCCGATCTGAAGGGTCAGCTGGAGCAAGCAATGGAGGAGAATGACAAGCTTAAAAAAGAGAACGATGCCTTCAAAGCTGAGCTGGATAACATGACCTCGGAGTTTGATAAACTGAAGAAAGCGAATGACGAGCTGCAGAAGAACTTGGACGCAGCGAAGCTGGAGAACGATAAGTTGAAGAACGATCTTGAAGAATCTCGACAAGAGATGGACAGGTTGCGAGCTGATGGCGATGCGTTGAAGAGTTTGGATGCGGATACAAAAGCTAAGGTGGACAAACTTCAATCGCAATTGGACGATTTGAAAGCGGAAAGGGACCGGTTGGCTGGCGAGAATGCGGACATGAAGGCCAAGAATTCGGAATTGGAACGGCAATTGGATGACGCGGTGAAAGCGTTGGAAAATATGAGAGCAGAGAATGCTGATCTGCTGGAGGAGCTGGAACGTTTGAAGGCAGAACTAGCGAGAGCCAAGAGCACGATTGAGCAATTGAAAGAGGAGATGGAGTCCCTGAAGAACGCGTTGGACAAGTGCGTGGGTGAGATGGAGAAGCTAACATCCGAGAATAAGGACCTTAAGGCAAAGAACGAGGCTCTTACGGCGGAGCTTGATCGATGCAAGGAAGATAGGGACTCGCTGAAGAACGACTTAGAGCGAACCAAGGCGGAACTGGACGGTGTGACCAACGAGCTGAACAAGCTGAAGGAGGATTACGGGTTGCTTAAACAAAATGTCGACCGACTGGAAGGTGAAAGGGATAAGCTGCAGGAGGAGTTGAACAATCTGAAGAACGGAATGAGGGAGCTTCAAGATGAAATAGATGCCGCAAAGCAGAGAGAAGCTGCGTTAAATTCCGAGCTTGATCGTGTTAAGAAGGAGAAGGATGCGTTAGCATCGGAATTGGGTAAACTGAAAGATGAAAATGCAGCTTTACAGGACGAGATCAAGGATCGATTGAATAAACAATTGAACGACTCGAATGCTGAGATCGAGAAGATGAAGGAAGAGCTGGCTAGGTTGAAGGACGAACTCGAGAAGCTGGGGGAAGAGAGAGACCAGCTGAAGCAGGACAACGAGAAACTGAATGCCGAAGTTGAACAACTGCGCAAAGAAAGAGACCAGCTGAAGCAGGACAACGAGAAACTGAATGCCGAAGTTGAACAACTGCGCAAAGAAAGAGACTCGTTAAACGACGATTTAAAGAAGTTGCGAGATGAGAATAACAGGTTACAAGGCGAGGTGAATGAACTGAGGGAACAGCTCAATGATGAGAGAAAGGCCAACGAGATCTTGAAGAAAGATTTGATGATGTTAGACAGTGAGGTGAAGGATTTGGGTAAGGCTCTTGATGAGGCCAGGGCGAAAAATGCTGCCTTGGCGGAGGATAACCAAGCACTTCAGTCAAAGCTGAAGAATTTGCAAGATGAGCTGGGCAGCTTGAGGGCAGAGTGCGCTGATTTGAAGTCGGAGATCGGAGACTTGAAGCAGGAAATCGGTGATATGAAGGCGACGATCGCTAAATTGGAGGAGGACCTGGAATATTGGAAGTTAGAGAACTGTAAGATCAAGATGGAGGTGGATAAACTGAAGGACGACTTGGAGAAGGCGTTAAAGGAGCTGAATGACTGCAAGGCCGCGAAGGAAGCGTGCCAGGCAGAGTTGAACCGTCTTAAAGCCGAGAGAGCCGATCTAGAGAAGAAACTCGCAGATTTACAGTCAGAAATTGAACAGTTGAAGAAGTCGCTCGAAGCAGAAAAATCGGCTAGAAGCAAGTGCGAGGAGGAGCTAGCAGCCCTGAGGAATGAGCTGGACGCGTTGAAGAAGGAGTTGGAAAACCTTCGAGAGGAGAGCAGAAGATGCAAAGGAGAATTGGAAGAGCTTAGAAAGAAGATCTCGGAGCTGACCAACGACCTGAACAATTGCAAAGAAGAAGTCAACGCGTTGAGAGATGCGAACGAAAAGTTGAAGGCGGAGGTGAACGCGTTGAACAACGAGAAGAACAAACTTCAAGAGGAGCGAGATAGATTGAAGAACGAAGCGGAATCTCTTCGGGCAGAGAACGCAGACCTTCGTCAAGATCGAGACAAGCTAGCGAACGAAACTAACAAGCTGAGGGGTGAGGGTGATGGACAGAAATCAGAGCTTGACAACTTGAGGTCTGACTTGTCTGCGCAGAAGGCTGCTGTTCAGGAGCTGCGAAACGATTTAAAGAAATGCCAGGATGAGAACGACAGGTTGAAACAGCAACTGGCTGAGATGGAGAAGTTGAAGGACGAGAAGAATCGATTGGAAAAGGCGTTAGCAGATGCGGAAGCTAAAGCGAAGGCTCTGGAAGGCCAGCTGAAAGATCTGCAGGCTGAGAAGCAAGAGCTGCTCAATGAGCTGCAGCGGCTGCGCGACGAGATAAGTCGCCTTATGAACGAGATTGACAAACATAAAGCTGCTGCTGAAGCAGCCATGAACGAACTGGCTGCCTTGAAGCAGGAGTTAAGCAATTTGAGAGCTATGCTGGATAAAGCCCGCGCCGAAAACGAAGCATTGAACAAAGAGAACGAGAAACTGAAGTCGGACATGGCGAAATTAAATCAGCAGCTGCAGGCATTGCAGGAAGAGAACGCGAAGCTGGCGAAGGAGAACGAGCAGTTGGTCTCGGAATTAGCAGCGGCGAAGGAGAAATTAAAAGAAGCTGAAGACCGGTTGAACGATCTGCGCAAGGAAAATGATGATTTGAAGAAGAGGATCGCTGATCTTGAAAATTCTGTGAAGGAGTTGGAGACGCTGAAGAAACAGCTGGAGGATTGCAGGGACGAGTTGGCGAGATTGAAGCTGGAATTGGAGAAATTGAAAGAGGACAATGAGAATTTGAAAAACCAGCTAAGGGATGCTCAAAATGAGTTGAAGGAGGTTAAGGACTTGAAGGATAAGCTGCAGCAGGAACTTGATGCTTTGAAGGCTGATCATGATAGATTGAAGGACGAGTTAGGGAAACTGAAGGAGATTAATGAGAAATTGAAGGACGAGTTCAACGACTTGAAGGCATTGAATGATAAATTGAAGGATGATTTGAAAAATCAGATGGCGATCAATGATAAGCTGAAGGATGAGGCAGAGAAGTTGAAGAAAGAACTTGAAGCGCAGAAGGCTGCCAATGATATGCTGAAGAATGAGGCAGAGAAGTTGAAGAAGGAAATTGAAGCGCAGAAGGCTGCCAATGATATGCTGAAGGATGAGGCAGAGAAGTTGAAGAAGGAAATTGAAGCGCAGAAGGCTGTCAATGATAAGTTGAAGGATGATCAGAAAAAGTTGTTGGCTGACATTGATAAATTAAAGAATGAGGGTAGAAGTTCTGAGGCAGGGTTAACGAATGAATTGGATAGGTTGAAGAAGGAAAATGATGACTTGAAGTCTGAAATTGAGAGGTTGAAAAGGGAATTGAATAATTGCCAGGATTATAATGATAAGCTGCGCAGAGAAATGGAAGGCTTCAAAGACGAACTTAAGAAACTGAAAGCTACTGCAGAGCAAGATACAGTCACTTTAAGAAGCCTAGCAGACGATTGTGGCGATTACATCAGGGCAAATGATTTGTTAAGGGAAAAACTTGACAAGCAAAACGAAGGTGTACAACGTGTCCGTGACTATATAACTTTCCTGGAGGGTAAAGTTGAAAATGAACCCAAAATGGCGGACGAATTGGAGGACGATTCCAGCATCGACCCAGAAATGAAAAAGGAGATAGGAGAGTTATTGACAAAGTCGCGGGATTTATCCGAAAATATTTATAACACTGAAAAAGAAATTCAGAATATTGCTGCTATTCTGAAACAACTTAAGGAACTTAAGGATGCTCCAGTTAGCGACAAGGCGTTTGATCCCGATTCGTGGCTTAACTCGTTGAAGTTAACGCAATTGGCAGAGCTCCACGACAAGATTTGTCTATTGACATCGGACATGGTGCATCAGGACAAGCCGAGGGCAGTTAGCTGTATGCCAGGGAGTCCATTAAGTGCGGATTACAATATTTTAAATCAGCGGATAGCCGCTCTGCAGAAACAGATAGCGGAGAAGCAAATGGAGACGGGATGGAAGCTAGAAGAGCTGAAGCGAGCTCTTCGGCAGGAGCAGGCTAACCTAATTCGAATCTCCGACGAAATGAATTTAGAGAGAAAACGTAATTTGGCCCTTCAGTTAGGAATGGATGATGCATCTTAA
Protein Sequence
MESMNPPNEPCCCCSYNPFSDNAKESEIYDLSFALRKLTVMKCQMKKWRMERLQFESENRSLKQALQSFGVNVDEILKPDPLLVQSREEIEWLQNANAALEDKVRDLEETLAERDWSDDPCASVHYIREKMRDLREHFALEKKELRDMISQLKLKLAQTEEDVSCPAIYRLRAKLRELMKGGQTADQQVSKVVERSIETLVDLSNSCDDLRLENEQLLAQIAELRRQLAELEKMQPKVPVEEMLRTVETTTVPEYIDVSDLLKKLEDCNLTVSELKHQLDEKDKLIEALRKELENMVDQQALLDEITAMKGELSKRDDKMRDLLNDMRQSEIDMLGLSNLTSELDTLKPQLSQLEDERDSLTDEVAKLRKLLAERNDQIIEILEHKNRLEQELAEKEAEAQRIIDSLKKEMDDLLARMANLQDELGERDKRIAELEKCCTERDELLEKLQAAQDELASLRGELASAKATIEDLKGEVDDLKKDKEKLLKELDEVKEQMNALSDQLAEERAAKEALQEELDSARKDLEDLRKDNSDLKGQLEQAMEENDKLKKENDAFKAELDNMTSEFDKLKKANDELQKNLDAAKLENDKLKNDLEESRQEMDRLRADGDALKSLDADTKAKVDKLQSQLDDLKAERDRLAGENADMKAKNSELERQLDDAVKALENMRAENADLLEELERLKAELARAKSTIEQLKEEMESLKNALDKCVGEMEKLTSENKDLKAKNEALTAELDRCKEDRDSLKNDLERTKAELDGVTNELNKLKEDYGLLKQNVDRLEGERDKLQEELNNLKNGMRELQDEIDAAKQREAALNSELDRVKKEKDALASELGKLKDENAALQDEIKDRLNKQLNDSNAEIEKMKEELARLKDELEKLGEERDQLKQDNEKLNAEVEQLRKERDQLKQDNEKLNAEVEQLRKERDSLNDDLKKLRDENNRLQGEVNELREQLNDERKANEILKKDLMMLDSEVKDLGKALDEARAKNAALAEDNQALQSKLKNLQDELGSLRAECADLKSEIGDLKQEIGDMKATIAKLEEDLEYWKLENCKIKMEVDKLKDDLEKALKELNDCKAAKEACQAELNRLKAERADLEKKLADLQSEIEQLKKSLEAEKSARSKCEEELAALRNELDALKKELENLREESRRCKGELEELRKKISELTNDLNNCKEEVNALRDANEKLKAEVNALNNEKNKLQEERDRLKNEAESLRAENADLRQDRDKLANETNKLRGEGDGQKSELDNLRSDLSAQKAAVQELRNDLKKCQDENDRLKQQLAEMEKLKDEKNRLEKALADAEAKAKALEGQLKDLQAEKQELLNELQRLRDEISRLMNEIDKHKAAAEAAMNELAALKQELSNLRAMLDKARAENEALNKENEKLKSDMAKLNQQLQALQEENAKLAKENEQLVSELAAAKEKLKEAEDRLNDLRKENDDLKKRIADLENSVKELETLKKQLEDCRDELARLKLELEKLKEDNENLKNQLRDAQNELKEVKDLKDKLQQELDALKADHDRLKDELGKLKEINEKLKDEFNDLKALNDKLKDDLKNQMAINDKLKDEAEKLKKELEAQKAANDMLKNEAEKLKKEIEAQKAANDMLKDEAEKLKKEIEAQKAVNDKLKDDQKKLLADIDKLKNEGRSSEAGLTNELDRLKKENDDLKSEIERLKRELNNCQDYNDKLRREMEGFKDELKKLKATAEQDTVTLRSLADDCGDYIRANDLLREKLDKQNEGVQRVRDYITFLEGKVENEPKMADELEDDSSIDPEMKKEIGELLTKSRDLSENIYNTEKEIQNIAAILKQLKELKDAPVSDKAFDPDSWLNSLKLTQLAELHDKICLLTSDMVHQDKPRAVSCMPGSPLSADYNILNQRIAALQKQIAEKQMETGWKLEELKRALRQEQANLIRISDEMNLERKRNLALQLGMDDAS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00860130;
90% Identity
iTF_00866453;
80% Identity
-