Basic Information

Gene Symbol
-
Assembly
GCA_947507545.1
Location
OX382187.1:2436903-2444246[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 42 4 3.8e+03 -1.7 0.2 31 57 62 88 57 93 0.66
2 42 0.32 3.1e+02 1.8 0.4 39 62 108 131 103 134 0.60
3 42 8.3 7.9e+03 -2.7 0.4 40 55 161 176 147 182 0.55
4 42 0.0016 1.5 9.2 0.6 30 57 223 250 214 252 0.84
5 42 6.5 6.2e+03 -2.4 0.0 28 59 385 416 383 421 0.68
6 42 0.0098 9.4 6.7 0.1 33 64 471 502 468 503 0.89
7 42 0.017 16 5.9 7.7 27 63 503 539 496 541 0.89
8 42 0.0021 2 8.8 2.4 28 60 532 564 529 573 0.77
9 42 2.3 2.2e+03 -0.9 0.0 27 50 566 589 564 593 0.76
10 42 0.028 27 5.2 2.6 22 60 596 634 595 636 0.93
11 42 0.02 19 5.7 5.8 21 62 630 671 629 674 0.91
12 42 1.1e-05 0.011 16.1 7.2 24 60 675 711 674 716 0.92
13 42 0.28 2.7e+02 2.0 3.2 32 60 718 746 715 751 0.76
14 42 0.0019 1.8 8.9 4.2 29 64 757 792 751 804 0.83
15 42 0.00089 0.85 10.0 1.3 23 62 772 811 770 813 0.89
16 42 5.9e-05 0.056 13.8 1.3 29 62 806 839 800 842 0.88
17 42 0.022 21 5.5 0.0 37 65 842 870 838 870 0.88
18 42 0.001 0.99 9.8 1.4 27 64 874 911 871 912 0.91
19 42 0.0054 5.2 7.5 3.0 24 62 913 951 910 964 0.50
20 42 0.33 3.2e+02 1.8 2.1 29 62 974 1007 969 1016 0.60
21 42 0.0043 4.1 7.8 0.3 24 62 1046 1084 1043 1087 0.89
22 42 0.0014 1.4 9.4 0.5 24 63 1095 1134 1094 1135 0.92
23 42 3.2 3.1e+03 -1.4 3.7 32 61 1166 1195 1141 1199 0.80
24 42 5.9e-06 0.0056 17.0 1.0 29 61 1205 1237 1201 1258 0.92
25 42 0.0013 1.3 9.5 1.3 24 63 1263 1302 1260 1303 0.94
26 42 0.009 8.5 6.8 4.1 25 63 1306 1344 1304 1346 0.87
27 42 0.0047 4.5 7.7 0.2 28 63 1344 1379 1341 1381 0.88
28 42 0.00081 0.77 10.1 1.7 29 62 1394 1427 1389 1429 0.88
29 42 0.012 11 6.4 7.9 22 64 1415 1457 1414 1458 0.92
30 42 5.9 5.6e+03 -2.2 0.0 47 61 1475 1489 1470 1492 0.79
31 42 0.0041 3.9 7.9 5.2 19 63 1496 1540 1494 1542 0.91
32 42 0.016 15 6.0 3.0 24 64 1543 1583 1540 1584 0.92
33 42 0.025 24 5.3 3.7 26 64 1580 1618 1577 1619 0.84
34 42 0.76 7.2e+02 0.6 7.1 31 64 1662 1695 1620 1696 0.87
35 42 0.1 99 3.4 4.0 22 56 1737 1771 1729 1775 0.60
36 42 0.014 13 6.2 1.7 28 60 1778 1810 1772 1814 0.89
37 42 0.0088 8.4 6.8 3.4 22 63 1827 1868 1826 1870 0.86
38 42 1.3 1.2e+03 -0.1 1.6 23 63 1870 1910 1868 1914 0.71
39 42 5.3 5.1e+03 -2.1 0.5 26 58 1901 1933 1896 1935 0.53
40 42 1e-06 0.00097 19.4 0.5 21 62 1959 2000 1958 2003 0.92
41 42 0.44 4.2e+02 1.4 0.2 28 56 2103 2131 2100 2140 0.74
42 42 1.6 1.5e+03 -0.4 1.0 26 50 2195 2234 2185 2236 0.55

Sequence Information

Coding Sequence
ATGGCTTGCGATCGTTGTCCCATGTCCCGTTGTCCTTGTGGTTGCAATGCTCCAAAGACGGGAAAATTCACGGAGCCACCGCACGAGCCTTGTTGTTGCTGCAACTACAATCCCTTCAGCGACGATTCCAAGGAATCCGAGATTCACGATCTGTCATTCGCCCTGCGAAAGCTCACGGTGATGAAGTGTCAGATGAAAAAGTGGCGCATGGAGCGTCTGCAGTTGGAAAGTGAATGCAGATCGATGAAGGAAGCTCTTCAGTCATTCGGTGTAAACACGGACGAAGGCCTGAAAACGGATCCCCTGATTGTTCACTTCAGAGCCGAGGCCATGAGACTGCAAAATGCGAACGAGCTCCTTGATGAGAAAATAAAAGTGCTCGAGGAAACATTGGCTGAGCGTGACCTCGACGAGGACCCCTGCGAAGGGGTTCTATCGATGAGAGAGAGGATGAGAACGCTTAGGGACCAATTTCTCGGCGAGAAAAAACAGTTAAAAGACACGATCACACAGTTGCGAATACAATTGGCCGAAGCCGAAGACGACACTTCCTGTGCTGCGTTGAATCGCTTGAGGGCAAAGCTTCGGGAGTTGATGAAGGGTGAACAGAAAGCCGACAAGCAGGTCACCAAAGTTGTGCAAAAGTCCATCGAGACCCTCGTGGATTTGTCGAAGGGCTGTGACGATTTGAAACTGGAGAACGAAAAGCTCCTTGCTGAAGTGCACACTCTGCGGAAGGCCCTGAAGAACCTGCCTCCGGATGACAGGTCCATTCGCATCGACGACAAAGAGTTGGTACCAATCGTAACTGAAGACAAAGAACCTGCAGCAACGACAATCAAAGGAATCCCCTCTTCAGAAGTGCCGAAGAGGTTGGAACCCGATGACTGCTCAGAGTACCAAGACTTGATTAGCGACCTGAAGAAACGTTTGGCCGAGAGTGAGATGGTGACAGCAGATTTGCGGATGCAACTCAAGGAACGCAGTGACACTGTTGCGCAGTTGTCGGTGGATCTCGACAGAATGACCGTCAGTCATCAAGCCATGATGACAGAAGTAACGGACGCTAAGAACGAGTTGATGAAGCGAGACCAGAAGATCATGGAGCTGCTGGAGAACGTGAGTAGAACGTCCCTCGATGTGTTAGGCATGAACGAGTTGAAGAACGACATCGACAAGATGAAGCCGCAGCTGTACGACCTCCAAATCGAGAGAGACAACCTTTTGGACGAGCTGCACAAAGTGCGTGGCGTTGTGTCGGATCGCAATGACCAAATCATCGAGGTATTGGAAAAACGGGACAAGGCGATCGGCGACTACACCGCGAAAATCGAGGCTATGCAAGCCAGGATAGACGCGTTGCTTGGACAGGAAGTCATATTGAAAAACGAGGCTGGCGATGCTAACGTGCAAATTGCGGAACTGGAGGACGAAATTGCCCGACTCAAATCGCGGATTGATGAGCTGGAGAAAGGGTGCGGTGACACCGAGGCACTTAACCACAAATTGGAAGAATTGAACGATGCGTTGAAGGAATTGCATAAAGAGAACAGCGCTCTTCGAGATGAGAATACCAATCTGACGTTGACGAACGAAAAGCTTAAGGAGAACGCGGCTAAGCTGAAGAGCGAGAGCGAGAAGCTTTCCGCGGAGATTGATAAATTGCGGATGGAAAATGACACACTCGGCGGCACGGTTGACGGCCTACAAAAAGATTTGGCTGACGCAAGGGCGGAAAACAATGCGCTGGGTGAACAGCTGAGCACCATGAAGGATGCGGATGGGAAACAGAAGGATAATGAAAATAATCTGAAGGGGCGGTTGGATAATCTGACAGAGCAATTGAAGTCGCTCGGTGAGGAAAACGCCATTCTAAAAAGCGACTTCAACAAGTTAAAGGCTGAAAGTGAGGGTTTGGAAAAACAGCTACGAAGTTTGACGGAAGATTTTAATAACCTCGAGGAAAAGAACAGGATGCTTAATAAAGAGCTCGATAAAGGTGAAGCAGAATTGTTGGAAATTAAAGAGAAAAACGATTCATTGAGAAAGGAGCTGGAAGACCTGAGGAAGGAAAATGAAAAGCTTACGAGCGAAGCTGATGAACTACGCGAAGCTCTTGAAGAGATTAATAAAGCGTTTGCGAAGTGCAAGAAAGAAAACGCCGACTACGAACAAGAGTTGCAGCAGCTAAAGGCAAAGAATTTGTCTTTGAATGCGGAACTGGATAATTCGAAATACGAAAACGATTCCCTGAAGCCGAAATTAATTGAATTAAACAACGAGTTCGATAAGCTGAAGACCGAGAACGACGCTCTCCGGGCTCAAAATTCCACTTTGCAAACCAAAGTCGACGCAATGAAATTAGAAAATAATGAGTTGCAACGAGTTGCCGGTGAACTTAAAGCCGAGTTAGGAAACACTCATGGTGAATTAGCGAAACTTCGGGACGAAAATAGTCGGCTAAAGGCCGAAGTTCGCAAACTCGAAGAAGCTAATGCGGGTTTGAAAGTCGATTTCGACAAGGCCACTTCGGTTAATGACCAATTGAAAGCACAAGTGGCTTCTCTAGCGAGTGACCTTGACGCGACGAGAGAAAAGTTCGATCAATTGAACGCGGACAACAGTGCATTAAGGGATTCGGTGGAAAAGCTTGAAGCGGAGAATGATAATCTCAAAGCCGATGCAGTCAAGTTACGTGAAGATTCGGAAAGTATAAAAGAAATGTTGGAGAAGATGAAGAACGAAAGATCCGCCCTCAAAGATGAAATGGACAAGCTGCTGAGCGATAATACAATCCTAAAAGACAATCTCGCCAAGTTGACCGGGGAAAACGACCAATTAAAGACTGAAGTTGATACCCGGAAGTCAGAAATTGAAAAACTTAAGGCCGCTGGTGATGGAGAAATGCAAAATGGTAAGCGTTTGGAGGCCGAAATCGATGATCTGAAACACGAAAGAAATGGGATGAAGCTTGAGTTGGATGAGTCACGATCCGAGAATCTCGCCCTTAAGGGAGAAAGGGATGCTTTGAAGCAAGCACTCGAAGATTCGATAGCTGGTAGCAAGAGTGTTAAGGCCGCACTTGACAATTTGAAGCTGGAGAACGAAAAAATCCAGGCTGACCTTGAAGCTACCAAAGCGGACAGAGATAAGCTCGGAGCGGAAGCTGCAAAATTGAAAGCGGATGCAGATGCCCTCGGAGACAAGGTGAAAAGTTTAACGGAACAACTACAAAAATGTGCGGGCGAAGTGGAGAACGTGAAGAGCGAGCTCAGCGATGCGAAGAAGACTATTGGGACTCTCGAGTCTCAGCTCGTTGACCTCAACGCGGACAAAGCTCAGCTTGAGAAGGATATGAAAGCACTCAATGCCGCGTACGAGGATTTGAAAGGTCAATTGAATTCGGAAAAAGAGGGTAAACAAGCTGCGCAACTGGAGCTGACGGCCTTGAAGGGTGAACTGAAAGAGTTGCTGGGAAAACTGGACAATTTTCACTCGGAGAAGACCAATGAAAGTGCTGAGAAAGAGGAATTGAAGAAAAAGCTCATTGAAATGCAGGAGCATCTTTCGAAAGTTTCGGCGGAAAACGTTGATCTTGCGAAAAGCTTTAAAACTTTAAGCTCCGAAGTGGCAGCGCTGAAGAACGAAAACGAAGCACTGCGTGCCGAAGTTGGCCAACTTGAAAATGACCTGGATCACTTTAAAACGGAAAATTGTAAGCTGAAGATGAGTGCTGAAAAGTGCTCCAGTGATGTTGAAAAATTGAAGAAGGATTTGGATGCTAGTCGAGCGAAAATTAAGGAGCTTGAGGACCAATTAGGGAACTTGGAGGCAGAGAAAAAACACACTTCACCGACGATCGACCGTCTGGAAAATGAACTATCGACCTTGAAGGAGAATCTGACTTCCGAGAAGACTGCCAAGGAAAATTGTCTTAAGAGGATTGCCGCATTGGAGGAAGAACTCGAGGCTTTGAAAGCTGAACTATTGAAATGCAGGAATGAAAAGGACCGACTCGGAAACAACGTTGACGATTTGAAAAAACAACAAGCCGTATTCGACGCCGAAGCGGATAAATTACGCGCCGAAATAGGAGCTTTGCGAGGGCAGAATGACAAATTGAAGTCAGATCTAGAACGAACGGTAACTGATTCCAATGGCGCCAAAGGCGATTACAAGCGATTGCAAGACACATCGGAATCTTTGAAATCTGCGAACGCCACCCTGAAGAGTGAAAATGATAAACTGAACGCTGCTAACGACGAACTGAAAGCCGAGCTCGGGCTTGTGAAATCCGAGAAGGAGAAATTAAAGGCTGAGTTGAGTGAAGTGAGAAATGAAGCTGCAAAGCTCAAAGACAACTTGGAACAATGTCGTGTAGACGGTGATGCTCTCAAGGCTGATTTGGGCAAAGCGAATGACGAAATAAAAGGGTTAAGAAGCGAACTCGACAAACTAAAATCCCAATCAGCTGACACACTGAAAGCTCGCCAACATTCAGACTCCAAAACGAAAAATCTCGAGACCCAACTGACGGCATCTCAAGAAGAAAAACGAAAACTTATCGAAGATCTCGGTGCCTTAAGAACCGAAACTGATAAGCTTCGAAAGGAGCTATCCGCTGATAAGGCAGCCAAGGAAAACCTTCTTAAAGAACACGCTGCAGCCAAGTCCGAGTTGGAAGCTTTGAAATCTGAGTTTGACAAAATACGGACCGATAACGAAACGTTGAAGAAAGAGTTGGATACCCTGAAGAAACAGCTGGAGAATTTGAATTCGGCGAATGCGCGTTTAACCGACGAGATCGATAGGGCGAAGGGCGAGTTAAATACTTTATTGCCGGAGGTTGAGAAGTTGAAGGGAGAACTGACGAAAGCGAGGGGCGAATTGGCCGATAAATTGGAAAAACTAGCCGAGAAAGAGAACGAAATCAAATTGGCGAAGCTTGAAAATGAGAAGCTCAAGAAATTATATACGGACAGCGAGAGCCGTGTCAAAGCTCTGGAAAATGAGCTGGGAGCTTTGCGAGCCGAGGAAGACAAACTCGAAATGGAAATAAAGGACTTGATAGGAGAGATCAGCAAACACGGCAAGAATTTGGCGGATGAAAAGGCAGCCAGAGCAGCCTCGTTGAAAGAGCTCGAGGCACTCAAGGATACGCTTGCCGGAATGAAAAACGATTTGGACAAGTGTAGAGCGGAAATGGCAAAAATGAAGAACGAACTCGACGGCGCTCGAGCTGAAACAGTGAAATTGCGCGAACAATCGAATAGATTAAAGGCGGAAAATAGCGGTTTACAGAATCAACTTGAAAAATCAAATGCGGATGCAAGCAAATTCCAATCTGATTTGGAAAAGCTCCAGGCGATGAACAGCAAACTAACGAGTGACCTTGAGACCGCGAAATCCGAGATCGAGAAGTTGAAGGATTTGGAAAATCTCAAAGCTGCTTATGAAAAGTCAATGACTGACCTTGAGAAGGAAAGGGCCGAAGCGAAAAAGTTGAAGGGTCAGTTGGACGGCCTGAAAAAGGAGAACGATGCATTAAAGAGTGATCTGGAACACACTAAAGCTGACACAGAGAAATTAAAGAGTGACTTGGACAAGGCTAATACCGAGATCGATAATCTCAAGGACGACAAGGAACGTTCCAAAGCGGAGGCTTTAAAGCTGAAGAATGATTTCGCCAAGGCGAAAATGGAGAACGATCGATTGAGCGACGAATTGCGGCGAGAGAAAGACAATTCCGCGAAATTAATGGAGGACAGTACCAAATTAAGAGGCGAAGTCGAGAAGCTTGAACTCGGAGCAGCGGCAGGACGAAAGGCAGCCAGAGCCTCCAATGCGGAAGTCAGTTTCTTAAACGACGAGATCGATCGTCTGACGGCTCAAAATGCTAAGTTGGAGAGTGAGATTGCGGATTGTCAGGCGGAAAAGACTGCTTTGAGGAATGACATCAAGAGGCTCGAAGCTGAAGTCGACAAAACAAGAGGTACTGTTGCAAGGCGCCGGTCAAAGGCCATAGTAAAATCGGCAAAAGAACCGGATGTTAACCCTGACGAGTGTGGTGACTATGTGAGGTCAAACGAGTTGTTGGCTCGTTTTCTGGTAAAGCAGAATGAAGGTGTGGCGCGTGTTCGTGAGTACATTGGTCACATTGATGGCAGAGTTCCAACGAGACCCACGATGACACAAAGCCCCGACGAGGATGCGGATATCAGCGATTTTCTCGAGCCCGCCGTTGTCGAGACTCTACAAAAGTCCCAGGAACTATCGAGGAATATCGTCGGGGTCGAGGTCGATATACAGTCGATTGGAAATCGACTGAGCGATTGCAGAAAGGAGAACGACAGGTTGAAGAAATTACTCGAGGATCTCGGAGTCGACAATCAGGAGATCAAAGAGTTCCGCGAGGAAATGGGCGATGCCTTCGATGCCGAATCATGGCTGAGATCGTTGACGTTGTCGCAATTGGCCGAACTTCATGATCGCATCTGCTTGCTAACTTCGAGTATGGTGAAACAAGACAACAGCCAAAGTGGACAGTTGGACGAGGAGGACACCGAAGATCCACTCAAGGAAAACTACGACGCATTGAACAAACGGATAGCGGCATTACAGAAACAAATTGCCGGCAAACAGGCGGAAGCTGCGAAGAAAGTCTACGACATGAGGCAGGCTCTTCGTCAGGAACAGGCCCAGCTCATCGTGATCTCGGACGCGATGAATCGCGAGCGGAAACGTAATTTGGCTCTTCATCTTACCATGGATGGGTCAGCATGTTGGGTCAACAAATGCAAGGCCGAATGCAGTGGAAATATCGATCAGTTGATCGCTCTCGATGATCCTGAAGAGACTCTGTTGCTCGAAGACAAGACTGCGAAGAATCGCTGA
Protein Sequence
MACDRCPMSRCPCGCNAPKTGKFTEPPHEPCCCCNYNPFSDDSKESEIHDLSFALRKLTVMKCQMKKWRMERLQLESECRSMKEALQSFGVNTDEGLKTDPLIVHFRAEAMRLQNANELLDEKIKVLEETLAERDLDEDPCEGVLSMRERMRTLRDQFLGEKKQLKDTITQLRIQLAEAEDDTSCAALNRLRAKLRELMKGEQKADKQVTKVVQKSIETLVDLSKGCDDLKLENEKLLAEVHTLRKALKNLPPDDRSIRIDDKELVPIVTEDKEPAATTIKGIPSSEVPKRLEPDDCSEYQDLISDLKKRLAESEMVTADLRMQLKERSDTVAQLSVDLDRMTVSHQAMMTEVTDAKNELMKRDQKIMELLENVSRTSLDVLGMNELKNDIDKMKPQLYDLQIERDNLLDELHKVRGVVSDRNDQIIEVLEKRDKAIGDYTAKIEAMQARIDALLGQEVILKNEAGDANVQIAELEDEIARLKSRIDELEKGCGDTEALNHKLEELNDALKELHKENSALRDENTNLTLTNEKLKENAAKLKSESEKLSAEIDKLRMENDTLGGTVDGLQKDLADARAENNALGEQLSTMKDADGKQKDNENNLKGRLDNLTEQLKSLGEENAILKSDFNKLKAESEGLEKQLRSLTEDFNNLEEKNRMLNKELDKGEAELLEIKEKNDSLRKELEDLRKENEKLTSEADELREALEEINKAFAKCKKENADYEQELQQLKAKNLSLNAELDNSKYENDSLKPKLIELNNEFDKLKTENDALRAQNSTLQTKVDAMKLENNELQRVAGELKAELGNTHGELAKLRDENSRLKAEVRKLEEANAGLKVDFDKATSVNDQLKAQVASLASDLDATREKFDQLNADNSALRDSVEKLEAENDNLKADAVKLREDSESIKEMLEKMKNERSALKDEMDKLLSDNTILKDNLAKLTGENDQLKTEVDTRKSEIEKLKAAGDGEMQNGKRLEAEIDDLKHERNGMKLELDESRSENLALKGERDALKQALEDSIAGSKSVKAALDNLKLENEKIQADLEATKADRDKLGAEAAKLKADADALGDKVKSLTEQLQKCAGEVENVKSELSDAKKTIGTLESQLVDLNADKAQLEKDMKALNAAYEDLKGQLNSEKEGKQAAQLELTALKGELKELLGKLDNFHSEKTNESAEKEELKKKLIEMQEHLSKVSAENVDLAKSFKTLSSEVAALKNENEALRAEVGQLENDLDHFKTENCKLKMSAEKCSSDVEKLKKDLDASRAKIKELEDQLGNLEAEKKHTSPTIDRLENELSTLKENLTSEKTAKENCLKRIAALEEELEALKAELLKCRNEKDRLGNNVDDLKKQQAVFDAEADKLRAEIGALRGQNDKLKSDLERTVTDSNGAKGDYKRLQDTSESLKSANATLKSENDKLNAANDELKAELGLVKSEKEKLKAELSEVRNEAAKLKDNLEQCRVDGDALKADLGKANDEIKGLRSELDKLKSQSADTLKARQHSDSKTKNLETQLTASQEEKRKLIEDLGALRTETDKLRKELSADKAAKENLLKEHAAAKSELEALKSEFDKIRTDNETLKKELDTLKKQLENLNSANARLTDEIDRAKGELNTLLPEVEKLKGELTKARGELADKLEKLAEKENEIKLAKLENEKLKKLYTDSESRVKALENELGALRAEEDKLEMEIKDLIGEISKHGKNLADEKAARAASLKELEALKDTLAGMKNDLDKCRAEMAKMKNELDGARAETVKLREQSNRLKAENSGLQNQLEKSNADASKFQSDLEKLQAMNSKLTSDLETAKSEIEKLKDLENLKAAYEKSMTDLEKERAEAKKLKGQLDGLKKENDALKSDLEHTKADTEKLKSDLDKANTEIDNLKDDKERSKAEALKLKNDFAKAKMENDRLSDELRREKDNSAKLMEDSTKLRGEVEKLELGAAAGRKAARASNAEVSFLNDEIDRLTAQNAKLESEIADCQAEKTALRNDIKRLEAEVDKTRGTVARRRSKAIVKSAKEPDVNPDECGDYVRSNELLARFLVKQNEGVARVREYIGHIDGRVPTRPTMTQSPDEDADISDFLEPAVVETLQKSQELSRNIVGVEVDIQSIGNRLSDCRKENDRLKKLLEDLGVDNQEIKEFREEMGDAFDAESWLRSLTLSQLAELHDRICLLTSSMVKQDNSQSGQLDEEDTEDPLKENYDALNKRIAALQKQIAGKQAEAAKKVYDMRQALRQEQAQLIVISDAMNRERKRNLALHLTMDGSACWVNKCKAECSGNIDQLIALDDPEETLLLEDKTAKNR

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-