Basic Information

Gene Symbol
-
Assembly
GCA_905147325.1
Location
LR990188.1:6025975-6057017[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 5 0.00017 0.084 13.1 0.3 30 63 330 363 324 364 0.89
2 5 0.0017 0.79 10.0 0.4 32 61 729 758 720 760 0.87
3 5 0.001 0.48 10.7 2.0 32 59 1276 1303 1251 1307 0.81
4 5 0.001 0.48 10.7 2.0 32 59 1736 1763 1711 1767 0.81
5 5 0.00039 0.19 12.0 0.1 32 61 2151 2180 2147 2183 0.87

Sequence Information

Coding Sequence
ATGGTCCGCTCGGGCCGAACCCCCTTTCCACGGTGGGCTCTCACCCGCCTGGACAGGGAGGCCGCTGAGGAGGCGGCGCTCCTCCAGGACTGGCTCGCGCCTCCACTCAGAGTCCTGGACGTGGACGCCGCAGCGGTCCAGCTGAGGGGGTCTCTGACCGAGAAAACTACTGACCATGGCATGAACAAAGGACCTGATTCGCAATCGGATTCGGACAAGAAGAAGGGTGCAGAACGCATTGACTCGGGGGAGTCCGCGTCTGCTTTTAGACGGGGAGGGGAGCTGATGGCTGCCCTCAAAGAATTAACGAGGACCGGAAGCGTGGAGTCGATGTCGTATGACAGTGACGACTCCACATTCTCGAGACCAGACAGTCGGGTGTCGAACAGCGGCTCGTACCGCAAGCGACCCCGACGAGAAGACTCGGACGATGACTTGGGCGGCTCGGGTGACGAGGAGCGCCCCGCCCAAAAGGCCCCAAAGGGGAAAAGCGGCAGAGGCCGCGCGTCCCCTGCACAGGTACGGGACTCAAAGGGACGCTTCCTGAAGCATTCTGATGCGTCGGGGAGTCAGCCCCAAGGGGAAGATCGGTCGCACTCGGGCGGCAGGAGCGACGGGAGCTCCGCCGCTCTGAAGAGCACCGACACCCCGAGAGGAAGANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGCCGAAATGGTCGCTGCGGAACGGGCGGCTATGGAGCTCCAGGCCGAGAAGGAGGTGGCGGAGTCGGCGCGCAAGGAGTATGAACTCCGGCCGATCCGCGTGCCTGATTACTTCGACGGGGCATCCACGCCCGAATTAAAGGGACATGTTCTCCGCAACTTGGACCTTATTAGCCAGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTCTTATGAGCCGGTCCACCATCGCGTCGGAAACCATGAAATTGTCCATGGACAATGACCGGCTAAAGAAGGAGATAGAGGATCTCCGGAGTGAGATGGCCGTTTTGCGGGCTGAATTGCAGCGCCCGCGCAGCAACCCGGACGAGATAATCCCCCCACCGCCTCCAAGTGCTCAGTTTGAGCCTGAGTCGCTCAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNATCGTCCTCTCGGGCCTTGTCGGCGGATGCCCCGCCGGCTAGAGTTAGCCCTATGTTGAAGCTGGAGGTCCCTCCGGCGCCCACTGACCTCGGTCCTCCCCCTCCCACGACTTCGCGTGAGGGAGGGAAGACCAAGTCCAAGGGAAAGGGTAAGGGGAAAAAGCGGGCGGTGGTAGCAAACCCCGCTCCGCCAGTAATGACAGAGGAGGAGAGACCAGCGGAGGCCTGGTCTACAGTGACCCGCCGCAAGGCGAAACCCAGGGCGGACCCGGCAGTGATGGATGGTAGGCGTGACTGCGTGATGTATGATTGGAAGTCGCCGAAAGGAGACGGCGATAATAATGGACCGAAACTGCATAACGAATTGGACTTGATGATTGGTGAAGAGCGTGCGGACTGCGAAAAGTCTGCGCTTGCTGGGGAAGGTGACGAGCGGCTTGCCGCTAGTAAGGAAACCGAGGAGAGGAGAGATGAGGCGGACTCCGACGCTGAGTCTCTGATGAGCGCGTACAGTGCGCAAAGTGAAGACGGATCTAACCGTCCACCCCGCAAACGTGGCAGGCCCCCTACCACTGGTGAACATGTGGGGAAGTGGAAAATCAAGAAGGAGATGGAGGCTGCCAAGCGTGCGAACTCACTGGAAGTGAGGATTGCCGAGGCGGACCGAATAGCAGCCGAAATCGCCATGAGCGAGGCAGCTAGCGACCGAGGCTCCCGTGCGGCATCGCGCGCCTCATCTAGGCTTGACGCGGACGAACTGCCGGTGGAGGGCCTTAAGAAGAGGCTAGAAGAATCCCTCGCGGGAATTCGGGAGGTTTCGAGGGTGTCCAAGGGGCTTAAAGGGGTGTGCCAAAAGACGCTCAAGCAGGCGGCAAGCAGTATTGAAGACGTGGCAGAGGAACTGCTATCACGCACGGCCACTGATGAGATACGCCGCCTGCAAATTGAGAACGCCCGCCTGAAGAGGCAAATGGGCGAGCTCCGTAAAGAGCTCGACTGCGTGCGGGCTATGGTGATTGGAACCGGGGAAGAGAGGCCAGCAAACCCAACTGAGACCCCTAAATCTACGGAGTCGGACCTTGCGCAGGTCGTAAGGGAGGAGATAGGCAAGGTCCTCCAAAGGGTCCAGTTTCTGGAGAGCAAAATTCTCCGGCCGCCCTTGGCATCTGACAAGGTCAAACAGACGACACCCGCACCAGAAGTAAGGCGAGTCGAAAAGACACGTCCTTCTAAACCACCCGAGCGGGCGCCCACACTCGTGACGGAACAGACTCCGACCACGTCGACCGCCCCCAAAGCTGCCGCCAAAGTCCAGCGCGACAAAAAAGTTAACGGAAAGGAGAAGGCGAACAGCCAAAGGCAGAAGGCCACCATTGCCCAGCCTCAGGCGCCCGCTAGGCCCGATAGCGGAGAGTGGCAGGTGGTTGGTGCCAATAAGGCCGCCAAAAAGGCAAAGCGTTTATCGGGGGTGGGGTGCGTGCTGGACAACCTCCTCGTGGTGCACCCTGGGCACCGCCGACCACCTCTTCAGCATCCCTGTCGAGGTGGTCGACGGTGGCGGATAGTCCGGCTAGGGAGTCGCGTTTTGTCTGCTTCTGGCACTCTTTTCGGAGAGGGTCGGGGGCAGACCGGCGACGGGGGGCGTGATGTATGTAAGAGGAGAGGGTGCAATGTCCGCGAAGACTGGTGCTGCGGCCTGCCCAATGCGAGGGGTCTTGCGTGCCCGAACGCTGAAAGGCGGGGGGGCGAAACCCCGTCATTAGCGCAGGGGGGGATGGGTATAGGGACCTCACCCTTCAAAGGAAACAACAACAATATGGAGCAAGCTGATCAACACATAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCGTCTTCGGGCGCACCCGACCTTGGCGATCCTAACGGACTCGATAATGCTTCTGATATGGCGAGTAGGCGTGTTGACCTGGTGAGGTCCGCGTCTGTTGGAAAAGGAGAGGAGCGGATTTCCGCCAGTAAGGATACGGAGAGGAGATTTGAGGTGGACTCTGACGCTGAGTCATTGATGAGCGCGTCTAGTGCTATGAGTTTAGACGAACGCGGCTCACTGCCCCGCAAACGAGGAAGGCCACCTACCACTGGCGAGCATGTAGGGAAATGGAAGGTTAAAAAGGAGATGGAGGCTGCCAAGCGGGCGAACTCACTTGAAGTGAGGATCGCTGAGGCTGATAGACTGGCAGCTGAAACTGCAATGAGTGAGGCAATTAGCGACCGAGGCTCCCGAGCGGTATCACGCGCTTCATTGAGACTCGACGCTGACGAACTGCCGGTTGAGGGTCTAAAAAAGAGGCTCCAAGAGTCCCTCGATGGGATCCGGGATGTGTCGAGGGTTTCTAAAGGCCTCAAAGGGGTGTGCCAAAAAGCACTCAAGCAGGCGGCGAACAATATCGAAGACGTGGCAGAGGAGCTGCTATCGCGTACGGCAACCGATGAAGTCCGCCGCCTGCAAATTGAGAACGCCCGTCTGAAGAGGCAGATGGCTGAACTGCGAAAAGAGCTCGACTGCGTCCGGGAAATGGTGGTTGGAGTCAGGGAAGAGAGTCGGCCTGCAAATCCTGCTGCAGCCCCTAAAACCACTGAGTCGGACCTTGCGCAGGTTGTACGAGAGGAGATCGGCAAGGTCCTGCAAAGGGTACAATTTCTGGAGGGCAAGATCCTTCGGCCGCCCTTGGCGTCTGACAAGGTCAGACAGCCGCCACCAGGCCCAGAAGTAAAGCAGGCACGACCTGCTAAAACATCCGAGCGGGCGCCCTCTTCCATGACGGTACCGGCACCCTCCACTTCGGCCGTTCCCAAAGTTGCCGCCAAAGTCGTGCGCGACAAAACAGAGAACGCAAAGGGGAAGGCGAACAGCAAAAGCCAGAAGGCCATTGCTGTCCAGCCTCAGCGTGATGTATGTAAGAGGAGAGGGTGCAATGTCCGCGAAGACTGGTGCTGCGGCCTGCCCAATGCGAGGGGTCTTGCGTGCCCGAACGCTGAAAGGCGGGGGGGCGAAACCCCGTCATTAGCGCTGGGGGGGATGGGTATAGGGACCTCACCCTTCAAAGGAAACAACAACAATATGGAGCAAGCTGATCAACACATAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCGTCTTCGGGCGCACCCGACCTTGGCGATCCTAACGGACTCGATAATGCTTCTGATATGGCGAGTAGGCGTGTTGACCTGGTGAGGTCCGCGTCTGTTGGAAAAGGAGAGGAGCGGATTTCCGCCAGTAAGGATACGGAGAGGAGATTTGAGGTGGACTCTGACGCTGAGTCATTGATGAGCGCGTCTAGTGCTATGAGTTTAGACGAACGCGGCTCACTGCCCCGTAAACGAGGAAGGCCACCTACCACTGGCGAGCATGTAGGGAAATGGAAGGTTAAAAAGGAGATGGAGGCTGCCAAGCGGGCGAACTCACTTGAAGTGAGGATCGCTGAGGCTGATAGACTGGCAGCTGAAACTGCAATGAGTGAGGCAATTAGCGACCGAGGCTCCCGAGCAGTATCACGCGCTTCATTGAGACTCGACGCTGACGAACTGCCGGTTGAGGGTCTAAAAAAGAGGCTCCAAGAGTCCCTCGATGGGATCCGGGATGTGTCGAGGGTTTCTAAAGGCCTCAAAGGGGTGTGCCAAAAAGCACTCAAGCAGGCGGCGAACAATATCGAAGACGTGGCAGAGGAGCTGCTATCGCGTACGGCAACCGATGAAGTCCGCCGCCTGCAAATTGAGAACGCCCGTCTGAAGAGGCAGATGGCTGAACTGCGAAAAGAGCTCGACTGCGTCCGGGAAATGGTGGTTGGAGTCAGGGAAGAGAGTCGGCCTGCAAATCCTGCTGCAGCCCCTAAAACCACTGAGTCGGACCTTGCGCAGGTTGTACGAGAGGAGATCGGCAAGGTCCTGCAAAGGGTACAATTTCTGGAGAGCAAGATCCTTCGGCCGCCCTTGGCGTCTGACAAGGTCAGACAGCCGCCACCAGGCCCAGAAGTAAAGCAGGCACGTCCTGCTAAAACATCCGAGCGGGCGCCCTCTTCCATGACGGTACCGGCACCCTCCACTTCGGCCGTTCCCAAAGTTGCCGCCAAAGTCGTGCGCGACAAAACAGAGAACGCAAAGGGGAAGGCGAACAGCAAAAGGCAGAAGGCCATTGCTGTCCAGCCTCAGGTTAGGTTGGGGTGCGTGCTGGTTGGGGTGCGTGCTGGACAACCTCCTCGTGGTGCACCCTGGGCACCGCCGACCACCTCTTCAGCATCCCTGTCGAGGTGGTCGACGGTGGCGGATAGTCCGGCTAGGGAGTCGCGTTTTGTCTGCTTCTGGCACTCTCTTCGGAGAGGGTCGGGGGCAGACCGGCGACGGGGGGGCGATAATAATGGACCGAAACTGCATCACGAATTGGACTTGTTGATTGGCGAAGAGCGTGCGGACTACGAAAAGTCTGCGTCTGCTGGGGAAGGTTTGGAGCGGCTTGCCGCTAGTAAGGATACCGAGGCGAGAAGAGATGAGGCGGACTCTGACGCTGAGTCTCTGATGAGCACGTACAGTGCTCAAAGCGAAGACGGATCTAACCGTCCACCCCGCAAACGTGGCAGGCCCCCTACCACTGGTGAACATGTGGGCAAGTGGAAGATTAAGAAGGAGATGGAGGCTGCCAAGCGTGCGAATTCACTAGAAGTGAGGATTGCCGAGGCGGACCGTATAGCAGCCGAAATCGCCATGAGCGAGGCAGCTAGCGACCGAGGCTCCCGTGCGGTATCGCGCGCCTCATCGAGGCTTGACGTTGATGAGTTGCCGGTGGAGGGCCTAAAGAAGAGACTGGAAGAATCCCTCGCAGGAATTCGAGAGGTGTCGAGAGTGTCTAAGGGCCTCAAAGGGGTGTGCCAGAAGACGCTCAAACAGGCGGCGACCAGTATCGAAGACGTGGCAGAGGAGCTGCTATTGCGTACAGCAACCGATGAGGTCCGCCGCCTACAAATTGAGAACGCCCGCCTGAAGAGGCAAATGGGTGAACTCCGAAAAGAGCTCGACTGTGTTCGGGCAATGGTTATCGGAGTCAGGGAAGAGAGTCGGCCAGCAAATTCTGCTGCGGCCCCTAAATCCACTGAGTCGGACCTTGCGCAGGTAGTAAGGGAGGAGATTGGCAAGGTCCTCCAAAGGGTCCAGTTTCTGGAGAGCAAAATCCTCCGGCCGCCCTTGGCGTCTGACAAGATCAAACAGACGCCACCGGCCCCAGAAGTAAAGCAGGCACGTCCCGCTAAAACACCCGAGCGGGCGCCCTCATCCATGACGGCACCGGCACCAACCACCTCGGCCGCTTCCAAAGTAGCCGCCAAAGGCGCGCGCGACAGAACAGGGAATGGAAAGGAGAAGGCGAACAGCGAAAGGCAGAAGGCCATTGCTGTCCAGCCTCAGGTGCCTACTGAACCCAATAGCGGAGAGTGGCAGGTAGTTGGCGCCAAGAAAGCTGCCAAAAAGGCGAAGCAGAAGAGAAGAAGGACAAGAAATAAGCTCAGGGCACCACGCTCGACGGCAGTGGTCCTCACTCTCGCACCTGAAGCCGAAAAGCGCGGCGTCACGTATGCTGCGCTGTTGCATAGTGCCAGGGCGTGGCCTGCGGGCGTCGGGGAGGGCAACCCGGCGTCCGACCAGTAA
Protein Sequence
MVRSGRTPFPRWALTRLDREAAEEAALLQDWLAPPLRVLDVDAAAVQLRGSLTEKTTDHGMNKGPDSQSDSDKKKGAERIDSGESASAFRRGGELMAALKELTRTGSVESMSYDSDDSTFSRPDSRVSNSGSYRKRPRREDSDDDLGGSGDEERPAQKAPKGKSGRGRASPAQVRDSKGRFLKHSDASGSQPQGEDRSHSGGRSDGSSAALKSTDTPRGRXXXXXXXXXXXXXXXXAEMVAAERAAMELQAEKEVAESARKEYELRPIRVPDYFDGASTPELKGHVLRNLDLISQXXXXXXXXXXXXXXXXXXXXXXXXXXXXLMSRSTIASETMKLSMDNDRLKKEIEDLRSEMAVLRAELQRPRSNPDEIIPPPPPSAQFEPESLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSSSRALSADAPPARVSPMLKLEVPPAPTDLGPPPPTTSREGGKTKSKGKGKGKKRAVVANPAPPVMTEEERPAEAWSTVTRRKAKPRADPAVMDGRRDCVMYDWKSPKGDGDNNGPKLHNELDLMIGEERADCEKSALAGEGDERLAASKETEERRDEADSDAESLMSAYSAQSEDGSNRPPRKRGRPPTTGEHVGKWKIKKEMEAAKRANSLEVRIAEADRIAAEIAMSEAASDRGSRAASRASSRLDADELPVEGLKKRLEESLAGIREVSRVSKGLKGVCQKTLKQAASSIEDVAEELLSRTATDEIRRLQIENARLKRQMGELRKELDCVRAMVIGTGEERPANPTETPKSTESDLAQVVREEIGKVLQRVQFLESKILRPPLASDKVKQTTPAPEVRRVEKTRPSKPPERAPTLVTEQTPTTSTAPKAAAKVQRDKKVNGKEKANSQRQKATIAQPQAPARPDSGEWQVVGANKAAKKAKRLSGVGCVLDNLLVVHPGHRRPPLQHPCRGGRRWRIVRLGSRVLSASGTLFGEGRGQTGDGGRDVCKRRGCNVREDWCCGLPNARGLACPNAERRGGETPSLAQGGMGIGTSPFKGNNNNMEQADQHIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSSGAPDLGDPNGLDNASDMASRRVDLVRSASVGKGEERISASKDTERRFEVDSDAESLMSASSAMSLDERGSLPRKRGRPPTTGEHVGKWKVKKEMEAAKRANSLEVRIAEADRLAAETAMSEAISDRGSRAVSRASLRLDADELPVEGLKKRLQESLDGIRDVSRVSKGLKGVCQKALKQAANNIEDVAEELLSRTATDEVRRLQIENARLKRQMAELRKELDCVREMVVGVREESRPANPAAAPKTTESDLAQVVREEIGKVLQRVQFLEGKILRPPLASDKVRQPPPGPEVKQARPAKTSERAPSSMTVPAPSTSAVPKVAAKVVRDKTENAKGKANSKSQKAIAVQPQRDVCKRRGCNVREDWCCGLPNARGLACPNAERRGGETPSLALGGMGIGTSPFKGNNNNMEQADQHIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSSGAPDLGDPNGLDNASDMASRRVDLVRSASVGKGEERISASKDTERRFEVDSDAESLMSASSAMSLDERGSLPRKRGRPPTTGEHVGKWKVKKEMEAAKRANSLEVRIAEADRLAAETAMSEAISDRGSRAVSRASLRLDADELPVEGLKKRLQESLDGIRDVSRVSKGLKGVCQKALKQAANNIEDVAEELLSRTATDEVRRLQIENARLKRQMAELRKELDCVREMVVGVREESRPANPAAAPKTTESDLAQVVREEIGKVLQRVQFLESKILRPPLASDKVRQPPPGPEVKQARPAKTSERAPSSMTVPAPSTSAVPKVAAKVVRDKTENAKGKANSKRQKAIAVQPQVRLGCVLVGVRAGQPPRGAPWAPPTTSSASLSRWSTVADSPARESRFVCFWHSLRRGSGADRRRGGDNNGPKLHHELDLLIGEERADYEKSASAGEGLERLAASKDTEARRDEADSDAESLMSTYSAQSEDGSNRPPRKRGRPPTTGEHVGKWKIKKEMEAAKRANSLEVRIAEADRIAAEIAMSEAASDRGSRAVSRASSRLDVDELPVEGLKKRLEESLAGIREVSRVSKGLKGVCQKTLKQAATSIEDVAEELLLRTATDEVRRLQIENARLKRQMGELRKELDCVRAMVIGVREESRPANSAAAPKSTESDLAQVVREEIGKVLQRVQFLESKILRPPLASDKIKQTPPAPEVKQARPAKTPERAPSSMTAPAPTTSAASKVAAKGARDRTGNGKEKANSERQKAIAVQPQVPTEPNSGEWQVVGAKKAAKKAKQKRRRTRNKLRAPRSTAVVLTLAPEAEKRGVTYAALLHSARAWPAGVGEGNPASDQ

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-