Basic Information

Gene Symbol
-
Assembly
GCA_948098915.1
Location
OX402546.1:34157-63992[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 5 0.0036 4.6 8.2 0.4 33 62 223 252 219 254 0.83
2 5 0.0016 2.1 9.3 1.8 25 52 612 639 608 646 0.72
3 5 0.0036 4.6 8.2 2.3 33 61 917 945 907 948 0.89
4 5 0.0036 4.6 8.2 2.3 33 61 1311 1339 1301 1342 0.89
5 5 0.0036 4.6 8.2 0.4 33 62 1765 1794 1761 1796 0.83

Sequence Information

Coding Sequence
ATGTTGGTGGCAGGCATACCTGCGAATGCGAATTTCGTCGCCAAGGAAGCTGTGGTGGTCCTGAAGAGAGTGGACACCGTGGTGACCCCGGTGGCAGAAATGCTACACAAGACCAGATGTCCATCCCTGGTCAGTAGCTTCGGGAGTGCGGAGTCCCTGAACTCTGTTATGTCGGTCGTTTCCGACAGCCGGACTAGGTTCTGGCGAAAGAGGCCCAACGCCTCTAGGAGGAATAGTGGGAGCAGTGGGTCCGAGTCAGACCAAAACCCTCTCCCCGCAAGGACGGCCACGTCGGCCACAAAACGCGGTCGCGGTAGGCCCCCAACAACGGGTCAGTATGTCGGCTTGGCCAAAGCCAAGGCCGCATACGTAGCCGCCGAGCGTCAGGCGATAGAATTGGAGGCGGAGAAAGCGGTGGCGGCAGCAACGCCCGCATCGCGTTCTGACGCCGCAAATAAGTACCTGAGTACAGTGACGTCCTGGGGCCCGACGAGAGAGGCCCCAGCCGTCGCCTTGTCCACCCGGGTAAAGGAGAGCCTGACGGCGATTGAATTGGTCGGTAGCAAATCGGGCCACCTTAAAGGCACTTTTCAAAGTGCCTTAAAAAAGTCGGTGGAGGTGCTGCGAGAGGTGTTTGAGGAGTTGGCCTCTCGAACGCCTGCGGATGAGGTGAGTCGGCTCCAAGCGGAGAACCACCTCCTGAAGGGCCAGTTGGACCGGCTCGGTGAAGAGTTGGCAGCTCTCAGAGCCGAGGTGCGGAGCAACAGAGAGCGCGCTGCTCCCACTCCGGTGGTTGAGCTTGAAGCCTTAGGGCAGGATCTCCGAGCTTCTTTGGCAAGAGTCGATGCTCGGCTTGAGAGCCTGGAGCTAGACGGGCGGCTTTTGCCAGCTAAACGCCACCGTCCACCTTTGGCGGTAGATCGATCGATCGAAAGAGAGAAGGAAGTGGCGGCACCTGAGCCACCGGCCCCGGCCCCGGCCCCTAAGAAGGCTAGGCCCCTAGGACCGGCTCAAAAGCTTGCCCCAACTCTGGCTGCAACGGCGAATAAGAAGACGCCAGTGGCTCCAAAAGTTGCCGCCAGACCTGGCAAGAAGAAGGGGAAAGAGGGCAGTGGTACTGCACCCTCCCCAGCTCCCTCTCCTTCGGCTGACCCTGCTCCGACAAGCAAGGCTGGTCGGACTCAGGATGAGGAAGGGTGGCAGACGGTAGGCCCCCGGCGCAATAAGCAGCGCAAGGCCCCGGCAGACAAGGCGCCAAGGACATCGGCGCCCATCAAGCCAGTAGCATCCCAACCGGAGGTGCCCGTTTCGGCGGTCTGCCTGCATGGCAGAGCGGACGAGTCTTCGGACTCTGAAATGTCGGCCGACTTTGAGTCGGACGCCGGTCCAACGGCGCAATCGCGGAAGAGGGCTTTCCTCGAAGATGTGTCCAACTCGGACTCGTCTAAGGGGAAGGCCAAAAAGAAGGTCAAGGCCAAAAGGCGCTCCGCCATCGCCATTGTCGACACCCACGACGTTGGTCTGGCAGAGGCGGAGCGCCAGCTGAGCTTGgcaaggcgggcacaggtccagcaggacctagaggagcaggtcgctgccaaggagcagcaggcgcgaactgcgcgggccagggccgggtacggcgcgtctagcgtcctctacacggaccactccctggccgagctggagcgcatggctcaggaggatgcgggggagatcgtagaggtcgcctccaagtcctccaacctgaaggggacacttcagaagtcgctgaagtgtcgtgccgcgagcatgcgtggcatcgtcggtgagctggtgcagcgcaccgccactgaagagacgcggcagctgcaggcgcgggtcgaccgcctgcagtcggaggtcagccagctgcaccaaaagctggctgaagttacggcccggcctacccaggctgagcccaaatcggcggcagcgggcccctcctcctcctcctccaaccttgaggatattatccgaaaggttgttatggaggagagggcgttcacaagagcctgcctcgatggcatcgaggataggctcttgcccgagaaacgcctgcgtccgccactagcggcagacaaaagggcgaAAAAGCCGGGGCCACCTCCCCCAGAAAGGGAGGACGCCCAACTCCCTTCGACCACCCCATCCAACCAGTCGTGGTCGGAGGTGGTCAAGAAGGGAAAGGGCAAGGCGAAAGCGACCGCCGCccctaccacccctgccacctcatcctgcccgccccaggcggcagAGGCTTCGATTCGACGGAAGCGGCCGTCGTCCTTGTTCTTCGCGCCCGGGTGTTTTGACTCGGACACGGAGACCGAGGGAACGGCGACCGCAAAGGTCAGTACTGCCCGGAGAGGGAGGGGGAAAACCACGGTGGCGGGTCCTTCGACCCGTAAAGACTTTGCAAAGCCTGTGTCGTCGGATGGAGACCTGGACGAAATCCTTCGACGCAGGCAGTATAAGGGTACAGCGACGGAGGCTTCTTGCTCAGCAACCGACACCGGTGCTGAGTCAGATGCCGTGCCGATGGTAGGCGACCTGACGCAATTAAacgcgcaggagttgagggcccaggcaggtgagaagctcgcctgtattttggaggttgccagaaagtctgggaacctcaaaggcgagtttgttggccgcctcaaaagttccgcgtcaaccctgagggaggtcgtggatgccctggcgtgcaggagcgaagcggaggaaacgcgacagctgagggcaacaaacagccgcctgaagctggaagtagaagcgatgaagttggagttgaaagctctacgtcgtggctttactgaagccaggacggccgcaaccgccgccgccaccacggcagctacagaaacggaggctctccccgcaactcctttggggacggccgatctcttcgagaagctggagcggtccctaacaataaggttagggaacatgctggacgcccggatatcgggactcaaagatcagtctgtgccagcaccgccggtccgccctcctttaggggcCAAGAGATCGGCACAGGCAGCAGCGGCAGCGGCTGCCACCAAATCACCAAAGCCGGCAGCCAAGCCGGCTCCCAATCTCGCCCCTAAAGGGGCGGCCCACAATGAGGGACCCTCTACCCAGAGCGCAAGGCAGGCTGCTCCACAAGAGCAAACCTGGGCCAAAGTGGTGGgaaggggaaaaggaaagggaaagtcttcccttccctcttctgcacctgctgcccccgcggcccaggctccagctaagaagcaggcggctaacatggccgacaaggccaagaagacggccctgaaaccccataagtccgcggcagtcgtgctggctcttcagccggaagccgcggagaaaggggtgacgtacagcgccgtccttcaaatggcgcaggataaggtggatctcaaggagctggAGGCTTCGATTCGACGGAAGCGGCCGTCGTCCTTGTTCTTCGCGCCCGGGTGTTTTGACTCGGACACGGAGACCGAGGGAACAGCGACCGCTAAGGTCAGTACTGCCCGGAGAGGGAGGGGAAAAACCACGGTGGCGGGTCCTTCGACCCGTAAAGACTTCGTAAAGCCTGTGTCGTCGGATGGAGACCTGGACGAAATCCTTCGACGCAGGCAGTATAAGGGTACAGCGACGGAGGCTTCTTGCTCGGCAACCGACACCGGCGCTGAGTCAGATGCCGTGCCGATGGTAGGCGACCTCACGCAATTAAacgcgcaggagttgagggcccaggcaggtgagaagctcgcctgtattttggaggttgccagaaagtctgggaacctcaaaggcgagtttgttggccgcctcaaaagttccgcgtcaaccctgagggaggtcgtggatgccctggcgtgcaggagcgaagcggaggaaacgcgacagctgagggcaacaaatagccgcctaaagctggaagtggaagccatgaagttggagctgaaagctctacgtcgtggcttcactgaagccaggacggccgcaaccgccgccgccaccacggcagctacagagacggaggctctccccgcaactcctttggggacggccgatctcttcgagaagctggagcggtccctaacaataaggttagggaacatgctggacgcccggatatcgggactcaaggatcggtctgtgccagcaccgccggtccgccctcctttaggggcCAAGAGAACGGCACCGGCAGCAGCGGCAGCGGCTGCCACCAAACCACCAAAGCCGGCAGCCAAGCCGGCTCCCGACTTCGCCCCTAAAGAGGCGGCTGACAATGAGGGACCCTCCACCCAGGGCCCAAGGCAGGCTGCTCCACAGGAGCAAACCTGGGCCAAAGTGGTGGgaaggggaaaaggaaagggaaaatcttcccttccctcttctgcacctgctgcccccgcggcccgggctcctgctaagaagcaggcagcaaacatggccgacaaggccaagaagacggcactgaaaccccacaagtccgcggcagtcgtcctggctttgcagccggaagccgtggagaagggagtgacgtacagcgccgtcctccaaatggcgcaggagaaggtggacctcaaggagctggGCATACCTGCGAATGCGAATTTCGTCGCCAAGGAAGCTGTGGTGGTCCTGAAGAGAGTGGACACCGTGGTGACCCCGGTGGCAGAAATGCTACACAAGACCAGATGTCCATCCCTGGTCAGTAGCTTCGGGAGTGCGGAGTCCCTGAACTCTGTTATGTCGGTCGTTTCCGACAGCCGGACTAGGTTCTGGCGAAAGAGGCCCAACGCCTCTAGGAGGAATAGTGGGAGCAGTGGGTCCGAGTCTGACCAAAACCCTCTCCCCGCAAGGACGGCCACGTCGGCCACAAAACGCGGTCGCGGTAGGCCCCCAACAACGGGTCAGTATGTCGGCTTGGCCAAAGCCAAGGCCGCATACGTAGCCGCCGAGCGTCAGGCGATAGAATTGGAGGCGGAGAAAGCGGTGGCGGCAGCAACGCCCGCATCGCGTTCTGACGCCGCAAATAAGTACCTGAGTACAGTGACGTCCTGGGGCCCAACGAGAGAGGCCCCAGCCGTCGCCTTGTCCACCCGGGTAAAGGAGAGCCTGACGGCGATCGAATTGGTCGGTAGCAAATCGGGCCACCTTAAAGGCACTTTTCAAAGTGCCTTAAAAAAGTCGGTGGAGGTGCTGCGAGAGGTGTTTGAGGAGTTGGCCTCTCGAACGCCTGCGGATGAGGTGAGTCGGCTCCAAGCGGAGAACCACCTCCTGAAGGGCCAGCTGGACCGGCTCGGTGAAGAGTTGGCAGCTCTCAGAGCCGAGGTGCGGAGCAACAGAGAGCGCGCTGCTCCCACTCCGGTGGTTGAGCTTGAAGCCTTAGGGCAGGATCTCCGAGCTTCTTTGGCAAGAGTCGATGCTCGGCTTGAGAGCCTGGAGCTAGACGGGCGGCTTTTGCCAGCTAAACGCCACCGTCCACCTTTGGCGGTAGATCGATCGATCGAAAGAGAGAAGGAAGTGGCGGCACCTGAGCCACCGGCCCCGGCCCCGGCCCCTAAGAAGGCTAGGCCCCTAGGACCGGCTCAAAAGCTTGCCCCAACTCTGGCTGCAACGGCGAATAAGAAGACGCCAGTGGCTCCAAAAGTTGCCGCCAGACCTGGCAAGAAGAAGGGGAAAGAGGGCAGTGGTACTGCACCCTCCCCAGCTCCCTCTCCTTCGGCTGACCGTGCTCCGACAAGCAAGGCTGGTCGGACTCAGGATGAGGAAGGGTGGCAGACGGTAGGCCCCCGGCGCAATAAGCAGCGCAAGGCCCCGGCAGACAAGGCGCCAAGGACATCGGCGCCCGTCAAGCCAGTAGCATCCCAATCAGGTACTaaaaaactccggacccgccgttccactgcggttgtgatcgcgctgcaacagggggctgaagagagagggcggacgtacgggaacctgatcgccgaggcaaagtccaaactggacttggcagatctcgggatcccatcagtacgctttcggcaagcggcgactggggcacgaattttagaggtcccaggagcagcaagtggggacaaggccgatctattggctgaaaagctccgggggctgtttgcctcggaggacgtgaaggtctccaggccggtgactacagtcgaccttagggtaaccggtctggatgactcggttacctcggaggagctggcgactgcggtcgctaaagctggagggtgttcgccggacaacgtaaagtgtggcgaaataaagaagggaccctccggcgcggggacagcactagtaaagtgccccgttgcagccgccaaaaaagtggtagtcggtcgcctccgggtcggatgggtggcggtcggggtgaagatcctcgacaccactccactccggtgtttccggtgctttgcgcagggtcatatttgggcccagtgtttgtcagaggtcgaccgcagtcagaattgtctgcgttgtggtcaagcgggccacaaagcccgagagtgctcctccaagccccactgccccgtatgtgaggcagctggaaaaaaggcggaccaccttgctgggggcaaggcttgtgccaagcccccaacaaagaaggcagcgaagaaggccgcggcaaaagccaaaacagctaagccgcagcctgccaacccgcctccagcgggtggctcggaggatccaatggaggttcattaa
Protein Sequence
MLVAGIPANANFVAKEAVVVLKRVDTVVTPVAEMLHKTRCPSLVSSFGSAESLNSVMSVVSDSRTRFWRKRPNASRRNSGSSGSESDQNPLPARTATSATKRGRGRPPTTGQYVGLAKAKAAYVAAERQAIELEAEKAVAAATPASRSDAANKYLSTVTSWGPTREAPAVALSTRVKESLTAIELVGSKSGHLKGTFQSALKKSVEVLREVFEELASRTPADEVSRLQAENHLLKGQLDRLGEELAALRAEVRSNRERAAPTPVVELEALGQDLRASLARVDARLESLELDGRLLPAKRHRPPLAVDRSIEREKEVAAPEPPAPAPAPKKARPLGPAQKLAPTLAATANKKTPVAPKVAARPGKKKGKEGSGTAPSPAPSPSADPAPTSKAGRTQDEEGWQTVGPRRNKQRKAPADKAPRTSAPIKPVASQPEVPVSAVCLHGRADESSDSEMSADFESDAGPTAQSRKRAFLEDVSNSDSSKGKAKKKVKAKRRSAIAIVDTHDVGLAEAERQLSLARRAQVQQDLEEQVAAKEQQARTARARAGYGASSVLYTDHSLAELERMAQEDAGEIVEVASKSSNLKGTLQKSLKCRAASMRGIVGELVQRTATEETRQLQARVDRLQSEVSQLHQKLAEVTARPTQAEPKSAAAGPSSSSSNLEDIIRKVVMEERAFTRACLDGIEDRLLPEKRLRPPLAADKRAKKPGPPPPEREDAQLPSTTPSNQSWSEVVKKGKGKAKATAAPTTPATSSCPPQAAEASIRRKRPSSLFFAPGCFDSDTETEGTATAKVSTARRGRGKTTVAGPSTRKDFAKPVSSDGDLDEILRRRQYKGTATEASCSATDTGAESDAVPMVGDLTQLNAQELRAQAGEKLACILEVARKSGNLKGEFVGRLKSSASTLREVVDALACRSEAEETRQLRATNSRLKLEVEAMKLELKALRRGFTEARTAATAAATTAATETEALPATPLGTADLFEKLERSLTIRLGNMLDARISGLKDQSVPAPPVRPPLGAKRSAQAAAAAAATKSPKPAAKPAPNLAPKGAAHNEGPSTQSARQAAPQEQTWAKVVGRGKGKGKSSLPSSAPAAPAAQAPAKKQAANMADKAKKTALKPHKSAAVVLALQPEAAEKGVTYSAVLQMAQDKVDLKELEASIRRKRPSSLFFAPGCFDSDTETEGTATAKVSTARRGRGKTTVAGPSTRKDFVKPVSSDGDLDEILRRRQYKGTATEASCSATDTGAESDAVPMVGDLTQLNAQELRAQAGEKLACILEVARKSGNLKGEFVGRLKSSASTLREVVDALACRSEAEETRQLRATNSRLKLEVEAMKLELKALRRGFTEARTAATAAATTAATETEALPATPLGTADLFEKLERSLTIRLGNMLDARISGLKDRSVPAPPVRPPLGAKRTAPAAAAAAATKPPKPAAKPAPDFAPKEAADNEGPSTQGPRQAAPQEQTWAKVVGRGKGKGKSSLPSSAPAAPAARAPAKKQAANMADKAKKTALKPHKSAAVVLALQPEAVEKGVTYSAVLQMAQEKVDLKELGIPANANFVAKEAVVVLKRVDTVVTPVAEMLHKTRCPSLVSSFGSAESLNSVMSVVSDSRTRFWRKRPNASRRNSGSSGSESDQNPLPARTATSATKRGRGRPPTTGQYVGLAKAKAAYVAAERQAIELEAEKAVAAATPASRSDAANKYLSTVTSWGPTREAPAVALSTRVKESLTAIELVGSKSGHLKGTFQSALKKSVEVLREVFEELASRTPADEVSRLQAENHLLKGQLDRLGEELAALRAEVRSNRERAAPTPVVELEALGQDLRASLARVDARLESLELDGRLLPAKRHRPPLAVDRSIEREKEVAAPEPPAPAPAPKKARPLGPAQKLAPTLAATANKKTPVAPKVAARPGKKKGKEGSGTAPSPAPSPSADRAPTSKAGRTQDEEGWQTVGPRRNKQRKAPADKAPRTSAPVKPVASQSGTKKLRTRRSTAVVIALQQGAEERGRTYGNLIAEAKSKLDLADLGIPSVRFRQAATGARILEVPGAASGDKADLLAEKLRGLFASEDVKVSRPVTTVDLRVTGLDDSVTSEELATAVAKAGGCSPDNVKCGEIKKGPSGAGTALVKCPVAAAKKVVVGRLRVGWVAVGVKILDTTPLRCFRCFAQGHIWAQCLSEVDRSQNCLRCGQAGHKARECSSKPHCPVCEAAGKKADHLAGGKACAKPPTKKAAKKAAAKAKTAKPQPANPPPAGGSEDPMEVH

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-