Basic Information

Gene Symbol
-
Assembly
GCA_947458855.1
Location
OX375845.1:33253-55055[-]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 3 0.00013 0.17 13.0 0.5 31 60 254 283 245 287 0.83
2 3 4.8e-06 0.0063 17.6 0.6 32 63 1090 1121 1071 1122 0.91
3 3 1.5e-05 0.02 16.0 1.5 31 63 1437 1469 1426 1470 0.88

Sequence Information

Coding Sequence
ATGAATAACCAGGAGGGTCCTTCCGAGCTTCGCGCCGGAGGGGAATCCCTCTGTGTTGCGACTTCAGCCACTACTGCATCAGTAGAGACGGAAGCCGCAACATGCAGCCACTTATTTGGTACGGCTGATTCCTTGCTTTACATTACAACGACCCAGGCTACGAAAAAGGTACCGAAAATAATACTAACAAGGATTGAGGATATACAGGCAACAACGACCACAGGCGATAACATGGATACGGATTACCAACCCCAAACTTCCGACCCAACCAAAGGGTCAGAAGAAAATACGGAGGAAGCGCAGCCGAGTAGAGTTGCTGATCGAACACGGTCAGCAACTAAAACTCTTGCGGGAAAAATGTGGTCACAGCGAAAGCGTGTGGCTACTGAGGAGCTTGGCGAGGACACGACCAGTGACGACGGCTCCGTCAGGAGTCGTAGTAATTGCTCAAAGCGTGGTCGTGCTTCCCCTGCAAATTCTTTAGACCGACTCACTACCAAACCGGTCCAACAAAGAAGAAAGGAATTCCTTCTACAGGGTGAAAAGAGCGAGGAATCCCAAACAGAACTGGAAGGTAACTCTTCGTCTTCCGACGAGCGTTCGGCGGACACTCTGCTCACCGAGGCTCTTCGGGCGGCAGAAGCGGTACGTGATATTGCTAAAAAATCGACCAATATCAAAGGCAACCTTCGACATGAACTAAAGGTTGCCTCTAATTCATTTAAAAAAATTGCGAGGGTCCTGAGCCGTAGAACTACCTCAGAGGAAGTGAAACAACTGCAAGCGGAGAACAACCGTTTGAGACGGGAGATGGATGAGATCCGGAAGGATTTCTCGGCCATCCGCGAGGAGATTGGGCAGAGCAAGCGCACGCCTTCAAGAGAAGAAGGTACGCCGCGAAATCAACAATCCGTGGTGCCAACCGTATCCTCAATGGAAGAGTTCCAAAGGATGATCatggtatcagtgggcaccatgattaatgcgagacttgagggtttggaggaccgtctcctaccTTCAACAATTATCAGACCTGCACTAGGAAGGAAGGTCCCAGACCCGACGGCTGTTTGTGTCGCACCTGTCGCATCGGAGCCAACAATTCCGTCGGCTTCCTCTTCTTCAGTGAAGAAGTCTGATAAAAAGCAGATACCGCAACAGCAACCTCCGGCCCCAGTTACAAAGCCCGAATCTTGGGCAACGGTAACCCGACGGGGAAAGAAAACCGCGCGAACAGTACTCGAAGGCCCTACGCCTGCTGCTGCTACTGCAACAAgcggaggctggtcacctCCGCTGAAGCTAACGGTCCCAAGTGGGGCCCCTCTCGGCACTTCTCTGTTGAGGCGGAGTTGGTTGGGATGGGGCCACAACCCCGGCCGCCGTTCATTCGGTGGCCGAGGAGGAACGGGGCGGCGGGGGACATGGTGCTCGTTCCCCCCGCCCTGTGGGGCCTGGGGCGGCGGATGCGGGGGCATTCGTCGCGTCGAGGGCCACgagggttcctccgagctgcgcgccggaggggaatccctctgtgTTGTGACTTCCGGAACTACTGCACCAGCAGATCCAGTAGCAACGAAAGCCACAACATGCGGTCCCACTGTCTTATCTGGGGGGGCCAATTCAGTGCCGAAATTAAACAATACAACGACAAGTGGCGATAATAAGGATATGGATTTCACAACTGAACGGACGAAAATGAACAGCACAATGACCAATAGCGAGAATATGGTAAAGGATTTCACTATGGAAATGGACCTAGAAGAGGTGGCAAGCTCTTCGAGGGAAGAGAGGGAAGAGACGCCGATCGGCGCAAAGCGGTCACGCGCCTTAAAGCGTGGGCCCAAGCGGCGCCTTTGGGCGCTTAGCTCAGGTTCGGAATCGGCCTCGAGCGGGGAAGGGGTGGAAAGTTCACCCACAAGAGGTAGAGGAAGGGACCGCACCCCTACCACTGGGGATATCGGCCTACCAAAAACTAAAGCCGACTACTTAAAGGCGCAAAGAGATGAAATGGAGTTTGCTGCCGAGAACGAGGTCGCGGTTATATCTGAAAGGATCGCCACCAGGAACCAACGACTTGGCATCCCGGAAGACACCACTAGAGATGACGAAGACGAACGCCAGGCCACTGACCTCAATAACCAGCTGCTAGACAGTTGGGGCCCCTCCCGGCACTTCTCTGAGGAGGCGGAGTTGGTTGGGATGGGGCCACAACCCCGGCTGCCGTTCATTCGGTGGCCGAGGAGGAACGGGGCGGCGGGGGACATGGTGCTCGTTCCCCCCGCCCTGTGGGGCCTGGGGCGGCGGATGCGGGGGCATTCGTCGCGTCGAGGGCCACATAAACTGAAATTACATCCCCGCCCTCTGCTCATCAGAGAGTGTTTCGGATGTAATATGAATGTGTCGAAAAGTGTCGATTTCCTTGACGTCGATTTTATGAGTAAGGCTGGAGAGCCTATAAACACTCCTGAGGGTAGGGAGCAGTGCCCTCAGTACTGGgagggtccctccgagcttcgcgccggaggggaatccctctgtgttgcGACTTCAGCCACTACTGCATCAGTAGAGACGGAAGCCGCAACATGCAGCCACTTATCTGGTACGGCTGATTCCTTGCTCAACATTACCACGACCCAGGCTACGAAAAAGGTACCGAAAATCATATTGACAAGGATTGATGATTATAGACAGGCAACCACGAACACAGGCGAGATAAATATGGATACCGATAATCAACACCATACAGAGGAAGAGAATACAAAAGCCCAGGAGGGGGAGTCCGAAGCGGATACTCTGGAGGAGCCCCTCGCAAGGCGGACGCGTTCCGCTACGAAAGGCTTGGTGGGGCAACTCTGGGCACAGCGCAAGCGCACGGTTGAGGATCGCGGCGAATCCAGCGACGAAAGCCCTGCAAGGAGCAGAAGCTGCGGTTCCAAGCGTGGACGGGGTTCCCCAACAACAAGGCGGGCACCCGCCAAGGCGGCGCAGCCAAGACGGGAAGAGTTTTTATTGCTCAACCCGGAAGAGCAGCCTGAGCCGGACAGCGGGAGGAGCGGGCTAGAGGGAGATTCTTCGTCTGAAGACGAAAGAACTTCCGATACCCTGCTCAAAGAGGCCACGGAGGCAGCTGAGGCGGTCAGATCAATCGCCATTAAGTCCAGCAACATTAAGGGTGACCTGCGAAGGGACCTTAAGGTTGCTGCGACTTCATTCAAGAAGATCGCCAAGCTGCTTAGCCGGCGTACCTCCTCGGAGGAGGTGAGGAAGGTGAAGGCGGAGAATGACCGCTTGCGTCGGGAGGTGGACGAACTCCGCAAGGAGTTCGCTACAATCCGACAAGAAGTGCAGGGGGAAAACCGCCAAAGCACCCCGCCTATATTACGGCCCACCATCTCGCCGACCGAATCCCCGATGGaggacttccatcgggctataatgatctcggttggcacgatgctgaacgccagactggaaagtttggaggagcgccttctcccggCTAAGATAATAAGGCCTCCACTAGGCGTTAAGGCGCCTGCCGCGTCCGCCGTGACTCTTGCCGCCGGCAAAGCAAGCGCACCGCTACCTTCTTCTTCTCTCCAAACTCCGGGGaagaagaagaggaaaggcgaagctaccccatcggccgctccccacccactttcacttgcaccgACGACCCCTGCTAGTATAACGGATTTGGAAATACAAGACATAAGTATGAATGATGCGGTTTCTACTGAAGTACGGACGCCAGTTGTATTACTGGACCGTACTGACGTAGATAAACAGAAAGAGGATGACTTTAAGTCGGCAACAGAAGGACCGGATGAGCTCTGCCCAAGTGTGGCCACGCGAACGCGTTCCGCGGCTAAATGCGAATCGGGGAAATTTTGGACTCGTAAGCGCACGGCAGAGGTTTGCGGTGAAACCAGCGACGAGGGCTCCGTAAGGAGCAGCAGTTGTGGGTCGAAACGCGGGCGAACAACTGGGTCTACAACAAGGCGGCCACCGACAAAACAAGCGAATCGAAAGCGGGAGGAATTCCTCGTCGATTCGGAGGAGCAACAAGACCTAGACTTCGATAGCGCCAGGAGCGGGTTAGAAGGAGATTCTTCATCTTCGGATGAAAGATCTGCCGATACCCTGCTCAAAGAGGCAAGGGAGGCAGCTGAGGCGGTCAAAGCTATTGCTTTGAAGTCCAGCAACATTAAGGGAGACCTGCGGAGGGACCTTAAAGTTGCTGCGACTTCCTTCAAGAAGATCGCCAAAATATTAAGCCGCCGGACCACTTCAGAAGAGGTGAAGAGGCTGAAAACGGAAAATGACCGCCTGCGTCGGGAGATGGATGAACTCCGAAAGGAATTTACCACAATCCGACAGGAGGTGCGTGAAAACACCCCTCAGACTGGGCCGCGACCCCAGTCTATCACggtgccgacaggatccctgatggaggaattccaacgggcgataatggtatctgttggcaccatgttaaatgctaggcttgagggtctagaggagcgccttctgccacccaCTCGCACCCGGCCTGCACTGGGCAAACGTGCCCCAGAGTCGGCTCCGGCTGCTGTTTGTGTCGCAGCGGTCGCAGCACAGGACGCAAATTCTTCTTTGGTCGCTTCTCCTCAGCTGGACGCGGAAAAGAAAGAACCGAAGAAGAAAAAGAAGGAAAATACAAAGAAAAGTCCCAAGGCGCCAGAGTGCCAACTAGGACTAACCCCGGCTCCTGAAGAACAGACTTGGGCGAAGGTGGTTGGGCgcaaacaaaagaaggcgaacaggcccgccccgactaagcctAAGCGTGATTTATCAGTCCCTTGCGGGACAGTTAAATTGCAGTTACATCCCCACTCTCTGCACATCAGACAGTGTTTCGGATGTAACATGTATATGTCGACTTCAGAAAGATTCACTCGCTGA
Protein Sequence
MNNQEGPSELRAGGESLCVATSATTASVETEAATCSHLFGTADSLLYITTTQATKKVPKIILTRIEDIQATTTTGDNMDTDYQPQTSDPTKGSEENTEEAQPSRVADRTRSATKTLAGKMWSQRKRVATEELGEDTTSDDGSVRSRSNCSKRGRASPANSLDRLTTKPVQQRRKEFLLQGEKSEESQTELEGNSSSSDERSADTLLTEALRAAEAVRDIAKKSTNIKGNLRHELKVASNSFKKIARVLSRRTTSEEVKQLQAENNRLRREMDEIRKDFSAIREEIGQSKRTPSREEGTPRNQQSVVPTVSSMEEFQRMIMVSVGTMINARLEGLEDRLLPSTIIRPALGRKVPDPTAVCVAPVASEPTIPSASSSSVKKSDKKQIPQQQPPAPVTKPESWATVTRRGKKTARTVLEGPTPAAATATSGGWSPPLKLTVPSGAPLGTSLLRRSWLGWGHNPGRRSFGGRGGTGRRGTWCSFPPPCGAWGGGCGGIRRVEGHEGSSELRAGGESLCVVTSGTTAPADPVATKATTCGPTVLSGGANSVPKLNNTTTSGDNKDMDFTTERTKMNSTMTNSENMVKDFTMEMDLEEVASSSREEREETPIGAKRSRALKRGPKRRLWALSSGSESASSGEGVESSPTRGRGRDRTPTTGDIGLPKTKADYLKAQRDEMEFAAENEVAVISERIATRNQRLGIPEDTTRDDEDERQATDLNNQLLDSWGPSRHFSEEAELVGMGPQPRLPFIRWPRRNGAAGDMVLVPPALWGLGRRMRGHSSRRGPHKLKLHPRPLLIRECFGCNMNVSKSVDFLDVDFMSKAGEPINTPEGREQCPQYWEGPSELRAGGESLCVATSATTASVETEAATCSHLSGTADSLLNITTTQATKKVPKIILTRIDDYRQATTNTGEINMDTDNQHHTEEENTKAQEGESEADTLEEPLARRTRSATKGLVGQLWAQRKRTVEDRGESSDESPARSRSCGSKRGRGSPTTRRAPAKAAQPRREEFLLLNPEEQPEPDSGRSGLEGDSSSEDERTSDTLLKEATEAAEAVRSIAIKSSNIKGDLRRDLKVAATSFKKIAKLLSRRTSSEEVRKVKAENDRLRREVDELRKEFATIRQEVQGENRQSTPPILRPTISPTESPMEDFHRAIMISVGTMLNARLESLEERLLPAKIIRPPLGVKAPAASAVTLAAGKASAPLPSSSLQTPGKKKRKGEATPSAAPHPLSLAPTTPASITDLEIQDISMNDAVSTEVRTPVVLLDRTDVDKQKEDDFKSATEGPDELCPSVATRTRSAAKCESGKFWTRKRTAEVCGETSDEGSVRSSSCGSKRGRTTGSTTRRPPTKQANRKREEFLVDSEEQQDLDFDSARSGLEGDSSSSDERSADTLLKEAREAAEAVKAIALKSSNIKGDLRRDLKVAATSFKKIAKILSRRTTSEEVKRLKTENDRLRREMDELRKEFTTIRQEVRENTPQTGPRPQSITVPTGSLMEEFQRAIMVSVGTMLNARLEGLEERLLPPTRTRPALGKRAPESAPAAVCVAAVAAQDANSSLVASPQLDAEKKEPKKKKKENTKKSPKAPECQLGLTPAPEEQTWAKVVGRKQKKANRPAPTKPKRDLSVPCGTVKLQLHPHSLHIRQCFGCNMYMSTSERFTR

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-