Basic Information

Gene Symbol
-
Assembly
GCA_963971325.1
Location
OZ020225.1:30781477-30788888[+]

Transcription Factor Domain

TF Family
TF_bZIP
Domain
bZIP domain
PFAM
AnimalTFDB
TF Group
Basic Domians group
Description
bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 9 0.015 16 6.4 0.0 31 62 587 618 579 621 0.87
2 9 0.0029 3.1 8.7 0.0 26 59 644 677 641 683 0.87
3 9 1.5 1.7e+03 -0.0 0.0 32 59 685 712 677 714 0.81
4 9 0.34 3.7e+02 2.1 0.0 33 59 726 752 725 755 0.85
5 9 0.0051 5.5 7.9 0.1 27 61 754 788 751 792 0.78
6 9 0.0053 5.8 7.8 0.1 27 58 809 840 799 845 0.64
7 9 0.046 50 4.8 0.0 29 61 845 877 840 881 0.73
8 9 0.068 73 4.3 0.0 31 60 895 924 887 929 0.75
9 9 0.012 13 6.7 0.3 31 61 943 973 935 977 0.71

Sequence Information

Coding Sequence
ATGGAGGGCGACATTGCCCCTAAAGAAGACCCCTTTGACATCAAAATACCCAATTTTGAAGACCCCATCACCAAAACTGCCGAGAACAAGATCGAAATCTTGAAGAATGACTTATTATCGATGCAAGGCTGGACCAGGGATGACCTAAGTGACGTCGAATCGCAAATTGACGAGATTGTCCACCATGCTGAACTCTTAGTCAATTCTTTGCTCATTGGGGATGGCTGCAGACTTGGCAGATTCGATTATTTTGAACGATACGAACTCCAAGAACATATTTTGGAAGCACCGGAAGAATTTCGTAGCGTTTGCGAGCAAGATCGAGGTGATGGTGCATCATTAGACCTTATAGCATCGCAATGTCCCCAAAAGACTGGCACACAGAGTGGAGTTGAACCAGAACGAGCGTCCTCCGAGCATGACGATGATAATGACCCAGCACCGGAAGCACTGGGCCTTATATTGGCCAGTGTTTCCGCAGCAAGGCCCTGTGAGAACGTTGATACAACGTTGGCACATTGCGACGACCGTGTGAACTGCATCTTAACTAATCTTGATGAAATCAGTGAACAAATCGATGGTGCCAATGCCAGTCTTAAAGAAATACAAGATTTGTTTGGTGAGGAATTGAAATTAGACGTGAATAGAGCAGTACAAGATGGAAATAGACCAGTACGAGATGAACTAGCTACAACACAATATTTATACAACCAGGAATCCGAAGATAACACAGATAAGATAGCAGCCGGTGAGATAACGAACACAACGGTTGATTCACTCATGAAAACAGATGCGGAACGCGGAGGTGTATCGAGTGGGATTCCAAAGAAACCTGAATTGATCAATGATGCACTTTACTTGCGGCGACCTTGTCCGTTTTACGTTGTAAACACAGTCGCCAGTGAACGCGTCGAGCGTCGAGAACTGTTTAATAAAGAAATCGAAAGTGTACAAATGAGTGGAGTAAACAAAGTTGATGGTGAACAAGTGCAAGCCGATCGTGTAACAGGTATTGGTGATAACGATTTCTGCGAATTCTTCGATGGTTTAGAAAGTGATATGTTTCCCTCGAGTTTCTTGAAGACCATTGAAGAAGAAGAGGACGAAGTGATGTGTTCTAGTGGTAATAACAATGTTCTGATGCGGAATAATAATGTTGTTGGTGGCGTCAAATTACCTGAACCACTGGAAGTCATCGAAGAAGTTGAGGAAAGGAGATCGGATGGTTTAGATTTTAGAAAtgtagacacgaataaatttaATTACGAAAGCTACTTTGACTCTCTATTGTCTACTAAAAGGATTAGTCAATTGTCTAATAGGTCCAATCCATTGCCTAGTAAAAGCAATAGGTCTAGTCTATCGTCTCGTAAACAAGACAAATCTGGTTTGTTTACAAATAATCAACCAATCAATCCTGAGTTACCTGATGAATTGACTCTGGAAGTCATCACCGCCAAGTTGAACGATTTGGATCTAAACGGGGTGCTAAAATACAGGAGATTTGATGATTTGGGTCCAGGTAACAGAGCAGATGACAGGGAGACTCAGGAGGACGCCGAGTTTGAGTTTATAGAAGAATTATTGAGGAGTTTAGAAGCCGATGAGGGATTGAGTAAACTTATAGACAATAATGGCGACAGAGACATGAGTAATGGTAATAATAACGGTGCCTTGAGGAATGTTTGTGAAAGTGGTATCAAAATGGGTGACTTGAGCAATGGAAATGGTGACTTAAGGAATGCAATTGGTGCCGTAAGCAATGAAACTCGTGACTTAAGCAATGAAACTGGTGGCTTGAGAAACGAAACTGGTGACTTAAGCAACGAAACTCATGATTTAAGCAACAAAACTCATGATTTCACCAATGAAACTCATGATTTAAGCGAAATTCATGACTTAAGCAGTGAATCTGGCGGCTTAACCAATGAAATTGATGGCTTAACCATTGAAACTCATGACTTAACCAAGGAAAGTCATGACTTAACGAAGGAAACTCATGACTTAACCAAGGAAACTCATGACTTCACCAAGGAAACTCGTGATTTCACCAGCGAAACTCATGACTTCAGCAAGGAAACTCATGACTTAACAAAAGAAACTCATGACTTCACCAATGAAACTCATGATTTAAGCGAAATTGATGACTTTACCAAGGAAACTTGTGATTTCACCAAAACTCATGACTTAACCAAGGAAACTCATGACTTAACAAAAGAAACTCATGACTTCACCAATGAAACTCATGATTTAAGCGAAATTGACGACTTAACCAAGGAAACTCATGACTTAACCAAGGAAACTCATGACCTCACCAATGAAACTCATGACTTCACCAATGAAACTCATGACTTAACCAAGGAAATTAGTGATTTCACCAGTGAAACTCATGATTTAAGCGAAACTCATGACTTCACCAAGGAAACTCATGACTTAACCAAGGAAACTCATGACCTCACCAAGGAAACTCATGACTTAACAAAAGAAACTCATGACTTCACCAATGAAACTCATGATTTAAGCGAAATTGACGACTTTACCAAGGAAACTCATGACTTAACCAAGGAAACTCATGACCTCACCAATGAAACTCATGACTTCACCAATGAAACTCATGACTTAACCAAGGAAATTAGTGATTTCACCAGTGAAACTCATGATTTAAACGAAACTCATGACTTCACCAAGGAAACTCATGACTTAACCAAGGAAACTCATGACCTCACCAATGAAACTCATGACTTCACCAATGAAACTCATGACTTAACCAAGGAAATTAGTGTTTTCACCAGTGAAACGCATGATTTAAGCGAAACTCATGACTTCACCAAGGAAACTCATGACTTAACCAAGGAAACTCATGATTTAACCAACGAAACTCATGACTTAACTAATGAAACTCGCCACTTAACCAACAAAACTCGTGATATCATCACCCAAACTCGTGTTGAGGAGTGTATATCTAACGGTATTGTGCCAATGAACATTGAGCCAACGAACATTGAGCCAACAATGACCCAGTCACCGTCCAGCGATCGGAGTGTTAGCAGACGATGCGTGAGTGTGATGGTGTGCGGCAAGGTTGAAGAATATCCGACGGTGGAAGCGTGGAGCAGCACCTCGAAGGACGGTGGTTGTAGTGACGGCGTTGCGTGCGCCACGAAGACGGCGACATCGATCGCGTCCACCGGCGTCGAATACGGCGCGCAACCACCCGGGTGCAGTCGCGATCCGGCCGCCACGGTCTCCACTACGTCCGATACCGAACTTGGCACCGTCGCGCAGCAAACACAACAATCTATAACCTTAAAAAATGACCGAAGTGCGTGCGTCTCGAATTTAAGTTGGGGCTTCAGGAACGGCCGGCTAGTGTTCGATGTTGCAAGTGATGAAGATGCCCGTAACAAAGATGACGAAGGTTCAGATATCCTCAAGTCTAAAGATCTAAACAAGAAGCGCTCTTCTGAAGGTAGGTTGCGATTCCTCGAAGAGAAACTCAAAGAAGCTGGGATTACCGATGGTAAGAATGGAAGCGCAACCACAAACGAAGAGAAACCGGACAGTATAGGTCTTCAAGCTTCATCTTTCGAAGACCTAACATCTGATTCTTCGGAAGAAGACCTAATTCCATCAGATCCTGACAATTTAACCACCTCGTTTTGTCAATTCGACCCACAGGAGATACACAAGTTCCAAAACTCGTTCTTCGAGATATACGATGAGTTAGACGACACGGACGACGACCAAGAATCAATCATGAGAGGAGACGGAAGTAACACAATCGCAGAACGTAAGCAACACAATCCAAATCGCGATAACCTCAAGTCACTCCTCAAGAAACCTGGTAGAAACAAGGAGAAGAAGAACAGGGTGATCTTCAATGAAACGAAGAACGAGTTTTTCGACGCCGATTACATTATTTTGATCAGAGAGGAGTGCGACTACGACGAAGAGGAAGATGACGGCGTCTGCACCTGTAACCAACACGAGATGGTGCGTCTTACGTGCTGTGAACCGAATTGTAATTGTAACGTTTACGAGGGATTTGATCCAACGCCTCAATCGCCGAAATTTGCTCCACCGCTGGAGTTCGTGGATGCTGTTACGTTGAGTCCTCCTGAAGAGTACAAGGATATGGAGCTTGAGGAACAACAGCTACTCGCGTTGCAGCAGCAGATGGCCAGAAGAGGACAAAGAGCTCCAGTGTGCAGGGAGTGCAGTGCTTCACATGACGATGAAGAAGTGATTGGACAGATTGCTGCAGTAATTTTGCTGCAGCTAACTGCAATTTCTGATAGCTTAATCGATTCCAGGCATACCAATACAATTCCGATAGCCGTGCTTAACGTAAATGGAACAATTCCCTGGGATAGTTCCGATGCATTAGTTGATAGGACAAGGATTTTCAACAATCTATAA
Protein Sequence
MEGDIAPKEDPFDIKIPNFEDPITKTAENKIEILKNDLLSMQGWTRDDLSDVESQIDEIVHHAELLVNSLLIGDGCRLGRFDYFERYELQEHILEAPEEFRSVCEQDRGDGASLDLIASQCPQKTGTQSGVEPERASSEHDDDNDPAPEALGLILASVSAARPCENVDTTLAHCDDRVNCILTNLDEISEQIDGANASLKEIQDLFGEELKLDVNRAVQDGNRPVRDELATTQYLYNQESEDNTDKIAAGEITNTTVDSLMKTDAERGGVSSGIPKKPELINDALYLRRPCPFYVVNTVASERVERRELFNKEIESVQMSGVNKVDGEQVQADRVTGIGDNDFCEFFDGLESDMFPSSFLKTIEEEEDEVMCSSGNNNVLMRNNNVVGGVKLPEPLEVIEEVEERRSDGLDFRNVDTNKFNYESYFDSLLSTKRISQLSNRSNPLPSKSNRSSLSSRKQDKSGLFTNNQPINPELPDELTLEVITAKLNDLDLNGVLKYRRFDDLGPGNRADDRETQEDAEFEFIEELLRSLEADEGLSKLIDNNGDRDMSNGNNNGALRNVCESGIKMGDLSNGNGDLRNAIGAVSNETRDLSNETGGLRNETGDLSNETHDLSNKTHDFTNETHDLSEIHDLSSESGGLTNEIDGLTIETHDLTKESHDLTKETHDLTKETHDFTKETRDFTSETHDFSKETHDLTKETHDFTNETHDLSEIDDFTKETCDFTKTHDLTKETHDLTKETHDFTNETHDLSEIDDLTKETHDLTKETHDLTNETHDFTNETHDLTKEISDFTSETHDLSETHDFTKETHDLTKETHDLTKETHDLTKETHDFTNETHDLSEIDDFTKETHDLTKETHDLTNETHDFTNETHDLTKEISDFTSETHDLNETHDFTKETHDLTKETHDLTNETHDFTNETHDLTKEISVFTSETHDLSETHDFTKETHDLTKETHDLTNETHDLTNETRHLTNKTRDIITQTRVEECISNGIVPMNIEPTNIEPTMTQSPSSDRSVSRRCVSVMVCGKVEEYPTVEAWSSTSKDGGCSDGVACATKTATSIASTGVEYGAQPPGCSRDPAATVSTTSDTELGTVAQQTQQSITLKNDRSACVSNLSWGFRNGRLVFDVASDEDARNKDDEGSDILKSKDLNKKRSSEGRLRFLEEKLKEAGITDGKNGSATTNEEKPDSIGLQASSFEDLTSDSSEEDLIPSDPDNLTTSFCQFDPQEIHKFQNSFFEIYDELDDTDDDQESIMRGDGSNTIAERKQHNPNRDNLKSLLKKPGRNKEKKNRVIFNETKNEFFDADYIILIREECDYDEEEDDGVCTCNQHEMVRLTCCEPNCNCNVYEGFDPTPQSPKFAPPLEFVDAVTLSPPEEYKDMELEEQQLLALQQQMARRGQRAPVCRECSASHDDEEVIGQIAAVILLQLTAISDSLIDSRHTNTIPIAVLNVNGTIPWDSSDALVDRTRIFNNL

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-