Agra021411.1
Basic Information
- Insect
- Aphodius granarius
- Gene Symbol
- -
- Assembly
- GCA_963971325.1
- Location
- OZ020225.1:30781477-30788888[+]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 9 0.015 16 6.4 0.0 31 62 587 618 579 621 0.87 2 9 0.0029 3.1 8.7 0.0 26 59 644 677 641 683 0.87 3 9 1.5 1.7e+03 -0.0 0.0 32 59 685 712 677 714 0.81 4 9 0.34 3.7e+02 2.1 0.0 33 59 726 752 725 755 0.85 5 9 0.0051 5.5 7.9 0.1 27 61 754 788 751 792 0.78 6 9 0.0053 5.8 7.8 0.1 27 58 809 840 799 845 0.64 7 9 0.046 50 4.8 0.0 29 61 845 877 840 881 0.73 8 9 0.068 73 4.3 0.0 31 60 895 924 887 929 0.75 9 9 0.012 13 6.7 0.3 31 61 943 973 935 977 0.71
Sequence Information
- Coding Sequence
- ATGGAGGGCGACATTGCCCCTAAAGAAGACCCCTTTGACATCAAAATACCCAATTTTGAAGACCCCATCACCAAAACTGCCGAGAACAAGATCGAAATCTTGAAGAATGACTTATTATCGATGCAAGGCTGGACCAGGGATGACCTAAGTGACGTCGAATCGCAAATTGACGAGATTGTCCACCATGCTGAACTCTTAGTCAATTCTTTGCTCATTGGGGATGGCTGCAGACTTGGCAGATTCGATTATTTTGAACGATACGAACTCCAAGAACATATTTTGGAAGCACCGGAAGAATTTCGTAGCGTTTGCGAGCAAGATCGAGGTGATGGTGCATCATTAGACCTTATAGCATCGCAATGTCCCCAAAAGACTGGCACACAGAGTGGAGTTGAACCAGAACGAGCGTCCTCCGAGCATGACGATGATAATGACCCAGCACCGGAAGCACTGGGCCTTATATTGGCCAGTGTTTCCGCAGCAAGGCCCTGTGAGAACGTTGATACAACGTTGGCACATTGCGACGACCGTGTGAACTGCATCTTAACTAATCTTGATGAAATCAGTGAACAAATCGATGGTGCCAATGCCAGTCTTAAAGAAATACAAGATTTGTTTGGTGAGGAATTGAAATTAGACGTGAATAGAGCAGTACAAGATGGAAATAGACCAGTACGAGATGAACTAGCTACAACACAATATTTATACAACCAGGAATCCGAAGATAACACAGATAAGATAGCAGCCGGTGAGATAACGAACACAACGGTTGATTCACTCATGAAAACAGATGCGGAACGCGGAGGTGTATCGAGTGGGATTCCAAAGAAACCTGAATTGATCAATGATGCACTTTACTTGCGGCGACCTTGTCCGTTTTACGTTGTAAACACAGTCGCCAGTGAACGCGTCGAGCGTCGAGAACTGTTTAATAAAGAAATCGAAAGTGTACAAATGAGTGGAGTAAACAAAGTTGATGGTGAACAAGTGCAAGCCGATCGTGTAACAGGTATTGGTGATAACGATTTCTGCGAATTCTTCGATGGTTTAGAAAGTGATATGTTTCCCTCGAGTTTCTTGAAGACCATTGAAGAAGAAGAGGACGAAGTGATGTGTTCTAGTGGTAATAACAATGTTCTGATGCGGAATAATAATGTTGTTGGTGGCGTCAAATTACCTGAACCACTGGAAGTCATCGAAGAAGTTGAGGAAAGGAGATCGGATGGTTTAGATTTTAGAAAtgtagacacgaataaatttaATTACGAAAGCTACTTTGACTCTCTATTGTCTACTAAAAGGATTAGTCAATTGTCTAATAGGTCCAATCCATTGCCTAGTAAAAGCAATAGGTCTAGTCTATCGTCTCGTAAACAAGACAAATCTGGTTTGTTTACAAATAATCAACCAATCAATCCTGAGTTACCTGATGAATTGACTCTGGAAGTCATCACCGCCAAGTTGAACGATTTGGATCTAAACGGGGTGCTAAAATACAGGAGATTTGATGATTTGGGTCCAGGTAACAGAGCAGATGACAGGGAGACTCAGGAGGACGCCGAGTTTGAGTTTATAGAAGAATTATTGAGGAGTTTAGAAGCCGATGAGGGATTGAGTAAACTTATAGACAATAATGGCGACAGAGACATGAGTAATGGTAATAATAACGGTGCCTTGAGGAATGTTTGTGAAAGTGGTATCAAAATGGGTGACTTGAGCAATGGAAATGGTGACTTAAGGAATGCAATTGGTGCCGTAAGCAATGAAACTCGTGACTTAAGCAATGAAACTGGTGGCTTGAGAAACGAAACTGGTGACTTAAGCAACGAAACTCATGATTTAAGCAACAAAACTCATGATTTCACCAATGAAACTCATGATTTAAGCGAAATTCATGACTTAAGCAGTGAATCTGGCGGCTTAACCAATGAAATTGATGGCTTAACCATTGAAACTCATGACTTAACCAAGGAAAGTCATGACTTAACGAAGGAAACTCATGACTTAACCAAGGAAACTCATGACTTCACCAAGGAAACTCGTGATTTCACCAGCGAAACTCATGACTTCAGCAAGGAAACTCATGACTTAACAAAAGAAACTCATGACTTCACCAATGAAACTCATGATTTAAGCGAAATTGATGACTTTACCAAGGAAACTTGTGATTTCACCAAAACTCATGACTTAACCAAGGAAACTCATGACTTAACAAAAGAAACTCATGACTTCACCAATGAAACTCATGATTTAAGCGAAATTGACGACTTAACCAAGGAAACTCATGACTTAACCAAGGAAACTCATGACCTCACCAATGAAACTCATGACTTCACCAATGAAACTCATGACTTAACCAAGGAAATTAGTGATTTCACCAGTGAAACTCATGATTTAAGCGAAACTCATGACTTCACCAAGGAAACTCATGACTTAACCAAGGAAACTCATGACCTCACCAAGGAAACTCATGACTTAACAAAAGAAACTCATGACTTCACCAATGAAACTCATGATTTAAGCGAAATTGACGACTTTACCAAGGAAACTCATGACTTAACCAAGGAAACTCATGACCTCACCAATGAAACTCATGACTTCACCAATGAAACTCATGACTTAACCAAGGAAATTAGTGATTTCACCAGTGAAACTCATGATTTAAACGAAACTCATGACTTCACCAAGGAAACTCATGACTTAACCAAGGAAACTCATGACCTCACCAATGAAACTCATGACTTCACCAATGAAACTCATGACTTAACCAAGGAAATTAGTGTTTTCACCAGTGAAACGCATGATTTAAGCGAAACTCATGACTTCACCAAGGAAACTCATGACTTAACCAAGGAAACTCATGATTTAACCAACGAAACTCATGACTTAACTAATGAAACTCGCCACTTAACCAACAAAACTCGTGATATCATCACCCAAACTCGTGTTGAGGAGTGTATATCTAACGGTATTGTGCCAATGAACATTGAGCCAACGAACATTGAGCCAACAATGACCCAGTCACCGTCCAGCGATCGGAGTGTTAGCAGACGATGCGTGAGTGTGATGGTGTGCGGCAAGGTTGAAGAATATCCGACGGTGGAAGCGTGGAGCAGCACCTCGAAGGACGGTGGTTGTAGTGACGGCGTTGCGTGCGCCACGAAGACGGCGACATCGATCGCGTCCACCGGCGTCGAATACGGCGCGCAACCACCCGGGTGCAGTCGCGATCCGGCCGCCACGGTCTCCACTACGTCCGATACCGAACTTGGCACCGTCGCGCAGCAAACACAACAATCTATAACCTTAAAAAATGACCGAAGTGCGTGCGTCTCGAATTTAAGTTGGGGCTTCAGGAACGGCCGGCTAGTGTTCGATGTTGCAAGTGATGAAGATGCCCGTAACAAAGATGACGAAGGTTCAGATATCCTCAAGTCTAAAGATCTAAACAAGAAGCGCTCTTCTGAAGGTAGGTTGCGATTCCTCGAAGAGAAACTCAAAGAAGCTGGGATTACCGATGGTAAGAATGGAAGCGCAACCACAAACGAAGAGAAACCGGACAGTATAGGTCTTCAAGCTTCATCTTTCGAAGACCTAACATCTGATTCTTCGGAAGAAGACCTAATTCCATCAGATCCTGACAATTTAACCACCTCGTTTTGTCAATTCGACCCACAGGAGATACACAAGTTCCAAAACTCGTTCTTCGAGATATACGATGAGTTAGACGACACGGACGACGACCAAGAATCAATCATGAGAGGAGACGGAAGTAACACAATCGCAGAACGTAAGCAACACAATCCAAATCGCGATAACCTCAAGTCACTCCTCAAGAAACCTGGTAGAAACAAGGAGAAGAAGAACAGGGTGATCTTCAATGAAACGAAGAACGAGTTTTTCGACGCCGATTACATTATTTTGATCAGAGAGGAGTGCGACTACGACGAAGAGGAAGATGACGGCGTCTGCACCTGTAACCAACACGAGATGGTGCGTCTTACGTGCTGTGAACCGAATTGTAATTGTAACGTTTACGAGGGATTTGATCCAACGCCTCAATCGCCGAAATTTGCTCCACCGCTGGAGTTCGTGGATGCTGTTACGTTGAGTCCTCCTGAAGAGTACAAGGATATGGAGCTTGAGGAACAACAGCTACTCGCGTTGCAGCAGCAGATGGCCAGAAGAGGACAAAGAGCTCCAGTGTGCAGGGAGTGCAGTGCTTCACATGACGATGAAGAAGTGATTGGACAGATTGCTGCAGTAATTTTGCTGCAGCTAACTGCAATTTCTGATAGCTTAATCGATTCCAGGCATACCAATACAATTCCGATAGCCGTGCTTAACGTAAATGGAACAATTCCCTGGGATAGTTCCGATGCATTAGTTGATAGGACAAGGATTTTCAACAATCTATAA
- Protein Sequence
- MEGDIAPKEDPFDIKIPNFEDPITKTAENKIEILKNDLLSMQGWTRDDLSDVESQIDEIVHHAELLVNSLLIGDGCRLGRFDYFERYELQEHILEAPEEFRSVCEQDRGDGASLDLIASQCPQKTGTQSGVEPERASSEHDDDNDPAPEALGLILASVSAARPCENVDTTLAHCDDRVNCILTNLDEISEQIDGANASLKEIQDLFGEELKLDVNRAVQDGNRPVRDELATTQYLYNQESEDNTDKIAAGEITNTTVDSLMKTDAERGGVSSGIPKKPELINDALYLRRPCPFYVVNTVASERVERRELFNKEIESVQMSGVNKVDGEQVQADRVTGIGDNDFCEFFDGLESDMFPSSFLKTIEEEEDEVMCSSGNNNVLMRNNNVVGGVKLPEPLEVIEEVEERRSDGLDFRNVDTNKFNYESYFDSLLSTKRISQLSNRSNPLPSKSNRSSLSSRKQDKSGLFTNNQPINPELPDELTLEVITAKLNDLDLNGVLKYRRFDDLGPGNRADDRETQEDAEFEFIEELLRSLEADEGLSKLIDNNGDRDMSNGNNNGALRNVCESGIKMGDLSNGNGDLRNAIGAVSNETRDLSNETGGLRNETGDLSNETHDLSNKTHDFTNETHDLSEIHDLSSESGGLTNEIDGLTIETHDLTKESHDLTKETHDLTKETHDFTKETRDFTSETHDFSKETHDLTKETHDFTNETHDLSEIDDFTKETCDFTKTHDLTKETHDLTKETHDFTNETHDLSEIDDLTKETHDLTKETHDLTNETHDFTNETHDLTKEISDFTSETHDLSETHDFTKETHDLTKETHDLTKETHDLTKETHDFTNETHDLSEIDDFTKETHDLTKETHDLTNETHDFTNETHDLTKEISDFTSETHDLNETHDFTKETHDLTKETHDLTNETHDFTNETHDLTKEISVFTSETHDLSETHDFTKETHDLTKETHDLTNETHDLTNETRHLTNKTRDIITQTRVEECISNGIVPMNIEPTNIEPTMTQSPSSDRSVSRRCVSVMVCGKVEEYPTVEAWSSTSKDGGCSDGVACATKTATSIASTGVEYGAQPPGCSRDPAATVSTTSDTELGTVAQQTQQSITLKNDRSACVSNLSWGFRNGRLVFDVASDEDARNKDDEGSDILKSKDLNKKRSSEGRLRFLEEKLKEAGITDGKNGSATTNEEKPDSIGLQASSFEDLTSDSSEEDLIPSDPDNLTTSFCQFDPQEIHKFQNSFFEIYDELDDTDDDQESIMRGDGSNTIAERKQHNPNRDNLKSLLKKPGRNKEKKNRVIFNETKNEFFDADYIILIREECDYDEEEDDGVCTCNQHEMVRLTCCEPNCNCNVYEGFDPTPQSPKFAPPLEFVDAVTLSPPEEYKDMELEEQQLLALQQQMARRGQRAPVCRECSASHDDEEVIGQIAAVILLQLTAISDSLIDSRHTNTIPIAVLNVNGTIPWDSSDALVDRTRIFNNL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -