Malb025230.2
Basic Information
- Insect
- Macrophya alboannulata
- Gene Symbol
- -
- Assembly
- GCA_949628255.1
- Location
- OX451211.1:14990383-14995158[-]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 13 0.00039 0.48 11.3 7.5 25 63 537 575 535 577 0.91 2 13 0.00049 0.61 11.0 7.1 25 63 593 631 592 633 0.92 3 13 0.00035 0.44 11.5 4.9 24 61 655 692 652 696 0.91 4 13 0.0026 3.3 8.7 8.8 25 64 705 744 694 745 0.85 5 13 0.0078 9.7 7.2 11.6 25 63 719 757 713 762 0.80 6 13 0.00035 0.44 11.5 4.9 24 61 781 818 778 822 0.91 7 13 0.0026 3.3 8.7 8.8 25 64 831 870 820 871 0.85 8 13 0.0078 9.7 7.2 11.6 25 63 845 883 839 888 0.80 9 13 0.00073 0.91 10.4 5.3 25 63 901 939 900 941 0.79 10 13 0.0019 2.4 9.1 3.0 32 63 957 988 953 990 0.93 11 13 0.017 21 6.1 7.1 25 56 1006 1037 1005 1044 0.58 12 13 0.12 1.4e+02 3.4 2.6 29 64 1089 1124 1084 1132 0.69 13 13 0.15 1.9e+02 3.0 4.6 25 61 1127 1163 1120 1180 0.65
Sequence Information
- Coding Sequence
- ATGGCTCCCGCGgtggtggcaatttttttaatcactctATTCGTGGAAGGATTATCAGCTCCAAGTTGGTGTGGAGATTGCCAAACATGGGAATCGCACAGTGGCTCTCAGACTCATAGAGGATTTGGGAGGCAAATAAATCCCGAAAATTTGTCTCAGAGGTCAGAAAACTTGGAAGATTTAACACAACAAGCCGAAACCGAGTTAAACAGATCTCCCAATCAATTACCTTTCGATAATACAAGGCCTGGAAATTGGACCGATGTTAATCATTACAGAACATCGGATGGCCATGGAAGAGTATACGAAGAACAAGGCCAGCGTGTAGATGGATCAAATCGAATAAGATTCTCTAGAAGAAATTTCACTTCCAGTTATAGCAGTGGAAGCTTAGGTTCCTTTGGAGAAACTAATCTGGGACATATATATCCCAACGTTAGACAAGATGAGAGCCAGTTATTGAACCGCGAATCTTTGGATCAGTCACAAAATTCGGCTTATGATCGATTCGCCACTGGACGAAATTTCCATACTACACAGGACTCTTTACACTCTACAGAAAGGGTGAACAGTCATAACGATGCATCCAGATATTATGAAAATCATGGCAATAGTGGTCGGATCAGTGGAATTACTTCTGGCCAATCATCGCAACAAGGAATCAATGTATTGGATCGAACAAGACCAGGAAATTGGAGCACGGTTAATACTTTTAGAACCAATGAGGGTAATGGCAGAGTTTACGAAGAACGAGGGCAGATTGTAACAGGGCCGAGGCGAGTTCATTTTTATGCAAGAAATTACACTTCAAGTTATGCCTCTGGCGGAGGTATTCCGACTCTTGGTTTGGAGGGCGACAATACAAGGAACATCGAGAGTACCGTGCAGCAGATGCAGAGACAATTTGATAGTTATGGAAGAGAGCTTCATCAAAGTACTGAAGGTTCAACAAATGGTGATTACACTCAGCATTATCCTGGAGATTATACATCACCTAGTCAAACGTCGAGACAAACAAACTATAGATATGTATCAAGACCCAGTAACTATGAATCGCAAAATCAGAATACTTTGGATTCAAATTCTCACCAAACGTACCAACACACAACTAACTTAGGAAATCGGCATGTGTCTCAGTCTAGCAGTAGTTCTTTTGGTGGATCTGGACAACTTAATGGAAGAAATCCAGATTCTGGATACTCAACGGGCAGTTATACCAGTGGTAGTGGATACAATCATCAGGGAACTTTGGAAATACCATCTTCAGGACACACTGGACACCAAATTCCATATTACAATCAATTTCAAACCACCTCTGACTCCTCCTCTGCTGCCTTTGTATCTCGTCCCGATACCGATCTAAGAACTATTCAATCTGGTAGCGATCTAGAAACACAGCGAACGCTTAATACACACAACAGCTTTGATCAAACTACTAATAATAATCGGATTTATAGGATACAGAATGGGCAACTAGTTACACAGGGTATTGATTTGGGACAAATAGCACAAGCTCCTGATTGTGCAGAAGGTACAAATGGATATAGCTCATACGAACAGTCCTCCCGTAGAATCTATAGAGGGGCTGCCGAGCCTCATGATCTTTCGCAACAAGTGCAAGATCTTACCCAGCAGACGGAGGATCTTACCCAGCAAACGGAAGATCTAACTCAGCAGACACAGGATCTTACACAACAAACAGAAGATCTTACACAGCAAAATCAAGATTTTGGACAACAATCTTCTTGGAGACCAGGTAAATTGGAGGTTGGCAGTCAGCAGGTTGAAGATCTCACCCAGCAAACAGAGGATCTTACCCAACAAACGGAAGATCTTACCCAGCAATCGCAGGATCTTACACAACAAACAGAAGATCTTACACAACAAAATCAAGATTTTGGACAGCAATCTTCTTGGAGGCCAGGTAAATTGGAAGTTGGTAGTCAGCAGATTGAAAATCTCCCACAACAAACCGAAGGTCTTACTCAGCAAACGGAAGATCTTACCCAGCAGACGGAGGATCTTACTCAACAAGCTGAAGATCTGACACAACAAAATCAAGATTTCGGACAGCAACCTTCTCGGCGACCAGGTAAATTGGAAGTTGGTAGTCAACAGGTTGAAGATCTCACCCAGCAAACAGAGGATCTTACCCAACAAACGGAAGATCTTACCCAGCAAACGGAAGATCTTACCCAGCAAACAGAAGATCTCACTCAGCAAACGGAGGATCTTACACAACAAACAGAAGATCTTACACAACAAAATCAAGATTTTGGACAGCAATCTTTTTGGAGGCCAGGAAAATTGGAAGTTGGTAGTCAGCAGATTGAAAATCTCCCACAACAAACCGAAGGTCTTACTCAGCAAACGGAAGATCTTACCCAGCAGACGGAGGATCTTACTCAACAAGCTGAAGATCTGACACAACAAAATCAAGATTTCGGACAGCAACCTTCTCGGCGACCAGGTAAATTGGAAGTTGGTAGTCAACAGGTTGAAGATCTCACCCAGCAAACAGAGGATCTTACCCAACAAACGGAAGATCTTACCCAGCAAACGGAAGATCTTACCCAGCAAACAGAAGATCTCACTCAGCAAACGGAGGATCTTACACAACAAACAGAAGATCTTACACAACAAAATCAAGATTTTGGACAGCAATCTTTTTGGAGGCCAGGAAAATTGGAAGTTGGTAGTCAGCAGATTGAAAATCTCCCACAACAAACCGAAGGTCTTACCCAGCAAACAGAAGATCTTACTCAGCAAACGGAGGATCTTACACAACAAACGGAAGATCTGACACAACAAAATCAAGATTTCGGACCGCAATCTTCTTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAACAGGTTGAAGATCTCACCCAGCAAACAGAGCATCTTACCCAGCAAACGGAGGATGTTACACAACAAACGGAAGATCTGACACAACAAAATCAAGATTTCGGACCGCAATCTTTTTGGCGACCAGGTAAATTAGAAGTTGGTAGTCAGCAGGTTGAAGATCTCACCCAACAAACGGAAGATCTTACGCAACAAACGGAAGATCTTACGCAACAAACAGAAGACCTTACTCAACAAACAGAAGGCGAGACGCAACAAAATCTTATACCACCTCATTTCCAACCTTGGCACCATGAAAGATGGCAAGCTGCAGACCCAAATTATGTACCCCTACAAACAGTGTACGAAGGAGCAGTGAGACCTGAAAACTCAGATATCACCAGTCAACAACCGGAGGATCTAACTCAACAAACTGAAGATTTTGGGCAACAAACACAGGATCTTACTCAGCAAAGTGAAGATCTTGGTCAACAAACTGAAGATTTTGGTCAACAAACACAGGATCTTACTCAAAAAACTGAAGATCTTGGTCAACAAACTGAAGATTTTGGTCAACAAACACAGGATCTTACTCAACAAACTGAAGATCTTGGTCAACAAACTGAATATTTTGGTCAACAAACACAGGATCTTGGTCAACAAACTGAAGATCTTGGTCAGCAAACGCAAGGTATCATCCAAGAAACTGATGGCCAATCACagcaaaatgaaaattttaatggtTGGAGGGAACAGATAACAAGTGGCCCAGGATTTGGACAGGAGTCTCCTTGGAACTCTGACAATCTGGAAATTGGAGGTCAACaaaccgaaaatttttatcaagaaaatcaatttggcAAACACCAAACAATCATCCATCCCGGACAACCGACAAGACCAGCACCAAAGCCTGCACCTAAACCAAAACGTCCAAGTCACGGGAATTTCCATCATACTCAAGAGATTAATATAGAGATTGAAGAACCAACTGTATCTAATGCAGATAGTCATACGGTGCAACATAATGATCAGcaaaatagtgaaaaatggGTATCAACGGGTGTTCCTCCCACTCCACAAAGAGGTGATCAAGGTATCAACGTAAACTCTAACGAACCTGAAGAAGCAGATATTAATATTGAATCAGATATACCCAAGATACCTGAACATCAAATTCAATATGTGTACCCGTACCCGGATTCATCCAGCCAACAAACTACAAGTGGAAATCAATTCAGAGAAACTCAACCTACTAAAACTAAGACAAGTCGCCGAAGAGGGAATAATGCTGTTCAATACCAAGGTCCACAAGGATGGCATTCTCGTGATTTGTCAATCAGTCAAGACCCAACAATCAGACTTGTTGATAGACGTATAAACTCAGGTGACTTAAACTTGCCGCAATCAGCAAATACCGGACAAGTTATACAAGACTTTCAACAACATTTGACTAATCCTAAAGAAATTGAACAACTTGAATCTGGGCAAACAGTTCAGAGAATTCAACCTCTTGGTGCGGCTATAGAATCAAGACAACGGAGCAGTGGTCAATcagaaaaaattgtctttcCCGAATCTTCAGAAGTCTCTTTTAGTCCTAGAATTTTAGAGGCATTTGGAGCGAATGGACCATACGGCGAACATGATTTGGATATATTTGATTCTGCCAAACAGTATCCTGACACTACAACAGTTTTAACACCGCCTGAAAATGGAAATGATTGGGATATTCGTGAGGTTGATCGGATAGTTACAACCACAACTGAGGCTCCAACTCCTTTAccatcaacaacaacaactcCTCTGCCAACAACAACACCACCTCCGCCTCCAACTCCGGCTCCtggattttggaaaaaactgGGTAACACGTTTAGTACTACCGTAGAAAAAGCCAAGGACAAGGCGAGAGACTGGTTCGGTTAA
- Protein Sequence
- MAPAVVAIFLITLFVEGLSAPSWCGDCQTWESHSGSQTHRGFGRQINPENLSQRSENLEDLTQQAETELNRSPNQLPFDNTRPGNWTDVNHYRTSDGHGRVYEEQGQRVDGSNRIRFSRRNFTSSYSSGSLGSFGETNLGHIYPNVRQDESQLLNRESLDQSQNSAYDRFATGRNFHTTQDSLHSTERVNSHNDASRYYENHGNSGRISGITSGQSSQQGINVLDRTRPGNWSTVNTFRTNEGNGRVYEERGQIVTGPRRVHFYARNYTSSYASGGGIPTLGLEGDNTRNIESTVQQMQRQFDSYGRELHQSTEGSTNGDYTQHYPGDYTSPSQTSRQTNYRYVSRPSNYESQNQNTLDSNSHQTYQHTTNLGNRHVSQSSSSSFGGSGQLNGRNPDSGYSTGSYTSGSGYNHQGTLEIPSSGHTGHQIPYYNQFQTTSDSSSAAFVSRPDTDLRTIQSGSDLETQRTLNTHNSFDQTTNNNRIYRIQNGQLVTQGIDLGQIAQAPDCAEGTNGYSSYEQSSRRIYRGAAEPHDLSQQVQDLTQQTEDLTQQTEDLTQQTQDLTQQTEDLTQQNQDFGQQSSWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQSQDLTQQTEDLTQQNQDFGQQSSWRPGKLEVGSQQIENLPQQTEGLTQQTEDLTQQTEDLTQQAEDLTQQNQDFGQQPSRRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQNQDFGQQSFWRPGKLEVGSQQIENLPQQTEGLTQQTEDLTQQTEDLTQQAEDLTQQNQDFGQQPSRRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQNQDFGQQSFWRPGKLEVGSQQIENLPQQTEGLTQQTEDLTQQTEDLTQQTEDLTQQNQDFGPQSSWRPGKLEVGSQQVEDLTQQTEHLTQQTEDVTQQTEDLTQQNQDFGPQSFWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQTEDLTQQTEGETQQNLIPPHFQPWHHERWQAADPNYVPLQTVYEGAVRPENSDITSQQPEDLTQQTEDFGQQTQDLTQQSEDLGQQTEDFGQQTQDLTQKTEDLGQQTEDFGQQTQDLTQQTEDLGQQTEYFGQQTQDLGQQTEDLGQQTQGIIQETDGQSQQNENFNGWREQITSGPGFGQESPWNSDNLEIGGQQTENFYQENQFGKHQTIIHPGQPTRPAPKPAPKPKRPSHGNFHHTQEINIEIEEPTVSNADSHTVQHNDQQNSEKWVSTGVPPTPQRGDQGINVNSNEPEEADINIESDIPKIPEHQIQYVYPYPDSSSQQTTSGNQFRETQPTKTKTSRRRGNNAVQYQGPQGWHSRDLSISQDPTIRLVDRRINSGDLNLPQSANTGQVIQDFQQHLTNPKEIEQLESGQTVQRIQPLGAAIESRQRSSGQSEKIVFPESSEVSFSPRILEAFGANGPYGEHDLDIFDSAKQYPDTTTVLTPPENGNDWDIREVDRIVTTTTEAPTPLPSTTTTPLPTTTPPPPPTPAPGFWKKLGNTFSTTVEKAKDKARDWFG
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -