Mann004979.1
Basic Information
- Insect
- Macrophya annulata
- Gene Symbol
- -
- Assembly
- GCA_963924015.1
- Location
- OZ001258.1:4701414-4706957[+]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 23 0.0066 7 7.4 11.4 25 60 544 579 534 584 0.56 2 23 0.0037 3.9 8.2 10.7 25 63 551 589 547 594 0.89 3 23 0.0045 4.8 7.9 4.6 32 64 607 639 606 640 0.89 4 23 0.0023 2.4 8.9 4.3 25 60 628 663 627 665 0.90 5 23 0.0073 7.7 7.2 5.5 32 63 677 708 675 710 0.78 6 23 0.0032 3.3 8.4 6.4 25 60 712 747 708 749 0.75 7 23 0.002 2.1 9.1 7.0 25 61 761 797 760 800 0.80 8 23 0.0064 6.7 7.4 4.2 25 54 796 825 794 828 0.87 9 23 0.0026 2.7 8.7 5.0 32 64 838 870 834 871 0.86 10 23 0.03 31 5.3 3.8 25 50 866 891 865 895 0.69 11 23 0.0054 5.6 7.7 8.3 25 61 908 944 907 946 0.80 12 23 0.0019 2 9.1 8.8 25 63 929 967 928 973 0.93 13 23 0.0011 1.2 9.9 7.6 25 64 985 1024 984 1025 0.92 14 23 0.045 48 4.7 3.8 25 50 1020 1045 1019 1050 0.70 15 23 0.0011 1.2 9.9 7.6 25 64 1062 1101 1061 1102 0.92 16 23 0.019 20 5.9 4.1 31 58 1096 1123 1095 1127 0.49 17 23 0.0077 8.1 7.2 5.2 32 62 1139 1169 1135 1172 0.81 18 23 0.0015 1.6 9.5 5.0 25 61 1174 1210 1172 1212 0.87 19 23 0.0063 6.6 7.5 5.6 26 61 1224 1259 1222 1262 0.68 20 23 0.0009 0.95 10.2 9.1 25 63 1244 1282 1242 1288 0.93 21 23 0.0023 2.4 8.9 5.8 25 64 1300 1339 1299 1340 0.82 22 23 0.022 24 5.7 2.8 25 63 1356 1394 1355 1396 0.90 23 23 0.096 1e+02 3.7 3.6 25 61 1398 1434 1391 1438 0.65
Sequence Information
- Coding Sequence
- ATGGCTCCCGCGGCGGtggcaatatttttaatctctcTATTCGTGGAAGGATTATCAGCTCCAAGTTGGTGCGGAGATTGTCAAACATGGGAATCGCACAGAGGCTCTCAAACTCATAGAGGATTTGGCAGGGaaataaatcaagaaaatttgTCTCAACGGTCAGAAAATTTGGAAGATTTAACACAACAAGCGGAAACCGAGTTCAACAGATCTCccaatcaatttaattttgataatacgAGGCCTGGAAATTGGACTGATGTTAATCATTACAGAACATCTGATGGTCATGGAAGAGTATACGAAGAACAAGGCCAGCGTGTAGATGGATCAAGTCGAATTAGATTCTCtaaaagaaatttcacttCCAGCTATAGTAGTGGAAACTTAGGTTCCTTTGGAGAAACTAATCTAGGACGTATATATCCTAACATAAGACAAGATGAGAGCCAGTTATTGAACCGTGAATCTTTAGATCAGTCACAAAATTCAGCTTATGATCGATTCGCCACTGGACGAAATTCTCATACTACACAGGACTCTTTACATTCTACCGAACGGGTGAACAGCCATAACGATGCATCCAGATATTATGAAAATCTTGGCAATAGTCAGATAAGCAAAATTACTTCTGGTCAATCAGTGCAGCAAGGAATTAATGCATTGGATCAAACAAGACCAGCAAATTGGAGCACGGTTAATACGTTTAGAACCAATGATGGTAATGGCAGGGTTTACGAAGAACGAGGGCAGTTTGTAACCGGGCCGAAGCGGGTTCATTTTTATGTAAGAAATTACACTTCAAGTTATGCCTCTGACGGAGGTATTCCGACTCTTGATTTAGGGGGTGAAGATACAAGGAATGTCGAAAGCAGCATACGGCAGCAGCAGAGACTATTCGATAGTTACGGAAGAGAGCTTCATCAAAGTAATGAGGGTTCAACAAATGGTGGTTACACTCAGCATTATCCTGGACATTATACATCACCCAGTCAAACGACGAGACAAACAAACTACAGATATGTATCGAGACCTGGTAGCTACGAATTACAAAATCAGAATGCTTTAGattcaaattctcaccaaactTACCAGCACACATCTAACTCAGGAAATCAGCATGTGTCTCAGTCTAGCAGTAGTTCTTTTGATGGATTGGGACAAGTTAATGGAAGAAATCCAAATTCTGCATATTCAACAGGCAGTTATACCAGTAATAGCAGATACAATCATCTGGGAACTGTGGAAATACCTTCTTCGGGACACACTGGACACAGGATTCCATATTACAATCAATTTCAAACCACCTCTGATTCCTCCTCTGCTGCAATATCTCATCCTGACACCGATTTAAGAACTATTCAATCTGGTAGCGATCAAGAAACACAGCGTACGCTTAATTCACACAACAGCTTTGATCAAACTACTAAGAATAATCAAAAGAGGATTTATAGGGTACAGAATGGGCAACTAGTTACACAGGGAATTGATTTGGGACAAATAGCACAAGCTCCTGATTGTGCAGAAGGTACAAATGGATATAGCTCATATGAACAGTCCTACCGTAGGGTCTATAGAGGGGCTCCCGAATCTCATGATTTTTCGCAACAAGTGCAAGATCTTACCCAGCAAACAGAGGATCTTACCCAGCAAACGGAAGATCTCACCCAGCAAACGGAAGATCTCACCCAGCAAACGGAAGATCTTACCCAGCAGACGCAGGATCTTACACAACAAACAGAAGATCTAACACAACAAAATCAAGATTTTGGACAGCAATCTTTTTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAACAGGTTGAAGATCTCACTCAGCAAACAGAGGATCTTACTCAACAAACGGAAGATCTTGCCCAGCAGACAGAAGGTCTTACTCAGCAATCAGAGGATCTTACCCAACAAACGGAAGGGCTAACCCAGCAAACGGAAGATCTTACACAACAAAATCAGGATTTCGGACAGCAATCTTATTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAGCAGGTTGAAGATCTGACTCAGCAAACAGAGGATCTTACTCAACAAACAGAAGATCTTACTCAGCAAACAGAAGATCTTACTCAGCAATCAGAGGATCTTACCCAACAAACGGAAGGGCTCACCCAGCAAACGGAAGATCTTACTCAGCAGACGGAAGATCTTACACAACAAACAGAAGATCTTACACAACAAAATCAGGATTTCGGACAGCAATCTTATTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAGCAGGTTGAAGATCTGACTCAGCAAACAGAGGATCTTACTCAACAAACCGAAGATCTTACTCAACAAACAGAAGACCTTACTCAGCAAACAGAGGATCTTACCCAACAAACGGAAGGGCTCACCCAGCAAACGGAAGATCTTACCCAGCAGACAGAAGATCTTACACAACAGAATCAGGATTTCGAACAGCAATCTTCTTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAGCAGGTTGAAGATCTTACTCAGCAAACAGAGGATCTTACTCAGCAAACAGAAGATCTTACCCAACAAACGGAAGGGCTCACCCAGCAAACGGAAGATCTTACCCAGCAGACGGAAGATCTTACACAACAAACAGAAGATCTTACACAACAAAATCAGGATTTCGGACAGCAATCTTATTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAGCAGGTTGAAGATCTGACTCAGCAAACAGAGGATCTTACTCAACAAACAGAAGATCTTACTCAGCAAACAGAAGACCTTACTCAGCAAACAGAGGATCTTACCCAACAAACGGAAGGGCTCACCCAGCAAACAGAAGATCTTACCCAGCAGACAGAAGATCTTACACAACAGAATGAGGATTTCGTACAACAATCTTCTTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAGCAGGTTGAAGATCTGACTCAGCAAACAGAGGATCTTACTCAACAAACAGAAGATCTTACTCAGCAATCAGAGGATCTTACCCAACAAACGGAGGGGCTCACCCAGCAAACAGAAGATCTCACCCAGCAGACGGAAGATCTTTCACAACAAACAGAAGATCTTACACAACAAAATCAGGATTTCGGACAGCAATCTTATTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAGCAGGTTGAAGATCTGACTCAGCAAACAGAGGATCTTACTCAACAAACAGAAGATCTTACTCAGCAATCAGAGGATCTCACCCAACAAACGGAGGGGCTCACCCAGCAAACAGAAGATCTCACCCAGCAGACGGAAGATCTTTCACAACAAACAGAAGATCTTACACAACAAAATCAGCATTTCGGACAGCAATCTTATTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAGCAGGTTGAAGATCTTACGCAGCAAACAGAGGATCTTACTCAACAAACAGAAGATCTTACTCAGCAAACAGAAGATCTTACTCAGCAATCAGAGGATGTTACCCAACAAACGGAAGGGCTCACCCAGCAAACAGAAGATCTGACCCAGCAGACGGAAGATCTTACACAACAAACAGAAGATCTTACACAACAAAATCAGGATTTCGGGCAGCAATCTTATTGGAGACCAGGTAAATTGGAAGTTGGTGGTCAGCAGGTTGAAGATCTGACTCAGCAAACAGAGGATCTTACTCACCAAACGGAAGATCTTACCCAACAAACGGAAGGGCTCACCCAGCAAACGGAAGATCTTACACAACAAACAGAAGATCTCACTCAGCAAACAGAGGATCTTACCCAACAAACGGAAGATCTTACACAACAAAATCAGGATTTCGGACAGCAATCTTATTGGAGACCAGGTAAATTGGAAGTTGGTAGTCAGCAGGTTGAAGATCTGACTCAGCAAACAGAGGATCTTACTCAACAAACAGAAGATCTTAGTCAACAAACACAGGATCTTACTCAGCAAACTGAAGATTTTGGTCAACAAACACAGGATCTTACTCAACAAACTGAAGATCTTGGTCaacaaattgatgattttgGTCAACAAACACAGGATCTTACTCAGCAAACTGAAGATTTTGGTCAACAAACACAGGATCTTACTCAACAAACTGAAGATCTTGGtcaacaaattgaaaatcttggtCAACAAACCGAAGATTTTGGTCAACAAACACAGGATCTTACTCAACAAACTGAAGATCTTGGtcaacaaattgaaaatcttggtCAACAAACCGAAGATTTTGGTCAACAAACACAGGATCTTGGTCAACAAACTGAAGATCTTAATCAGCAGACGTTAGGTATTTTCCAAGAAACTGATGGCCAATCACAGCaacctgaaaatttcaatagttGGGGGGAACAGATACCAAGTAGCCCAGGATTTGGACGGGAGTCTTCTTGGAACTCTGACAATCTGGAAATTGGGGGTCAAcaaactgaaaatttcaatcaagaaAATCAATTTGGTGACCAACAAACAATCATCCATCCCGAACAATCGAACAGACCAGCACCAAAGCCTGCACCTAAACCAAGACGTCAAAGACCTGGGAATTTCCATCATACTCAACAGATTAATATAGAGATTGAAGAACCAACTGTATCTAATGCACATAGTCATACAGCGCAACATAACAATCAgcaaaatagtgaaaaatggGTATCAACGGGTGTTCCTCCCATTCCACAAAGAGGTGATCAAGACTCAACCAGCCAGCAAACTACAAGTGGGAATCAATTCAGACAAACTCAACCTACTAAAACTAAGACAAGTCGCCGAAGAGGGAATGACCGTGCTCAATACCAAGGTCCACAAGGATGGCATTCTCGTGATTTGTCGATTAGTCAAGATCCATTGATGAGACTTGTTGATAGACGTATGAACTCAGATGATTCAAACTTGCCCCAATCAGCAAATACTGGGCAAGTTAGACAAGATTTTCAACAACATTTGACTAATCCTAATGAAACTGAACAGCTCGAATCTAGGCAAACAGTTCAACCTCTTGGTGCGGCTATAGAATCAAGACAACGGAGCAGTGATCAATCAGATAGAATTGTCTTTCCCGACTCTTCAGAAGCCTCTTTCAGCCCTAGAATTTTAGAGGCATTTGGGGCGAATGGACCATACGGCGAACATGATTTAGGTATATTTGATTCTGCCAAAAAGTATCCTGACACTACAACAGTTTTAACACCCCCTGAAAATGGAAATGATTGGGATATTCGTGAAGTTGATCGGACAGTTACAACCACAACTGAGGCTCCAACTCCTCTACCAACAACTACTACAACAACACCAGCTCCGCCTCCAACTCCGGCTCCTGGATTTTGGAAAAAACTGGGTAACACCTTCAGTACTACGCTAGAAAAAGCCAAGGAGAAGGCGAGCGGCTGGTTTGGCTAA
- Protein Sequence
- MAPAAVAIFLISLFVEGLSAPSWCGDCQTWESHRGSQTHRGFGREINQENLSQRSENLEDLTQQAETEFNRSPNQFNFDNTRPGNWTDVNHYRTSDGHGRVYEEQGQRVDGSSRIRFSKRNFTSSYSSGNLGSFGETNLGRIYPNIRQDESQLLNRESLDQSQNSAYDRFATGRNSHTTQDSLHSTERVNSHNDASRYYENLGNSQISKITSGQSVQQGINALDQTRPANWSTVNTFRTNDGNGRVYEERGQFVTGPKRVHFYVRNYTSSYASDGGIPTLDLGGEDTRNVESSIRQQQRLFDSYGRELHQSNEGSTNGGYTQHYPGHYTSPSQTTRQTNYRYVSRPGSYELQNQNALDSNSHQTYQHTSNSGNQHVSQSSSSSFDGLGQVNGRNPNSAYSTGSYTSNSRYNHLGTVEIPSSGHTGHRIPYYNQFQTTSDSSSAAISHPDTDLRTIQSGSDQETQRTLNSHNSFDQTTKNNQKRIYRVQNGQLVTQGIDLGQIAQAPDCAEGTNGYSSYEQSYRRVYRGAPESHDFSQQVQDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTQDLTQQTEDLTQQNQDFGQQSFWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLAQQTEGLTQQSEDLTQQTEGLTQQTEDLTQQNQDFGQQSYWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQTEDLTQQSEDLTQQTEGLTQQTEDLTQQTEDLTQQTEDLTQQNQDFGQQSYWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEGLTQQTEDLTQQTEDLTQQNQDFEQQSSWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQTEGLTQQTEDLTQQTEDLTQQTEDLTQQNQDFGQQSYWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQTEGLTQQTEDLTQQTEDLTQQNEDFVQQSSWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQSEDLTQQTEGLTQQTEDLTQQTEDLSQQTEDLTQQNQDFGQQSYWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQSEDLTQQTEGLTQQTEDLTQQTEDLSQQTEDLTQQNQHFGQQSYWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLTQQTEDLTQQSEDVTQQTEGLTQQTEDLTQQTEDLTQQTEDLTQQNQDFGQQSYWRPGKLEVGGQQVEDLTQQTEDLTHQTEDLTQQTEGLTQQTEDLTQQTEDLTQQTEDLTQQTEDLTQQNQDFGQQSYWRPGKLEVGSQQVEDLTQQTEDLTQQTEDLSQQTQDLTQQTEDFGQQTQDLTQQTEDLGQQIDDFGQQTQDLTQQTEDFGQQTQDLTQQTEDLGQQIENLGQQTEDFGQQTQDLTQQTEDLGQQIENLGQQTEDFGQQTQDLGQQTEDLNQQTLGIFQETDGQSQQPENFNSWGEQIPSSPGFGRESSWNSDNLEIGGQQTENFNQENQFGDQQTIIHPEQSNRPAPKPAPKPRRQRPGNFHHTQQINIEIEEPTVSNAHSHTAQHNNQQNSEKWVSTGVPPIPQRGDQDSTSQQTTSGNQFRQTQPTKTKTSRRRGNDRAQYQGPQGWHSRDLSISQDPLMRLVDRRMNSDDSNLPQSANTGQVRQDFQQHLTNPNETEQLESRQTVQPLGAAIESRQRSSDQSDRIVFPDSSEASFSPRILEAFGANGPYGEHDLGIFDSAKKYPDTTTVLTPPENGNDWDIREVDRTVTTTTEAPTPLPTTTTTTPAPPPTPAPGFWKKLGNTFSTTLEKAKEKASGWFG
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -