Etri020222.1
Basic Information
- Insect
- Eupithecia tripunctaria
- Gene Symbol
- Zbtb41
- Assembly
- GCA_955876795.1
- Location
- OY041615.1:753698-756863[-]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 14 0.16 1.4e+03 0.4 0.1 26 46 198 218 188 225 0.87 2 14 0.42 3.6e+03 -0.9 0.1 21 33 221 233 217 252 0.75 3 14 0.6 5.1e+03 -1.4 0.0 26 46 253 273 247 280 0.78 4 14 0.4 3.4e+03 -0.8 0.1 22 52 278 308 273 310 0.76 5 14 0.00085 7.3 7.7 0.0 18 53 329 365 323 366 0.84 6 14 0.026 2.2e+02 2.9 0.1 22 52 362 392 360 392 0.85 7 14 0.00018 1.6 9.9 0.1 23 44 391 412 383 416 0.89 8 14 0.016 1.4e+02 3.6 0.4 26 46 422 442 416 449 0.89 9 14 0.37 3.2e+03 -0.8 0.0 22 46 446 470 441 478 0.80 10 14 0.17 1.5e+03 0.3 0.1 21 31 473 483 469 499 0.82 11 14 0.0027 23 6.1 0.0 21 46 501 526 490 529 0.89 12 14 0.00029 2.5 9.2 0.0 21 49 529 557 526 560 0.87 13 14 0.0018 15 6.7 0.0 21 52 557 588 554 588 0.88 14 14 0.00054 4.6 8.3 0.0 21 48 585 612 583 618 0.88
Sequence Information
- Coding Sequence
- ATGCCGTCTTCTGTATGTAGAATCTGTTTATGTAGTGTTAAGAAACTGtactacatagaaaacaaatatCCACAAGAAGTTTACCAGAGGTTTACTGGAAGTTTGCTCCTGTCAGGAGACTACAGACCCAGCACTGTGTGCTACATGTGTGACGCACAGTTAGTAAGATGCTGGAAGTTCTTTGACATGTGTCAGAAGTCGGAGAAAATCTGCACAGACCTGTATCAGAACAACCTAGAGATAACAGAGGAGAGTCTAGCTAAGCACAACCTAGGGTGGATATACAACTTGTCCACATCACCTGTGAAATTAGTAACTTATGTTGAAGAGACTGAAATTACTATTAAAGAAGAAGACACTTCAGATGATGAAGTCCTGATGCCCATGTTTGTTGAAGAAATCAAAAGTGAATTAATTCAAGTCAATTTGCCTTATAGTAATGAAACTGTTGATAATAAATCTAAGACCACAAGTATGCATGATAGCTCAAATGGAAATTGTAATGAAAAggatataaatagtaaaaaaagagGCAAAACAAAATCTAAACTGAAAGGAGAAACCCGTAGAGGCAATGTAGCTACACTATTTAAGTGTGATGTTTGTAAAAAGCCATTTACAAAGAGCTCTTACTTGAAAAGGCACATGAGGATACACACTGAGGAAGAGCCATATTCTTGCAAAATATGCACTGAGAGCTTTGGTTGTATGGATTCATTAAAAACTCATATGCTTGGACATACCGCTGAGATGCCCACCTGTGATGTCTGTGCCAAAAGCTTTGCAACAAAAACAGTTCTCAATAGACACATTATGCAAGTCCACTCTGGACAAAAACCATTGGAATGCAAAGTTTGCAATAAATCGTTTCCTGTCAGAGATAGTCTACGGGCGCATATGAAAATTCATGTTGGATACAAGCCATTTGTCTGCAATGTCTGTGGGAAAACTTTTTTTGACAAAAGTCAATTGAAAGTCCACACCAGGACACATTCTGGAGAAAAGCCATTTACATGTGACTTGTGCAAAAAGTCATTTACTCAATTAACTATATTAAAAAGTCATATGCATCTGCACACGGGAGAGAAGCCGTATGTTTGTAAAATATGCAATAAGTCATTTCGACATCGTTCCACATTGGCCTATCATTCCCAACTTCATACTGGGGGTAAACCATATACATGCCCTATATGCAACAGTGTCTTTGATCTGAAGCCTCTTTTACGCAAGCATATGGCGACTCACACCGACAACACCTCCCATGTTTGCAAAGTTTGTAACAAGGCATTTATCCAAAGTGTTTGTCTAAGGAAGCACATGAAAATTCACACAGAAGAAAGACGATATACTTGCAATGTGTGTGGCAAATCCTTCTTCAGGAGTACTGGCTTAAACAATCACATGAAAACACACACTGGAGAAAAACCATACACATGCAATGTGTGTGGAAAATCATTCTCCCTTGATTCAACTTTAGTGAAACACAAAAGAATACATACGGGTGAGCGACCATACTCCTGTGACCTGTGCAATAAGAAATTCAGGCTCAGCGGCGCTTTGAGTAGACACATCAAAATGCACACTGGAGAAAAGCCATTCACCTGCAACATTTGCAACAAATCATTCATTGGGAGTGGAGAATTGAAAAAACATTTGAAAGTACACAGTGAAGAAAAGCCATATACTTGCTACACTTGCACAAAATCTTTCAGAGATAAAAAGTCTTTAAAAATTCATATTCGATTGCACACTGGAGAAAAGCCATACACATGCAGTATCTGTGGTAAATCATTTACCCAAAGTAGTAGTTTGAGTATGCATGTGAAGAGAGTTCACAGTGAAGACAAACCCTTCGGCTGCACTGTTTGTGGAAAATCGTTTTCAGTCAATTATATTTTGACCAAACACATGAGAATTCATACTAAAGAAGAGCCACAATAG
- Protein Sequence
- MPSSVCRICLCSVKKLYYIENKYPQEVYQRFTGSLLLSGDYRPSTVCYMCDAQLVRCWKFFDMCQKSEKICTDLYQNNLEITEESLAKHNLGWIYNLSTSPVKLVTYVEETEITIKEEDTSDDEVLMPMFVEEIKSELIQVNLPYSNETVDNKSKTTSMHDSSNGNCNEKDINSKKRGKTKSKLKGETRRGNVATLFKCDVCKKPFTKSSYLKRHMRIHTEEEPYSCKICTESFGCMDSLKTHMLGHTAEMPTCDVCAKSFATKTVLNRHIMQVHSGQKPLECKVCNKSFPVRDSLRAHMKIHVGYKPFVCNVCGKTFFDKSQLKVHTRTHSGEKPFTCDLCKKSFTQLTILKSHMHLHTGEKPYVCKICNKSFRHRSTLAYHSQLHTGGKPYTCPICNSVFDLKPLLRKHMATHTDNTSHVCKVCNKAFIQSVCLRKHMKIHTEERRYTCNVCGKSFFRSTGLNNHMKTHTGEKPYTCNVCGKSFSLDSTLVKHKRIHTGERPYSCDLCNKKFRLSGALSRHIKMHTGEKPFTCNICNKSFIGSGELKKHLKVHSEEKPYTCYTCTKSFRDKKSLKIHIRLHTGEKPYTCSICGKSFTQSSSLSMHVKRVHSEDKPFGCTVCGKSFSVNYILTKHMRIHTKEEPQ
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00705280; iTF_00697927; iTF_00697569; iTF_00701276; iTF_00696641; iTF_00706249; iTF_00699435; iTF_00706605; iTF_00699762; iTF_00704243; iTF_00702196; iTF_00696981; iTF_00700310; iTF_00700662; iTF_00702552; iTF_00704606; iTF_00701632;
- 90% Identity
- iTF_00696641;
- 80% Identity
- iTF_00705280;