Lole015021.1
Basic Information
- Insect
- Lacanobia oleracea
- Gene Symbol
- NPLOC4
- Assembly
- GCA_950371165.1
- Location
- OX493392.1:27543061-27548738[+]
Transcription Factor Domain
- TF Family
- zf-MIZ
- Domain
- zf-MIZ domain
- PFAM
- PF02891
- TF Group
- Zinc-Coordinating Group
- Description
- This domain has SUMO (small ubiquitin-like modifier) ligase activity and is involved in DNA repair and chromosome organisation [1][2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 13 0.0035 35 5.7 0.0 39 48 173 182 162 184 0.81 2 13 0.0038 38 5.6 0.0 40 48 225 233 203 235 0.82 3 13 0.0042 43 5.5 0.0 40 48 274 282 256 284 0.84 4 13 0.0048 49 5.3 0.2 40 48 309 317 295 319 0.83 5 13 0.0048 49 5.3 0.2 40 48 344 352 330 354 0.83 6 13 0.0039 40 5.5 0.2 39 48 378 387 365 389 0.83 7 13 0.0039 40 5.5 0.2 39 48 413 422 400 424 0.83 8 13 0.0047 48 5.3 0.0 40 48 449 457 439 459 0.84 9 13 3.4 3.4e+04 -3.8 0.0 40 47 499 506 495 507 0.84 10 13 0.0048 49 5.3 0.2 40 48 534 542 520 544 0.83 11 13 0.0045 45 5.4 0.0 40 48 579 587 568 589 0.83 12 13 0.0044 44 5.4 0.0 40 48 626 634 614 636 0.83 13 13 0.0048 49 5.3 0.2 40 48 661 669 647 671 0.83
Sequence Information
- Coding Sequence
- ATGCCTCCTCACAAGCAAGCACCGGGGCCGCGGGGATGCGAACAGCCCCCGGATCGGAAAAGGGTTGGACCGTTGCAGGAGCGCGACGCGTACGGCAACGAGGTGGGCGTGTCGGCCAAGCGCGTGCCCGTGGCCTACCTGCTGGTGGACGTGCCGTGCGGCGTGGCGGCCGACGGCGGCGCGAGCACGTtcagcgcgcgcgccgccttcCCGCCCGCCCACCGCCCGCTGCAGCAGCACCTGCAGACGCTGCGCGCGCTGCACGCGCACCTGCAGGCGGCACCCAGCTTCCTGGAGGCCGCGTCCGACCTGCACGTGCTGCTGTTCCTGGCTAGCAACGAGGCGCTGCCACTGAGCCCGGCGGCGCTGGAGCCGCTGCTGGCGGCCGTGCGCGCGCaggacgccgccgccgccgacgcCTGGCGCGCCACGCCCACCGCTGCCACGCTCCACCAGCTCACCAGCGCCGCCGCCGACCACGACGACGACAGCATGCTGCTGGGCGCGGAGGGCGGCGCCGGCGTGTGGACGTGTCCGCTGTGCACCTTCCACAACGCGCCGCACAACGACGCCTGCGAGATGTGCGCCATGCCCAGGTAcgcacacgcacgcacacacacacacacacacacacacacacacagcatgcTGCTGGGCGCGGAGGGCGGCGCCTACGTGTGGACGTGTCCGCTGTGCACCTTCCACAACGCGCCGCACAACGACGCCTGCGAGATGTGCGCCATGCCCAGGTacgcacacgcacacacacacacacacacacacacacacagcatacTGCTGGGTGCGGAAGGTGGCGCCTACGTGTGGACGTGTCCGCTGTGCACCTTCCACAACGCGCCGCACAACGACGCCTGCGAGATGTGCGCCATGCCCAGcatgcTGCTGGGCGCGGAGGGCGGCGCCTACGTGTGGACGTGTCCGCTGTGCACCTTCCACAACGCGCCGCACAACGACGCCTGCGAGATGTGCGCCATGCCCAGcatgcTGCTGGGCGCGGAGGGCGGCGCCTACGTGTGGACGTGTCCGCTGTGCACCTTCCACAACGCGCCGCACAACGACGCCTGCGAGATGTGCGCCATGCCCAGcatgcTGCTGGGCGCGGAGGGCGGCGCCGGCGTGTGGACGTGTCCGCTGTGCACCTTCCACAACGCGCCGCACAACGACGCCTGCGAGATGTGCGCCATGCCCAGcatgcTGCTGGGCGCGGAGGGCGGCGCCGGCGTGTGGACGTGTCCGCTGTGCACCTTCCACAACGCGCCGCACAACGACGCCTGCGAGATGTGCGCCATGCCCAGcatacTGCTGGGTGCGGAAGGCGGCGCCTACGTGTGGACGTGTCCGCTGTGCACCTTCCACAACGCGCCGCACAACGACGCCTGCGAGATGTGCGCCATGCCCAGGTACGCAcacgcgcacacacacacacacacacacacacacacacacacacacagcatgcTGGAGGGCGGCGCCGGCGTGTGGACGTATCCGCTGTGCACCTTCCACAACGCGCCGCACAACGACGCCTGCGAGATGTGCGCCATGCCCAGcatgcTGCTGGGCGCGGAGGGCGGCGCCTACGTGTGGACGTGTCCGCTGTGCACCTTCCACAACGCGCCGCACAACGACGCCTGCGAGATGTGCGCCATGCCCAGGTacgcacacgcacacacacacacacacagcatgcTGCTGGGCGCGGAGGGCGGCGCCTACGTGTGGACGTGTCCGCTGTGCACCTTCCACAACGCGCCGCACAACGACGCCTGCGAGATGTGCGCCATGCCCAGGTacgcacacgcacacacacacacacacacacacagcatgcTCCTGGGCGCGGAGGGCGGCGCCTACGTGTGGACGTGTCCGCTGTGCACCTTCCACAACGCGCCGCACAACGACGCCTGCGAGATGTGCGCCATGCCCAGcatgcTGCTGGGCGCGGAGGGCGGCGCCTACGTGTGGACGTGTCCGCTGTGCACCTTCCACAACGCGCCGCACAACGACGCCTGCGAGATGTGCGCCATGCCCAGAGAATGTTCTGCAGAAGGCATTCGGCCCTCGCTAAATTGGAGTACTCCCATCGGGTATAGAGGACGACGGAGGATCGGCGATTCGATACCGATGCGAAAGCCTCCAATCACTAGTTATGTAAATCCCGAGGTGGCTTCTGGCTAG
- Protein Sequence
- MPPHKQAPGPRGCEQPPDRKRVGPLQERDAYGNEVGVSAKRVPVAYLLVDVPCGVAADGGASTFSARAAFPPAHRPLQQHLQTLRALHAHLQAAPSFLEAASDLHVLLFLASNEALPLSPAALEPLLAAVRAQDAAAADAWRATPTAATLHQLTSAAADHDDDSMLLGAEGGAGVWTCPLCTFHNAPHNDACEMCAMPRYAHARTHTHTHTHTHSMLLGAEGGAYVWTCPLCTFHNAPHNDACEMCAMPRYAHAHTHTHTHTHSILLGAEGGAYVWTCPLCTFHNAPHNDACEMCAMPSMLLGAEGGAYVWTCPLCTFHNAPHNDACEMCAMPSMLLGAEGGAYVWTCPLCTFHNAPHNDACEMCAMPSMLLGAEGGAGVWTCPLCTFHNAPHNDACEMCAMPSMLLGAEGGAGVWTCPLCTFHNAPHNDACEMCAMPSILLGAEGGAYVWTCPLCTFHNAPHNDACEMCAMPRYAHAHTHTHTHTHTHTHSMLEGGAGVWTYPLCTFHNAPHNDACEMCAMPSMLLGAEGGAYVWTCPLCTFHNAPHNDACEMCAMPRYAHAHTHTHSMLLGAEGGAYVWTCPLCTFHNAPHNDACEMCAMPRYAHAHTHTHTHSMLLGAEGGAYVWTCPLCTFHNAPHNDACEMCAMPSMLLGAEGGAYVWTCPLCTFHNAPHNDACEMCAMPRECSAEGIRPSLNWSTPIGYRGRRRIGDSIPMRKPPITSYVNPEVASG
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -