Cpyr001263.1
Basic Information
- Insect
- Cosmia pyralina
- Gene Symbol
- -
- Assembly
- GCA_946251865.1
- Location
- CAMIUE010000036.1:344707-347956[-]
Transcription Factor Domain
- TF Family
- zf-MIZ
- Domain
- zf-MIZ domain
- PFAM
- PF02891
- TF Group
- Zinc-Coordinating Group
- Description
- This domain has SUMO (small ubiquitin-like modifier) ligase activity and is involved in DNA repair and chromosome organisation [1][2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 24 0.0031 21 6.4 0.6 12 33 37 58 30 61 0.87 2 24 0.013 89 4.4 1.1 12 33 65 86 58 90 0.85 3 24 0.0037 25 6.1 1.1 11 33 92 114 87 118 0.87 4 24 1.6 1.1e+04 -2.3 0.0 12 33 121 142 114 148 0.76 5 24 0.0027 18 6.5 0.5 12 33 149 170 143 173 0.87 6 24 0.0036 24 6.2 1.0 11 33 176 198 171 201 0.87 7 24 0.011 78 4.5 0.9 12 33 205 226 198 230 0.85 8 24 0.0038 26 6.1 1.1 11 33 232 254 227 258 0.87 9 24 1.7 1.2e+04 -2.5 0.0 12 33 261 282 255 287 0.75 10 24 0.0028 19 6.5 0.6 12 33 289 310 283 314 0.87 11 24 0.0037 25 6.1 1.2 11 33 316 338 310 341 0.86 12 24 0.021 1.5e+02 3.7 1.3 12 33 345 366 339 370 0.85 13 24 0.0031 21 6.3 1.5 11 35 372 396 367 399 0.86 14 24 0.0029 20 6.4 0.8 11 33 400 422 395 426 0.87 15 24 0.0024 17 6.7 0.3 12 33 429 450 423 453 0.88 16 24 0.0028 19 6.5 0.5 12 33 457 478 452 483 0.88 17 24 0.0025 17 6.6 0.5 12 33 513 534 507 540 0.88 18 24 0.0038 26 6.1 1.2 11 33 540 562 535 566 0.87 19 24 0.017 1.2e+02 4.0 1.2 12 33 569 590 561 594 0.84 20 24 0.0026 17 6.6 1.3 11 35 596 620 590 623 0.85 21 24 0.0028 19 6.5 0.8 11 33 624 646 619 649 0.87 22 24 0.0027 19 6.5 0.4 12 33 653 674 648 678 0.88 23 24 0.0046 32 5.8 1.1 12 33 681 702 675 706 0.87 24 24 0.003 21 6.4 2.0 11 35 708 732 703 735 0.86
Sequence Information
- Coding Sequence
- ATGATTGTGATTGGTCGCGACACTAGCGCCACCCCGCGCACGTACGAACGTAACTACAGGCTTTACTGCATCGCGCTACAGCAGTATGTCTGTGTTCAGGTTCCTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTACTGCATCACGCTACAGCAGTATGTCTGTGTTCTGGTTCTTGACATAACATTATACGCTAAGAACGCTGTTTGTTCGCACAGTTATTGTATCGCGCTACAGCAGTATGTCTGTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTACTGCATCGCGCTACAGCAGTATGTCTGTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGAGCTGTTTGTTCGCACAGTTACTATATCGCGCTACAGCAGTATGTCTCTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTATTGTATCGCGCTACAGCAGTATGTCTGTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTACTGCATCACGCTACAGCAGTATGTCTGTGTTCTGGTTCTTGACATAACATTATACGCTAAGAACGCTGTTTGTTCGCACAGTTATTGTATCGCGCTACAGCAGTATGTCTGTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTACTGCATCGCGCTACAGCAGTATGTCTGTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGAGCTGTTTGTTCGCACAGTTACTATATCGCGCTACAGCAGTATGTCTCTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTATTGTATCGCGCTACAGCAGTATGTCTGTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTACTGCATCACGCTACAGCAGTATGTCTGTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGAGCTGTTTGTTCGCACAGTTACTGTATCGCGCTACAGCAGTATGTCTGTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTACTGCATCGCGCTACAACAGTATGTCTGTGTTCAGGTTCTTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTACTGCATCACGCTACAGCAGTATGTCTTTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTACTGCATCACGCTACAGCAGTATGTCTTTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTACTGCATCGCGCTACAGCAGTATGTCTGTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGAGCTGTTTGTTCGCACAGTTACTATATCGCGCTACAGCAGTATGTCTCTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTATTGTATCGCGCTACAGCAGTATGTCTGTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTACTGCATCACGCTACAGCAGTATGTCTGTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGAGCTGTTTGTTCGCACAGTTACTGTATCGCGCTACAGCAGTATGTCTGTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTACTGCATCGCGCTACAACAGTATGTCTGTGTTCAGGTTCTTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTACTGCATCACGCTACAGCAGTATGTCTTTGTTCTGGTTCTTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTACTGTATCGCGCTACAGCAGTATGTCTGTGTTCTGGTTATTGACATAACATTATACGCTAAGAGCGCTGTTTGTTCGCACAGTTACTGCATCGCGCTACAGCAGTATGTCTGTGTTCTGGTTCTTGACATAACATTACACgctaagagcgctgtttgctcGCACAGTTACTGTATCGCGCTACAGCAGTATGTCTGTGTTCAGGTTCCTGACATAGCATTATACGCTAAGAGCGCTTCGCACACTGCATCGCGCTACAGCAGTATGAAAGCAGCAGCAGCGGATGTTGCTGTAATACAGGCGCATGAGAAGTTTCAGCTTTATCTCTAG
- Protein Sequence
- MIVIGRDTSATPRTYERNYRLYCIALQQYVCVQVPDITLYAKSAVCSHSYCITLQQYVCVLVLDITLYAKNAVCSHSYCIALQQYVCVLVLDITLYAKSAVCSHSYCIALQQYVCVLVLDITLYAKRAVCSHSYYIALQQYVSVLVLDITLYAKSAVCSHSYCIALQQYVCVLVLDITLYAKSAVCSHSYCITLQQYVCVLVLDITLYAKNAVCSHSYCIALQQYVCVLVLDITLYAKSAVCSHSYCIALQQYVCVLVLDITLYAKRAVCSHSYYIALQQYVSVLVLDITLYAKSAVCSHSYCIALQQYVCVLVLDITLYAKSAVCSHSYCITLQQYVCVLVLDITLYAKRAVCSHSYCIALQQYVCVLVLDITLYAKSAVCSHSYCIALQQYVCVQVLDITLYAKSAVCSHSYCITLQQYVFVLVLDITLYAKSAVCSHSYCITLQQYVFVLVLDITLYAKSAVCSHSYCIALQQYVCVLVLDITLYAKRAVCSHSYYIALQQYVSVLVLDITLYAKSAVCSHSYCIALQQYVCVLVLDITLYAKSAVCSHSYCITLQQYVCVLVLDITLYAKRAVCSHSYCIALQQYVCVLVLDITLYAKSAVCSHSYCIALQQYVCVQVLDITLYAKSAVCSHSYCITLQQYVFVLVLDITLYAKSAVCSHSYCIALQQYVCVLVIDITLYAKSAVCSHSYCIALQQYVCVLVLDITLHAKSAVCSHSYCIALQQYVCVQVPDIALYAKSASHTASRYSSMKAAAADVAVIQAHEKFQLYL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -