Carc028680.2
Basic Information
- Insect
- Coenonympha arcania
- Gene Symbol
- Evi5
- Assembly
- GCA_036785405.1
- Location
- JAWDAA010000032.1:2702983-2719356[-]
Transcription Factor Domain
- TF Family
- TF_bZIP
- Domain
- bZIP domain
- PFAM
- AnimalTFDB
- TF Group
- Basic Domians group
- Description
- bZIP proteins are homo- or heterodimers that contain highly basic DNA binding regions adjacent to regions of α-helix that fold together as coiled coils
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 16 0.15 1.7e+02 3.1 3.2 37 62 92 117 87 120 0.89 2 16 0.54 6.4e+02 1.3 0.3 38 57 128 147 118 154 0.70 3 16 0.00044 0.51 11.2 3.0 23 48 207 232 206 246 0.81 4 16 0.00044 0.51 11.2 3.0 23 48 254 279 253 293 0.81 5 16 0.00044 0.51 11.2 3.0 23 48 301 326 300 340 0.81 6 16 0.00044 0.51 11.2 3.0 23 48 348 373 347 387 0.81 7 16 0.00044 0.51 11.2 3.0 23 48 395 420 394 434 0.81 8 16 0.00044 0.51 11.2 3.0 23 48 442 467 441 481 0.81 9 16 0.00044 0.51 11.2 3.0 23 48 489 514 488 528 0.81 10 16 0.00044 0.51 11.2 3.0 23 48 536 561 535 575 0.81 11 16 0.00044 0.51 11.2 3.0 23 48 583 608 582 622 0.81 12 16 0.00044 0.51 11.2 3.0 23 48 630 655 629 669 0.81 13 16 0.00085 0.99 10.3 3.7 23 48 677 702 676 716 0.84 14 16 4.1 4.9e+03 -1.5 2.6 21 62 748 791 745 793 0.71 15 16 4.8 5.6e+03 -1.7 1.1 26 59 814 858 807 862 0.70 16 16 1.8 2.1e+03 -0.4 0.0 37 51 885 899 879 905 0.50
Sequence Information
- Coding Sequence
- ATGGACGTATTCCTGTCCGAGGGGATAGAGATCGTCTTCAAAGTCGCCCTCGCACTTCTAACTCTGGGCAAAGATGATCTTTTGTCACTGGATATGGAAAACATCTTAAAGTTCATGCAAAAAGAGCTGCCACAGAAGGCCGAAGCTGATGAAGACGCGTTTATGAATCTCGCCTACTCCATCAAAGTTAACCCCGAGAAAATGAAGAAATTAGAAAAGGAATACACTGTTATCAAGACTAAGGAACAAGGAGACATAGCAGTTCTCAGATGTTTACGCCAGGAAAATCGTCTACTCAAACAAAGTGTTGAATTACTGGAGAAAGAAAGTTCAGCCTTAGTCGAAAGACTTGTCCAGGGTCAAGTGGACCGAGCTGAAGGCGAAGAGAAGACTTTTGCTTTGGCCCGAGAAGTGCAAGCTCTGCGTCGCGCAAATATGGATGCCCAGCAACGCCTTGCTGTTGCCCAGGATGAGATACGGAGCTTGGAAATGACTATAGCTGagAACAACTCCAGGCAATCGTCGCTAGAACGCACAGaggcgcacaacgcgaagggcgaagagctggctcgttgcctccagcgcgagctggtgcgggccaggctcgacgcagcggagcgGCGAGCCGCGGAGAGGGAGCTCCACGCTAGGGTCGCGGAGCTGGAGgacgagaacaagagcctgaggaaacagcgggtcgacaacaacgtagctcacttgcagcgcgagctggtgcgggccaggctcgacgcagcggagcgGCGAGCCGCGGAGAGGGAGCTCCACGCTAGGGTCGCGGAGCTGGAGgacgagaacaagagcctgaggaaacagcgggtcgacaacaacgtagctcacttgcagcgcgagctggtgcgggccaggctcgacgcagcggagcgGCGAGCCGCGGAGAGGGAGCTCCACGCTAGGGTCGCGGAGCTGGAGgacgagaacaagagcctgaggaaacagcgggtcgacaacaacgtagctcacttgcagcgcgagctggtgcgggccaggctcgacgcagcggagcgGCGAGCCGCGGAGAGGGAGCTCCACGCTAGGGTCGCGGAGCTGGAGgacgagaacaagagcctgaggaaacagcgggtcgacaacaacgtagctcacttgcagcgcgagctggtgcgggccaggctcgacgcagcggagcgGCGAGCCGCGGAGAGGGAGCTCCACGCTAGGGTCGCGGAGCTGGAGgacgagaacaagagcctgaggaaacagcgggtcgacaacaacgtagctcacttgcagcgcgagctggtgcgggccaggctcgacgcagcggagcgGCGAGCCGCGGAGAGGGAGCTCCACGCTAGGGTCGCGGAGCTGGAGgacgagaacaagagcctgaggaaacagcgggtcgacaacaacgtagctcacttgcagcgcgagctggtgcgggccaggctcgacgcagcggagcgGCGAGCCGCGGAGAGGGAGCTCCACGCTAGGGTCGCGGAGCTGGAGgacgagaacaagagcctgaggaaacagcgggtcgacaacaacgtagctcacttgcagcgcgagctggtgcgggccaggctcgacgcagcggagcgGCGAGCCGCGGAGAGGGAGCTCCACGCTAGGGTCGCGGAGCTGGAGgacgagaacaagagcctgaggaaacagcgggtcgacaacaacgtagctcacttgcagcgcgagctggtgcgggccaggctcgacgcagcggagcgGCGAGCCGCGGAGAGGGAGCTCCACGCTAGGGTCGCGGAGCTGGAGgacgagaacaagagcctgaggaaacagcgggtcgacaacaacgtagctcacttgcagcgcgagctggtgcgggccaggctcgacgcagcggagcgGCGAGCCGCGGAGAGGGAGCTCCACGCTAGGGTCGCGGAGCTGGAGgacgagaacaagagcctgaggaaacagcgggtcgacaacaacgtagctcacttgcagcgcgagctggtgcgggccaggctcgacgcagcggagcgGCGAGCCGCGGAGAGGGAGCTCCACGCTAGGGTCGCGGAGCTGGAGgacgagaacaagagcctgaggaaacagcgggtcgacaacaacgtagctcacttgcagGAACACAGACAAGAggctccgccgccgccgcccagtCAGTCAAACGTGGTCTCCGACATCATGGCCACTCCGAAGAAGCTTCTAAGAGCGTGGGAGGGCAGGTCCTCTGACATGCAAAAACTGGAAGAAGACTTGATGACTGTTAAAATTAAGGAAGTGGAGGCACTCACCGAGCTGAAGGAGCTCAGACTTAAGGAAATGGAGCTTCGTACCCAAGTGCAAGTATCGACCGACCAGCTGAGGAGGCAGGACGAGGAGCTGCGGCAGCTGCGCGAGGCGCTGCAGCGGGAGCGCGCCCTGCAGACCCGCCAGCGGGAGTTCCAGCACAAATACGCAGACCTGGAGAGCGAGGCTAAATATGAATCGATGCAAGCCAACATTCGCAACATGGAAGACGCACAGCGTATTGCCGAGTTGAAAATCGAAGTTTCAGAGTATAAATTAAAGCATGAAGTGATGGCGACGGAGGGTGCACTTCGGAGCAACAACAACACGGAGGACTCTGAACCGGTTCGTGGACTGCAGGATCAAATCACTGAACTGCGGACCGAGGTTATGCGGTTAGAGGCATGGAAGGCACGATTTCTCGGCCACTCGCCCGTTCGCGCTATCTCTGTGGACGAGGACCTCACTGAAGACGACAAATGCGTATCTATCGATCTCAACGACAAGAGTATGTCGTAG
- Protein Sequence
- MDVFLSEGIEIVFKVALALLTLGKDDLLSLDMENILKFMQKELPQKAEADEDAFMNLAYSIKVNPEKMKKLEKEYTVIKTKEQGDIAVLRCLRQENRLLKQSVELLEKESSALVERLVQGQVDRAEGEEKTFALAREVQALRRANMDAQQRLAVAQDEIRSLEMTIAENNSRQSSLERTEAHNAKGEELARCLQRELVRARLDAAERRAAERELHARVAELEDENKSLRKQRVDNNVAHLQRELVRARLDAAERRAAERELHARVAELEDENKSLRKQRVDNNVAHLQRELVRARLDAAERRAAERELHARVAELEDENKSLRKQRVDNNVAHLQRELVRARLDAAERRAAERELHARVAELEDENKSLRKQRVDNNVAHLQRELVRARLDAAERRAAERELHARVAELEDENKSLRKQRVDNNVAHLQRELVRARLDAAERRAAERELHARVAELEDENKSLRKQRVDNNVAHLQRELVRARLDAAERRAAERELHARVAELEDENKSLRKQRVDNNVAHLQRELVRARLDAAERRAAERELHARVAELEDENKSLRKQRVDNNVAHLQRELVRARLDAAERRAAERELHARVAELEDENKSLRKQRVDNNVAHLQRELVRARLDAAERRAAERELHARVAELEDENKSLRKQRVDNNVAHLQRELVRARLDAAERRAAERELHARVAELEDENKSLRKQRVDNNVAHLQEHRQEAPPPPPSQSNVVSDIMATPKKLLRAWEGRSSDMQKLEEDLMTVKIKEVEALTELKELRLKEMELRTQVQVSTDQLRRQDEELRQLREALQRERALQTRQREFQHKYADLESEAKYESMQANIRNMEDAQRIAELKIEVSEYKLKHEVMATEGALRSNNNTEDSEPVRGLQDQITELRTEVMRLEAWKARFLGHSPVRAISVDEDLTEDDKCVSIDLNDKSMS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00352799;
- 90% Identity
- iTF_00352799;
- 80% Identity
- iTF_00352799;