Cbeh010863.1
Basic Information
- Insect
- Colias behrii
- Gene Symbol
- Vps13
- Assembly
- GCA_029959075.1
- Location
- JARWMB010000009.1:8685847-8695140[+]
Transcription Factor Domain
- TF Family
- TSC22
- Domain
- TSC22 domain
- PFAM
- PF01166
- TF Group
- Basic Domians group
- Description
- These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 15 1.1 1.3e+04 -2.6 0.1 35 47 38 50 35 51 0.82 2 15 0.0047 54 5.0 0.1 8 26 496 514 494 535 0.81 3 15 0.0047 54 5.0 0.1 8 26 558 576 556 597 0.81 4 15 0.0047 54 5.0 0.1 8 26 620 638 618 659 0.81 5 15 0.0047 54 5.0 0.1 8 26 682 700 680 721 0.81 6 15 0.0047 54 5.0 0.1 8 26 744 762 742 783 0.81 7 15 0.0047 54 5.0 0.1 8 26 806 824 804 845 0.81 8 15 0.0047 54 5.0 0.1 8 26 868 886 866 907 0.81 9 15 0.0047 54 5.0 0.1 8 26 930 948 928 969 0.81 10 15 0.0047 54 5.0 0.1 8 26 992 1010 990 1031 0.81 11 15 0.0047 54 5.0 0.1 8 26 1054 1072 1052 1093 0.81 12 15 0.0047 54 5.0 0.1 8 26 1116 1134 1114 1155 0.81 13 15 0.0047 54 5.0 0.1 8 26 1178 1196 1176 1217 0.81 14 15 0.0047 54 5.0 0.1 8 26 1240 1258 1238 1279 0.81 15 15 0.0047 54 5.0 0.1 8 26 1370 1388 1368 1409 0.81
Sequence Information
- Coding Sequence
- ATGGTTTTCGAATCTATTGTAGTGGATGTTTTAAACCGGTTCCTTGGGGACTACGTTGAAAATCTAAATAGATCACAGTTAAAATTAGGAATATGGGGTGGCGACGTTGTATTGGAAAATTTGATACTTAAACAAAATGCTCTTGAAGAACTCAATATTCCAGTACAAACAGTATATGGTCACTTAGGAAAGTTAGTGCTCAAGATACCATGGAAAAATTTATATGGCGCTTCAGTAGAAGCAACAATAGAAAGACTCTTTCTCATTGTTAATCCTAGTGCTGAAGTAAAATATGATGcggaaaaagaagaaaaaatggCTCTACAAGCCAAACAAGCAGAGCTTGCAAGGGTTGAGGAGGCCAAAAAGAGAGAAGCTGAAAAAGATGAAATTAAATTGGATGAAACATTTGTTGAAAAATTGGTGacccaaataataaaaaatgtccAACTCAAAATTACTGACATACATATTCGTTATGAAGATAGTATAACTAATCCTAAAGCACCTTTTTCCTTTGGTATAACACTCCACAATTTATCTGTGCACACAACTGATGAAAACTGGAAACAGACTGTAATACAAGAGGCTGTCACTAAAATCTTTAAAATTTTGAGCTTGGAAGGATTAGCTATTTATTGGAATCCAACAACAGAACTGTATTCTAAAACCAGCCCCGATGAGATTAAGAACCGTTTGCAAAAAGAAATTGCAACTAAAGTAGTTCTTCCTGAAAATTATAACTATGCACTTGGTCCTATAAATGCGACGGCAAAATTAAAACTCAATCCGAAGCCTGAAGGAGATACGCCTAAATTTAGTATTCCTAAAGTAATCCTTAGTTTGCATATGGAACAGCTTGCTGTTAATCTTAATAAAGCACAGTACCAAGATATGATGCTGTTGGCAGACTCCATGGATCGGATGAGCAAAGGAGCACCATACAGAAAATATCGCCCAGACGCAAAAACTTACAAAGGTCACTATAAAGAATGGTGGCATTTTGCATACAAGTGCATTTTGGAAGAAGAAGTACTGAGACGTCGTAGGAATTGGGATTGGAACCACATGTTGTCCCACCGACAGCTTTGTAAAGACTATGCTAATGCTTACCAATGCAAGCTCACTAGTAAGGGAAAAGTAGCAATTGAGTACCAATGTGTGTTGGATAAAGCTGAGAAATCATTGGATCTGTTCAATTTGGTTGTAATCAGGCAACAAATTGAGTTAGAGGTGGAAAGATTAGGCAAATTGGAAGCAGAAGCAAAAAAATCTCGTGGTTGGTTCAGTGGCTGGTGGTCTGGAGCGAGTTCTAAGGATGAGGAATTATCAGAAGGAGTTGCTATCATGAAGCAATTCGAGAAGGCGATGACAGGCGAGGAGAAGGAGAAACTATTCCGCGCGATCGACTACCAGGAGAACACGGCTCCGTTGCACTTGCCCATCGAATACGTGGCTGTGGAGGGCAGCTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAA
- Protein Sequence
- MVFESIVVDVLNRFLGDYVENLNRSQLKLGIWGGDVVLENLILKQNALEELNIPVQTVYGHLGKLVLKIPWKNLYGASVEATIERLFLIVNPSAEVKYDAEKEEKMALQAKQAELARVEEAKKREAEKDEIKLDETFVEKLVTQIIKNVQLKITDIHIRYEDSITNPKAPFSFGITLHNLSVHTTDENWKQTVIQEAVTKIFKILSLEGLAIYWNPTTELYSKTSPDEIKNRLQKEIATKVVLPENYNYALGPINATAKLKLNPKPEGDTPKFSIPKVILSLHMEQLAVNLNKAQYQDMMLLADSMDRMSKGAPYRKYRPDAKTYKGHYKEWWHFAYKCILEEEVLRRRRNWDWNHMLSHRQLCKDYANAYQCKLTSKGKVAIEYQCVLDKAEKSLDLFNLVVIRQQIELEVERLGKLEAEAKKSRGWFSGWWSGASSKDEELSEGVAIMKQFEKAMTGEEKEKLFRAIDYQENTAPLHLPIEYVAVEGSFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGGAARVCGARGGGPAAAAQCQCSQNTSAWTGCSWPCGTTWRCCARVWSTWRWTCSSGPVPMLSGTYTLTEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGGAARVCGARGGGPAAAAQCQCSQVHIH
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -