Basic Information

Insect
Colias behrii
Gene Symbol
Vps13
Assembly
GCA_029959075.1
Location
JARWMB010000009.1:8685847-8695140[+]

Transcription Factor Domain

TF Family
TSC22
Domain
TSC22 domain
PFAM
PF01166
TF Group
Basic Domians group
Description
These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Drosophila protein bunched [1] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 15 1.1 1.3e+04 -2.6 0.1 35 47 38 50 35 51 0.82
2 15 0.0047 54 5.0 0.1 8 26 496 514 494 535 0.81
3 15 0.0047 54 5.0 0.1 8 26 558 576 556 597 0.81
4 15 0.0047 54 5.0 0.1 8 26 620 638 618 659 0.81
5 15 0.0047 54 5.0 0.1 8 26 682 700 680 721 0.81
6 15 0.0047 54 5.0 0.1 8 26 744 762 742 783 0.81
7 15 0.0047 54 5.0 0.1 8 26 806 824 804 845 0.81
8 15 0.0047 54 5.0 0.1 8 26 868 886 866 907 0.81
9 15 0.0047 54 5.0 0.1 8 26 930 948 928 969 0.81
10 15 0.0047 54 5.0 0.1 8 26 992 1010 990 1031 0.81
11 15 0.0047 54 5.0 0.1 8 26 1054 1072 1052 1093 0.81
12 15 0.0047 54 5.0 0.1 8 26 1116 1134 1114 1155 0.81
13 15 0.0047 54 5.0 0.1 8 26 1178 1196 1176 1217 0.81
14 15 0.0047 54 5.0 0.1 8 26 1240 1258 1238 1279 0.81
15 15 0.0047 54 5.0 0.1 8 26 1370 1388 1368 1409 0.81

Sequence Information

Coding Sequence
ATGGTTTTCGAATCTATTGTAGTGGATGTTTTAAACCGGTTCCTTGGGGACTACGTTGAAAATCTAAATAGATCACAGTTAAAATTAGGAATATGGGGTGGCGACGTTGTATTGGAAAATTTGATACTTAAACAAAATGCTCTTGAAGAACTCAATATTCCAGTACAAACAGTATATGGTCACTTAGGAAAGTTAGTGCTCAAGATACCATGGAAAAATTTATATGGCGCTTCAGTAGAAGCAACAATAGAAAGACTCTTTCTCATTGTTAATCCTAGTGCTGAAGTAAAATATGATGcggaaaaagaagaaaaaatggCTCTACAAGCCAAACAAGCAGAGCTTGCAAGGGTTGAGGAGGCCAAAAAGAGAGAAGCTGAAAAAGATGAAATTAAATTGGATGAAACATTTGTTGAAAAATTGGTGacccaaataataaaaaatgtccAACTCAAAATTACTGACATACATATTCGTTATGAAGATAGTATAACTAATCCTAAAGCACCTTTTTCCTTTGGTATAACACTCCACAATTTATCTGTGCACACAACTGATGAAAACTGGAAACAGACTGTAATACAAGAGGCTGTCACTAAAATCTTTAAAATTTTGAGCTTGGAAGGATTAGCTATTTATTGGAATCCAACAACAGAACTGTATTCTAAAACCAGCCCCGATGAGATTAAGAACCGTTTGCAAAAAGAAATTGCAACTAAAGTAGTTCTTCCTGAAAATTATAACTATGCACTTGGTCCTATAAATGCGACGGCAAAATTAAAACTCAATCCGAAGCCTGAAGGAGATACGCCTAAATTTAGTATTCCTAAAGTAATCCTTAGTTTGCATATGGAACAGCTTGCTGTTAATCTTAATAAAGCACAGTACCAAGATATGATGCTGTTGGCAGACTCCATGGATCGGATGAGCAAAGGAGCACCATACAGAAAATATCGCCCAGACGCAAAAACTTACAAAGGTCACTATAAAGAATGGTGGCATTTTGCATACAAGTGCATTTTGGAAGAAGAAGTACTGAGACGTCGTAGGAATTGGGATTGGAACCACATGTTGTCCCACCGACAGCTTTGTAAAGACTATGCTAATGCTTACCAATGCAAGCTCACTAGTAAGGGAAAAGTAGCAATTGAGTACCAATGTGTGTTGGATAAAGCTGAGAAATCATTGGATCTGTTCAATTTGGTTGTAATCAGGCAACAAATTGAGTTAGAGGTGGAAAGATTAGGCAAATTGGAAGCAGAAGCAAAAAAATCTCGTGGTTGGTTCAGTGGCTGGTGGTCTGGAGCGAGTTCTAAGGATGAGGAATTATCAGAAGGAGTTGCTATCATGAAGCAATTCGAGAAGGCGATGACAGGCGAGGAGAAGGAGAAACTATTCCGCGCGATCGACTACCAGGAGAACACGGCTCCGTTGCACTTGCCCATCGAATACGTGGCTGTGGAGGGCAGCTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAACAGAACACTTCTGCTTGGACCGGCTGCAGCTGGCCGTGCGGCACGACGTGGAGGTGCTGCGCGCGTGTGTGGAGCACGTGGAGGTGGACCTGCAGCAGCGGCCCAGTGCCAATGCTCTCAGGTACATATACACTAA
Protein Sequence
MVFESIVVDVLNRFLGDYVENLNRSQLKLGIWGGDVVLENLILKQNALEELNIPVQTVYGHLGKLVLKIPWKNLYGASVEATIERLFLIVNPSAEVKYDAEKEEKMALQAKQAELARVEEAKKREAEKDEIKLDETFVEKLVTQIIKNVQLKITDIHIRYEDSITNPKAPFSFGITLHNLSVHTTDENWKQTVIQEAVTKIFKILSLEGLAIYWNPTTELYSKTSPDEIKNRLQKEIATKVVLPENYNYALGPINATAKLKLNPKPEGDTPKFSIPKVILSLHMEQLAVNLNKAQYQDMMLLADSMDRMSKGAPYRKYRPDAKTYKGHYKEWWHFAYKCILEEEVLRRRRNWDWNHMLSHRQLCKDYANAYQCKLTSKGKVAIEYQCVLDKAEKSLDLFNLVVIRQQIELEVERLGKLEAEAKKSRGWFSGWWSGASSKDEELSEGVAIMKQFEKAMTGEEKEKLFRAIDYQENTAPLHLPIEYVAVEGSFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGGAARVCGARGGGPAAAAQCQCSQNTSAWTGCSWPCGTTWRCCARVWSTWRWTCSSGPVPMLSGTYTLTEHFCLDRLQLAVRHDVEVLRACVEHVEVDLQQRPSANALRYIYTNRTLLLGPAAAGRAARRGGAARVCGARGGGPAAAAQCQCSQVHIH

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-