Bvim031715.1
Basic Information
- Insect
- Brachylomia viminalis
- Gene Symbol
- sip1
- Assembly
- GCA_937001565.2
- Location
- CAKZJP020000374.1:900028-929976[-]
Transcription Factor Domain
- TF Family
- GCFC
- Domain
- GCFC domain
- PFAM
- PF07842
- TF Group
- Unclassified Structure
- Description
- This entry describes a domain found in a number of GC-rich sequence DNA-binding factor proteins and homologues [4, 5], as well as in a number of other proteins including Tuftelin-interacting protein 11 [1]. While the function of the domain is unknown, some of the proteins it is found in are reported to be involved in pre-mRNA splicing [1, 2]. This domain is also found in Sip1, a septin interacting protein [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 18 1.2e-51 3.2e-47 163.0 1.4 1 230 365 576 365 584 0.96 2 18 0.041 1.1e+03 0.5 0.0 213 230 622 639 617 646 0.81 3 18 0.041 1.1e+03 0.5 0.0 213 230 685 702 680 709 0.81 4 18 0.041 1.1e+03 0.5 0.0 213 230 748 765 743 772 0.81 5 18 0.041 1.1e+03 0.5 0.0 213 230 811 828 806 835 0.81 6 18 0.041 1.1e+03 0.5 0.0 213 230 874 891 869 898 0.81 7 18 0.041 1.1e+03 0.5 0.0 213 230 937 954 932 961 0.81 8 18 0.041 1.1e+03 0.5 0.0 213 230 1000 1017 995 1024 0.81 9 18 0.041 1.1e+03 0.5 0.0 213 230 1063 1080 1058 1087 0.81 10 18 0.041 1.1e+03 0.5 0.0 213 230 1126 1143 1121 1150 0.81 11 18 0.041 1.1e+03 0.5 0.0 213 230 1189 1206 1184 1213 0.81 12 18 0.041 1.1e+03 0.5 0.0 213 230 1252 1269 1247 1276 0.81 13 18 0.041 1.1e+03 0.5 0.0 213 230 1315 1332 1310 1339 0.81 14 18 0.041 1.1e+03 0.5 0.0 213 230 1378 1395 1373 1402 0.81 15 18 0.041 1.1e+03 0.5 0.0 213 230 1441 1458 1436 1465 0.81 16 18 0.041 1.1e+03 0.5 0.0 213 230 1504 1521 1499 1528 0.81 17 18 0.041 1.1e+03 0.5 0.0 213 230 1567 1584 1562 1591 0.81 18 18 3.1e-08 0.00087 20.6 0.0 213 275 1630 1692 1625 1692 0.94
Sequence Information
- Coding Sequence
- ATGTCGGACGACGAAGTAATGCGGTTTGAAATCACCGACTATGATTTGGACAATGAATTCAACCCAAACAGAACCCGAAAAGCTAAAAAAGAGCACCAGATTTACGGTGTTTGGGCCAAGGACAGTGACGAGGAGGATAATGAAGACAATGTCAGACAAAGGTCTCGCAAACCCAAAGACTTCACGGCACCCATTGGGTTTGTGGCCGGTGGCGTACAACAGGCTGGCAAGAAGAAGGAGGAAAGTAAAGAAATAGAATCATCAGAAGCTTCAACGTCTCGTCCAAAGTTCGCTGACAGCTCTGATGAAGATGAACAGAATGCGCCGGATGCTAGTGAGACTGCTGGCATTAGAAGACAGGGGCAAGGCATGAAGCCGGTCAACCTTGGAGGCAATGTGGGGAATTGGGAGAAGCATACTAAGGGTATTGGAGCTAAGCTGCTGTTGAAGATGGGTTACCAACCAGGTATGGGTCTAGGTAAAGATCTGCAGGGTATCTCAGCACCCGTTGAGGCCACTGTCAGGAGAGGAAGGGGTGCTATTGGTGCCTATGGACCTGAAAAAGCAGCTCAAAAAGCCAAAAAAGAAGAGGAACTCCGTCGTCTGAAAGAGAAGGgagatgaaaaagatacaaaagaAAAGAACTACAATTGGAAGAAGTCGCACAAAGGTCGCTACTTCTACCGAGATGCCGCAGACGTCATACAAGAGGGTAAACCTACGATGCATACTATCACCAGCAACGAGCTCTCCCGTGTGCCTGTAATTGATATGACAGGCAAGGAGAAGCGTGTGCTGAGTGGCTACCACGCGCTTAGAGCCGCTGCACCGCGCTTCGAACACGAGCCGCGACGCAAGTGTGACAACTTTGCCGCGCCACAGCTGGTGCACAACCTTGAACTGATGGTGGAATGCTGTGAACAGGACATAATTCAAAACGCCCGCGAGCTTCAAACAGCGGAAGACGAGATAGTTGTCCTAGAGAGGGACCTAGAGGAATGCAACATAAAGCTGTCGGAACAAGACAAGGTGATTTGCAAGGTGGAAGGTATACTCGAGCGTGTGGAGATGCTCAACAAGCCAGAGGTGTCTTTAGAAAGGGCCCACGATGTGCTGGCTGAGTTGCAGGACACCTACTCTGTAGAATACGAGGTGTTCGGCCTGGGCACCATAGCTGGTAACATAGTGAGCCCGCTGCTAAGTTCCTTACTCGCTACCTGGGAACCCCTCGCGGCCCCTGACGAGCACATACCCACCTTCCTGAAATGGAGGAAGTTGCTGACAGAAGAGGCGTACAACAGTCTACTGTGGCAACACTTTGTGCCGCGACTTACTGCGGCTGCTGAAACCTGGAACCCCCGCATGCCAACACCCATGCTCCGCGCAGTATCCGCATGGCAGGCCGCCTCCCCCCCCTGGCTGTCGCGGGCATGCGTCACACGCTGCGTGGTACCGCGAGTGCTAGCGGCCGTGAGGGCCTGGGACCCCACCAACGACACGCAGCCGCTGCACCAGTGGGTGCTGCCCTGGCATGATATGGCAGGAGAAGCCCTGGCCGGCTCGGTGTACCCGCTCATCCGCTCGCGCCTGGCGGCGGCGCTGGCGGCGTGGCACCCGGCGGACGGCTCGGCGCGCGGCGTGCTGCGCGCGTGGCGCGGCGCGTGGGGCGCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACAGTACGTAGCCCGCACTCACACTCTTAGGCTGAGTCGTCGATCTGAACACCCGGCGGACGGCTCGGCGCTGCACGCGCTGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTCGTGGGCAGGGAGAACACGGCCTGGCTATGGTGCGTAGAATGGCTGGAGCTCCTCGGGGCCCCGACAGTAGCGGCCATCGCGGCGCGTGCGCTGCTCCCTCGCTGGCTGGCCGCGCTCGCTGCGTGGCTCAACACCAACCCGCCGCATGCTACTGTGCTTAACTCTTACACGGACTTTAAGAAAATGTTCCCGGAAGAAGTCCTCAAAGAGCCCGCTGTACGAGACGCCTTCCGCAAAGCCTTAGACATGATGAATCGAAGCGCAGACATCGACTCAGTGGAACCTCCCCCGCCACCGCGCTACACCATGCCCGAGCTGAAGGAAACTTCCCGTATCGTAGAAGTTTTAACATCCGTCACACAAGCTAAAAGCTTCTCAGAATTACTTGAATGTAGATGTATCGAAAAAGGGATCACGTTCGTACCTATAGCGGGCAAAACTAGGGAAGGTAGACCGCTGTATAAAATTGGCGATCTGCAGTGTTATGTTATAAGAAATGTGATCATGTTTTCCAATGATAGTGGTAGGAGTTTCAACCCTATTAGTATGGATAAGTTGTTGAATATGGTTGAGGATtga
- Protein Sequence
- MSDDEVMRFEITDYDLDNEFNPNRTRKAKKEHQIYGVWAKDSDEEDNEDNVRQRSRKPKDFTAPIGFVAGGVQQAGKKKEESKEIESSEASTSRPKFADSSDEDEQNAPDASETAGIRRQGQGMKPVNLGGNVGNWEKHTKGIGAKLLLKMGYQPGMGLGKDLQGISAPVEATVRRGRGAIGAYGPEKAAQKAKKEEELRRLKEKGDEKDTKEKNYNWKKSHKGRYFYRDAADVIQEGKPTMHTITSNELSRVPVIDMTGKEKRVLSGYHALRAAAPRFEHEPRRKCDNFAAPQLVHNLELMVECCEQDIIQNARELQTAEDEIVVLERDLEECNIKLSEQDKVICKVEGILERVEMLNKPEVSLERAHDVLAELQDTYSVEYEVFGLGTIAGNIVSPLLSSLLATWEPLAAPDEHIPTFLKWRKLLTEEAYNSLLWQHFVPRLTAAAETWNPRMPTPMLRAVSAWQAASPPWLSRACVTRCVVPRVLAAVRAWDPTNDTQPLHQWVLPWHDMAGEALAGSVYPLIRSRLAAALAAWHPADGSARGVLRAWRGAWGAALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTRAAGARGQGEQYVARTHTLRLSRRSEHPADGSALHALLHQHIVPKLDHCLQHAPLELVGRENTAWLWCVEWLELLGAPTVAAIAARALLPRWLAALAAWLNTNPPHATVLNSYTDFKKMFPEEVLKEPAVRDAFRKALDMMNRSADIDSVEPPPPPRYTMPELKETSRIVEVLTSVTQAKSFSELLECRCIEKGITFVPIAGKTREGRPLYKIGDLQCYVIRNVIMFSNDSGRSFNPISMDKLLNMVED
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -