Alot010674.1
Basic Information
- Insect
- Agrochola lota
- Gene Symbol
- sip1
- Assembly
- GCA_947364205.1
- Location
- OX376241.1:12651054-12668429[+]
Transcription Factor Domain
- TF Family
- GCFC
- Domain
- GCFC domain
- PFAM
- PF07842
- TF Group
- Unclassified Structure
- Description
- This entry describes a domain found in a number of GC-rich sequence DNA-binding factor proteins and homologues [4, 5], as well as in a number of other proteins including Tuftelin-interacting protein 11 [1]. While the function of the domain is unknown, some of the proteins it is found in are reported to be involved in pre-mRNA splicing [1, 2]. This domain is also found in Sip1, a septin interacting protein [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 19 3.1e-35 3.9e-31 109.1 2.9 1 171 365 516 365 520 0.95 2 19 1.3e-09 1.7e-05 25.1 0.5 121 171 520 565 516 569 0.92 3 19 1.3e-09 1.7e-05 25.1 0.5 121 171 569 614 565 618 0.92 4 19 1.3e-09 1.6e-05 25.1 0.5 121 171 618 663 614 667 0.92 5 19 1.4e-09 1.8e-05 25.0 0.6 121 171 667 712 664 716 0.92 6 19 1.4e-09 1.7e-05 25.0 0.5 121 171 716 761 712 765 0.92 7 19 1.7e-09 2.1e-05 24.7 1.0 121 171 765 810 762 817 0.92 8 19 1.2e-09 1.5e-05 25.2 0.4 121 171 814 859 811 864 0.92 9 19 1.4e-09 1.8e-05 25.0 0.5 121 171 863 908 860 912 0.92 10 19 1.5e-09 1.8e-05 24.9 0.9 121 172 912 958 909 965 0.92 11 19 1.4e-09 1.8e-05 25.0 0.5 121 171 961 1006 959 1010 0.92 12 19 1.4e-09 1.8e-05 25.0 0.5 121 171 1010 1055 1008 1059 0.92 13 19 1.1e-09 1.4e-05 25.3 0.3 121 171 1059 1104 1054 1108 0.92 14 19 1.3e-09 1.6e-05 25.1 0.5 121 171 1108 1153 1104 1157 0.92 15 19 1.4e-09 1.8e-05 25.0 0.5 121 171 1157 1202 1155 1206 0.92 16 19 1.3e-09 1.6e-05 25.1 0.5 121 171 1206 1251 1202 1255 0.92 17 19 1.4e-09 1.7e-05 25.0 0.5 121 171 1255 1300 1252 1304 0.92 18 19 1.3e-09 1.7e-05 25.1 0.5 121 171 1304 1349 1301 1353 0.92 19 19 3.1e-34 3.8e-30 105.9 0.7 121 275 1353 1503 1350 1503 0.96
Sequence Information
- Coding Sequence
- ATGTCGGACGACGAAGTAATGCGGTTTGAAATCACCGACTATGATTTGGACAATGAATTCAATCCAAACAGAACCCGAAGAGCTAAAAAAGAGCACCAGATTTACGGTGTTTGGGCCAAGGATAGTGACGAGGAGGATAATGAAGACAATGTCAGACAAAGGTCTCGCAAACCCAAAGACTTCACGGCACCCATTGGGTTTGTGGCCGGTGGAGTACAACAGGCTGGCAAGAAGAAGGAGGAAAATAAAGAAATAGAATCATCAGATGCTTCCACCTCTCGTCCAAAGTTTGCTGACAGCTCTGATGAAGATGAACAGAATGCCCCGGATGCTAGTGAGACTGCTGGTATTAGAAGACAGGGGCAAGGCATGAAGTCGGTCAACCTTGGAGGCAATGTTGGGACTTGGGAAAGGCATACTAAGGGTATTGGAGCTAAGCTGCTGTTGAAGATGGGGTACCAACCAGGCATGGGTCTAGGTAAAGACCTGCAGGGTATCTCAGCACCTGTGGAGGCCACTGTCCGGAGAGGAAGGGGAGCTATAGGTGCCTATGGACCTGAGAAGGCAGCTCAAAAAGCCAAAAAAGAAGAAGAACTCCGTCGTCTGAAAGAGAAGGGAGATGAGAAAGAGACAAAAGAAAAGAACTACAACTGGAAGAAGTCGCACAAGGGTCGCTACTTCTACCGCGATGCTGCAGACGTCATACAAGAGGGTAAACCCACCATGCATACTATTACCAGCAACGAGCTCTCCCGCGTGCCTGTAATAGACATGACAGGCAAAGAGAAGCGAGTACTGAGCGGCTACCACGCGCTCAGAGCCGCCGCGCCGCGCTTCGAACACGAGCCGCGGCGCAAGTGTGACAACTTCGCCGCGCCACAGCTGGTGCACAACCTTGAACTGATGGTGGAATGCTGTGAACAGGACATAATCCAAAACGCTCGCGAGCTACAAACAGCGGAAGACGAGATAGTTGTCCTAGAGAGAGACCTAGAGGAATGCAACATAAAGCTGTCGGAACAAGACAAGGTGATCTGCAAGGTGGAAGGTATACTAGAGCGCGTGGAGATGCTCAACAAGCCTGATGTGTCATTAGAAAGGGCCCACGATGTGCTAGCTGAGTTGAAGGACTCCTACGCCGTAGAATACGAGATGTTCGGCTTAGGCACTATAGCGGGTAACATCGTGAGCCCGCTGCTAAGTTCCTTACTAGCTACCTGGGAACCTCTCTCGGCCCCCGACGAGCACATACCCACCTTCCTGAAATGGAGGAAGTTGCTCACTGAAGAGGCGTACAACAGCCTGCTGTGGCAACATTTTGTGCCGCAACTTACTGCGGCTGCTGAAACATGGAACCCACGCATGCCGACGCCGATGCTCCGCGCAGTGTCGGCGTGGCAGGCCGCGGCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGAGAAGCCCTAGCCAGCTCGGTGTACCCTCTGATCCGCTCGCGCCTGGCGGCCGCGCTGTCGGCGTGGCACCCGGCGGACGGCTCGGCGCGCGGCGTGCTGCGGGCCTGGCGCGGGGCCTGGGGCCCGGCCCTGCAGGCCATGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTGGTGGGCAGGGAGAACACGGCGTGGCTGTGGTGCGTGGAGTGGCTGGAGCTGCTGGGCGCGGCCACGGTGGCGACGCTGGCGGCGCGCGCGCTGCTGCCGCGCTGGCTGGGCGCGCTGGCCGCCTGGCTCAACACCAACCCGCCGCACGCCACCGTGCTCAACTCCtacactgattttaagAAAATGTTCCCGGAAGAAGTCCTCAAAGAGCCTGCCGTAAGAGACGCCTTCCGCAAAGCCTTAGACATGATGAATCGAAGCGCAGACATCGACTCAGTGGAACCTCCCCCACCACCACGCTTCACCATGCCCGAGCCGAAGGAAACATCCCGTATAACTGAAGTTTTAGCATCTGCTACACAAGCGAAAAGCTTCTCAGAGTTACTTGAGACTAGGTGTATCGAAAAAGGGATCACTTTTGTGCCTATAGCCGGCAAGACCAGAGAAGGTAGACCGCTGTACAAAATTGGCGATCTGCAGTGTTATGTTATAAGAAATGTGATCATGTTTTCCAATGATAGTGGTAGGAGTTTCAACCCTATTAGTATGGATAAGTTGTTGAATATGGTTGAGGATTGA
- Protein Sequence
- MSDDEVMRFEITDYDLDNEFNPNRTRRAKKEHQIYGVWAKDSDEEDNEDNVRQRSRKPKDFTAPIGFVAGGVQQAGKKKEENKEIESSDASTSRPKFADSSDEDEQNAPDASETAGIRRQGQGMKSVNLGGNVGTWERHTKGIGAKLLLKMGYQPGMGLGKDLQGISAPVEATVRRGRGAIGAYGPEKAAQKAKKEEELRRLKEKGDEKETKEKNYNWKKSHKGRYFYRDAADVIQEGKPTMHTITSNELSRVPVIDMTGKEKRVLSGYHALRAAAPRFEHEPRRKCDNFAAPQLVHNLELMVECCEQDIIQNARELQTAEDEIVVLERDLEECNIKLSEQDKVICKVEGILERVEMLNKPDVSLERAHDVLAELKDSYAVEYEMFGLGTIAGNIVSPLLSSLLATWEPLSAPDEHIPTFLKWRKLLTEEAYNSLLWQHFVPQLTAAAETWNPRMPTPMLRAVSAWQAAAPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGEALASSVYPLIRSRLAAALSAWHPADGSARGVLRAWRGAWGPALQAMLHQHIVPKLDHCLQHAPLELVGRENTAWLWCVEWLELLGAATVATLAARALLPRWLGALAAWLNTNPPHATVLNSYTDFKKMFPEEVLKEPAVRDAFRKALDMMNRSADIDSVEPPPPPRFTMPEPKETSRITEVLASATQAKSFSELLETRCIEKGITFVPIAGKTREGRPLYKIGDLQCYVIRNVIMFSNDSGRSFNPISMDKLLNMVED
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -