Basic Information

Gene Symbol
sip1
Assembly
GCA_947364205.1
Location
OX376241.1:12651054-12668429[+]

Transcription Factor Domain

TF Family
GCFC
Domain
GCFC domain
PFAM
PF07842
TF Group
Unclassified Structure
Description
This entry describes a domain found in a number of GC-rich sequence DNA-binding factor proteins and homologues [4, 5], as well as in a number of other proteins including Tuftelin-interacting protein 11 [1]. While the function of the domain is unknown, some of the proteins it is found in are reported to be involved in pre-mRNA splicing [1, 2]. This domain is also found in Sip1, a septin interacting protein [3].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 19 3.1e-35 3.9e-31 109.1 2.9 1 171 365 516 365 520 0.95
2 19 1.3e-09 1.7e-05 25.1 0.5 121 171 520 565 516 569 0.92
3 19 1.3e-09 1.7e-05 25.1 0.5 121 171 569 614 565 618 0.92
4 19 1.3e-09 1.6e-05 25.1 0.5 121 171 618 663 614 667 0.92
5 19 1.4e-09 1.8e-05 25.0 0.6 121 171 667 712 664 716 0.92
6 19 1.4e-09 1.7e-05 25.0 0.5 121 171 716 761 712 765 0.92
7 19 1.7e-09 2.1e-05 24.7 1.0 121 171 765 810 762 817 0.92
8 19 1.2e-09 1.5e-05 25.2 0.4 121 171 814 859 811 864 0.92
9 19 1.4e-09 1.8e-05 25.0 0.5 121 171 863 908 860 912 0.92
10 19 1.5e-09 1.8e-05 24.9 0.9 121 172 912 958 909 965 0.92
11 19 1.4e-09 1.8e-05 25.0 0.5 121 171 961 1006 959 1010 0.92
12 19 1.4e-09 1.8e-05 25.0 0.5 121 171 1010 1055 1008 1059 0.92
13 19 1.1e-09 1.4e-05 25.3 0.3 121 171 1059 1104 1054 1108 0.92
14 19 1.3e-09 1.6e-05 25.1 0.5 121 171 1108 1153 1104 1157 0.92
15 19 1.4e-09 1.8e-05 25.0 0.5 121 171 1157 1202 1155 1206 0.92
16 19 1.3e-09 1.6e-05 25.1 0.5 121 171 1206 1251 1202 1255 0.92
17 19 1.4e-09 1.7e-05 25.0 0.5 121 171 1255 1300 1252 1304 0.92
18 19 1.3e-09 1.7e-05 25.1 0.5 121 171 1304 1349 1301 1353 0.92
19 19 3.1e-34 3.8e-30 105.9 0.7 121 275 1353 1503 1350 1503 0.96

Sequence Information

Coding Sequence
ATGTCGGACGACGAAGTAATGCGGTTTGAAATCACCGACTATGATTTGGACAATGAATTCAATCCAAACAGAACCCGAAGAGCTAAAAAAGAGCACCAGATTTACGGTGTTTGGGCCAAGGATAGTGACGAGGAGGATAATGAAGACAATGTCAGACAAAGGTCTCGCAAACCCAAAGACTTCACGGCACCCATTGGGTTTGTGGCCGGTGGAGTACAACAGGCTGGCAAGAAGAAGGAGGAAAATAAAGAAATAGAATCATCAGATGCTTCCACCTCTCGTCCAAAGTTTGCTGACAGCTCTGATGAAGATGAACAGAATGCCCCGGATGCTAGTGAGACTGCTGGTATTAGAAGACAGGGGCAAGGCATGAAGTCGGTCAACCTTGGAGGCAATGTTGGGACTTGGGAAAGGCATACTAAGGGTATTGGAGCTAAGCTGCTGTTGAAGATGGGGTACCAACCAGGCATGGGTCTAGGTAAAGACCTGCAGGGTATCTCAGCACCTGTGGAGGCCACTGTCCGGAGAGGAAGGGGAGCTATAGGTGCCTATGGACCTGAGAAGGCAGCTCAAAAAGCCAAAAAAGAAGAAGAACTCCGTCGTCTGAAAGAGAAGGGAGATGAGAAAGAGACAAAAGAAAAGAACTACAACTGGAAGAAGTCGCACAAGGGTCGCTACTTCTACCGCGATGCTGCAGACGTCATACAAGAGGGTAAACCCACCATGCATACTATTACCAGCAACGAGCTCTCCCGCGTGCCTGTAATAGACATGACAGGCAAAGAGAAGCGAGTACTGAGCGGCTACCACGCGCTCAGAGCCGCCGCGCCGCGCTTCGAACACGAGCCGCGGCGCAAGTGTGACAACTTCGCCGCGCCACAGCTGGTGCACAACCTTGAACTGATGGTGGAATGCTGTGAACAGGACATAATCCAAAACGCTCGCGAGCTACAAACAGCGGAAGACGAGATAGTTGTCCTAGAGAGAGACCTAGAGGAATGCAACATAAAGCTGTCGGAACAAGACAAGGTGATCTGCAAGGTGGAAGGTATACTAGAGCGCGTGGAGATGCTCAACAAGCCTGATGTGTCATTAGAAAGGGCCCACGATGTGCTAGCTGAGTTGAAGGACTCCTACGCCGTAGAATACGAGATGTTCGGCTTAGGCACTATAGCGGGTAACATCGTGAGCCCGCTGCTAAGTTCCTTACTAGCTACCTGGGAACCTCTCTCGGCCCCCGACGAGCACATACCCACCTTCCTGAAATGGAGGAAGTTGCTCACTGAAGAGGCGTACAACAGCCTGCTGTGGCAACATTTTGTGCCGCAACTTACTGCGGCTGCTGAAACATGGAACCCACGCATGCCGACGCCGATGCTCCGCGCAGTGTCGGCGTGGCAGGCCGCGGCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGTAACTTAACGTCGCCGGCCTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGCGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGATATGGCAGGAGAAGCCCTAGCCAGCTCGGTGTACCCTCTGATCCGCTCGCGCCTGGCGGCCGCGCTGTCGGCGTGGCACCCGGCGGACGGCTCGGCGCGCGGCGTGCTGCGGGCCTGGCGCGGGGCCTGGGGCCCGGCCCTGCAGGCCATGCTGCACCAGCACATCGTGCCCAAGCTGGACCACTGTCTGCAGCACGCGCCGCTGGAGCTGGTGGGCAGGGAGAACACGGCGTGGCTGTGGTGCGTGGAGTGGCTGGAGCTGCTGGGCGCGGCCACGGTGGCGACGCTGGCGGCGCGCGCGCTGCTGCCGCGCTGGCTGGGCGCGCTGGCCGCCTGGCTCAACACCAACCCGCCGCACGCCACCGTGCTCAACTCCtacactgattttaagAAAATGTTCCCGGAAGAAGTCCTCAAAGAGCCTGCCGTAAGAGACGCCTTCCGCAAAGCCTTAGACATGATGAATCGAAGCGCAGACATCGACTCAGTGGAACCTCCCCCACCACCACGCTTCACCATGCCCGAGCCGAAGGAAACATCCCGTATAACTGAAGTTTTAGCATCTGCTACACAAGCGAAAAGCTTCTCAGAGTTACTTGAGACTAGGTGTATCGAAAAAGGGATCACTTTTGTGCCTATAGCCGGCAAGACCAGAGAAGGTAGACCGCTGTACAAAATTGGCGATCTGCAGTGTTATGTTATAAGAAATGTGATCATGTTTTCCAATGATAGTGGTAGGAGTTTCAACCCTATTAGTATGGATAAGTTGTTGAATATGGTTGAGGATTGA
Protein Sequence
MSDDEVMRFEITDYDLDNEFNPNRTRRAKKEHQIYGVWAKDSDEEDNEDNVRQRSRKPKDFTAPIGFVAGGVQQAGKKKEENKEIESSDASTSRPKFADSSDEDEQNAPDASETAGIRRQGQGMKSVNLGGNVGTWERHTKGIGAKLLLKMGYQPGMGLGKDLQGISAPVEATVRRGRGAIGAYGPEKAAQKAKKEEELRRLKEKGDEKETKEKNYNWKKSHKGRYFYRDAADVIQEGKPTMHTITSNELSRVPVIDMTGKEKRVLSGYHALRAAAPRFEHEPRRKCDNFAAPQLVHNLELMVECCEQDIIQNARELQTAEDEIVVLERDLEECNIKLSEQDKVICKVEGILERVEMLNKPDVSLERAHDVLAELKDSYAVEYEMFGLGTIAGNIVSPLLSSLLATWEPLSAPDEHIPTFLKWRKLLTEEAYNSLLWQHFVPQLTAAAETWNPRMPTPMLRAVSAWQAAAPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGNLTSPAWLARACVARCVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDMAGEALASSVYPLIRSRLAAALSAWHPADGSARGVLRAWRGAWGPALQAMLHQHIVPKLDHCLQHAPLELVGRENTAWLWCVEWLELLGAATVATLAARALLPRWLGALAAWLNTNPPHATVLNSYTDFKKMFPEEVLKEPAVRDAFRKALDMMNRSADIDSVEPPPPPRFTMPEPKETSRITEVLASATQAKSFSELLETRCIEKGITFVPIAGKTREGRPLYKIGDLQCYVIRNVIMFSNDSGRSFNPISMDKLLNMVED

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-