Aoxy007700.1
Basic Information
- Insect
- Allophyes oxyacanthae
- Gene Symbol
- sip1
- Assembly
- GCA_932294395.1
- Location
- CAKOAO010000094.1:1915248-1924991[-]
Transcription Factor Domain
- TF Family
- GCFC
- Domain
- GCFC domain
- PFAM
- PF07842
- TF Group
- Unclassified Structure
- Description
- This entry describes a domain found in a number of GC-rich sequence DNA-binding factor proteins and homologues [4, 5], as well as in a number of other proteins including Tuftelin-interacting protein 11 [1]. While the function of the domain is unknown, some of the proteins it is found in are reported to be involved in pre-mRNA splicing [1, 2]. This domain is also found in Sip1, a septin interacting protein [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 15 3.5e-34 4.3e-30 105.7 2.3 1 170 367 517 367 525 0.95 2 15 1.5e-10 1.8e-06 28.2 0.8 111 170 522 575 517 580 0.92 3 15 1.2e-10 1.5e-06 28.5 1.0 111 170 580 633 574 641 0.90 4 15 1.8e-10 2.2e-06 27.9 0.9 111 170 638 691 634 695 0.92 5 15 1.4e-10 1.7e-06 28.3 0.7 111 170 696 749 691 754 0.92 6 15 1.1e-10 1.4e-06 28.6 0.9 111 170 754 807 748 815 0.90 7 15 2e-10 2.4e-06 27.8 1.3 112 170 813 865 809 872 0.92 8 15 1.3e-10 1.6e-06 28.3 0.7 111 170 870 923 864 928 0.92 9 15 1.9e-10 2.3e-06 27.9 1.3 111 170 928 981 924 988 0.91 10 15 1.7e-10 2.1e-06 28.0 1.4 111 170 986 1039 982 1047 0.91 11 15 1.7e-10 2.1e-06 28.0 0.9 111 170 1044 1097 1040 1102 0.92 12 15 1.7e-10 2.1e-06 28.0 1.4 111 170 1102 1155 1098 1163 0.91 13 15 1.4e-10 1.7e-06 28.3 0.6 111 170 1160 1213 1155 1217 0.92 14 15 1.7e-10 2.1e-06 28.0 1.0 112 170 1219 1271 1215 1277 0.92 15 15 9.6e-34 1.2e-29 104.3 1.1 111 275 1276 1435 1272 1435 0.95
Sequence Information
- Coding Sequence
- atgtcggaCGATGAAGTAATTCGGTTTGAAATCACCGACTATGATTTGGACAACGAATTCAATCCAAACAGAAACAGAAGAGCTAAGAAGGAACAGCAAATTTacgGTGTATGGGCTAAAGACAGTGATGAAGAGGACAATGAGGATAATGTCAGACAAAGAACTCGCAAGCCTAAAGACTTCTCCGCACCTATTGGGTTTGTGGCCGGTGGTGTGCAACAAGCAGGGAAGAAGAAAGAGGAAATTAAGGGTAAAGAAGTGGAATCCGCCGAAGCTTCTACCTCCCGTCCAAAGTTTGCCGACAGCTCAGATGAGGATGAACAGAACGCCCCAGATGCAAGTGAAACAGCGGGCATCAGGAGACAGGGGCAAGGCATGAGGCCCGCCAACCTTGGAGGCAATGTGGGGACTTGGGAGAGGCATACTAAAGGTATTGGAGCCAAGCTGCTATTGCAGATGGGCTACCAACCAGGAAGGGGTTTGGGGAAGGACCTGCAGGGTATCTCGGCGCCCGTAGAGGCCACGGTCAGGAGAGGAAGAGGTGCTATTGGAGCCTATGGTCCAGAGAAAGCAGCTcAAAAAGCCAAAAGAGAAGAGGAGCAGCGTCGTCTGAAAGAGAAGGAAGATGAGAAAGGGGGAAAGGAGAAGAATTACAACTGGAAGAAGTCTCATAAGGGAAGATACTTCTACAGAGATGCCGCTGACGTCATACAAGAGGGTAAACCCACGATGCATACTATTACCAGCAACGAACTATCCCGTGTCCCTGTGATCGACATGACGGGCAAAGAGAAGCGAGTGCTGAGCGGGTACCACGCACTTCGCGCCGCCGCTCCACGCTTCGAGCACGAGCCGCGCCGCAAGTGCGACAACTTCGCGGCTCCGCAACTGGTACACAACTTGGAGCTGATGGTGGAGTGCTGTGAACAGGACATAATTCAAAACGCTCGCGAGCTCCAATCAGCGGAAGACGAAATAGTCGTTCTGGAGAGAGACCTAGAAGAATGCAGCTTAAAGTTAGAAGACCAAGACAAAGTGATCAGCAAGGTGCAAGGCATACTGCAACGAGTGGAGCTGTTGAATAAACCGGATGTGTCGCTGGAGAGAGCGCATGATGTGTTGGCGGAACTAAAGGAATCCTACCCCCTAGAATACGAGATGTTCGGCCTGGGCACGATAGCAGGCAACATAGTGAGTCCTCTCCTCAGCTCCCTGCTAGCCACGTGGGAGCCCCTGCAGGCGCCCGAAGAGCCTATCCCCACCTTCGTGAAGTGGAGGAAGCTGCTTACTGAAGAAGCTTACAATAGTTTGCTGTGGCAACACTTTGTGCCTCAACTTACTGCGGCTGCTGAAACATGGAACCCTCGCATCCCGGGCCCGATGCAGCGCGCGGTGTGCGCGTGGCAGGCGGCGTGCCCGGCGTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTCGTGCCGCGCGTGCTGGCCGCCGCGCGCGCCTGGGACCCCACCCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGACATGGCAGGTCGGTACCCGCACACATTGTTACCGTGGCAGGCGGCGTGCCCGGCGTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTCGTGCCGCGCGTGCTGGCCGCCGCGCGCGCCTGGGACCCCACCCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGACATGGCAGGTCGGTACCCGCACACATTGTTACCGTGGCAGGCGGCGTGCCCGGCGTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTCGTGCCGCGCGTGCTGGCCGCCGCGCGCGCCTGGGACCCCACCCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGACATGGCAGGTCGGTACCCGCACACATTGTTACCGTGGCAGGCGGCGTGCCCGGCGTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTCGTGCCGCGCGTGCTGGCCGCCGCGCGCGCCTGGGACCCCACCCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGACATGGCAGGTCGGTACCCGCACACATTGTTACCGTGGCAGGCGGCGTGCCCGGCGTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTCGTGCCGCGCGTGCTGGCCGCCGCGCGCGCCTGGGACCCCACCCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGACATGGCAGGTCGGTACCCGCACACATTGTTACCGTGGCAGGCGGCGTGCCCGGCGTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTCGTGCCGCGCGTGCTGGCCGCCGCGCGCGCCTGGGACCCCACCCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGACATGGCAGGTCGGTACCCGCACACATTGTTACCGTGGCAGGCGGCGTGCCCGGCGTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTCGTGCCGCGCGTGCTGGCCGCCGCGCGCGCCTGGGACCCCACCCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGACATGGCAGGTCGGTACCCGCACACATTGTTACCGTGGCAGGCGGCGTGCCCGGCGTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTCGTGCCGCGCGTGCTGGCCGCCGCGCGCGCCTGGGACCCCACCCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGACATGGCAGGTCGGTACCCGCACACATTGTTACCGTGGCAGGCGGCGTGCCCGGCGTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTCGTGCCGCGCGTGCTGGCCGCCGCGCGCGCCTGGGACCCCACCCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGACATGGCAGGTCGGTACCCGCACACATTGTTACCGTGGCAGGCGGCGTGCCCGGCGTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTCGTGCCGCGCGTGCTGGCCGCCGCGCGCGCCTGGGACCCCACCCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGACATGGCAGGTCGGTACCCGCACACATTGTTACCGTGGCAGGCGGCGTGCCCGGCGTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTCGTGCCGCGCGTGCTGGCCGCCGCGCGCGCCTGGGACCCCACCCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGACATGGCAGGTCGGTACCCGCACACATTGTTACCGTGGCAGGCGGCGTGCCCGGCGTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTCGTGCCGCGCGTGCTGGCCGCCGCGCGCGCCTGGGACCCCACCCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGACATGGCAGGTCGGTACCCGCACACATTGTTACCGTGGCAGGCGGCGTGCCCGGCGTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTCGTGCCGCGCGTGCTGGCCGCCGCGCGCGCCTGGGACCCCACCCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGACATGGCAGGTCGGTACCCGCACACATTGTTACCGTGGCAGGCGGCGTGCCCGGCGTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTCGTGCCGCGCGTGCTGGCCGCCGCGCGCGCCTGGGACCCCACCCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGACATGGCAGGTCGGTACCCGCACACATTGTTACCGTGGCAGGCGGCGTGCCCGGCGTGGCTGGCGCGCGCGTGCGTGGCGCGCTGCGTCGTGCCGCGCGTGCTGGCCGCCGCGCGCGCCTGGGACCCCACCCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCCTGGCACGACATGGCAGGAGAAGCGCTGGCCAGCTCGGTGTACCCGCTGATCCGCTCGCggctggcggcggcgctggcggcGTGGCACCCGGCGGACGGGTCGGCGCGCGGCGTGCTGGGCGCGTGGCGCGCGGCGTGGGGGCCCGCGCTCACCAACATGCTGCACCAGCACATCGTGCCCAAGCTCGACCACTGCCTCACGCACGCGCCGCTCGAGCTCGTGGGCAGGGAGAACAcGGCCTGGCTATGGTGCGTAGAATGGCTGGAACTGCTCGGCGCTGCAACTGTAGCAGCCATAGCTGCGCGCGCATTACTGCCGCGCTGGCTAGCTGCATTAGCGGCCTGGCTCAACACCAACCCGCCGCACGCTACTGTGCTGAACTCTTATACCGATTTCAAGAAAATGTTCCCCGAGGAAGTCCTTAAAGAGCCAGCGGTGCGTGACGCCTTCCGCAAGGCATTGGACATGATGAACAGGAGCGCAGATCTGGACGCAGTAGAACCGCCTCCTCCACCTCGTTTCTCCATGCCAGAACCTAAAGAAACCTCCCGCATCGCCGAAGTAATAGCTTCAGTCACTCAGGCGAAGAGCTTCTCAGAGCTACTGGAGACGAGGTGCATCGAAAAAGGAATCACATTTGTTCCAATCGCTGGAAAAACCAGGGAAGGTAGACCCCTTTATAAGATTGGGGAGCTTCAATGTTACGTCATCAGGAATGTGATCATGTATTCCAACGATAGTGGTCGGACGTTTAGCCCTATCAGCATGGATAAGCTGCTGAATATGGTggaagaataa
- Protein Sequence
- MSDDEVIRFEITDYDLDNEFNPNRNRRAKKEQQIYGVWAKDSDEEDNEDNVRQRTRKPKDFSAPIGFVAGGVQQAGKKKEEIKGKEVESAEASTSRPKFADSSDEDEQNAPDASETAGIRRQGQGMRPANLGGNVGTWERHTKGIGAKLLLQMGYQPGRGLGKDLQGISAPVEATVRRGRGAIGAYGPEKAAQKAKREEEQRRLKEKEDEKGGKEKNYNWKKSHKGRYFYRDAADVIQEGKPTMHTITSNELSRVPVIDMTGKEKRVLSGYHALRAAAPRFEHEPRRKCDNFAAPQLVHNLELMVECCEQDIIQNARELQSAEDEIVVLERDLEECSLKLEDQDKVISKVQGILQRVELLNKPDVSLERAHDVLAELKESYPLEYEMFGLGTIAGNIVSPLLSSLLATWEPLQAPEEPIPTFVKWRKLLTEEAYNSLLWQHFVPQLTAAAETWNPRIPGPMQRAVCAWQAACPAWLARACVARCVVPRVLAAARAWDPTHDTQPLHHWVLPWHDMAGRYPHTLLPWQAACPAWLARACVARCVVPRVLAAARAWDPTHDTQPLHHWVLPWHDMAGRYPHTLLPWQAACPAWLARACVARCVVPRVLAAARAWDPTHDTQPLHHWVLPWHDMAGRYPHTLLPWQAACPAWLARACVARCVVPRVLAAARAWDPTHDTQPLHHWVLPWHDMAGRYPHTLLPWQAACPAWLARACVARCVVPRVLAAARAWDPTHDTQPLHHWVLPWHDMAGRYPHTLLPWQAACPAWLARACVARCVVPRVLAAARAWDPTHDTQPLHHWVLPWHDMAGRYPHTLLPWQAACPAWLARACVARCVVPRVLAAARAWDPTHDTQPLHHWVLPWHDMAGRYPHTLLPWQAACPAWLARACVARCVVPRVLAAARAWDPTHDTQPLHHWVLPWHDMAGRYPHTLLPWQAACPAWLARACVARCVVPRVLAAARAWDPTHDTQPLHHWVLPWHDMAGRYPHTLLPWQAACPAWLARACVARCVVPRVLAAARAWDPTHDTQPLHHWVLPWHDMAGRYPHTLLPWQAACPAWLARACVARCVVPRVLAAARAWDPTHDTQPLHHWVLPWHDMAGRYPHTLLPWQAACPAWLARACVARCVVPRVLAAARAWDPTHDTQPLHHWVLPWHDMAGRYPHTLLPWQAACPAWLARACVARCVVPRVLAAARAWDPTHDTQPLHHWVLPWHDMAGRYPHTLLPWQAACPAWLARACVARCVVPRVLAAARAWDPTHDTQPLHHWVLPWHDMAGRYPHTLLPWQAACPAWLARACVARCVVPRVLAAARAWDPTHDTQPLHHWVLPWHDMAGEALASSVYPLIRSRLAAALAAWHPADGSARGVLGAWRAAWGPALTNMLHQHIVPKLDHCLTHAPLELVGRENTAWLWCVEWLELLGAATVAAIAARALLPRWLAALAAWLNTNPPHATVLNSYTDFKKMFPEEVLKEPAVRDAFRKALDMMNRSADLDAVEPPPPPRFSMPEPKETSRIAEVIASVTQAKSFSELLETRCIEKGITFVPIAGKTREGRPLYKIGELQCYVIRNVIMYSNDSGRTFSPISMDKLLNMVEE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -