Dcae019334.1
Basic Information
- Insect
- Diloba caeruleocephala
- Gene Symbol
- sip1
- Assembly
- GCA_947459985.1
- Location
- OX381624.1:7110000-7121191[-]
Transcription Factor Domain
- TF Family
- GCFC
- Domain
- GCFC domain
- PFAM
- PF07842
- TF Group
- Unclassified Structure
- Description
- This entry describes a domain found in a number of GC-rich sequence DNA-binding factor proteins and homologues [4, 5], as well as in a number of other proteins including Tuftelin-interacting protein 11 [1]. While the function of the domain is unknown, some of the proteins it is found in are reported to be involved in pre-mRNA splicing [1, 2]. This domain is also found in Sip1, a septin interacting protein [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 2.3e-55 3.1e-51 175.1 3.5 2 275 366 621 365 621 0.96
Sequence Information
- Coding Sequence
- ATGTCGGACGACGAAGTAATTCGGTTTGAAATTACAGACTATGATTTGGACAATGAATTCAATCCGAACCGGAACAGAAGAGCTAAAAAGGAGCAACAGATTTACGGTGTATGGGCTAAAGATAGTGATGAAGATGACAACGAGGACAATGTCAGACAAAGATCCCGTAAACCTAAAGATTTCTCAGCACCTATTGGATTTGTGGCTGGTGGTGTGCAACAAGCTGGCAAAAAGAAGGAAGAAAGTAAAGAAATAAAACCATCCGAAGCATCCGCCTCCAAACCAAAGTTTGCTGACAGCTCAGATGAGGATGAACAAAATGCGCCAGATGCAAGTGAAACAGCAGGCATCAGGAGACAAGGGCAAGGCATGCGGCCTGCCAACTTAGGGGGCAATGTGGGGAATTGGGAGAGACATACTAAGGGTATCGGAGCCAAGTTGTTACTACAGATGGGTTACCAACCAGGCAGGGGTTTAGGTAAAGATCTGCAGGGCATCTCAGCACCTGTGGAGGCCACCGTCAGGAGAGGAAGAGGAGCTATTGGTGCCTATGGTCCTGAAAAAGCAGCTCAAAAAGCCAAAAGAGAAGAGGATCTACGTCGTCTTAAAGAGAAAGAAGAAGAAAAAGAGACACAAGAGAAAACCTACAACTGGAAGAAATCTCACAAGGGGAGATACTTCTACCGGGATGCTGCTGACGTCATACAAGAGGGTAAACCTACGATGCATACTATTACAAGCAATGAGCTGTCCCGCGTGCCTGTAATCGACATGACGGGCAAGGAAAAGCGAGTGCTTAGTGGTTACCACGCTCTACGTGCCGCCGCGCCGCGCTTCGAACATGAGCCGAGACGAAAGTGCGACAACTTTGCAGCGCCACAACTTGTACATAACTTGGAACTGATGGTGGAGTGCTGTGAACAGGACATAATCCAAAACGCTCGGGAACTCCAAACAGCAGAAGACGAGATAGTAGTCCTAGAGAGGGACCTGGAGGAATGCAACATTAAGCTGCAGGAGCAAGACGACGTGATATTCAAAGTTCGCGGTATATTAGAACGGGTAGAGATGCTGAACAAACCGGAAGTGTCGTTGGAAAAAGCGTATGACGTGTTGGCTGAATTGAAGGAATCCTACCCACTAGAATACGAGATGTTCAGCCTTGGGACTATAGCTGGCAACGTCGTGAGTCCACTTTTCAGCTCACTTCTCGCCACCTGGGAGCCTTTACAAGCACCTGATGAGCCTATACCTACCTTCCTGAAGTGGAGGAAGCTGCTCACAGAAGAAGCTTACAATAATCTTCTGTGGCAGCACTTTGTACCTCAACTTACCGCGGCCGCTGAGGCGTGGAACCCTCGCGTGCCGGGCGCGATGGTGGAGGCAGCGCGCGCGTGGGCGGCTGCTGGGCCCGAGTGGCTGGCGCGTGCAGGTGTGGCGCGTGCCGTGCTGCCGCGCGTGCTGGCGGCCGTGCGAGCCTGGGACCCCACGCACGACACGCAGCCGCTGCACCACTGGGTGCTGCCGTGGCACGACCTGGCAGGTGAAGCACTGGCCAGCTCTGTATACCCGTTGATCCGCTCGCGGCTGGCAGCAGCACTGTCGGCCTGGCACCCGGCCGACGGCTCAGCGCGTGGTGTCCTGGGTGCCTGGCGCGTCGCCTGGGGGCCGGCGCTCACCAACATGCTGCACCAGCACATCGTGCCCAAGCTCGACCACTGCCTGCAGCACGCGCCACTCGAGCTCGTCGGAAGGGAGAACACGGCGTGGCTGTGGTGTGTAGAATGGCTGGAATTGCTAGGCGCGGCAACTGTTGCCGCTATAGCTGCGCGTGCACTATTACCGCGCTGGCTAGCTGCACTGGCAGCCTGGCTCAACACCAACCCACCCCACGCAACAGTGCTCAACTCTTATACAGACTTCAAGAAAATGTTCCCGGAAGAAGTTTTGAAAGAGCCAGCGGTACGTGACGCCTTCCGCAAGGCATTAGACATGATGAACCGCAGCGCAGACCTAGATTCAGTGGAACCACCTCCTCCACCACGCTTCACTATATCAGAACATAAGGAAACGTCTCGAATCGCTGAAGCAATTGCTTCAGCTACACAAGCAAAGAGCTTCTCAGAGTTACTCGAAACCAGATGCATAGAGAAAGGGATTACCTTTGTACCTATAGCTGGAAAAACTAGGGAAGGCCGGCCGTTGTATAAGATTGGCGACCTTCAGTGTTATGTAATAAGGAATGTGATCATGTTTTCCGATGACAGCGGTAGGACGTTCAGCCCTATCAGTATGGATAAGTTGCTAAATATGGTGGAAGAATAA
- Protein Sequence
- MSDDEVIRFEITDYDLDNEFNPNRNRRAKKEQQIYGVWAKDSDEDDNEDNVRQRSRKPKDFSAPIGFVAGGVQQAGKKKEESKEIKPSEASASKPKFADSSDEDEQNAPDASETAGIRRQGQGMRPANLGGNVGNWERHTKGIGAKLLLQMGYQPGRGLGKDLQGISAPVEATVRRGRGAIGAYGPEKAAQKAKREEDLRRLKEKEEEKETQEKTYNWKKSHKGRYFYRDAADVIQEGKPTMHTITSNELSRVPVIDMTGKEKRVLSGYHALRAAAPRFEHEPRRKCDNFAAPQLVHNLELMVECCEQDIIQNARELQTAEDEIVVLERDLEECNIKLQEQDDVIFKVRGILERVEMLNKPEVSLEKAYDVLAELKESYPLEYEMFSLGTIAGNVVSPLFSSLLATWEPLQAPDEPIPTFLKWRKLLTEEAYNNLLWQHFVPQLTAAAEAWNPRVPGAMVEAARAWAAAGPEWLARAGVARAVLPRVLAAVRAWDPTHDTQPLHHWVLPWHDLAGEALASSVYPLIRSRLAAALSAWHPADGSARGVLGAWRVAWGPALTNMLHQHIVPKLDHCLQHAPLELVGRENTAWLWCVEWLELLGAATVAAIAARALLPRWLAALAAWLNTNPPHATVLNSYTDFKKMFPEEVLKEPAVRDAFRKALDMMNRSADLDSVEPPPPPRFTISEHKETSRIAEAIASATQAKSFSELLETRCIEKGITFVPIAGKTREGRPLYKIGDLQCYVIRNVIMFSDDSGRTFSPISMDKLLNMVEE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00345826; iTF_01437063; iTF_00001192; iTF_00850762; iTF_00850761; iTF_00017355; iTF_00018181; iTF_01362444; iTF_01363122; iTF_01361641; iTF_00187181; iTF_01117230; iTF_01119353; iTF_00111679; iTF_01173294; iTF_01312238; iTF_01316367; iTF_01317308; iTF_00112512; iTF_00114117; iTF_00274447; iTF_01027345; iTF_01062819; iTF_01532033; iTF_01063779; iTF_01340131; iTF_00036741; iTF_00809158; iTF_00711880; iTF_01085256; iTF_00301193; iTF_01064682; iTF_00831243; iTF_01533068; iTF_01538766; iTF_00928722; iTF_00685459; iTF_01061909; iTF_00177145; iTF_01425081; iTF_01084284; iTF_00907902; iTF_01533951; iTF_01487715; iTF_00040836; iTF_00041793; iTF_01029286; iTF_00810087; iTF_00771945; iTF_00973802; iTF_00300235; iTF_00850763; iTF_01230603; iTF_00851820; iTF_00951855; iTF_00185328; iTF_00186195; iTF_00039832; iTF_00821848; iTF_01028312; iTF_00447161; iTF_00445196; iTF_00446161; iTF_01031162; iTF_01338778; iTF_00888284; iTF_00952732; iTF_00042653; iTF_00043519; iTF_00383677; iTF_01192734; iTF_01342235; iTF_01026208; iTF_00869650; iTF_01491953; iTF_01179506; iTF_01180327; iTF_00273619; iTF_01221590; iTF_01285557; iTF_01260121; iTF_01264634; iTF_01076518; iTF_00784411; iTF_00783500; iTF_01118331; iTF_00425404; iTF_00709255; iTF_01425949; iTF_00931423; iTF_00932568; iTF_01073631; iTF_00622016; iTF_00761696; iTF_00794624;
- 90% Identity
- iTF_00907064;
- 80% Identity
- -