Cpal008018.1
Basic Information
- Insect
- Carterocephalus palaemon
- Gene Symbol
- -
- Assembly
- GCA_944567795.1
- Location
- CALYMS010000102.1:2171389-2182691[-]
Transcription Factor Domain
- TF Family
- GCFC
- Domain
- GCFC domain
- PFAM
- PF07842
- TF Group
- Unclassified Structure
- Description
- This entry describes a domain found in a number of GC-rich sequence DNA-binding factor proteins and homologues [4, 5], as well as in a number of other proteins including Tuftelin-interacting protein 11 [1]. While the function of the domain is unknown, some of the proteins it is found in are reported to be involved in pre-mRNA splicing [1, 2]. This domain is also found in Sip1, a septin interacting protein [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 13 2.1e-19 7e-16 58.8 1.3 77 163 190 270 176 275 0.92 2 13 0.00047 1.5 8.4 0.1 132 163 274 300 269 305 0.87 3 13 0.00047 1.5 8.4 0.1 132 163 304 330 299 335 0.87 4 13 0.00049 1.6 8.4 0.1 132 163 334 360 329 364 0.87 5 13 0.00053 1.7 8.3 0.1 132 162 364 389 359 392 0.87 6 13 0.00047 1.5 8.5 0.1 132 163 394 420 389 425 0.87 7 13 0.00047 1.5 8.5 0.1 132 163 424 450 419 455 0.87 8 13 0.00046 1.5 8.5 0.1 132 163 454 480 449 485 0.87 9 13 0.00046 1.5 8.5 0.1 132 163 484 510 479 515 0.87 10 13 0.00047 1.5 8.5 0.1 132 163 514 540 509 545 0.87 11 13 0.00047 1.5 8.4 0.1 132 163 544 570 539 575 0.87 12 13 0.00045 1.5 8.5 0.1 132 163 574 600 569 606 0.87 13 13 1.3e-31 4.1e-28 98.9 1.4 132 275 604 743 599 743 0.95
Sequence Information
- Coding Sequence
- ATGTTCGGCCTGGGCACCATAGGCGGCAACGTGGCGGCGCGTGCTCGCCGAGGAGGCCTACAGCAGCCTGCTGTGGCAGCACTTCCTGCCCAGGATCACCGCCGCGGCCGAGTGAGTCAAACTAACCACCCGCCGGAACACGAGATGTTCGGCCTGGGCACCATAGGCGGCATCGTGGCGGCGCGTGCTCGCCGAGGAGGCCTACAGCAGCCTGCTGTGGCAGCACTTCCTGCCGAGGATCACCGCCGCGGCCGAGTGAGTCAAACTAACCACCCGCTGGAACACGAGATGTTCGGCCTGGGCACCATAGGCGGCAACGTGGCGGCGCGTGCTCGCCGAGGAGGCCTACGACAGCCTGCTGTGGCAGCACTTCCTGCCGAGGATCACCGCCGCGGCCGAGTGAGTCAGACTAACCACCCGCTGGAATACGAGATGTTCGGCCTGGGCACCATAGGCGGCAACGTGGCGGCGCGTGCTCGCCGAGGAGGCCTACAACAGCCTGCTGTGGCAGCACTTCCTGCCGAGGATCACCGCCGCGGCCGAGCGGCAACGTGGCGGCGCGTGCTCGCCGAGGAGGCCTACAACAGCCTGCTGTGGCAGCACTTCCTGCCCAGGATCACCGCCGCGGCCGAATCGTGGAACCCCCGCGCTCCGGAGCCGATGCTGGCCGTGTTCGAGGCGTGGGCGGACGCGGCGCCGCGCTGGCTGCTCgaggcggtggcggcgcgcgcgctcgccccgcgcctgctggccgccgTGCGCGCGTGGGACCCCACCGCGGACACGCAGCCGCTGCACCACTGGgcggtggcggcgcgcgcgctcgccccgcgcctgctggccgccgTGCGCGCGTGGGACCCCACCGCGGACACGCAGCCGCTGCACCACTGGgcggtggcggcgcgcgcgctcgccccgcgcctgctggccgccgTGCGCGCGTGGGACCCCACCGCGGACACGCAGCCGCTGCACCACTGGgcggtggcggcgcgcgcgctcgccccgcgcctgctggccgccgTGCGCGCGTGGGACCCCACCGCGGACACGCAGCCGCTGCACCACTGGgcggtggcggcgcgcgcgctcgccccgcgcctgctggccgccgTGCGCGCGTGGGACCCCACCGCGGACACGCAGCCGCTGCACCACTGGgcggtggcggcgcgcgcgctcgccccgcgcctgctggccgccgTGCGCGCGTGGGACCCCACCGCGGACACGCAGCCGCTGCACCACTGGgcggtggcggcgcgcgcgctcgccccgcgcctgctggccgccgTGCGCGCGTGGGACCCCACCGCGGACACGCAGCCGCTGCACCACTGGgcggtggcggcgcgcgcgctcgccccgcgcctgctggccgccgTGCGCGCGTGGGACCCCACCGCGGACACGCAGCCGCTGCACCACTGGgcggtggcggcgcgcgcgctcgccccgcgcctgctggccgccgTGCGCGCGTGGGACCCCACCGCGGACACGCAGCCGCTGCACCACTGGgcggtggcggcgcgcgcgctcgccccgcgcctgctggccgccgTGCGCGCGTGGGACCCCACCGCGGACACGCAGCCGCTGCACCACTGGgcggtggcggcgcgcgcgctcgccccgcgcctgctggccgccgTGCGCGCGTGGGACCCCACCGCGGACACGCAGCCGCTGCACCACTGGgcggtggcggcgcgcgcgctcgccccgcgcctgctggccgccgTGCGCGCGTGGGACCCCACCGCGGACACGCAGCCGCTGCACCACTGGgcggtggcggcgcgcgcgctcgccccgcgcctgctggccgccgTGCGCGCGTGGGACCCCACCGCGGACACGCAGCCGCTGCACCACTGGGTGCTGCCGTGGCACCCGCTCGCCGGCACAGCCCTGAACACGTCAGTGTACCCGCTGATCCGGTCCCGGCTAGCCGCGGCGCTGTCCGCGTGGCACCCGGCCGACGCCTCAGcccgcccactgctgggcgcgtgGCGGCACGCGTGGGGCGCGGCGCTGACGGCGCTGCTGCACCACCACGTGCTGCCCAAGCTGGAGCACTGCTTGCAGCACGCGCCGCTCGAGCTAGTCGGCAGGGAGAACACTGCGTGGCTGTGGTGCGTGGAGTGGATCGAGCTGGTCGGCGCGCCTACTATcggtgcgctggcggcgcgcgcgctgctgccgcgctggctggccgcgctggccgcgtggctcaacacgggcccgccgcacgccaccgtgCTCGCCTCCTACGCCGACTTCAAGAAAATGTTCCCGGAAGAGGTGCTGAAAGAGCCGGGCGTTCGCGACGCGTTCAGAAAAGCCCTCGACATGATGAACCGAAGCACCGACATCGATTCCGTCgagccgccgcccccgccgcgctTCACGCCCGTCGACACCAAGGAAACATCCAAGATATCCGACGTCTTAGCCAGTATTACACAGCACAAATCCTTCTCGGAGATTCTAGAGTCTAGATGTATTGAGCGCGGGATAACCTTTGTTCCAATCGCAGGCAAGAGCAGAGAGGGCCGGCCGCTGTACAAGATAGGACAACACCAGTGCTACGTTATACGGAACGTTATTATGTATTCGGACGACGCTGGGCGGACGTTTTCGCCCATAGGATTGGATAGGTTGTTAAATTTGGTTGAAGAATGA
- Protein Sequence
- MFGLGTIGGNVAARARRGGLQQPAVAALPAQDHRRGRVSQTNHPPEHEMFGLGTIGGIVAARARRGGLQQPAVAALPAEDHRRGRVSQTNHPLEHEMFGLGTIGGNVAARARRGGLRQPAVAALPAEDHRRGRVSQTNHPLEYEMFGLGTIGGNVAARARRGGLQQPAVAALPAEDHRRGRAATWRRVLAEEAYNSLLWQHFLPRITAAAESWNPRAPEPMLAVFEAWADAAPRWLLEAVAARALAPRLLAAVRAWDPTADTQPLHHWAVAARALAPRLLAAVRAWDPTADTQPLHHWAVAARALAPRLLAAVRAWDPTADTQPLHHWAVAARALAPRLLAAVRAWDPTADTQPLHHWAVAARALAPRLLAAVRAWDPTADTQPLHHWAVAARALAPRLLAAVRAWDPTADTQPLHHWAVAARALAPRLLAAVRAWDPTADTQPLHHWAVAARALAPRLLAAVRAWDPTADTQPLHHWAVAARALAPRLLAAVRAWDPTADTQPLHHWAVAARALAPRLLAAVRAWDPTADTQPLHHWAVAARALAPRLLAAVRAWDPTADTQPLHHWAVAARALAPRLLAAVRAWDPTADTQPLHHWAVAARALAPRLLAAVRAWDPTADTQPLHHWVLPWHPLAGTALNTSVYPLIRSRLAAALSAWHPADASARPLLGAWRHAWGAALTALLHHHVLPKLEHCLQHAPLELVGRENTAWLWCVEWIELVGAPTIGALAARALLPRWLAALAAWLNTGPPHATVLASYADFKKMFPEEVLKEPGVRDAFRKALDMMNRSTDIDSVEPPPPPRFTPVDTKETSKISDVLASITQHKSFSEILESRCIERGITFVPIAGKSREGRPLYKIGQHQCYVIRNVIMYSDDAGRTFSPIGLDRLLNLVEE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -