Agri023651.1
Basic Information
- Insect
- Achroia grisella
- Gene Symbol
- PAXBP1
- Assembly
- GCA_030625045.1
- Location
- JAULSB010000467.1:7455341-7467965[+]
Transcription Factor Domain
- TF Family
- GCFC
- Domain
- GCFC domain
- PFAM
- PF07842
- TF Group
- Unclassified Structure
- Description
- This entry describes a domain found in a number of GC-rich sequence DNA-binding factor proteins and homologues [4, 5], as well as in a number of other proteins including Tuftelin-interacting protein 11 [1]. While the function of the domain is unknown, some of the proteins it is found in are reported to be involved in pre-mRNA splicing [1, 2]. This domain is also found in Sip1, a septin interacting protein [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 10 0.0034 32 4.6 0.0 4 40 268 304 265 307 0.86 2 10 0.0069 66 3.6 0.1 12 40 301 329 300 332 0.87 3 10 0.0058 55 3.9 0.1 12 40 326 354 323 357 0.88 4 10 0.006 58 3.8 0.1 12 40 351 379 349 382 0.88 5 10 0.006 58 3.8 0.1 12 40 376 404 374 407 0.88 6 10 0.0061 58 3.8 0.1 12 40 401 429 399 432 0.88 7 10 0.0068 65 3.6 0.1 12 40 426 454 425 458 0.87 8 10 0.006 58 3.8 0.1 12 40 451 479 449 482 0.88 9 10 0.0061 58 3.8 0.1 12 40 476 504 474 507 0.88 10 10 1.9e-12 1.8e-08 35.0 1.5 12 147 526 642 523 657 0.83
Sequence Information
- Coding Sequence
- ATGTTTCCATTTGACTGGGTCAGTGTTTttatgataactatgactgcaAGCACAGCGAGATGCGAGCGCGACCGAGAGATCCTTATCGATCCTCACTCATCTCGAGCCCGGCGCACGTCGACCATTAATCTAACGGGGCCCAATGACGATAGATCGTGCCGAACATGTCACGTTCTCCGATCGTTATCAGGGCGAGCTCATCGTTATTGCGGTCGCAGGCTGGAGGAGCTGCAGATGGACCGGGACGAGACGCGCGCGCGCCGCGACGAGGTGGTGCGGCGGCTGCAGGCGGCGGCCGCGGCGCGCCGCACGCGGGCCGCGCGCCGCGCCGGGCTCGACGCCGCCTACCGCCGCGCGCAGGCCGCCCGCGGCTACCTCACTGACCTCGTCGAGTGTCTCGACGAGAAGATGCCGGCGCTGCAGGCGCTGGAGGCACGCGCGTTAGCGCTGCATCGGCGGCGCTGCGAGTTCCTGCAGGAGAGGCGCCGCGCTGACGTCCGCGACCAGGCGCAGGACGCGCTCGCGCTTGCCGcaCGGCCCGGCGCCGCGAAGCCCGTGGATTCAGAGGAGAAGATACGTCGCGCGGCCGAGCGAGAGGGCAGGAGGAGAGCGAGACGGCTGcagcgcgcggcggcggcggccgcggccggcGCCACCCCGCGACACCGCGACGGAGACTCCAGCGACGACGACCTCCCGCCCGCAGAGCACCAGCACTACCAGCAGGAGAGAGAGACGATCAAGCAGATGTCGTCGTCGCTGTTTGCGGACGCGCTGCCGGCGTGGCGCAGCGTGGCCGGCGTCTGCGCACGTCTCGCACGCTTCCGGCGACGCTCACGAGCACTCTACTGCGACGCGTACGTCGCGCACTGCCTGCCGCAGCTGCTCGCGCCGTACGTGCGCCACCAGGTGAGCGCGCTTTACTGCGACGCGTACGTCGCGCACTGCCTGCCGCAGCTGCTCGCGCCGTACGTGCGCCACCAGGTGAGCGCGCTCTACTGCGACGCGTACGTCGCGCACTGCCTGCCGCAGCTGCTCGCGCCGTACGTGCGCCACCAGGTGAGCGCGCTCTACTGCGACGCGTACGTCGCGCACTGCCTGCCGCAGCTGCTCGCGCCGTACGTGCGCCACCAGGTGAGCGCGCTCTACTGCGACGCGTACGTCGCGCACTGCCTGCCGCAGCTGCTCGCGCCGTACGTGCGCCACCAGGTGAGCGCGCTCTACTGCGACGCGTACGTCGCGCACTGCCTGCCGCAGCTGCTCGCGCCGTACGTGCGCCACCAGGTGAGCGCGCTCTACTGCGACGCGTACGTCGCGCACTGCCTGCCGCAGCTGCTCGCGCCGTACGTGCGCCACCAGGTGAGCGCGCTCTACTGCGACGCGTACGTCGCGCACTGCCTGCCGCAGCTGCTCGCGCCGTACGTGCGCCACCAGGTGAGCGCGCTCTACTGCGACGCGTACGTCGCGCACTGCCTGCCGCAGCTGCTCGCGCCGTACGTGCGCCACCAGGTGAGCGCGCTCTACTGCGACGCGTACGTCGCGCACTCCCTGCCGCAGCTGCTCGCGCCGTACGTGCGCCACCAGGTGAGCGCGCTCTACTGCGACGCGTACGTCGCACACTCCCTGCCGCAGCTGCTCGCGCCGTACGTGCGCCACCAGCTTATATTATGGAATCCGTTAGCTGACGAAGATAATGAGGACTACGAGAAAATGGACTGGTACAAATGTCTGATGATGTACGGAGTCCGGGCCGAGCGGCTGTCTAGTGAGTCTGAGCAGTCGGATGATGAGGAGGAAGCAGAGGATGTTACGGTTACAGAGACCTCGGTGCGGGAAGATCCTGACCTGTACCTAGTGCCGACCATCATCAACAAACTCGTACTGCCTAAGCTTACAGAGCTGGTGGAGCAGGCGTGGGACCCGATGTGTGTGCGCGCGTGCGTCCGGCTGCGGCACCTCGTACTGCGCGCCGCCGGcctgccgccgccgcccgccgccggacCGGACCCCGCGCCCGGCTCCGGCCCCGGCTCCGGCCCCGGCCCCGGCTCCGGCCCGGACACCGCCGCGCCGGCGCCTCCGTTACGACGGCTAGCGCTCGCCGTGCGCGCGCGGCTCGCAGCCACGATCAACGCAGATGTCTTCTTACCCACGCTGCCTCCACAAGCGATGGAGGGTGCGGGCGGTCTGTTCTGGCGGCGGTGTGCTGGCGGCGCCGTGCGGCTGCTGCGCGGGACGCTGGCGctcgccgcgccgcccgccctgCTGCGTGCCGACGCGCTCGTGCTCGCCCTGCTCGAGACGGTGTGctgcgcggcgggcgcggctgctgcgggcgggggcgcgggcgcgggggcggtggcggcggcgggggcggcggccgCGCTGGCTGCGTCGCTGCCACGTGCCGGCCAGCTGCGAGCACGTGCGCTGCCGCGACTGGCCGCGCTCGCACAACACGTGCTGCAATACCTGCACTCGGACAATCCCATGCACCTaaaagCACTGGAACAGGCCAGAAGTTTAATAGCAGAAGCTCGCGCGTTAGAATGA
- Protein Sequence
- MFPFDWVSVFMITMTASTARCERDREILIDPHSSRARRTSTINLTGPNDDRSCRTCHVLRSLSGRAHRYCGRRLEELQMDRDETRARRDEVVRRLQAAAAARRTRAARRAGLDAAYRRAQAARGYLTDLVECLDEKMPALQALEARALALHRRRCEFLQERRRADVRDQAQDALALAARPGAAKPVDSEEKIRRAAEREGRRRARRLQRAAAAAAAGATPRHRDGDSSDDDLPPAEHQHYQQERETIKQMSSSLFADALPAWRSVAGVCARLARFRRRSRALYCDAYVAHCLPQLLAPYVRHQVSALYCDAYVAHCLPQLLAPYVRHQVSALYCDAYVAHCLPQLLAPYVRHQVSALYCDAYVAHCLPQLLAPYVRHQVSALYCDAYVAHCLPQLLAPYVRHQVSALYCDAYVAHCLPQLLAPYVRHQVSALYCDAYVAHCLPQLLAPYVRHQVSALYCDAYVAHCLPQLLAPYVRHQVSALYCDAYVAHCLPQLLAPYVRHQVSALYCDAYVAHSLPQLLAPYVRHQVSALYCDAYVAHSLPQLLAPYVRHQLILWNPLADEDNEDYEKMDWYKCLMMYGVRAERLSSESEQSDDEEEAEDVTVTETSVREDPDLYLVPTIINKLVLPKLTELVEQAWDPMCVRACVRLRHLVLRAAGLPPPPAAGPDPAPGSGPGSGPGPGSGPDTAAPAPPLRRLALAVRARLAATINADVFLPTLPPQAMEGAGGLFWRRCAGGAVRLLRGTLALAAPPALLRADALVLALLETVCCAAGAAAAGGGAGAGAVAAAGAAAALAASLPRAGQLRARALPRLAALAQHVLQYLHSDNPMHLKALEQARSLIAEARALE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -