Lfig011242.1
Basic Information
- Insect
- Lasioglossum figueresi
- Gene Symbol
- PAXBP1
- Assembly
- GCA_028455805.1
- Location
- CM052201.1:2348677-2357346[-]
Transcription Factor Domain
- TF Family
- GCFC
- Domain
- GCFC domain
- PFAM
- PF07842
- TF Group
- Unclassified Structure
- Description
- This entry describes a domain found in a number of GC-rich sequence DNA-binding factor proteins and homologues [4, 5], as well as in a number of other proteins including Tuftelin-interacting protein 11 [1]. While the function of the domain is unknown, some of the proteins it is found in are reported to be involved in pre-mRNA splicing [1, 2]. This domain is also found in Sip1, a septin interacting protein [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 1.5e-29 1.5e-25 90.5 2.6 1 272 541 751 541 753 0.92
Sequence Information
- Coding Sequence
- ATGTTAGGGACAATGTCATTATTCAATAAACCGAAAAGGAATATCCGTCGTCGACATTTCAACGACGATGACGAAGATAATGAAAACAGGATGGAAGTTGAGGATGCACAACCAGTCAAATCTAAAACAAAGAAGAAGGATAAGCCGAAGCAGACGCTCCTTAGTTTTGGAGAGGAGCTGGAAGAAGCGGATGATGGTGAAGTCTTCAAAGTGAAGAAATCATCACGGAGCAAAAAATTGATGAAACAGCTAGATCACGAGAGGAGAAAAAAGAAAGGTGAAGAGAAAATGCAAGTGGACTCTGAGCAAGCAAACATGTCCATTAAACAGGAAAATGATTTAGAAATAAAGACAGATGATCTAGTAGTTAAAATAAAAAATACAGGACCATTAATCTTAAATGGACGAGCTGCGCTGGCAGCTGGAAAAGATGACTATATGTCAGATGAAGAAGATGATAAACTACATGGTCATAAATTCATTAGAAATACAGACAGAGCTGACACAATGAAAATTCTTCTTGAAAGTGGTTGTATACCTGATGCGGCAATGATTCATGCAGCTAGAAAGCGTAGACAGAAGGCAAGAGAGCTGGGGACTGATTACATTCCTATAGAAGAGCAAAGCGATGATAAGGGGAAGTCAAGACTTATTAGGGAAGAAGATCATGACAGAAGTGATGACGATGATTCTCAAGATCGTATTGATATGACTGTTAATACTGAAGCTCGAGATAAAGAAAAAAGAAGAGAAGCATTTCTTGCATCACAAGCACCAATAAAACTTTCGGACGACGAAAGCGAACATGAAATTGAGGAAGAAGAGTGGGAAGCTCAGCAAATAAGGAAAGGTGTAACAGGCGCACAGATTGCAGCGGTACAACAGGATTCCATGATGCAGCAGCAGTTCTCACTGGGTATGAATGTGAATCAGATAATGGGAAGTGGGGTACCTTTAGAAATAATGATGCCAGCTCCACCACCTCCGCCAACAATTCAGCCACCAGATCCAACAAAAATAGTACCAGTTTCTCCTCAGGAAGTTCTCAATAAGCTACGTGCAAGATCGGACAGTTTAAAAGAAGTACATCGGCGTCACCAATCAGATCAGGACCGTTTAGAAGAAGAATTAGGACAAGCTAGTAAAGAACTGGAAGATGGCGAAATTCGCGCACCGCAGCTTGCACAGCGCTTCACATATTACCAAGAGTTGCGTGGGTATGTCACTGACTTGGTAGAGTGTCTTGATGAGAAGCTGCCTCTGGTTGTTGGATTGGAGCAACGTTGGTTAGATCTTTATAGTGAACGCGCGACTGAACTAATGGAGAGACGGCGGCAAGACACAAGAGATCAAGCAGAAGAAATTACAACAGCTGCAAGAGGCCAGCCTATACGGAGGGGACCAGAAGTTGAAGTTCGTAAGCGTCGAGAAACTGAAAGAGAAGGTAGAAGAGCTCGTCGAAGAAGAGCTAGAGAATCTACATTGCCAAAACATATTGACGGAATGTCTAGTGACGATGAAGTTACAGAACAGCAAAATCTTGCTTTTAAGCAAACGAAAGATGAAATCGACAATGAAAGTAAAGAAATTTTCCATGATGTAAGGGACGAGTACTGTACCTTACGAGGAATACTATCGAAGTTGGAATCTTGGAGAGAAAGAGATAGAGATGCTTACAAGGAGGCTTACGTTTCTTTATGTATACCTAAAATCATATCTCCTATTATTAGATGGCAATTGCTGACATGGAATCCAATTATGGAAAGCGCCGATATAGAAAGAACAAAATGGTACAACACATTGTTGTTATATGCACTAGATAATAAAGAAACCGAAGAATCGCTTAAAAGGGATCCGGATGTTAGATTAGTACCATCAACAGTGGAAAAAATTGTCTTACCTAAATTAACGTCTATAGTTGAAAAAATATGGGATCCCATGTCTACGTCACAAACATTACGACTTGTCGGCACAGTAAATCGTCTTATCAGAGAGTATCCTAATTTAAACGACACAAGTAAACAATTAGAAGCATTATTTAATGCTATATTAGAAAAAATTAAAGCAGCAGTGGAGAATGATGTTTTTATACCAATTTTTCCAAAACAAGTTTTGGATACTAAACATCAATTCTTTCAAAGACAATTTTCAATGGCTGTTAAACTTTTGAGAAATTTACTTAGTTGGCAAGGTCTTCTCGGCGACACACAATTAAAAAACCTGGCGCTTGGCTCGTTGTTAAATAGATATCTTTTGGCTGGTTTGAAGGTTTCTGTTCCAACTGATGCTTTATTTAAAGCAAATATGGTAATGAGTACATTGCCTCGCGCATGGTTGCAAGGTGAAACAATAGAACACCTGAGAATGTTTGCTAGCCTTATTCAACAGCTTAGTGAACAATTAGATCAAGCCAATCCAGCGCATAATGAAGCCTGGGAATACTCGAAGTCCATCCTGAAAATAATCAAGCCTTTGTAA
- Protein Sequence
- MLGTMSLFNKPKRNIRRRHFNDDDEDNENRMEVEDAQPVKSKTKKKDKPKQTLLSFGEELEEADDGEVFKVKKSSRSKKLMKQLDHERRKKKGEEKMQVDSEQANMSIKQENDLEIKTDDLVVKIKNTGPLILNGRAALAAGKDDYMSDEEDDKLHGHKFIRNTDRADTMKILLESGCIPDAAMIHAARKRRQKARELGTDYIPIEEQSDDKGKSRLIREEDHDRSDDDDSQDRIDMTVNTEARDKEKRREAFLASQAPIKLSDDESEHEIEEEEWEAQQIRKGVTGAQIAAVQQDSMMQQQFSLGMNVNQIMGSGVPLEIMMPAPPPPPTIQPPDPTKIVPVSPQEVLNKLRARSDSLKEVHRRHQSDQDRLEEELGQASKELEDGEIRAPQLAQRFTYYQELRGYVTDLVECLDEKLPLVVGLEQRWLDLYSERATELMERRRQDTRDQAEEITTAARGQPIRRGPEVEVRKRRETEREGRRARRRRARESTLPKHIDGMSSDDEVTEQQNLAFKQTKDEIDNESKEIFHDVRDEYCTLRGILSKLESWRERDRDAYKEAYVSLCIPKIISPIIRWQLLTWNPIMESADIERTKWYNTLLLYALDNKETEESLKRDPDVRLVPSTVEKIVLPKLTSIVEKIWDPMSTSQTLRLVGTVNRLIREYPNLNDTSKQLEALFNAILEKIKAAVENDVFIPIFPKQVLDTKHQFFQRQFSMAVKLLRNLLSWQGLLGDTQLKNLALGSLLNRYLLAGLKVSVPTDALFKANMVMSTLPRAWLQGETIEHLRMFASLIQQLSEQLDQANPAHNEAWEYSKSILKIIKPL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01123058;
- 90% Identity
- iTF_00863950; iTF_00141223; iTF_01420929; iTF_00684162; iTF_00360899; iTF_00860387; iTF_00863267; iTF_00865316; iTF_00873796; iTF_00963844; iTF_00219109; iTF_00224549; iTF_00229350; iTF_00024615; iTF_00216391; iTF_00227271; iTF_00183944; iTF_00221257; iTF_00215018; iTF_00230026; iTF_00233226; iTF_00231979; iTF_01071019; iTF_01418347; iTF_00217069; iTF_01066253; iTF_00773806; iTF_01169166; iTF_00225232; iTF_00217745; iTF_00760977; iTF_00861137; iTF_01420273; iTF_00183282; iTF_00360210; iTF_00676018; iTF_00983602; iTF_01123058; iTF_01418994; iTF_00219795; iTF_00221907; iTF_00231315; iTF_00625491; iTF_00088820; iTF_00762548; iTF_00087157; iTF_00214389; iTF_01419638; iTF_00215704; iTF_00225912; iTF_00306264; iTF_00232601; iTF_00763225; iTF_00227964; iTF_00961820; iTF_00142461; iTF_00141822; iTF_01122430; iTF_00230687; iTF_01065567; iTF_00220576; iTF_00226592; iTF_00862560; iTF_01232994; iTF_01234341; iTF_01232397; iTF_01235023; iTF_01512182; iTF_01512826; iTF_01513471; iTF_00866005; iTF_00866699; iTF_00633625; iTF_00391288; iTF_00864634; iTF_00733952;
- 80% Identity
- -