Basic Information

Gene Symbol
PAXBP1
Assembly
GCA_941918865.2
Location
CALNXB020000680.1:10691-33341[-]

Transcription Factor Domain

TF Family
GCFC
Domain
GCFC domain
PFAM
PF07842
TF Group
Unclassified Structure
Description
This entry describes a domain found in a number of GC-rich sequence DNA-binding factor proteins and homologues [4, 5], as well as in a number of other proteins including Tuftelin-interacting protein 11 [1]. While the function of the domain is unknown, some of the proteins it is found in are reported to be involved in pre-mRNA splicing [1, 2]. This domain is also found in Sip1, a septin interacting protein [3].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 18 4.3e-05 0.38 10.9 0.0 3 40 522 559 520 561 0.92
2 18 0.0057 51 3.9 0.0 18 40 563 585 560 588 0.89
3 18 0.0054 48 4.0 0.0 18 40 589 611 584 613 0.89
4 18 0.0059 52 3.8 0.0 18 40 615 637 612 639 0.89
5 18 0.0054 48 4.0 0.0 18 40 641 663 636 665 0.89
6 18 0.0051 45 4.0 0.0 18 40 667 689 662 693 0.89
7 18 0.0055 48 4.0 0.0 18 40 693 715 689 719 0.90
8 18 0.0059 52 3.8 0.0 18 40 719 741 716 743 0.89
9 18 0.0049 43 4.1 0.0 18 40 745 767 739 770 0.89
10 18 0.0051 45 4.0 0.0 18 40 771 793 766 797 0.89
11 18 0.0059 52 3.8 0.0 18 40 797 819 794 821 0.89
12 18 0.0054 48 4.0 0.0 18 40 823 845 818 847 0.89
13 18 0.0054 48 4.0 0.0 18 40 849 871 844 873 0.89
14 18 0.0059 52 3.8 0.0 18 40 875 897 872 899 0.89
15 18 0.0052 46 4.0 0.0 18 40 901 923 896 926 0.89
16 18 0.0068 60 3.6 0.0 18 39 927 948 923 952 0.90
17 18 0.0065 58 3.7 0.0 19 40 954 975 951 977 0.90
18 18 2.6e-11 2.3e-07 31.2 0.0 18 190 979 1130 976 1139 0.70

Sequence Information

Coding Sequence
ATGTCATTGTTTCGGAAGCCGAAGAAGATTCAGCGTCGGGTATTTTGTGCTGATGATGACGAAGATGGTGAGCCCGAGGCACCTCCGCCACCAGTCATCAGCCAAGAACGGGAGAAGAAAGAAAAGGAACAGAAGAAACCTGCTCTGTTGAGTTTCGCTGACGAAGAAGAAGAAGTGGAGGTGTTTAAAGTCAAGAAGTCTTCTCAAAGTAAGCGCTTAGCGAAGAAGCGTGAGAAAGAGAAGCTGAAACGTGATGTGAGAACCGACGGTGATAATAACAAATATGACACTATTATTATTGAGGACAAGAGCAACGACACAGACTTCATCAACGACAAACCGAAACCAAAGAAGAAGGTGTCACTAGAGGGGTTAATCCTGTCGGGTCGAGAGGCGCTAGCGGCCGACGGGGCGGGGGACGTGTCCGATGAGAGTGCACCCGAAGAGGGGGACGCGGACGGGGACGACAGGGGGTTCCACCGGTACCGCGCCGAGTCCGTGAGGGCCGCACTGTTGGGCGCGCCGGGTCGCATTCCCGACGCAGCGCTCATCCACGCCGCACGCAAGACCAGACAGCAGGCTCGCGAGTTAGGCGGCGGAGGCGCCGGCGCAGATTACATTCCCGTGAAGTCTGCTGAGTCCGCCCCGGGATCCCGGCTAGTTCGGGACGATGGGTCCGGTGATGATGAGGAGGAAGGGCGAATTCATGTCCGGGGACTGGACTTGCCTAGTGATAAGCCTAAACGTGGCACAGCAGCCGTCTCCGAAGAAGAGTTGGACAGCGAATCGGATGAATGGGTGGAACAACAGCTGCAGAAGGCGCGGCCCGTCCTCGCGGATCTGACTGGTGAAGCGCCTATAGAAGTGAACCCATTCTCGGTGTTGTTGCCGCCGCCTCAGCTGTCGGCTCCCGAACACCTGCGACCTCTCACGACTGGCACGCAGCCTCCCGCGACTGCCAACGAGCTGGTGGACGCGCTGAGGAAACGGTTAGGAGAACTCCAAGTAGAACGCGAAGCAACATCAGAAACACAGAAGGACCTGCGCGAGAGGTTGCTCACTGCGGCCAGAATACGAGAGAGTCGAGCGTCGCGTTGTGGGGAGTTAGATGCGGCCTATAGAAGGGCGCAGGCCATTCGAGGATACCTCACAGACCTGATCGAGTGCCTTGATGAGAAGATGCCACAACTGGAGGCGTTAGAAGCTCGGGCGTTAAACTTACACAAGCGTCGCTGTGAGTTCCTTATTGAGAGGCGAAGAGCTGACCTGAGGGATCAGGCGCAAGATGTGCTTGCACCGCCTGGTCGGGTCACAAAACCAACGGACAACGAAGAGAAGACGCGTCGCGCGGCCGAGCGTGAGGGCCGGCGGAGGGCCAGACGGCTCAAGCGCGAGGCCACGGCCGCTGCCGCCGGGACCACGCTGCCGCATCGGGACGGAGACTCCTCGGATGATGAGCTGCCTCCGCATGAGATGCATCACTACACACAAGAGAGAGACGCTATCCGTCAGCAGTCAGCCTCGTTGTTCAGCGACGCGCTCCCCGCGTGGCGCAGTGTGTCGGGTGTGTGCAAGCGTCTCGCGCGCTGGAGAGCTCGCGCCGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAAGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAAGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAAGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAAGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAAGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAAGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAAGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAAGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAAGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAAGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAAGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAAGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAAGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAAGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAAGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAGGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGGTAAGTGCCGACCTGTACACGGACGCTTATGTAGCCGACTGTCTGCCCAAGCTACTGGCGCCTTACGTCCGACATGAGCTAATATTGTGGAACCCGCTGGCAGACGAAGACAACGAAGATTACGAGAGAATGGATTGGTACAAATGTTTGATGATGTACGGCGTCCGTACAGACCGCGGTGCAGACTCATCCTCGTCGGGGTCGGAAGGCGAGGCGGAGCCGCTGCCGGTCACGGACAGCTCCGTGCGAGAGGATCCCGACCTGCTGCTGGTGCCTAGCATCATCAGCAGGGTGGTGCTGCCTTGTCTCACAGAGCTGGTGAGCGTGGCGTGGGACCCGCTGTCGGTGCGCTCGTGCACGCGCCTGCGCGGCCTGCTGCTGCGCGCGGCCGGGCTGCCCGCCTGCGCCTGCGCAGTGCGGCGCCTGGCGGCCGCGCTGCGCGTGCGCCTGTCGCAGGCCTTAGGCGCTGATGTGTTCCTGCCTGCGCTGCCGCCTCAGGTAATGGAAGGTCCGGGCGGTGCGTTCTGGCGACGTTGCCTGGGCGCGGGCGTACGCTTACTGCGCGCCACGCTGTCGCTCACGGGCCCGCCTGCGCTCTTGTATGCTGACCCACTCGTACTGTCACTTATAGAAACGCTCTGTACGGGCGCGGGCGCAGCGGGCGGTCCGTACGTGGCGCACGCGGCGGCCGCGCTGGGCGCCACCCTGCCGCGCGCCGGCTCGCTGCGCAGGCGAGCGCGAGCGCGACTGGCCGCGCTCGCCACGCTAGCGCTGTCCCGGCTCGACACTGATAACCCTATGCATTTGAAAGCTCTAGAACAAGCGAGAGCAGTGATCGCAGAGGCACGTGCGGTTGAATGA
Protein Sequence
MSLFRKPKKIQRRVFCADDDEDGEPEAPPPPVISQEREKKEKEQKKPALLSFADEEEEVEVFKVKKSSQSKRLAKKREKEKLKRDVRTDGDNNKYDTIIIEDKSNDTDFINDKPKPKKKVSLEGLILSGREALAADGAGDVSDESAPEEGDADGDDRGFHRYRAESVRAALLGAPGRIPDAALIHAARKTRQQARELGGGGAGADYIPVKSAESAPGSRLVRDDGSGDDEEEGRIHVRGLDLPSDKPKRGTAAVSEEELDSESDEWVEQQLQKARPVLADLTGEAPIEVNPFSVLLPPPQLSAPEHLRPLTTGTQPPATANELVDALRKRLGELQVEREATSETQKDLRERLLTAARIRESRASRCGELDAAYRRAQAIRGYLTDLIECLDEKMPQLEALEARALNLHKRRCEFLIERRRADLRDQAQDVLAPPGRVTKPTDNEEKTRRAAEREGRRRARRLKREATAAAAGTTLPHRDGDSSDDELPPHEMHHYTQERDAIRQQSASLFSDALPAWRSVSGVCKRLARWRARAADLYTDAYVADCLPKLLAPYVRHEVSADLYTDAYVADCLPKLLAPYVRHEVSADLYTDAYVADCLPKLLAPYVRHEVSADLYTDAYVADCLPKLLAPYVRHEVSADLYTDAYVADCLPKLLAPYVRHEVSADLYTDAYVADCLPKLLAPYVRHEVSADLYTDAYVADCLPKLLAPYVRHEVSADLYTDAYVADCLPKLLAPYVRHEVSADLYTDAYVADCLPKLLAPYVRHEVSADLYTDAYVADCLPKLLAPYVRHEVSADLYTDAYVADCLPKLLAPYVRHEVSADLYTDAYVADCLPKLLAPYVRHEVSADLYTDAYVADCLPKLLAPYVRHEVSADLYTDAYVADCLPKLLAPYVRHEVSADLYTDAYVADCLPKLLAPYVRHEVSADLYTDAYVADCLPKLLAPYVRHEVGADLYTDAYVADCLPKLLAPYVRHEVSADLYTDAYVADCLPKLLAPYVRHELILWNPLADEDNEDYERMDWYKCLMMYGVRTDRGADSSSSGSEGEAEPLPVTDSSVREDPDLLLVPSIISRVVLPCLTELVSVAWDPLSVRSCTRLRGLLLRAAGLPACACAVRRLAAALRVRLSQALGADVFLPALPPQVMEGPGGAFWRRCLGAGVRLLRATLSLTGPPALLYADPLVLSLIETLCTGAGAAGGPYVAHAAAALGATLPRAGSLRRRARARLAALATLALSRLDTDNPMHLKALEQARAVIAEARAVE

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-