Rfor005968.1
Basic Information
- Insect
- Rhodophaea formosa
- Gene Symbol
- PAXBP1
- Assembly
- GCA_963082605.1
- Location
- OY720207.1:20158184-20177106[+]
Transcription Factor Domain
- TF Family
- GCFC
- Domain
- GCFC domain
- PFAM
- PF07842
- TF Group
- Unclassified Structure
- Description
- This entry describes a domain found in a number of GC-rich sequence DNA-binding factor proteins and homologues [4, 5], as well as in a number of other proteins including Tuftelin-interacting protein 11 [1]. While the function of the domain is unknown, some of the proteins it is found in are reported to be involved in pre-mRNA splicing [1, 2]. This domain is also found in Sip1, a septin interacting protein [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 35 5e-06 0.056 13.9 0.0 3 40 200 237 198 240 0.92 2 35 4.5e-06 0.05 14.1 0.0 4 40 241 277 239 280 0.94 3 35 4.5e-06 0.05 14.1 0.0 4 40 281 317 279 320 0.94 4 35 4.5e-06 0.05 14.1 0.0 4 40 321 357 319 360 0.94 5 35 4.5e-06 0.05 14.1 0.0 4 40 361 397 359 400 0.94 6 35 4.4e-06 0.049 14.1 0.0 4 40 401 437 399 441 0.94 7 35 4.4e-06 0.049 14.1 0.0 4 40 441 477 438 480 0.94 8 35 4.4e-06 0.049 14.1 0.0 4 40 481 517 478 520 0.94 9 35 5.7e-06 0.064 13.7 0.0 4 39 521 556 518 558 0.94 10 35 4.7e-06 0.053 14.0 0.0 4 40 561 597 559 599 0.94 11 35 4.5e-06 0.05 14.1 0.0 4 40 601 637 599 640 0.94 12 35 4.5e-06 0.05 14.1 0.0 4 40 641 677 639 680 0.94 13 35 4.4e-06 0.049 14.1 0.0 4 40 681 717 678 720 0.94 14 35 4.7e-06 0.053 14.0 0.0 4 40 721 757 719 759 0.94 15 35 4.5e-06 0.05 14.1 0.0 4 40 761 797 759 800 0.94 16 35 4.6e-06 0.051 14.1 0.0 4 40 801 837 798 839 0.94 17 35 4.4e-06 0.049 14.1 0.0 4 40 841 877 838 880 0.94 18 35 4.4e-06 0.049 14.1 0.0 4 40 881 917 878 920 0.94 19 35 4.4e-06 0.049 14.1 0.0 4 40 921 957 918 960 0.94 20 35 4.5e-06 0.05 14.1 0.0 4 40 961 997 959 1000 0.94 21 35 4.6e-06 0.051 14.1 0.0 4 40 1001 1037 998 1039 0.94 22 35 4.4e-06 0.049 14.1 0.0 4 40 1041 1077 1038 1080 0.94 23 35 4.5e-06 0.05 14.1 0.0 4 40 1081 1117 1079 1120 0.94 24 35 4.4e-06 0.049 14.1 0.0 4 40 1121 1157 1118 1160 0.94 25 35 4.6e-06 0.051 14.1 0.0 4 40 1161 1197 1158 1199 0.94 26 35 4.7e-06 0.053 14.0 0.0 4 40 1201 1237 1199 1239 0.94 27 35 4.5e-06 0.05 14.1 0.0 4 40 1241 1277 1239 1280 0.94 28 35 4.5e-06 0.05 14.1 0.0 4 40 1281 1317 1279 1320 0.94 29 35 4.4e-06 0.049 14.1 0.0 4 40 1321 1357 1318 1360 0.94 30 35 4.5e-06 0.05 14.1 0.0 4 40 1361 1397 1359 1400 0.94 31 35 4.3e-06 0.048 14.1 0.0 4 40 1401 1437 1398 1441 0.94 32 35 4.3e-06 0.048 14.1 0.0 4 40 1441 1477 1438 1481 0.94 33 35 4.5e-06 0.05 14.1 0.0 4 40 1481 1517 1479 1520 0.94 34 35 4.6e-06 0.051 14.1 0.0 4 40 1521 1557 1518 1559 0.94 35 35 1.1e-16 1.2e-12 48.9 0.5 4 190 1561 1726 1559 1735 0.69
Sequence Information
- Coding Sequence
- ATGAAGCCGCGTTTAAGACTTCAAGAGCAACAACAAGAGCGCGAGAAGACAGCTGCTCGCGTGGAAGAGATACGTTCCCGCGTTCTCAATGCGGCGAGGATCCGAGAATCTCGCGCTGCGAGGACCTCGGCGCTGGACGCCGCGTACAGACGAGCGCAGGCCGCGCGCGGGTACCTCACTGACCTCGTGGAGTGTCTGGATGAGAAGATGCCGCAACTGGAAGCGCTGGAGGCTCGTGCGCTGGCTTTGCACCGGCGGCGCTGCGAGTTCCTCACGGAGCGGCGCCGCGCCGACGTGCGCGACCAGGCGCACGACGTGCTCGCACTCGCAGCTCGACCCGGAGCTGCGAAGCCAGTAGACTCGGAAGAGAAAATTCGTCGCGTGGCCGAGCGGGAAGGGCGACGACGCGCGAGGAGGTTACAACGCGACGCTGCGGCCGCCAACGCCGGCACTGCTGTGAGGCATCGGGACGGGGACTCTAGTGATGATGAATTACCTCCCACGGAATTACAGCATTATAAACAGGAGAGAGAGTCGCTGCGCTCGCAGTCGGCGGCGCTGTTCGCGGACGCGCTCCCGGCGTGGCGCGGCGTGCGCGGCGTGTGCGCGCGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGGGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTACTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTACTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTACTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTACTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTACTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGGTGAGTGTTACTCTACAGCGACGCCTGCAGCGCTGGCGGCGACACGACCCGCGCCTCTACAGCGACGCCTACGTCGCGCACTGCCTGCCCAAGCTGCTGGCGCCCTACGTCCGCCACCAGCTCATATTATGGAACCCGCTGGCAGACGAGGACAATGAGGACTACGAAAGAATGGACTGGTACAAATGTGTGATGATGTACGGCGTCCGGGCAGAACGGCTTTCTAGTGAATCGGAACAAGAAGAGGAGGAGGACGAGTCGCCCCCCAGCGTGTCGGAGCGAGCGGTGCGAGACGACCCCGACCTCATGCTGGTGCCCGCACTCGTTTCTAGGGTCGTGTTGCCAAAACTTACAGAGATAGTGGAAAACGCATGGGACCCCGTATGCGTGCGGGCGTGCGTCAGACTCCGCCAACTCTTGGTTCGCGCCGCCAACGTGCCCGGCGCCTCCAATGCCTTGCGGAAACTCGCGGCTGCAGCCCGGACGAGGCTGCAGGCCGCCTTGAACGCTGATGTGTTCCTGCCTGCTCTGCCTCCGCAGATAATGGAAGGCGCAGGCGGCGCGTTCTGGCGTCGCTGTCTGGGCTCGGGCGTGCGCTTACTGCGCGCCGCGCTGGCGCTGTCCGCCCCGCCCGCGCTGCTGCGCGCCGACCCCGTCGTGCTCGCGCTCGTCGAGACCCTCAGCTGCGCTGCGGGCGCTGCGCCCGGACCGCAGGTCGCTGCTGCAGCCGCTGCGTTGGCTGCGACACTGCCGCGGAGTGGGGAACTTAGGGCAGCGGCCTTGAAGAGGCTGGCTGCTCTGGCGAAGCTGGCTATGACCAGGCTGCAGAGCGACAATCCTATGCATCTAAAAGCGCTGGAGCAAGCGCGAGCAGTGATAGCAGAAGCCAAGGCGATGGAATGA
- Protein Sequence
- MKPRLRLQEQQQEREKTAARVEEIRSRVLNAARIRESRAARTSALDAAYRRAQAARGYLTDLVECLDEKMPQLEALEARALALHRRRCEFLTERRRADVRDQAHDVLALAARPGAAKPVDSEEKIRRVAEREGRRRARRLQRDAAAANAGTAVRHRDGDSSDDELPPTELQHYKQERESLRSQSAALFADALPAWRGVRGVCARLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVGVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQVSVTLQRRLQRWRRHDPRLYSDAYVAHCLPKLLAPYVRHQLILWNPLADEDNEDYERMDWYKCVMMYGVRAERLSSESEQEEEEDESPPSVSERAVRDDPDLMLVPALVSRVVLPKLTEIVENAWDPVCVRACVRLRQLLVRAANVPGASNALRKLAAAARTRLQAALNADVFLPALPPQIMEGAGGAFWRRCLGSGVRLLRAALALSAPPALLRADPVVLALVETLSCAAGAAPGPQVAAAAAALAATLPRSGELRAAALKRLAALAKLAMTRLQSDNPMHLKALEQARAVIAEAKAME
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -