Nrev014187.2
Basic Information
- Insect
- Nycteola revayana
- Gene Symbol
- PAXBP1
- Assembly
- GCA_947037095.2
- Location
- OX344845.2:20957571-20974985[-]
Transcription Factor Domain
- TF Family
- GCFC
- Domain
- GCFC domain
- PFAM
- PF07842
- TF Group
- Unclassified Structure
- Description
- This entry describes a domain found in a number of GC-rich sequence DNA-binding factor proteins and homologues [4, 5], as well as in a number of other proteins including Tuftelin-interacting protein 11 [1]. While the function of the domain is unknown, some of the proteins it is found in are reported to be involved in pre-mRNA splicing [1, 2]. This domain is also found in Sip1, a septin interacting protein [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 27 0.033 3e+02 1.8 0.0 8 39 198 229 162 234 0.65 2 27 0.055 5e+02 1.1 0.0 9 40 251 282 245 286 0.87 3 27 0.055 5e+02 1.1 0.0 9 40 303 334 297 338 0.87 4 27 0.055 5e+02 1.1 0.0 9 40 355 386 349 390 0.87 5 27 0.055 5e+02 1.1 0.0 9 40 407 438 401 442 0.87 6 27 0.055 5e+02 1.1 0.0 9 40 459 490 453 494 0.87 7 27 0.055 5e+02 1.1 0.0 9 40 511 542 505 546 0.87 8 27 0.055 5e+02 1.1 0.0 9 40 563 594 557 598 0.87 9 27 0.055 5e+02 1.1 0.0 9 40 615 646 609 650 0.87 10 27 0.055 5e+02 1.1 0.0 9 40 667 698 661 702 0.87 11 27 0.055 5e+02 1.1 0.0 9 40 719 750 713 754 0.87 12 27 0.055 5e+02 1.1 0.0 9 40 771 802 765 806 0.87 13 27 0.055 5e+02 1.1 0.0 9 40 823 854 817 858 0.87 14 27 0.055 5e+02 1.1 0.0 9 40 875 906 869 910 0.87 15 27 0.055 5e+02 1.1 0.0 9 40 927 958 921 962 0.87 16 27 0.055 5e+02 1.1 0.0 9 40 979 1010 973 1014 0.87 17 27 0.055 5e+02 1.1 0.0 9 40 1031 1062 1025 1066 0.87 18 27 0.055 5e+02 1.1 0.0 9 40 1083 1114 1077 1118 0.87 19 27 0.055 5e+02 1.1 0.0 9 40 1135 1166 1129 1170 0.87 20 27 0.055 5e+02 1.1 0.0 9 40 1187 1218 1181 1222 0.87 21 27 0.055 5e+02 1.1 0.0 9 40 1239 1270 1233 1274 0.87 22 27 0.055 5e+02 1.1 0.0 9 40 1291 1322 1285 1326 0.87 23 27 0.055 5e+02 1.1 0.0 9 40 1343 1374 1337 1378 0.87 24 27 0.055 5e+02 1.1 0.0 9 40 1395 1426 1389 1430 0.87 25 27 0.055 5e+02 1.1 0.0 9 40 1447 1478 1441 1482 0.87 26 27 0.055 5e+02 1.1 0.0 9 40 1499 1530 1493 1534 0.87 27 27 5.1e-11 4.6e-07 30.7 0.0 9 190 1551 1710 1545 1723 0.72
Sequence Information
- Coding Sequence
- ATGGAAGAGTTACAAATCGACCGTCGAAAGACAGCAGAAACCAAAGAGGAGCTTCAGACCCGTCTGCTGTCTTCAGCCCGAGTGAGGGAGAGCAGGGGAGCGCGGTGTGGCGAGTTGGACGCTGCGTACAAACGAGCGCAGACCATACGCGGCTTCCTCACTGATCTCATCGAGTGTTTGGATGAAAAGATGCCGCAACTAGAAGCTTTGGAGAGTCGCGCTCTAGCGTTACACAAACGTCGGTGCGAGTTCTTAGTGGAACGACGACGCGCCGACCTCAGGGACCAGGCGCAACATGTACTCTCACTAGGTAAACCTGGTTCGAAACCAGTGGAAAGCGAAGAGAAGACCCGTCGTGCAGCAGAACGCGAGGGTCGGCGACGAGCTCGTCGCCTGGCGCGAGCCGCCGCAGCCGCCGCCGCGGGGGCCGCGCAGCCCTCGCACCGGGACGGAGACTCCAGCGATGATGAGCTGCCTCCCGCTGAACTGCACCACTTTAATAGCGAGAGAGACTCGATCCGTGCGGAGGCGGCGGCGCTGTTCTCGGACGCGCTGCCGGCGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGGTACCTTGTACTAGACATTGTTCTCGGACGTGGCGCAGCGCGTGGGGCGTGTGCGCGCGCGTGGCACGCTGGCGCACGCGCGCCGGCGCGCTGTACAACGACGCTTACGTGCCGCACTACCTGCCCAAGCTGCTGGCCCCCTACGTCAGGCACCAGCTTATTCTATGGAACCCACTCGCTGACGAAGACAACGAGGACTATGAGAAAATGGATTGGTACAAATGTCTGATGATGTACGGCGTCCGAACCGAACATCTTTCTGGCGAGTCGGACGAGTCTGATGACGCGTCTCCCCCGCCCCCCCTCAGCGAGCGGGGGGTGCGGGACGATCCGGACCTCATGTTGGTGCCCACGGTGCTCAGCAAAGTACTGCTACCAAAGGTTACAGAGCTGGTGGAGGTGTCGTGGGACCCGGTGTCGGTGCGCGCGTGCGTGCGCCTGCGCCGCGTGCTGGAGCGCGCGGGGGCGGTGCCGGGGGGCGGGGGGGCGCTGCGCCGCCTGGCCGCCGCCGCGCGCGCGCGCCTGCAGCAGGCGCTGGCTGCTGACGTGTTCCTGCCTGCACTGCCTCCCGCACTGCTGGAAGGTGCGGGTGGTCAATTTTGGCGACGTTGTCTGGGCGCGGGGGTGAGGTTGCTTCGCGCCGCGCTGTCCTTCTCCGCGCCACCCCCGTTCCTCAGGGCGGACCCGCTTGTACTGCAACTTATCGAGACGCTGTGcacgggcgcgggcgcggcgccgggcgcgcACATGTGCTCGGCGGCGCACGCGCTGGCCGAGACGCTCCCGCGCGGGGGCGAGCTGCGCGCGCGCGCCCTCGCGCGCATTGCGGCCCTCGCCACGCTCGCGCTGCAGCGACTCCACACCGACAACCCGCTACACTTGTGA
- Protein Sequence
- MEELQIDRRKTAETKEELQTRLLSSARVRESRGARCGELDAAYKRAQTIRGFLTDLIECLDEKMPQLEALESRALALHKRRCEFLVERRRADLRDQAQHVLSLGKPGSKPVESEEKTRRAAEREGRRRARRLARAAAAAAAGAAQPSHRDGDSSDDELPPAELHHFNSERDSIRAEAAALFSDALPAWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQVPCTRHCSRTWRSAWGVCARVARWRTRAGALYNDAYVPHYLPKLLAPYVRHQLILWNPLADEDNEDYEKMDWYKCLMMYGVRTEHLSGESDESDDASPPPPLSERGVRDDPDLMLVPTVLSKVLLPKVTELVEVSWDPVSVRACVRLRRVLERAGAVPGGGGALRRLAAAARARLQQALAADVFLPALPPALLEGAGGQFWRRCLGAGVRLLRAALSFSAPPPFLRADPLVLQLIETLCTGAGAAPGAHMCSAAHALAETLPRGGELRARALARIAALATLALQRLHTDNPLHL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -