Hvit016222.1
Basic Information
- Insect
- Horisme vitalbata
- Gene Symbol
- PAXBP1
- Assembly
- GCA_951804965.1
- Location
- OX638096.1:3899571-3932008[-]
Transcription Factor Domain
- TF Family
- GCFC
- Domain
- GCFC domain
- PFAM
- PF07842
- TF Group
- Unclassified Structure
- Description
- This entry describes a domain found in a number of GC-rich sequence DNA-binding factor proteins and homologues [4, 5], as well as in a number of other proteins including Tuftelin-interacting protein 11 [1]. While the function of the domain is unknown, some of the proteins it is found in are reported to be involved in pre-mRNA splicing [1, 2]. This domain is also found in Sip1, a septin interacting protein [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 12 0.0011 18 5.7 0.0 6 41 823 858 819 862 0.92 2 12 0.0011 18 5.7 0.0 6 41 945 980 941 984 0.92 3 12 0.0011 18 5.7 0.0 6 41 1067 1102 1063 1106 0.92 4 12 0.0011 18 5.7 0.0 6 41 1189 1224 1185 1228 0.92 5 12 0.0011 18 5.7 0.0 6 41 1311 1346 1307 1350 0.92 6 12 0.0011 18 5.7 0.0 6 41 1433 1468 1429 1472 0.92 7 12 0.0011 18 5.7 0.0 6 41 1555 1590 1551 1594 0.92 8 12 0.0011 18 5.7 0.0 6 41 1677 1712 1673 1716 0.92 9 12 0.0011 18 5.7 0.0 6 41 1799 1834 1795 1838 0.92 10 12 0.0011 18 5.7 0.0 6 41 1921 1956 1917 1960 0.92 11 12 0.001 17 5.8 0.0 6 41 2043 2078 2039 2082 0.92 12 12 2.5e-16 4.2e-12 47.1 0.1 6 190 2165 2327 2161 2336 0.83
Sequence Information
- Coding Sequence
- ATGTTTCGGAAGCCAAAAAAGATCCAGAGGCGCGTTTTCTGtgctgatgacgatgacgacGGCGAACCGGAGGCCCCACCGCCGCCAATCATCAGCCAGGACTCTCTGGAAAGGAAAGAAAAGAAGGAGCCTAGGCCTGTGAAGAGTGTGGCACTGCTGAGTTTCGCTGATGAGGAGGAAGATTGCGAAGAATTCAAGGTTAAAAAGTCATCGCAGAGCAAGCGGCTGTCCAAACGCAGAGATAAGGAAAGAAAGAGATTGCCCGATGGTGATAGTAACAGATATGACAGTAACGGCCACGAGGAATACTCACAAACAGAGGATGCAACATCGAAGCCGAAGAAGAAAGTGACTCTGGAGGGGTTGATTCTATCCGGGCGGGAGGCTCTAGCGGCGGATGGGGCCGGCGACGTGTCTGACGAGGAGGGGGAGGAGGGGGATGATAGGGGGTTCCACCGGTACCGCGCGGAGTCGGTCAGGGCTGCGCTGGCGGGCGCGCCGGGACACATTCCCGACGCCGCGCTCATCCACGCCGCACGCAAGACCAGACAGCAGGCTCGTGAGCTCGGAGACTTTGTGCCGATACAATCTGAGCCAGCGGCCGGGTCGCGGCTCGTGCGGGAAGGCGATGACTCGGGCGATGACGAGGAGGATCGCATACAAGTGAGGGGCCTCGAAATGCCCAGCGATAAGCCACAGCGTGGCACGGCGGCCATCGACccagaagaagaagacaacAGCGAGGCGGAGGAGTGGGCGGAGCAACAGATGCAGAAGGCCATGCCTGCTATTGTTGATATCACAGGTGAGAACTCGATCGAGTTGAATCCGTTCGCGGTGGCGCCGCCCCCGCCGCGCATGTCGGTAGCGCCACACCTGCGCCCGCTGCCCCCCGCCACATCGGCGCACCAACTGGTGGCAGCACTCACCGAGCGGCTTGAAGAGCTGCAACTGGAGCGTATCAACACAGCGAATAAGAGGCAAGCGCTGCGCGAACAGGTGTTGGAGCTGGCGCGCGTGCGTGAAAGTCGCGCGGCGAGACGCGCCGAGTTAGACGCCGCGTATCGGCGCGCGCAGACGGCCAGAGGGTATCTCACGGATCTCATCGAGTGTCTGGACGAGAAGATGCCGCAGCTGGAAGCGCTGGAGGCGCGAGCGTTGGCGCTGCACCGGCGCAGACACGAGTTCCTGGTGGAGCGACGCCGCGCTGACGTGCGGGACCAGGCGCAGGACGTGCTGGCATTGGCAGGTCGTCCGGGCAGCGTGCGGGCTCCGGACGGGGCCGACAAGGTGCGGCGCACGGCGGAGCGCGAGGGGCGGCGCCGCGCGCGGCGACTGCGCCGGGAGGCAGCCGGGGGCGCGCGCTCGCATCGCGACGGAGACTCCAGCGACGACGAGCTGCCGCCGCAGGAGCACCACCACGCGCTCACAGAGAGAGGTACCGCGCTGAGGACGGGGCGGAGCGCGAGGGGCGGGGGGGCGCGCGCTCGCATCGCGACGGAGACTCCAGCGACGACGAGCTGCCGCCGCAGGAGCACCACCACGCGCTCACAGAGAGAGCGACGACGAGCTGCCGCCGCAGGAGCACCACCACGCGCTCACAGAGAGAGGTACCGCGCTGAGGACGGGGCGGAGCGCGAGGGGCGGGGGGGCGCGCGCTCGCATCGCGACGGAGACTCCAGCGACGACGAGCTGCCGCCGCAGGAGCACCACCACGCGCTCACAGAGAGAGGTACCGCGCTGAGGACGGGGCGGAGCGCGAGGGGCGGGGGGGCGCGCGCTCGCATCGCGACGGAGACTCCAGCGACGACGAGCTGCCGCCGCAGGAGCACCACCACGCGCTCACAGAGAGAGCGACGACGAGCTGCCGCCGCAGGAGCACCACCACGCGCTCACAGAGAGAGGTACCGCGCTGAGGACGGGGCGGAGCGCGAGGGGCGGGGGGGCGCGCGCTCGCATCGCGACGGAGACTCCAGCGACGACGAGCTGCCGCCGCAGGAGCACCACCACGCGCTCACAGAGAGAGGTACCGCGCTGAGGACGGGGCGGAGCGCGAGGGGCGGGGGGGCGCGCGCTCGCATCGCGACGGAGACTCCAGCGACGACGAGCTGCCGCCGCAGGAGCACCACCACGTGCTCACAGAGAGAGCGAGCATCCTCTCGGCGTCGGAGGCGTTGTTTTCGGACGCGCTGGGCGCGTACGGCAGTGCGCGCGGAGTGTGCGCACGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCTCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCTCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCTCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGGTACGTTCTCGTACGGCAGCGCGCGGCTGGCGACGTGGCGACGCCGCGACCGCGCCGCCTACTGCGACGCACACCTGCCCGCCGCGCTGCCCAAGCTGCTGGCGCCCTACGTCAGGCATCAGCTCATTTTATGGAACCCTCTTGCTGATCAGGACAACGAGGATTACGAGAAAATGGATTGGTACAAATGCCTGATGATGTACGGCGTCCTTAGCGAGGCGTCGGAGTCGTCCTCAGACTCGGAGAGCGAgagcgccgcgccgccgccgctcagCGAGCTCGCCGTCAGAGAAGACCCCGACCTCATGCTGGTACCCACCATCTTGAATAAGGTTGTACTGCCCAAGATTACAGAGGTGGTGGAGCAGGCGTGGGACCCGGTGTGCGTGCGCGCGTGCGTGCGGCTGCGGCAGGTGCTGACGCGCGCCGCCGCCGTGCCGGGCGCCGCGCGCGCCCTGCGCCcgctcgccgccgccgccacgcGCCGCCTGGCCCTGGCGCTGCACGCCGACGTCTTCCTGCCGACCCTCCCGCCCGCTGTGATGGAAGGTCCGGGCGGGTCGTTCTGGCGACGCTGCGTGGGCGGCGGCGTGCGGCTGCTGCGCGGCGCGCTCGCGCTCACCTCGCCGCCCGCCATCTTGAACAACGACGCCAACGTGCTCGCACTCATCGAGTCGCTGAGTACTGGTGCGGGCGCGGCGCCGGGCGCGTGCGTGTcccgcgcggcggcggcgctggCGGACACGCTGCCGCGCGGCGGCGCTCTGCGGCGCGCCGCTCTGCGCCGCCTGGCGCGACTCGCCGCGCTCGCGCTCTCGCGCCTCGACGCAGACAACCCGCTGCATCTgaaagCCATCGAGCAGGCGCGGGCTGTGCTGGCGGAGGCAAATTCAAAGGAGTGA
- Protein Sequence
- MFRKPKKIQRRVFCADDDDDGEPEAPPPPIISQDSLERKEKKEPRPVKSVALLSFADEEEDCEEFKVKKSSQSKRLSKRRDKERKRLPDGDSNRYDSNGHEEYSQTEDATSKPKKKVTLEGLILSGREALAADGAGDVSDEEGEEGDDRGFHRYRAESVRAALAGAPGHIPDAALIHAARKTRQQARELGDFVPIQSEPAAGSRLVREGDDSGDDEEDRIQVRGLEMPSDKPQRGTAAIDPEEEDNSEAEEWAEQQMQKAMPAIVDITGENSIELNPFAVAPPPPRMSVAPHLRPLPPATSAHQLVAALTERLEELQLERINTANKRQALREQVLELARVRESRAARRAELDAAYRRAQTARGYLTDLIECLDEKMPQLEALEARALALHRRRHEFLVERRRADVRDQAQDVLALAGRPGSVRAPDGADKVRRTAEREGRRRARRLRREAAGGARSHRDGDSSDDELPPQEHHHALTERGTALRTGRSARGGGARARIATETPATTSCRRRSTTTRSQRERRRAAAAGAPPRAHRERYRAEDGAEREGRGGARSHRDGDSSDDELPPQEHHHALTERGTALRTGRSARGGGARARIATETPATTSCRRRSTTTRSQRERRRAAAAGAPPRAHRERYRAEDGAEREGRGGARSHRDGDSSDDELPPQEHHHALTERGTALRTGRSARGGGARARIATETPATTSCRRRSTTTCSQRERASSRRRRRCFRTRWARTAVRAECAHGWRRGDAATAPPTATHTCPPRCPSCWRPTSGIRYVLVRQRAAGDVATPRPRRLLRRTPARRAAQAAGALRQASGTFSYGSARLATWRRRDRAAYCDAHLPAALPKLLAPYVRHQVRSRTAARGWRRGDAATAPPTATHTCPPRCPSCWRPTSGIRYVLVRQRAAGDVATPRPRRLLRRTPARRAAQAAGALRQASGTFSYGSARLATWRRRDRAAYCDAHLPAALPKLLAPYVRHQVRSRTAARGWRRGDAATAPPTATHTCPPRCPSCWRPTSGIRYVLVRQRAAGDVATPRPRRLLRRTPARRAAQAAGALRQASGTFSYGSARLATWRRRDRAAYCDAHLPAALPKLLAPYVRHQVRSRTAARGWRRGDAATAPPTATHTCPPRCPSCWRPTSGIRYVLVRQRAAGDVATPRPRRLLRRTPARRAAQAAGALRQASGTFSYGSARLATWRRRDRAAYCDAHLPAALPKLLAPYVRHQVRSRTAARGWRRGDAATAPPTATHTCPPRCPSCWRPTSGIRYVLVRQRAAGDVATPRPRRLLRRTPARRAAQAAGALRQASGTFSYGSARLATWRRRDRAAYCDAHLPAALPKLLAPYVRHQVRSRTAARGWRRGDAATAPPTATHTCPPRCPSCWRPTSGIRYVLVRQRAAGDVATPRPRRLLRRTPARRAAQAAGALRQASGTFSYGSARLATWRRRDRAAYCDAHLPAALPKLLAPYVRHQVRSRTAARGWRRGDAATAPPTATHTCPPRCPSCWRPTSGIRYVLVRQRAAGDVATPRPRRLLRRTPARRAAQAAGALRQASGTFSYGSARLATWRRRDRAAYCDAHLPAALPKLLAPYVRHQVRSRTAARGWRRGDAATAPPTATHTCPPRCPSCWRPTSGIRYVLVRQRAAGDVATPRPRRLLRRTPARRAAQAAGALRQASGTFSYGSARLATWRRRDRAAYCDAHLPAALPKLLAPYVRHQVRSRTAARGWRRGDAATAPPTATHTCPPRCPSCWRPTSGIRYVLVRQRAAGDVATPRPRRLLRRTPARRAAQAAGALRQASGTFSYGSARLATWRRRDRAAYCDAHLPAALPKLLAPYVRHQVRSRTAARGWRRGDAATAPPTATHTCPPRCPSCWRPTSGIRYVLVRQRAAGDVATPRPRRLLRRTPARRAAQAAGALRQASGTFSYGSARLATWRRRDRAAYCDAHLPAALPKLLAPYVRHQVRSRTAARGWRRGDAATAPPTATHTCPPRCPSCWRPTSGIRYVLVRQRAAGDVATPRPRRLLRRTPARRAAQAAGALRQASGTFSYGSSRLATWRRRDRAAYCDAHLPAALPKLLAPYVRHQVRSRTAARGWRRGDAATAPPTATHTCPPRCPSCWRPTSGIRYVLVRQRAAGDVATPRPRRLLRRTPARRAAQAAGALRQASGTFSYGSARLATWRRRDRAAYCDAHLPAALPKLLAPYVRHQLILWNPLADQDNEDYEKMDWYKCLMMYGVLSEASESSSDSESESAAPPPLSELAVREDPDLMLVPTILNKVVLPKITEVVEQAWDPVCVRACVRLRQVLTRAAAVPGAARALRPLAAAATRRLALALHADVFLPTLPPAVMEGPGGSFWRRCVGGGVRLLRGALALTSPPAILNNDANVLALIESLSTGAGAAPGACVSRAAAALADTLPRGGALRRAALRRLARLAALALSRLDADNPLHLKAIEQARAVLAEANSKE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -