Hcon020557.1
Basic Information
- Insect
- Hirtodrosophila confusa
- Gene Symbol
- Ubp1
- Assembly
- GCA_035043065.1
- Location
- JAWNNI010000242.1:3320355-3334306[-]
Transcription Factor Domain
- TF Family
- CP2
- Domain
- CP2 domain
- PFAM
- PF04516
- TF Group
- Beta-Scaffold Factors
- Description
- This family represents a conserved region in the CP2 transcription factor family.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 2 3.2e-53 2.6e-49 167.9 0.7 20 190 410 577 392 585 0.90 2 2 4.2e-13 3.5e-09 36.8 1.0 188 222 622 656 616 657 0.92
Sequence Information
- Coding Sequence
- ATGGCGCTTTCGTTTCTATCGCAAAATTCCGGCCTGTTGGATTTACAAAGCATATTTGATCCACAATATTCAttgcaacaacatcaacaacaagaacaacatttaccatcacaacaacaacaacaacaagaagaacaactaCAAATATCACGCACATCAACGAAATTTCATTTGAGCATTTTCAACGATTTCGACCAAATGGAATTCAATAACAACTTGAGTCGAAACCATAATCAATATcagaataataacaatattaataataatatcagccacaataccaacaacaataacaacaacattcatACGCACCAGACCAACGGTGAAAATCTTAATCAGATCCAAAATCGTAATTTCATCAGCGGctatcatcatcagcatatTGGATCGGATTATGAGCAAGTGATTAACTTTGTTGACTCACCACCGAACTCAGAGGAATCTTGGACAGACGCACAGTCGAAGGATTCACCAGGACCGCAGATAATCGACGTAAGGACAATTTACTCAAACAGTGGCTCACGCAAAAGACGAATGGATTGGGACTCATTGGATATTGGTCAAAGTGAAAATTCGCCAACAACACAAACTGGCGACCTACCCAATAAGGTGGTACATCAGGAGAAGGATAAACACAAGCGTGAAAAGCACTCAGGTCGTAGCAGCTGGAGCGACGATATAGGCTTTGATCTGAACGCAGAGTTTAATAGCAACTCGTATTTGAACAATGAAAACTTTCTATCGTTCTCCCCGAGCCTGACGGCACTGAAACAGGAGCCGCAAACCGAGCAGCTCAAGCCGAATACAAAGCTACCCTTGGatagtggcagcaacaacaccaacaactcCGCCGGCGTGGTCAACATTGGCAAGATAGACAAGTCGCCATTGGGCGACGCCAATCACTCGCCACAGCACGCCGGGCAGGCGCAGGACGGGGCAGGAGGCGCTACTGGAGCGGGAGGCACAGGAGCAGCGGCTGGcggcaacatcagcagcagcagcaaacatgAGTTGAACACTGGCATCATTTGTGGCTGTGGCTCGCCGCAAGGCTCGCCTGCCGCAACGGAATTTGAACTAAATGGCAATGCCAATGGAAATGCAGCATCCGCAGAGAAAACCagagctgcagctggcaatGAGACCTATGCACAGGCGCCACGCTCTggactgcagcagcagctgagcatTGTCGAAGCGGCCAAAATAGAGCCCAACTCGTCTGGCAGCGCGGCACACGCAGAGGATCACAAGTTTCAGTACATTTTGGCAGCGGCCACCTCAATTGCAACAAAGAACAATGAGGAGACATTGACCTATCTGAATCAGGGTCAAAGCTATGAGAtcaaattgaagaaaattgGTGATTTATCTTTCTATCGTGATAAGATTTTGAAGAGCGTTATTAAAATCTGTTTCCATGAGCGTCGATTGCAGTTTATGGAACGCGAACAGATGCAACAATGGCAAGCATCGCGTCCTGGAGATCGCATCATTGAAGTGGATGTGCCGCTATCTTATGGATTGTGCCATGTGTCGCAGCCATTGAGTTCAAATTCATTGAACactgttgaaatattttgggatCCATTGAAGGAGGTCGGTGTTTACATCAAGGTCAATTGCATTTCAACTGAATTTACACCAAAGAAGCACGGTGGAGAAAAGGGTGTACCATTTCGACTACAAATTGAAAcgtatattgaaaataataatacaaatagcagcaacagcagctcaagcagcagcagcagcagcaacagtagcaGCGGGGGCACATCACCCGACACACCCGAGAATCGTACGTCCAATGGCAACGCCAGCAGCGGCTCCATCGCCGGACTGGCCGCACTCAATGGCAAACAGGCGGTGCATGCAGCTGCCTGTCAGATTAAGGTTTTCAAGCTAAAAGGCGCTGATCGCAAGCATAAACAAGATCgtgaaaaaatacaaaagcgTCCACAATCTGAGCAGGATAAGTTTCAGCCCAGCTACGAGTGCACCATTATGAATGACATATCATTGGATTTGATAATGCCAGCGACAACTACAGGCTGTTACAGCCCCGAATATATGAAATTGTGGCCAAATTCGCCGGTGCATATACCAAAATATGATGGGATGCTACCATTTGCCAGCAGCGCATCACCAGCGACCAGCAGCAGCCCCATTGCGATCAATTCAGTGACATCAACAAATTCGCCAACATTGAAACTAATGGATGCCACAAATATGGTCTCGCCGCAGCATGTGCCAGCGGATATGGATGATTATAATCAGAACATAATGCCGGAATCAACGCCCGCACAAGTGACACAATGGCTGACCAATCATCGTCTGACGGCCTACCTCACCACTTTTGCCCATTTCTCGGGCTCGGATATTATGCGCATGTCAAAGGAGGATTTAATACAAATCTGTGGACTTGCCGATGGCATACgtatgtttaatattttgcgCGCCAAAACTATTACGCCACGTTTGACACTCTATGCCAGCATGGATGGCTGCAGCTACAATGCCATCTATTTGCTGTCCAACACTGCCAAGGAGTTGCAGCAGAAGCTCTACAAGCTGCCCGGTTTCTATGAGTTCATGGCTAAGGGCGGCTCTGCTGGCGCGTTAGAGAATGGAGGCGTGGCAGCCGCTGCAGCCGCGGCGGCCGCAGCGCTTTACAATAATTGGGGCATGCATTCAAAGTACTCCGGCAGTGGCTCCAACATCTTCAACGAGGTTAACAAGAGTTGCGTGTACATTTCGGGGCCCTCGGGCGTTCATGTCAGCGTCACCGATGAGGTGCTCAACAATGAGATCAAGGATGGCAGCCTCTATGCGCTGGATGTGCAGGGCGGCAAAgttgttttgaaattgataAACAAGCAGGATAACAACTGA
- Protein Sequence
- MALSFLSQNSGLLDLQSIFDPQYSLQQHQQQEQHLPSQQQQQQEEQLQISRTSTKFHLSIFNDFDQMEFNNNLSRNHNQYQNNNNINNNISHNTNNNNNNIHTHQTNGENLNQIQNRNFISGYHHQHIGSDYEQVINFVDSPPNSEESWTDAQSKDSPGPQIIDVRTIYSNSGSRKRRMDWDSLDIGQSENSPTTQTGDLPNKVVHQEKDKHKREKHSGRSSWSDDIGFDLNAEFNSNSYLNNENFLSFSPSLTALKQEPQTEQLKPNTKLPLDSGSNNTNNSAGVVNIGKIDKSPLGDANHSPQHAGQAQDGAGGATGAGGTGAAAGGNISSSSKHELNTGIICGCGSPQGSPAATEFELNGNANGNAASAEKTRAAAGNETYAQAPRSGLQQQLSIVEAAKIEPNSSGSAAHAEDHKFQYILAAATSIATKNNEETLTYLNQGQSYEIKLKKIGDLSFYRDKILKSVIKICFHERRLQFMEREQMQQWQASRPGDRIIEVDVPLSYGLCHVSQPLSSNSLNTVEIFWDPLKEVGVYIKVNCISTEFTPKKHGGEKGVPFRLQIETYIENNNTNSSNSSSSSSSSSNSSSGGTSPDTPENRTSNGNASSGSIAGLAALNGKQAVHAAACQIKVFKLKGADRKHKQDREKIQKRPQSEQDKFQPSYECTIMNDISLDLIMPATTTGCYSPEYMKLWPNSPVHIPKYDGMLPFASSASPATSSSPIAINSVTSTNSPTLKLMDATNMVSPQHVPADMDDYNQNIMPESTPAQVTQWLTNHRLTAYLTTFAHFSGSDIMRMSKEDLIQICGLADGIRMFNILRAKTITPRLTLYASMDGCSYNAIYLLSNTAKELQQKLYKLPGFYEFMAKGGSAGALENGGVAAAAAAAAAALYNNWGMHSKYSGSGSNIFNEVNKSCVYISGPSGVHVSVTDEVLNNEIKDGSLYALDVQGGKVVLKLINKQDNN
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00519681; iTF_00562945; iTF_00595009; iTF_00901250; iTF_00476387; iTF_00559255; iTF_00557667; iTF_00608203; iTF_00560803; iTF_00589315; iTF_00802595; iTF_00567301; iTF_00474982; iTF_00526342; iTF_00532134; iTF_00831910; iTF_00610410; iTF_00518923; iTF_00537996; iTF_00550700; iTF_00584940; iTF_00582029; iTF_01326928; iTF_00528561; iTF_00530033; iTF_00583441; iTF_00599345; iTF_00804945; iTF_00553522; iTF_00562233; iTF_00472126; iTF_00804146; iTF_00490547; iTF_00568019; iTF_00509582; iTF_00547124; iTF_01555887; iTF_00516018; iTF_00601525; iTF_00508086; iTF_01570006; iTF_00805731; iTF_00537271; iTF_00492730; iTF_00615963; iTF_01320955; iTF_01327654; iTF_00506621; iTF_00575337; iTF_00530717; iTF_00616672; iTF_00597917; iTF_00588628; iTF_00522718; iTF_00585726; iTF_01549342; iTF_00613951; iTF_00520391; iTF_00508864;
- 90% Identity
- iTF_00804945;
- 80% Identity
- -