Dpic013938.1
Basic Information
- Insect
- Drosophila picticornis
- Gene Symbol
- UBP1
- Assembly
- GCA_035043845.1
- Location
- JAWNMR010000493.1:17507937-17520218[+]
Transcription Factor Domain
- TF Family
- CP2
- Domain
- CP2 domain
- PFAM
- PF04516
- TF Group
- Beta-Scaffold Factors
- Description
- This family represents a conserved region in the CP2 transcription factor family.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 3 0.25 2.1e+03 -2.3 4.9 92 132 26 67 16 120 0.73 2 3 5.7e-53 4.7e-49 166.5 0.6 17 189 418 587 401 599 0.88 3 3 3.6e-13 3e-09 36.4 0.8 188 221 636 669 626 671 0.89
Sequence Information
- Coding Sequence
- ATGGCGCTTTCGTTTTTATCGCAAAGTTCCGGCCTTTTGGATTTAAAAAGCATATTTGATCCACAATATTCattacaacaacatcaacaagaaaatcaacatcagcaacatcaacaacagcaacaacaacagcagcagcaactttatttgccatcacaacaacaacatcaacaacaacaacattatcaAATATCACACATATCAACGAAATTTGATTTGAGCATTTTCAACGATTTCGATCAAATggaattcaataataatttgagtCGAAACCATAATCAGTACcagaataacaataacaataacaataacaataacaataacgaaaATAACAGTAGcatcaacaataataacaaccacaataacgACAACGGTGAACATTTGAATCAGGTCCAAAATCGCCATTTCATGAGCGgctatcatcatcagcataTTGGGTCGGATTATGAGCAAGTGATTAACTTTGTTGACTCACCACCGACCTCTGAGGAATCTTGGACAGACGCACAGTCCAAGGATTCACCAGGACCGCATATTATCGACGTACGGACAATTTACTCAGAGAGTGGTTCACGCAAAAGACGAATGGATTGGGACTCATTGGATATTGGCCAAAGTGAAAATTCGCCGACAACACAATCGGGCGACATACCCAATAAGGTAGCACATCTCGAGAAAGATAAACACAAACGTGAAAAGCATTCAGGTCGCAGCAGCTGGAGCGACGATATTGGCTTCGATCTGAACGCCGATTTTAATAGCAACTCATATTTGAACAATGAAAACTTCTTGGCCTTCTCGCCGAGCTTAACGGCATTAAAACAGGAGCCGCAAACGGAGCAGCTGAAACCGACTACAAAGCTACCCTTGGACAGCAGCTCTGCGGGAGCTGTGCACAATATTGGCAAGATCGACAAGTCGTCATTGGGCGATGCCAATCATTCACCACAGCGCCCGGCCCAATCCCAGGAAGTGGTTGGCGCTGTTGCAGGAACAGTAACAGCTGctggcaacagtggcagcgAAAGCGGTGGCAAGCATGAGTTGAACTCCGCAATTATATGCGGCTGTGGCTCACCACAAAGCTCGCCCGCCTCAACGGAATTTGAACTGAAGGACAGTGCCACCAATAGAAATGCAGCTGCATCTGACAAAACCATTGCTTCATCTGGCAACGAGGCTTTTGCCCAGGCGCCACGCTccgagctgcagcagcaggtgaGCGGTGTTGAAGCGGCCAAAATTGAGCCAAGCTCATCCAGTGGTGCATCACATGCCGAGGATCACAAATTTCAGTACATTCTGGCAGCGGCCACCTCAATTGCGACAAAGAACAATGAGGAGACTTTGACCTATCTCAATCAGGGTCAAAGCTATGAGatcaaattgaagaaaattgGTGATTTATCATTCTATCGCGATAAGATTCTGAAGAGCGTTATCAAAATCTGTTTCCATGAGCGTCGATTGCAGTTCATGGAACGCGAACAGATGCAACAATGGCAAGCATCGCGTCCTGGTGATCGCATCATTGAGGTGGATGTGCCATTGTCGTATGGTTTGTGCCACGTGTCGCAGCCTTTGAGTTCGAGTGCATTAAACACTGTGGAGATCTTTTGGGATCCATTCAAGGAGGTCGGCGTCTACGTCAAGGTCAATTGCATTTCAACAGAGTTCACACCAAAGAAGCACGGTGGTGAAAAGGGTGTTCCGTTTCGTCTACAAATTGAAACTTATATAGAAAACTCACCagcaaatagcagcaacagtaacaccagtagcaacagcagcagtgtcAGCGGTGCCAGTGTCAataacaacgccaacaacaacactagcAACACAactatcaacaacaacaacacattagGTGGCACATCGGACAGCCGTGCCGCACTCAATGGCAAACAGGCGGTGCATGCAGCTGCTTGTCAGATTAAGGTCTTCAAGCTAAAAGGCGCCGATCGCAAGCACAAACAGGATCGAGAAAAAATACAGAAGCGGCCAACATCTGAGCAGGATAAATTCCAGCCCAGCTACGAATGCACCATAATGAATGATATATCATTGGACCTGATAACGCCAACCACCACAACAGGCTGCTACAGTCCCGAGTATATGAAACTGTGGCCGAATTCGCCGGTGCATGTACCAAAATACGATGGAATGCTTCCATTTGCGAGCAGCGCATCACCAGCGGCCAGCAGCAGTCCCATTGCGATCAATTCAGTGACATCAACAAATTCGCCAACATTGAAGCTAATGGATGCCACAAATATGGTATCGCCGCCACATGTGCCAGCAGATATGGAAGACTATAATCAGAACATAATGCCGGACTCAACACCGGCACAAGTGACACAATGGCTGACCAATCATCGTCTGACGGCCTACTTAGCCACATTTACTCATTTCTCGGGCGCGGACATTATGCGCATGTCCAAGGAGGATCTTATACAAATCTGTGGACTTGCCGATGGCATACGTATGTTCAATATTTTACGCGCCAAAACGATTGCCCCACGTCTGACTCTCTATGCGAGCATGGATGGCTACAGCTACAATGCGATCTACTTGCTCTCCAACACGGCCAAGGAGTTGCAGCAGAAGCTGTACAAGTTGCCGGGCTTTTACGAGTTCATGGCGAAGGGGGGATCTGGTGGCGGCTTGGAGAATGGCGGTGttgcagctgcggctgctgctgctgcggcgctCTACAACAATTTTGGCATGCATTCAAAATACTCCGGCAGCGGCTCGAACATATTCAATGAGGTTAACAAGAGTTGCGTGTACATTTCGGGTCCATCGGGCATACTTGTCAGCGTCACCGACGAGGTGCTCAACAATGAGATCAAGGATGGCAGTCTCTATTCACTGGATGTGCAGGGCGGCAAAGTTGTCTTGAAATTTATCAATAAGCAGGATAACAATTAA
- Protein Sequence
- MALSFLSQSSGLLDLKSIFDPQYSLQQHQQENQHQQHQQQQQQQQQQLYLPSQQQHQQQQHYQISHISTKFDLSIFNDFDQMEFNNNLSRNHNQYQNNNNNNNNNNNNENNSSINNNNNHNNDNGEHLNQVQNRHFMSGYHHQHIGSDYEQVINFVDSPPTSEESWTDAQSKDSPGPHIIDVRTIYSESGSRKRRMDWDSLDIGQSENSPTTQSGDIPNKVAHLEKDKHKREKHSGRSSWSDDIGFDLNADFNSNSYLNNENFLAFSPSLTALKQEPQTEQLKPTTKLPLDSSSAGAVHNIGKIDKSSLGDANHSPQRPAQSQEVVGAVAGTVTAAGNSGSESGGKHELNSAIICGCGSPQSSPASTEFELKDSATNRNAAASDKTIASSGNEAFAQAPRSELQQQVSGVEAAKIEPSSSSGASHAEDHKFQYILAAATSIATKNNEETLTYLNQGQSYEIKLKKIGDLSFYRDKILKSVIKICFHERRLQFMEREQMQQWQASRPGDRIIEVDVPLSYGLCHVSQPLSSSALNTVEIFWDPFKEVGVYVKVNCISTEFTPKKHGGEKGVPFRLQIETYIENSPANSSNSNTSSNSSSVSGASVNNNANNNTSNTTINNNNTLGGTSDSRAALNGKQAVHAAACQIKVFKLKGADRKHKQDREKIQKRPTSEQDKFQPSYECTIMNDISLDLITPTTTTGCYSPEYMKLWPNSPVHVPKYDGMLPFASSASPAASSSPIAINSVTSTNSPTLKLMDATNMVSPPHVPADMEDYNQNIMPDSTPAQVTQWLTNHRLTAYLATFTHFSGADIMRMSKEDLIQICGLADGIRMFNILRAKTIAPRLTLYASMDGYSYNAIYLLSNTAKELQQKLYKLPGFYEFMAKGGSGGGLENGGVAAAAAAAAALYNNFGMHSKYSGSGSNIFNEVNKSCVYISGPSGILVSVTDEVLNNEIKDGSLYSLDVQGGKVVLKFINKQDNN
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00497147;
- 90% Identity
- iTF_00597121; iTF_00595714; iTF_00496355; iTF_00500068; iTF_00535076; iTF_00582724; iTF_00552070; iTF_00521927; iTF_00524884; iTF_00498632; iTF_00481994; iTF_00566552; iTF_00559993; iTF_00527808; iTF_00609643; iTF_00543505; iTF_00619604; iTF_00548566; iTF_00521112; iTF_00592856; iTF_00494873; iTF_00497147; iTF_00511035; iTF_00501568; iTF_00513222; iTF_00570161; iTF_00556978; iTF_00485624; iTF_00499352; iTF_00542202; iTF_00497913; iTF_00576828; iTF_00518207; iTF_00552799; iTF_00564416; iTF_00615963;
- 80% Identity
- -