Basic Information

Gene Symbol
Ubp1_1
Assembly
GCA_018904025.1
Location
JAEIFQ010000344.1:11075758-11085726[-]

Transcription Factor Domain

TF Family
CP2
Domain
CP2 domain
PFAM
PF04516
TF Group
Beta-Scaffold Factors
Description
This family represents a conserved region in the CP2 transcription factor family.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 3 1.2 1e+04 -4.6 2.1 9 9 77 77 26 118 0.55
2 3 2.7e-33 2.2e-29 102.2 0.1 25 150 422 542 401 543 0.92
3 3 1e-12 8.3e-09 34.9 2.4 188 222 540 574 539 575 0.93

Sequence Information

Coding Sequence
ATGGCGCTTTCGTTTCTATCACAAAATTCCGGCCTCTTGGATTTACATAGCATATTTGATCCACAATATTCATTGCAACAACATCAACAACAACAACAATTACATTTACCATCACAACAACAACAACAATTGCAACAAACATCACGCACGTTAACGAAATTCGATATAAACATTTTTAACGATTTCGACCAAATGGAATTCAACAACAATTTGAGTCGAAACCACAATCAATATCAGAATAATAACAACAATAACACTATCAATAATAATAATAATAGTAACCACAATACCAACAACAACAACACCACCAACAACAACAACAACATACATACGCAGCAAAACAACGGTGAAAATCTTAATCAGATCCAAAATCGTCACTTTATCAGCGGCTATCATCATCAGCATATTGGATCGGATTATGAGCAAGTGATTAACTTTGTTGACTCACCACCTAATTCAGAGGAATCTTGGACAGACGCACAATCAAAGGATTCGCCCGGACCTCAGATAATCGACGTACGGACAATTTACTCCAACAGTGGTTCACGCAAAAGACGAATGGATTGGGACTCATTGGATATCGGTCAAAGTGAAAATTCGCCGACAACACAAGCTGGCGACTTACCCAATAAGGTGGCACATCAGCAGGAGAAGGATAAACACAAGCGTGAAAAGCATTCAGGTCGCAGCAGCTGGAGCGACGATATAGGCTTCGATCTGAACGCTGAGTTTAATAGCAACTCATATTTGAACAATGAAAACTTTCTATCGTTCTCCCCGAGCCTGACGGCACTGAAACAGGAGCCGCAGACAGATCAGCTCAAGCCGAATCCAAAGCTACCGCTGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTCGCGATTGTGAACATTGGCAAGATTGACAAATCACCACTGGGTGAGGCCAATCATTCACCACAACGTGCCGGACAGCAGGAGTCAGGATCGGCTGGAGCAACAGGTGGCAATAGTGGCAAACATGAGTTGAACTCCGGCAATATATGCGGCTGTGGCTCGCCACAAGGCTCTCCAGCCGCAACGGACTTTGAACTGAACAACGGCAATGCCAATGGAAATGCTGCAGCAGGCGGCGACAAGAGCAGAGCTGCAGCAGCCAATGAGCCCTTTGCACAGGCGGCACGCTCTGGACTGCAGCAGCAGCTGAGCGTTGTCGAGGCGGCCAAAATAGAGCCCAGCTCCTTGGGCGGTGCAGCCCATGTGGAGGATCACAAATTTCAATACATTTTGGCAGCAGCCACCTCAATTGCAACGAAGAACAATGAGGAGACTCTGACCTATCTGAATCAGGGTCAAAGTTATGAGATCAAATTGAAGAAAATTGGCGATTTATCTTTCTATCGTGATAAGATTTTGAAGAGTGTTATCAAAATCTGTTTCCATGAGCGTCGATTGCAGTTCATGGAACGCGAACAGATGCAACAATGGCAAGCTTCGCGTCCTGGCGATCGCATCATTGAGGTAGATGTGCCACTCTCCTATGGGCTGTGCCATGTGTCGCAGCCATTGAACTCGAATGCATTGAACACTGTCGAGATATTCTGGGATCCATTGAAGGAGGCGGTGCATGCAGCTGCCTGTCAGATTAAGGTATTCAAGCTGAAAGGCGCCGATCGTAAGCACAAGCAGGATCGCGAGAAGATTCAAAAGCGTCCACAGTCCGAGCAGGAGAAGTTCCAGCCCAGCTACGAATGCACCATCATGAATGATATATCATTGGATCTGATAATGCCAGCCACCACCAGCACTGGCTGCTACAGTCCCGAGTATATGAAACTGTGGCCCAATTCGCCGGTTCATATACCAAAATATGATGGGATGCTACCGTTCGCAAGCAGCGCATCTCCGGCGACAAGCAGCAGCCCCATTGCGATCAATTCAGTGACATCAACAAATTCGCCAACATTGAAACTAATGGATGCCACGAATATGGTATCGCCGCAGCATGTGCCAGCGGATATGGATGATTATAATCAGAACATAATGCCGGAATCAACGCCCGCACAAGTGACACAATGGCTGACCAATCATCGTCTGACGGCCTACCTCAACACGTTTGCCCATTTCTCGGGAGCCGATATTATGCGCATGTCGAAGGAGGATCTTATACAGATCTGTGGTCTTGCCGATGGCATTCGCATGTTTAATATTTTGCGCGCCAAAACAATTGCGCCGCGTTTGACACTCTACGCCAGCATGGACGGCTGCAGCTTTAATGCTATCTATTTGTTGTCCAATACGGCCAAGGAACTACAGCAGAAGATCTACAAGTTGCCTGGTTTCTATGAGTTCATGGCTAAGGGGGGCGCCACGGGTGTTTTGGAGAATGGCAGCGTATCTGCGGCAGCAGCAGCAGCAGCGGCAGCCGCTGCACTCTACAATAATTGGGGCATGCACTCAAAGTACTCGGGCAGCGGCTCGAACATCTTTAACGATGTGACCAACAAGAGTTCTGTGTACATGTCGGGACCATCTGGTGTGCATGTCAGTGTCTCCGACGAGGTGCTCAACAACGAGATCAAGGACGGCAGCCTCTATGCTCTGGATGTGCAGAGTGGCAAAATTATATTGAAATTGATCAATAAGCAGGATAACAATTGA
Protein Sequence
MALSFLSQNSGLLDLHSIFDPQYSLQQHQQQQQLHLPSQQQQQLQQTSRTLTKFDINIFNDFDQMEFNNNLSRNHNQYQNNNNNNTINNNNNSNHNTNNNNTTNNNNNIHTQQNNGENLNQIQNRHFISGYHHQHIGSDYEQVINFVDSPPNSEESWTDAQSKDSPGPQIIDVRTIYSNSGSRKRRMDWDSLDIGQSENSPTTQAGDLPNKVAHQQEKDKHKREKHSGRSSWSDDIGFDLNAEFNSNSYLNNENFLSFSPSLTALKQEPQTDQLKPNPKLPLXXXXXXXXXXXXXXXXXXXAIVNIGKIDKSPLGEANHSPQRAGQQESGSAGATGGNSGKHELNSGNICGCGSPQGSPAATDFELNNGNANGNAAAGGDKSRAAAANEPFAQAARSGLQQQLSVVEAAKIEPSSLGGAAHVEDHKFQYILAAATSIATKNNEETLTYLNQGQSYEIKLKKIGDLSFYRDKILKSVIKICFHERRLQFMEREQMQQWQASRPGDRIIEVDVPLSYGLCHVSQPLNSNALNTVEIFWDPLKEAVHAAACQIKVFKLKGADRKHKQDREKIQKRPQSEQEKFQPSYECTIMNDISLDLIMPATTSTGCYSPEYMKLWPNSPVHIPKYDGMLPFASSASPATSSSPIAINSVTSTNSPTLKLMDATNMVSPQHVPADMDDYNQNIMPESTPAQVTQWLTNHRLTAYLNTFAHFSGADIMRMSKEDLIQICGLADGIRMFNILRAKTIAPRLTLYASMDGCSFNAIYLLSNTAKELQQKIYKLPGFYEFMAKGGATGVLENGSVSAAAAAAAAAAALYNNWGMHSKYSGSGSNIFNDVTNKSSVYMSGPSGVHVSVSDEVLNNEIKDGSLYALDVQSGKIILKLINKQDNN

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2