Basic Information

Gene Symbol
DIP2
Assembly
GCA_947623375.1
Location
OX392498.1:56293962-56314274[-]

Transcription Factor Domain

TF Family
CBF
Domain
CBF_beta domain
PFAM
PF02312
TF Group
Beta-Scaffold Factors
Description
Core binding factor (CBF) is a heterodimeric transcription factor essential for genetic regulation of hematopoiesis and osteogenesis. The beta subunit enhances DNA-binding ability of the alpha subunit in vitro, and has been show to have a structure related to the OB fold [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 5 0.0067 1.2e+02 3.2 0.0 84 110 495 521 477 526 0.85
2 5 0.0018 33 5.1 0.0 80 110 550 580 534 586 0.80
3 5 0.0024 44 4.7 0.0 81 110 610 639 593 645 0.81
4 5 0.0014 26 5.4 0.0 80 110 668 698 652 704 0.80
5 5 0.0015 27 5.4 0.0 80 110 727 757 711 762 0.80

Sequence Information

Coding Sequence
ATGATATTGGAGTGCCGAGAAATGTGGTCATGGAAGCGTGCGAGGGCACACGCGCTCCGCTCCCCGCCGGGCAGGACCAACCGACGACTGACGCGCAACGAGAGTCGGTACCACTCCGAGGTCAGACAGGAGGCGGTGCAGCAGGCGCTAGCGGAGATGCAAAACAGGCCCAAGCCGTCTCTGCCCATGCCGTCGAAGAGAACATCCATGATGGCCAAGAGTCCCGACAGAGAGAGACACGACCTCTCATCAAGTTCAGACGAGGACTCCTGCGCCGGTGACACCGACCTGCCGCCACCGCCTGAGCTGGGGTCTCCCCCCGCGCGGCGGCCCGCCGCCCCGCACCCCCGCTCGCACCCGCACCCCTTGCCGCAGCCCCCGCACCTCCGCGAGAGGGAGAGGGAGCGGGAGCGCGAGCGAGAGAAGAAACATGTGAGGGAGAGGACAGAAATTGATCTGTCGGAGATGACACATTTGCCAGCATACATCCAACCAGACGTGACCCACACGTCGGCGGGCGCCCGCCGCGGCGGCGCACTCGTGGCAGACCGCGTGGGGTGCTACCAGTCTGATGACACCGGCACCGGCACTGGACGATGGAAGGTGTCAGCCAAAATCCAGCAGCTTCTCAACACACTGAAGAGACCAAAGCGTCGTCCACTGCCAGAGTTCTACGAGGACGACGACATCGAGCTAGAGATCGCAGCGAACCCCAAGGACCCGAACGCGCCCAAGCCCGAGGGCGGGACCATGACCCCCGCCGTCGGTGAACAGCTGGTGGTCCCAGCTGGATTGCCAAGAAACTTGGAGGCGGCGCTACAGAGATACGGCACGGCGTCGTTCAAAGCGAACGTGGCCACGGTCTTGGACCCCAACGGGAAGTTGAGCAACTCGCTTACATACGGCAAACTGCTAAGCCGCTCCCTCAAAATCGCCTTCGCGTTGCTCAACAAGTCCTTCACATCAAAGAGCAGCAGCGGCGGTCCTTTAACCGGCGACACGTCCATCAAGCCAGGGGACAGAGTAGCCCTAGTGTACCCCAACAACGATCCCATAAACTTCATGTGCGCGTTCTATGGCTGCCTGCAAGCTGGTGTAGTTCCTGTGCCCATAGAAGTGCCTCTGACGAGGAGGGACGCGGGGCTTCAGCAGGTTGGGTTCCTGCTGGGATCCTGTTCGATACAGTACGCGCTGACGTCGGATGCGTGTCTGAAAGGTCTCCCAAAGACGTCCTCGGGAGACGTGGTGTCCTTCCGCGGTTGGCCCTCGCTGCTGTGGGTCTCCACCGAGAAGCTGCCCCGCCCCCCGCGCGACTGGATCCCGCCCCCCCGCCCCGCGGACGACAGCCCAGCGCACATCGAGCACACGTCCGCCGTCGACGGGTCCGCCATGGGCGTCATCGTTACACGCTCGTCAATGCTGTCGCACTGCCGAATGCTGTCGGTGGCGTGCAACTACACGGAGGGCGAGCACATGGTGTGCGTGCTGGATTTCAAGCGCGAGACGGGGCTGTGGCACGCCGTGCTAGCCAGCGTGCTAAACGGCATGCACGTCATCTTCATTCCGTACGCGCTGATGAAGGTCAGCCCGGCCTCGTGGATGCACATGATCACCAAGTACAGGTGGGTCGCTATTGTGTCTAATGCTCGAACACATAGTATGCTGGACTTCAAGCGCGAGACGGGGCTGTGGCACGCCGTGCTAGCCAGAGTGCTAAACGGCATGCACGTCATCTTCATTCCGTACGCGCTGATGAAGGTCAGCCCGGCCTCGTGGATGCACATGATCACCAAGTACAGGTGGGTCGCTATTGTGTCTAATGCTCGAACACATAGTCTGCTGGACTTCAAGCGCGAGACGGGGCTGTGGCACGCCGTGCTAGCCAGCGTGCTAAACGGCATGCACGTCATCTTCATTCCGTACGCGCTGATGAAGGTCAGCCCGGCCTCGTGGATGCACATGATCACCAAGTACAGGTGGGTCGCTATTGTGTCTAATGCTCGAACACATAGTATGCTGGATTTCAAGCGCGAGACGGGGCTGTGGCACGCCGTGCTAGCCAGCGTGCTAAACGGCATGCACGTCATCTTCATTCCGTACGCGCTGATGAAGGTCAGCCCGGCCTCGTGGATGCACATGATCACCAAGTACAGGTGGGTCGCTATTGTGTCTAATGCTCGAACACATAGTATGCTGGACTTCAAGCGCGAGACGGGGCTGTGGCACGCCGTGCTAGCCAGCGTGCTAAACGGCATGCACGTCATCTTCATTCCGTACGCGCTGATGAAGGTCAGCCCGGCCTCGTGGATGCACATGATCACCAAGTACAGGTGGGTCGCTATTGTGTCTAATGCTCGAACACATAGTATGCTGGACTTCAAGCGCGAGACGGGGCTGTGGTACGCCAGCATGCTGAACGATATGCATGTGATACGCTAA
Protein Sequence
MILECREMWSWKRARAHALRSPPGRTNRRLTRNESRYHSEVRQEAVQQALAEMQNRPKPSLPMPSKRTSMMAKSPDRERHDLSSSSDEDSCAGDTDLPPPPELGSPPARRPAAPHPRSHPHPLPQPPHLREREREREREREKKHVRERTEIDLSEMTHLPAYIQPDVTHTSAGARRGGALVADRVGCYQSDDTGTGTGRWKVSAKIQQLLNTLKRPKRRPLPEFYEDDDIELEIAANPKDPNAPKPEGGTMTPAVGEQLVVPAGLPRNLEAALQRYGTASFKANVATVLDPNGKLSNSLTYGKLLSRSLKIAFALLNKSFTSKSSSGGPLTGDTSIKPGDRVALVYPNNDPINFMCAFYGCLQAGVVPVPIEVPLTRRDAGLQQVGFLLGSCSIQYALTSDACLKGLPKTSSGDVVSFRGWPSLLWVSTEKLPRPPRDWIPPPRPADDSPAHIEHTSAVDGSAMGVIVTRSSMLSHCRMLSVACNYTEGEHMVCVLDFKRETGLWHAVLASVLNGMHVIFIPYALMKVSPASWMHMITKYRWVAIVSNARTHSMLDFKRETGLWHAVLARVLNGMHVIFIPYALMKVSPASWMHMITKYRWVAIVSNARTHSLLDFKRETGLWHAVLASVLNGMHVIFIPYALMKVSPASWMHMITKYRWVAIVSNARTHSMLDFKRETGLWHAVLASVLNGMHVIFIPYALMKVSPASWMHMITKYRWVAIVSNARTHSMLDFKRETGLWHAVLASVLNGMHVIFIPYALMKVSPASWMHMITKYRWVAIVSNARTHSMLDFKRETGLWYASMLNDMHVIR

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-