Basic Information

Gene Symbol
pol
Assembly
GCA_933228835.1
Location
CAKOGE010000001.1:2128572-2141120[-]

Transcription Factor Domain

TF Family
DACH
Domain
DACH domain
PFAM
AnimalTFDB
TF Group
Unclassified Structure
Description
This family of proteins includes transcription factors involved in the regulation of organogenesis. Members of this family appear to regulate the SIX1, SIX6, and possibly SIX5 genes, influencing myogenesis and the proliferation of precursor cells in myoblasts. They are known to act as corepressors or coactivators in these processes, depending on their interaction with other proteins such as EYA3, CREBBP, NCOR1, TBL1, HDAC1, and HDAC3. These proteins are also implicated in the repression of cyclin-dependent kinase inhibitors, including the p27Kip1 promoter, which is key in cell cycle regulation. Some family members inhibit TGF-beta signaling through interactions with SMAD4. They are characterized by a conserved DNA-binding domain known as the DACHbox-N or DD1 domain, which is structurally similar to the forkhead/winged helix domain and is responsible for their DNA-binding activity.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 2 3.3e-13 3.7e-09 36.7 2.9 103 159 915 972 908 984 0.84
2 2 0.25 2.8e+03 -2.0 0.1 91 133 1033 1047 1028 1060 0.56

Sequence Information

Coding Sequence
ATGACGAAAATCTGTTTCTGCGGCGAATCAAAAAGATTCTCCCTGCCATATTTATGTGCACACCAAGTAGATTTTCTCTTAGTGCAAGAGACTTTCTTAAAACCTAGCAGACGCGATCCGAAAATAGCTAACTTTAATATGATTAGAAACGATAGAATCACCGCGAAAGGCGGCACTGCTATCTACTACAGAAGGTCTCTCCACTGTACTCCTCTCGACCCTCCCGCCCTTACGAATATGGAAGTCACTATTTGTCAGGTGAGTATGACCGGCCATCCGACGGTCATACTAGCATCAGTCTACCTACCTCCAACCAAAGATCTACTCGAGAGTGATCTTAAGGCACTCCTCTCCCTAGGAGACTCCGTCATTCTGGCAGGAGATTTTAACGCCAAGCACCAAAGCTGGAATTGTCACTCCCCTAACCAAAGAGGACGCCAACTAGAGAGACTGACCAACCGCCCAGATCTCAATTTTATAGTCGTAGCCCCTGACACACCGACTCGTTACCCTATGTCGGACTCGACCACAGACAGACCGGACATACTCGACATAGCGCTCTTGAAGAACATCGCCTTACAGGTGTGTCCCTTGAGAGTGCTGTCCGAGCTATGTTCAGACCACCGCCCAGTCCTGATCCAGCTAGGACCTCCGCCCACTACTAACCCTCCGACTAAAACAATCGTAGACTTCAAGAAGCTTTCCGAGCATCTTAAGTCACTCGATTCAGTGCACATAGCTAAAATTCCTGACAAAATAGAGACTCTAGATCAAGCACATGCAGCTTCACTGCATCTCACTGATCACCTCCGCGACGCTCTCAGTGTTTGTTCTCGGGAAATGCCAAACGGATTTATCCGTCAACAGCTCCCAGACGACGTGAGAGAACTCATTTCGATTAAAAATGCAGCTCGTAGGGCCCATGACGCTTGTCCGACCGCTGACAACCGGAGGGAACTATGGCGAGTCCAGAGGGAGGTGAAGGCTCGCATAGCAGAACTCCGGGACGAACAATGGGACAGGAAGCTGAGCGAGCTCTCGCCATCTCACGTCGCCTACTACAGACTGGCGCGAGCTCTCAAAGCCGACCCTGTCTCGGCCACCCCCCCGCTCATACGTCCCAACCAACCACCCGCCTTTGAGGATATCGATAAGGCCGAATGCTTAGCCGACAGCTTGGAAGCTCAATGCTCCCCAAGCACCACATCAATTGATAGGTCCCACGTAGAGCTAGTCGATAAAGGCTTAGCTTCCCTCCTCTCTTCTGCCCCTGGAGGCGACCCAATCACCCCCACGACCTGTGGCGAAGTTCGTCAGATCATCAAGGACCTACACGCTAAGAAAGCTCCCGGTCCCGACTCGATTAACAATAAAACTCTTAAGTTACTCCCCGCTCAAATAATAAACCTCCTGGTTGTTATTTTCAACACCCTTATGATGGGATGCTCCTTCCCCGACCAGTGGAAGGAAGCAACAGTTATAGGTATCCCGAAGCCGGGCAAACCTAGAAACCTCCCTACTAGTTATCGTCCCATTAGTTTGCTCAACACCCTTGGCAAGGTGTATGAGAAAGTAATACTCAATCGCCTTAGGGCCGCCGTTGAGGAGAAGAAACTCCTCAACGACGAACAGTTTGGCTTTAGAGCCAAACACTCTTGTATTCACCAAGCGCACCGCCTCACGGAGCACATATATAGTAATTTCAATCGCTTTCAAAGGAAAGGCATCCCTACAGGAGCCCTCTTTTTCGACGTAGCGAAAGCCTTCGACAAGGTATGGCACGCCGGCTTGTTATACAAGTTACACCACCTAGGCGTGCCAGAGAGGCTCGTACGCCTGCTACGAGACTATCTCACAAACCGTACTTTCCGCTATAGGCTAGACGGGACCTTGTCCTCCCCCAGACCTATCAGAGCAGGAGTCCCTCAGGGCTCGGTTCTCTCCCCCCTTCTGTACGCGCTGTACACAAGCGACATACCCAAGTCCCCCAATGTTAGTATAGCCCAGTTTGCGGACGACACCGCTCTCTATACATCCGATAAGAACCCAATCGTAGTCAGAGCCCGACTCCAGAAAGCGGTAATCAGCCTAGGCCGTTGGTTCCGCCAATGGAGGATAGAGGTCAATCCGGATAAAAGCGCAGCAGTGCTTTTTTCTAGGAAAGGCCAAAAATCCATGCCCAAGCATAAACGCGACATGCTCGTGATCACCCTCTATGGGCGTCATATTCCGTGGCAAGATAAAGCCAAATACTTAGGTGTAACCTTCGATAGAACGATGAGCTTCGCCGCACACATACGCAGAGTCCGTAACAAGGCTCGGTACGTGCTCGGTCGTCTCTACACGATGATCTGCGCTAAGAGCAAGCTGTCCCTCAGACACAAGGTCACATTATATAAGACATGCATTCGTCCAATCATGTCTTACGCGTGTGTAGTCTTTGCACACCTCCCCCCCTCCGCTTATAACAGCCTCCAGGTAGTCCAAAACAAGTTTATGCGCATGGCCACGGACGCTCCGTGGTTCATGCGTAATGTCGACCTCCATAGGGACCTCCAACTCCCCACAATAGCTCAACATTTTAAGCAGCTTTCCAAATCCTATTTTGAGAAAGCTGCCAAACACCCCAACCCACTAGTGGTTGAGGCCTCAAACTACTTAACCGATCGTAACGACCCTCCAGACAAAAGGCGCCCTAAGCACGTCCTTAACGACCCTGACGACCAGATTACGACAGACAACGCACCCTATCAGAAGAGATTAAAGAAAGAAAGGAAACAGCGGCAACAGATTCAAGAACAACTGGATTTGGAGCTAAAACGACGGCAGAAGATAGAAGAGGCACTAAAGCAGTCAGGTGCGCCCGGTGAAATTCTCAGAATAGTAACTgAGAATTTAACACCACCGTCCCAAGAAAATCGCGAACGTGAGAATGGTACGGAGAGCAAACCCCCAAGCACTGAACCCCCCACCTCGTCGCCGCCATTCCAGCGGGACCCGCCACGCACGCCCGACAAGCCGCAGTGGAACTACCCTCCGCCACCTGTCGACATAATGAGTGGAGGAGCTGCTTTTTGGCAGAACTACTCTGAATCCCTGGCGCAAGAGTTGGAGATGGAACGCAAGTCCCGCCAGCAAGCCATGGAGCGTGACGTGAAGAGCCCGCTGTCGGACCGCGCCAGCTACTACAAGAACTCGGTGCTGTTCAGTTCGGCCACTTAG
Protein Sequence
MTKICFCGESKRFSLPYLCAHQVDFLLVQETFLKPSRRDPKIANFNMIRNDRITAKGGTAIYYRRSLHCTPLDPPALTNMEVTICQVSMTGHPTVILASVYLPPTKDLLESDLKALLSLGDSVILAGDFNAKHQSWNCHSPNQRGRQLERLTNRPDLNFIVVAPDTPTRYPMSDSTTDRPDILDIALLKNIALQVCPLRVLSELCSDHRPVLIQLGPPPTTNPPTKTIVDFKKLSEHLKSLDSVHIAKIPDKIETLDQAHAASLHLTDHLRDALSVCSREMPNGFIRQQLPDDVRELISIKNAARRAHDACPTADNRRELWRVQREVKARIAELRDEQWDRKLSELSPSHVAYYRLARALKADPVSATPPLIRPNQPPAFEDIDKAECLADSLEAQCSPSTTSIDRSHVELVDKGLASLLSSAPGGDPITPTTCGEVRQIIKDLHAKKAPGPDSINNKTLKLLPAQIINLLVVIFNTLMMGCSFPDQWKEATVIGIPKPGKPRNLPTSYRPISLLNTLGKVYEKVILNRLRAAVEEKKLLNDEQFGFRAKHSCIHQAHRLTEHIYSNFNRFQRKGIPTGALFFDVAKAFDKVWHAGLLYKLHHLGVPERLVRLLRDYLTNRTFRYRLDGTLSSPRPIRAGVPQGSVLSPLLYALYTSDIPKSPNVSIAQFADDTALYTSDKNPIVVRARLQKAVISLGRWFRQWRIEVNPDKSAAVLFSRKGQKSMPKHKRDMLVITLYGRHIPWQDKAKYLGVTFDRTMSFAAHIRRVRNKARYVLGRLYTMICAKSKLSLRHKVTLYKTCIRPIMSYACVVFAHLPPSAYNSLQVVQNKFMRMATDAPWFMRNVDLHRDLQLPTIAQHFKQLSKSYFEKAAKHPNPLVVEASNYLTDRNDPPDKRRPKHVLNDPDDQITTDNAPYQKRLKKERKQRQQIQEQLDLELKRRQKIEEALKQSGAPGEILRIVTENLTPPSQENRERENGTESKPPSTEPPTSSPPFQRDPPRTPDKPQWNYPPPPVDIMSGGAAFWQNYSESLAQELEMERKSRQQAMERDVKSPLSDRASYYKNSVLFSSAT

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00801710;
90% Identity
iTF_00801710;
80% Identity
-