Basic Information

Gene Symbol
snpc-4
Assembly
GCA_944452905.1
Location
CALYCC010000002.1:294517-297285[-]

Transcription Factor Domain

TF Family
MYB
Domain
Myb_DNA-binding domain
PFAM
PF00249
TF Group
Helix-turn-helix
Description
This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 5 0.058 1.4e+02 3.7 0.0 14 43 246 280 238 283 0.77
2 5 7e-11 1.7e-07 32.2 0.1 1 45 289 335 289 336 0.97
3 5 5.8e-10 1.4e-06 29.3 0.1 3 44 346 391 344 393 0.96
4 5 7e-10 1.7e-06 29.0 0.0 1 45 399 444 399 445 0.92
5 5 2.3e-12 5.5e-09 36.9 0.3 1 41 449 489 449 490 0.98

Sequence Information

Coding Sequence
ATGAGCGAATCAGAGGACGAAGACGTTATCCTCGGCGACATAAAAGCGTTGCAGGCAGTTTTAACCCAGAAACAAACAGCCAGAGTTAAATTAGAGGTGAAATCGGAAGATAGAATTCCGGATGAACCTAACCTATGCGAACCGTCAACATCTAAAACCCGCGATATTTGCTCTCCAACGTTGAATACGGTTGAAAATTTGGAGGATTTAGATGCAGAAGAAAGCGACAATGAATGTTTATCCGAAATCGAAGAAGACGATGATAATATAGAGGATTTATCCGACTCCGAAGCTGCGTTAAATTTGAATAAAAAAATCATCAGTCTTTTATCGACAGCCCAAAATCGGCTTCGTCTATTACTGAAAGAATGCAAAAGACGTCAAGCAATGATAGATCAAAAAATACGTGAAAAAACTTCAGACCCACTGACCAGAACGCTGCTGACAATAGCTGGAATGCCTTATTTTAAAGACAAGCAGCATTTTCCAGCGCCAAAAAACGAAGATGCTAAATTAAAGGCGAGCCGTGGAGAGTTACAGATTATTAAACTGCCTTGTCCTCTGCGGTGGACATTGAAGGATAGAAATATATTATGGAGAGCAATTGCATGGGATGCTACAAACAATTGTCAAGAAGAAGCAAAGAGAGATACTGCAGAGGATCCTCTGAGGCTTGAGCAAGTAGGGCCATACAGGGAGAAAAAATTGCCCGAAGAAATGCTTAGGAATATAAAACATTTGGTTGGTCCGTTAGGAAGCAGGGAATTTGACTGGTTGAAAATAGCGGCGACAGATTTCAACGGAAGACATTCTGCGAACGAGTGCAGGGCGATGTGGAACGTATTTTTACATCCTGATATAAATAAGGGGAAATGGAAAAAGACAGAGGATTTCGAGCTCAAATCACTGGCAGAGGAGTACAACTTTCAAAATTGGGAAGCGGTCGCTAGTAGGTTGGGAACTAACAGAAGTGGATATCAATGCTTTATTAGATTTAATACAAATTTTAAGAACAATATGTTCAGCAGAAGTAGTTCGTGGTCTAAAGAAGAAGACCAGAAACTTACGGACACTGTAAACGCCTTGAGAACTGGGAATTTCATTCCCTGGGGAGAAGTAGCTAAGGTTATGGGAAATAGGAGGAAACAGCAGATTTATATGCGCTGGAATTACAGCCTAGCACCTAACTTGAAAAAGGGTAGATTCACTGAGGAAGAGGATAGGCTGCTGCTTGAAGGAGTTGCCAAGTTTGGCTTTAACTTTACTAAAATTTCTGTCCTCCTACTGCCCCAGAGAAATACCGCCCAGCTAAACGATCACTACAGAACTCTTATGAATGAGAAAAAGAACAAATGGAACTGCGAGGACGACATGAAACTGGTCAAGCTGTTCGATAGGTTTGGAAACAACTGGTCCGCTATTGCGAAGGAATTTTCAAACAAATCTAGAGTGCAAGTCAGACATCGGCATACTGCGATTATACGATATCTAAGCAGAGGCCTTTCTATAAGGACTATACCTAGGTGGGGGATTAAAACTGAAGAGGAAGATACTTCGGAAGATATATTCACCCGCGGAGAACGGATGCTCGATCAGATGAGCGAGATGGCGAGAATGCAGCAGAGGGCAAGGGAGGGGACAAGGAGAGAGCAGATGCCTAAGATAGACCAGGATCTCAGGGATTATTTCAAAGCGATATATCAGTCTTCCTCCAAGCCCGGGAGACACAGGAAATATTACTCTGTTGAAGAGCTTGACGAAATGACTCAGAAGCTTAACGTCATTTTAAATATTTTTAAAGCTGATCTTGACATTCCTGACGATTTAGAGGAGCAAACATTCTTGACGGAGAAGGACAAGCAGCTGCTGACTTCATTGAAAGAATACTCAAGAAGCGAAAGCTTGGAATCCGAGGAACGACCAAAATACATTGAATACGTCAGGCGGAAGATGTTCGGATCTTCAATGCCGGCGACTGGTGAAGTTCGTTTCATACCGCCGTTACCATTCAATGGAAAAGTTCAGACAACTAAGAAAAAAAAACTTGTGGGCATTAATTATTCTCCCAGTGGAGATAAATGTCTCGCCGAAATACCCCAAGAACTTTGTACATCAAAGACAATTGTATCGTTGATCGGCGGTTGGGAAATTGAAGTAGGATTTCAAAATATGGCAAAGATATTTATTCCAAAGACTGAACAGTTCGCAGAAGCTAATACTGGCAACAGAGATAGTTTCATCGAAACTCTTGACCCGAATGCTCCAAGTACTTCCGGGATTGTTAACAATGGAAGTGGGAACTCTGGACAGGAAGTTTTATCAAGCCCACATCAATCATCTCTTCAAATAATCAGAAGTCTGTCTAGATATTATCCGGAAGTTTTGATACCACCAAACTATACAACATTGCTGGGTTTTCGCGGTTTATTATTATCCAAACATATCCTTGAGGCAGAGGGTCCGGGACGAGGAGAATTCGAAGAAGAAGAGGAAAGAGAAGAAGAAGAAGAATGTCCGATAACGCCGGAGGGAGAAAAAGCTTTGGAATTATTCGAGGAACGTCTAGTTAAACTCTTCAAATTTCCTATAGAATTGTCAGAAATAGCACCGCCTCTGTTACACGTATTACAAAATGCAGACTTGGGTGATACTGACGATGACGTTTGCGGTAAGAAGAGAAAAGCAAGTAAAACAAACTCCGAACCAGCAGCTAAGAGAAAAGCAAAATTCGAAAAAGTCAATGTGAAAGAAAATCACGAGGCTCAATAA
Protein Sequence
MSESEDEDVILGDIKALQAVLTQKQTARVKLEVKSEDRIPDEPNLCEPSTSKTRDICSPTLNTVENLEDLDAEESDNECLSEIEEDDDNIEDLSDSEAALNLNKKIISLLSTAQNRLRLLLKECKRRQAMIDQKIREKTSDPLTRTLLTIAGMPYFKDKQHFPAPKNEDAKLKASRGELQIIKLPCPLRWTLKDRNILWRAIAWDATNNCQEEAKRDTAEDPLRLEQVGPYREKKLPEEMLRNIKHLVGPLGSREFDWLKIAATDFNGRHSANECRAMWNVFLHPDINKGKWKKTEDFELKSLAEEYNFQNWEAVASRLGTNRSGYQCFIRFNTNFKNNMFSRSSSWSKEEDQKLTDTVNALRTGNFIPWGEVAKVMGNRRKQQIYMRWNYSLAPNLKKGRFTEEEDRLLLEGVAKFGFNFTKISVLLLPQRNTAQLNDHYRTLMNEKKNKWNCEDDMKLVKLFDRFGNNWSAIAKEFSNKSRVQVRHRHTAIIRYLSRGLSIRTIPRWGIKTEEEDTSEDIFTRGERMLDQMSEMARMQQRAREGTRREQMPKIDQDLRDYFKAIYQSSSKPGRHRKYYSVEELDEMTQKLNVILNIFKADLDIPDDLEEQTFLTEKDKQLLTSLKEYSRSESLESEERPKYIEYVRRKMFGSSMPATGEVRFIPPLPFNGKVQTTKKKKLVGINYSPSGDKCLAEIPQELCTSKTIVSLIGGWEIEVGFQNMAKIFIPKTEQFAEANTGNRDSFIETLDPNAPSTSGIVNNGSGNSGQEVLSSPHQSSLQIIRSLSRYYPEVLIPPNYTTLLGFRGLLLSKHILEAEGPGRGEFEEEEEREEEEECPITPEGEKALELFEERLVKLFKFPIELSEIAPPLLHVLQNADLGDTDDDVCGKKRKASKTNSEPAAKRKAKFEKVNVKENHEAQ

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01414102;
90% Identity
-
80% Identity
-