Basic Information

Gene Symbol
Su(H)_1
Assembly
GCA_933228905.1
Location
CAKOGJ010001721.1:33852-58233[+]

Transcription Factor Domain

TF Family
CSL
Domain
BTD domain
PFAM
PF09270
TF Group
Beta-Scaffold Factors
Description
Members of this family of DNA binding domains adopt a beta-trefoil fold, that is, a capped beta-barrel with internal pseudo threefold symmetry. In the DNA-binding protein LAG-1, it also is the site of mutually exclusive interactions with NotchIC (and the viral protein EBNA2) and co-repressors (SMRT/N-Cor and CIR) [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 2 0.75 1.1e+05 -3.8 0.0 64 85 523 544 523 557 0.66
2 2 3e-67 4.5e-62 210.6 0.0 1 123 700 822 700 822 0.99

Sequence Information

Coding Sequence
ATGGACCATTCTGGCATGAAGAGTCCTCCAACAACAGAGAGGAAACTACTGCAACGCAGAGCTGACCTAAAACCTGTATCGCCACAACATCTTTCTTCTTTTTTTTCCCGCGGTCCAATTCGTCGTGAAGTGGTGGAGTCGTCGTCCGACATGCATATGATACCGGACCCTCTGGGTGGCACGCCCCACGATGCCATCAACGGGGAGCACCAAGAAACCCCGCAGCGGCACCAGTACAACAATTCGGCCGGGGTGTTGGCGAGGAGTAACCACGGCAACCACTCGAGGGGAGTCAACTTGGTCAGCAGCGGGTATTTGATGTCGGGATCGGGTGGCCCGCATTGCGGAGGGGTGCAGATGCCTCCGTCTCCCGTCAGGACCCCGTCCCCACGGCCCATCGAGCAGTTCGGCCGGCGTTTGTCTCCGTACGGATCCTGCGGCCGGGCCAATTCCCCCCCCCCCCCCACTTTCCCATTCCCTCCGTCAGCCACGAACGAGAACGTGCCAATGCCACTCCGGAGAAAGGGAAAGAAAGCTTCGCTCCTAATGAGAAGCGACAAGGGGTGCCACTCAAAACCGGGGACGGTGGAGTTGACTCAGGAGTTTGGTAGCGACGGCTTTCGCCCCGATTCACTGGGCGTCGAAGACGGGGGTCGGAGTTCGCTCGACCGTTGGCTGATCGGGTCGGGGGTGGATAGGAAAGTACTTATGCCTGGTAAGGTGTCTGGAGCATGCCCGGAGACGCGTGGGTTTTCCGGCCCTGGGGTGAGAAATCGTTGGTGCCGGTCCTCGGCCCAACCTTCTACGTTCGCCTTAGAGCCAACAGGGAAATTTTCACCTGGCTGCAGTGTCGTCCGCCTCTCCAGCTCCACACCTTATGGCTGCTTTCAATCTGTCCTCCTCGCTTCTCCATCCTCTGCGTCTGCTTGCGTCGCGTCGCTCTCCTTCGTTCAAGTCGTCTCTCTTTCCGTCTGCCGTCTTTCACTCTGGAGTCTGAGGTCTGGGGCGAGCTCACTGGGGAACCCCTTTTTATATCGTCGGTCAATGCGACCGGGGCAACGAGAGGAAAGAGTGTCGGGGATCGGGATGGTGTGGTGGCTTGAGTACTGGCCTCCCATCCCAAGGGCCTGGGTTCGATTCCCGGCAATGGTGGACTTTTCAGTAGCAAAGCGACTTCGTCCAATCACAACTGAGCCGGCGAAGCTACTTCGTCTTATCTCAATGGAGGATAGTACATCTGACTCTGCCAATAGGAGCAGTTACGACAACTTCTGTGCGAGCAGACCTTCCGAGAATCCCCCTCCACTCCCTTCCCTTCCCATAAGAGCTGTCAAAGACGGGAGGAAAATGATGTCAATGTTATCAGAGCAGCTTTTTTGTTCCTTGTTGCGTCCGGGGAAGGACCAGAGGCTTACCCGTGAGGCGATGAAGAAATACTTGCGCGAGAGGAGCGACATGGTGGTCATAATCCTTCACGCAAAGGTCGCGCAGAAGTCGTACGGCACGGAGAAGCGATTCTTCTGCCCTCCGCCGTGCATCTACCTGTACGGGGACGGGTGGCGGTTGAAGCAGGAGGCACTGCTGAAGGCCGGGGAGTCTGAGCAGTCATCGCAGCTCTGCGCATTCATCGGGATTGGCAATTCGGACCAGGACATGCAACAGTTGGACCTCAACGGGAAGGTGCGTTCTCTAAAGCCCTGTACACACGGCGCCTTTGATGGGGGGGACACAGCTGACTCTGCAAATAGGAGCAGTTGTGACAGGATCTATGCGAGCAGACCTTCCGAAAACCCCCCTCCACTCCCTTCCCTTCCCCTAAAAGCAGTCAAGGACGGGAGGAAACAATACTGCGCTGCCAAGACGCTGTACATATCCGACTCGGACAAGAGGAAACACTTCATGCTCTCTGTCAAGATGTTCTACGGCAACGGCCACGACATCGGAGTGTTCTACAGTAAGCGCATCAAAGTGATCTCGAAGCCTTCCAAGAAGAAACAGAGTCTCAAGAATGCAGATCTGTGCATTGCGAGCGGAACCAAAGTTGCCCTCTTCAACCGACTGCGGTCTCAAACGGTCAGCACAAGGTATCTTCACGTGGAGAACGGCAACTTCCACGCAAGCTCGACGCAGTGGGGAGCCTTCACCATACACTTGCTGGATGACAACGAGTCTGAGTCGGAAGAGTTTGCGGTGCGCGATGGCTACATCCACTACGGCAGCACCATCAAGCTGGTGTGCAGCGTGACGGGGATGGCGCTCCCACGGCTGGTGATCCGGAAGGTGGACAAGCAGATGGCGCTGCTGGATGCGGACGATCCAGTCTCACAGCTGCACAAGTGCGCCTTCTACCTCAAGGACATGGAGAGGATGTACCTCTGCCTCTCCCAAGAGAGGATCATCCAATTCCAGGCGACGCCCTGTCCCAAGGATCCAAACCGCGAGATGATAAACGACGGAGCATCCTGGACCATCATCAGTACCGACAAGGCAGAGTACCAGTTCTACGAAGGCATGGGACCAATACGCACGACAATTACTCCTGTCCCGATCGTCCACAGTTTACATGGGCGGATTCAGCATCAAATTTGGGGGGGGGTCATGACCTGGGTCCGGGGAGTATGGAATTCCCCCCCCTCCCCCCTCACCCCCCCCCTCCCCGGGAGAGAGAGGCATTCTCCCGCGTAA
Protein Sequence
MDHSGMKSPPTTERKLLQRRADLKPVSPQHLSSFFSRGPIRREVVESSSDMHMIPDPLGGTPHDAINGEHQETPQRHQYNNSAGVLARSNHGNHSRGVNLVSSGYLMSGSGGPHCGGVQMPPSPVRTPSPRPIEQFGRRLSPYGSCGRANSPPPPTFPFPPSATNENVPMPLRRKGKKASLLMRSDKGCHSKPGTVELTQEFGSDGFRPDSLGVEDGGRSSLDRWLIGSGVDRKVLMPGKVSGACPETRGFSGPGVRNRWCRSSAQPSTFALEPTGKFSPGCSVVRLSSSTPYGCFQSVLLASPSSASACVASLSFVQVVSLSVCRLSLWSLRSGASSLGNPFLYRRSMRPGQREERVSGIGMVWWLEYWPPIPRAWVRFPAMVDFSVAKRLRPITTEPAKLLRLISMEDSTSDSANRSSYDNFCASRPSENPPPLPSLPIRAVKDGRKMMSMLSEQLFCSLLRPGKDQRLTREAMKKYLRERSDMVVIILHAKVAQKSYGTEKRFFCPPPCIYLYGDGWRLKQEALLKAGESEQSSQLCAFIGIGNSDQDMQQLDLNGKVRSLKPCTHGAFDGGDTADSANRSSCDRIYASRPSENPPPLPSLPLKAVKDGRKQYCAAKTLYISDSDKRKHFMLSVKMFYGNGHDIGVFYSKRIKVISKPSKKKQSLKNADLCIASGTKVALFNRLRSQTVSTRYLHVENGNFHASSTQWGAFTIHLLDDNESESEEFAVRDGYIHYGSTIKLVCSVTGMALPRLVIRKVDKQMALLDADDPVSQLHKCAFYLKDMERMYLCLSQERIIQFQATPCPKDPNREMINDGASWTIISTDKAEYQFYEGMGPIRTTITPVPIVHSLHGRIQHQIWGGVMTWVRGVWNSPPSPLTPPLPGRERHSPA

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-