Doo028178.1
Basic Information
- Insect
- Dicycla oo
- Gene Symbol
- NFAT5
- Assembly
- GCA_948252095.1
- Location
- OX411712.1:3531626-3553422[-]
Transcription Factor Domain
- TF Family
- RHD
- Domain
- RHD domain
- PFAM
- PF00554
- TF Group
- Beta-Scaffold Factors
- Description
- Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal DNA-binding domain that is similar to that found in P53. The C-terminal domain has an immunoglobulin-like fold (See PF16179) that functions as a dimerisation domain [1-2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 1.5e-29 2e-25 90.8 0.0 1 168 258 434 258 435 0.87
Sequence Information
- Coding Sequence
- ATGAAAATGACTATGTCCACGGCCCCGACGATGAGCGCACGGGTACACAGGAAAGTGATGAGGGCGCCGCATAAGCGGGCCCACCCGGGTAAGACGCAGCACCCGGGCAAAATGGTGCATGCAGGCAAAATTGCCCACCACCCGGGCAAACTGGCACATTTAGGCAAATTCGGGAAACTCGCGCACTACGCTCACAGTCACGCACTGAGGCCGCCCGAGCCATGTGAGAACAGTAACGACAGCGGCCTAGGACCCGACGCCGTCAACAGATTATCGGACACTTTGGAGGAGTGGGAACACCCGGAGTTGAAACGACAACGTGGGGACTCTGAAGCGGCGGTTAAGCTAGAATGCGACGACGCAAACGACGCGTACGCATACCGCGCGCCGGGCGCGGCTGAGCCCGCGTCTGCGCCCGCGCCGCCCGACGCCCCCGCCTCTACGCATGCGCCGGCCCTACCCGCTCCCGCAATGAGAGCGGGTTGCAGTATATCTAGTCCAAACATAGCCGAAACTCCCGGCAAGAGATTCCAGGATCCCTACAGATCGTGCGGGAAACTGGGCAGCGGGGGTAAACTGGCCAACAAGTACCCTGGTGGGGGGAAATTAGTTGGTGCCGGTGGGAAATTAGGGGGCAAACTAGGGGGGAAGTACGGAGGTAAATTGGCACAAGCTATGGCGAGAAGAAGGGCGCTAGCAGCCATGACGCAAGGTGGCCGACCAGGGTTGGCGGCGCCACTGTCGAGCAGGTCGAGAGATGGCACGGTGGAGCTACAGATACTATGCCAGCCTGAGACACAACACAGAGCGAGATATCAGACTGAAGGCAGCAGAGGCGCTGTCAAAGACAATTCAGGGAATGGTTTCCCGACAGTTAAACTAGTCGGTTACGACAAACCGGCTGTGCTGCAGGTATTCATAGGCACAGACACAGGTAGAGTGGCGCCGCACATGTTCTACCAAGCATGTCGGGTGTCCGGCAAGAATTCCACGCCTTGCAAGGAGAGGAAGGACGACGGCACGGTCATCATCGAGATCGACCTGGAGCCGGCCAAGAACTGGCAGGTCACTTGTGACTGTGTTGGAATACTCAAGGAGCGCAACGTAGACGTAGAACACAGATTCGGCGAGTCACTTGGCGGTACGGGTGGCGCTGGCACAGTGAGCGCTCCAGCTGGCGCGGCGCACATCTCACGCGGCAAGAAGAAGTCCACGCGCTGCCGCATGGTGTTCCGCACCGACATCGTCAACGCGGCCGGCCACGTGGAGACGCTGCAAGTGTGCTCCACGCAGATTATATGCACACAACCACCAGGCGTCCCCGAAGTGTGTCGCAAGTCCCTAGTATCCTGCCCTGCCTCCGGAGGCCTGGAGCTGTACCTCCTCGGCAAGAACTTCCTGAAGGAGACGCGCGTGGTGTTCTGCCAGCGACAGGACGGCCGCACGCACTGGGAAGAGGAGGTGGCTCCGGACAAGGAGTTCTTACAGCAGACTCACTTAGTGTGCTGCGTCCCGCCATACCTCCGGCACGACATAACGGAGACAGTGGCTGTGCAGCTGTTCGTGCGCTCCGGCGGCAAGGCGTCCGAGCCGCACTGCTTCTACTACACGCCGCCCGCGCTCGACACCGGACCGCTGCACTCTTCCAGACATCATACTCAAGGTGGCGACGAGGCGCGGCCTGTGCTGCTGTGGGGAGGGAGCATGATGCCGCCGCCCTCGCTGCCCGTGCGCAGGCCTTCCATCATTGTGCCAGACCCGCACTCCCCGCAAGGACTTAAAAGCGAGGTCGCAGACGAATCCAGCCAGCATTCGTTGGCAGATGGCGGTGAGTCGTCGTGTGGAGCGGACGGACCTGATAAGCCTATGAACATGTCTGATGAACTAGACATTGTGGACCTCAGGCTCAAGTCTGAGATGTCTCGGGACTCGCAGAGCCAGGTCGGTTTCGTAAGCGGGTACGAGAGCATGAAACTCTCTCCTCCCACTCCTGCGCAGAGCGCGCCTCCCTCCCCCCTCGCGACCTTCACGCAACAACTGCAGGCCATACAGAACCAAGTGCAGACTGACAAGATGGTAGAAAGCGTCACCGCCGCCATCTTCAACAACGCGGACAACGCGCCGCAGATCTACCCCACCATCATGCAGCCCATTGACCCCATGAGACAGCTCATCTCATCTAAGAGTGTGGATCCTTTGGAGACAGACATGAAACTCAACCCTGAACTCATGATTACTGACAATACCATGCAGCAAAACGAGCAAAGACTGATAGTGTACAACCAGATGCAGAGCAGACAGGAGGAGAGCTTCAACCCTTTTGGAGCCATCACTAAGCTGGAGATGGGGACTAGTCAGATGGAGCAACGTCTGGCGCAGCAGTCTGCGCACATGGAGGCGCTAGTCGAAGATGCTATGAAGGCCACGGCCGCCATACTTCCTGACAACACCAAGCTCGATGAGCTGGTGAACTCGCGAGTGGACGACCACCTTTCTGCGTCCACCTCGCCTTCGGCCGCCAGCCACGCGTCCGACATCCTGCTGAGCCCCAACGCGGCCGTGCCCGGCAGGAACTCCGCCACGGAGCTCCTCCCCGCCATGCCGACGTCGACCATGTCCCCCGACGTGATCCTCAACCCGCAAGTGTCCCCCAGCATGCTCTGCGACAACTCGCAGCGCATCGTCATGCCCTCGGGCCCCTCCCAGGACGAACTCATGATGATGCAAGACCAGGTGCAACTGACGTCCGTGAAGACCCCGCCCGCAGCCGTCAAGTCAATGATACTGAACGCGGCGGCTGAGATCCTCACGTCAGACACGACCATGAACGCTTTAGTCACGTCAGCGATCAACACCGCTAATATTTTGAGTACCGAGAGTGCAGCAGGATCGGGAGGTATGCAGACGGCCGACTCTGCGCAACCGCCCGCGGAACCCGCTGGTGAAACTCCTCTAGTAGCCATGTCTCAAGCTGTATCTCAGGCTGTGACTCAGGCCGTATCTCAAGCTGTCTCCCAAGCCGTGTCCCAGGCCGTGACCCAGGAGATGACGGCTCCGGTACAAGGACTCACAGACATGAGCGACCAAGACCTACTCTCGTACATTAACCCCAGCACCTTCGATCAGGATTACGTGAATTTAGTGAAACTTCAGAATTGGACAGCCTTCTACAGCAACAACTGCGTCGGTAATACCTGTTGA
- Protein Sequence
- MKMTMSTAPTMSARVHRKVMRAPHKRAHPGKTQHPGKMVHAGKIAHHPGKLAHLGKFGKLAHYAHSHALRPPEPCENSNDSGLGPDAVNRLSDTLEEWEHPELKRQRGDSEAAVKLECDDANDAYAYRAPGAAEPASAPAPPDAPASTHAPALPAPAMRAGCSISSPNIAETPGKRFQDPYRSCGKLGSGGKLANKYPGGGKLVGAGGKLGGKLGGKYGGKLAQAMARRRALAAMTQGGRPGLAAPLSSRSRDGTVELQILCQPETQHRARYQTEGSRGAVKDNSGNGFPTVKLVGYDKPAVLQVFIGTDTGRVAPHMFYQACRVSGKNSTPCKERKDDGTVIIEIDLEPAKNWQVTCDCVGILKERNVDVEHRFGESLGGTGGAGTVSAPAGAAHISRGKKKSTRCRMVFRTDIVNAAGHVETLQVCSTQIICTQPPGVPEVCRKSLVSCPASGGLELYLLGKNFLKETRVVFCQRQDGRTHWEEEVAPDKEFLQQTHLVCCVPPYLRHDITETVAVQLFVRSGGKASEPHCFYYTPPALDTGPLHSSRHHTQGGDEARPVLLWGGSMMPPPSLPVRRPSIIVPDPHSPQGLKSEVADESSQHSLADGGESSCGADGPDKPMNMSDELDIVDLRLKSEMSRDSQSQVGFVSGYESMKLSPPTPAQSAPPSPLATFTQQLQAIQNQVQTDKMVESVTAAIFNNADNAPQIYPTIMQPIDPMRQLISSKSVDPLETDMKLNPELMITDNTMQQNEQRLIVYNQMQSRQEESFNPFGAITKLEMGTSQMEQRLAQQSAHMEALVEDAMKATAAILPDNTKLDELVNSRVDDHLSASTSPSAASHASDILLSPNAAVPGRNSATELLPAMPTSTMSPDVILNPQVSPSMLCDNSQRIVMPSGPSQDELMMMQDQVQLTSVKTPPAAVKSMILNAAAEILTSDTTMNALVTSAINTANILSTESAAGSGGMQTADSAQPPAEPAGETPLVAMSQAVSQAVTQAVSQAVSQAVSQAVTQEMTAPVQGLTDMSDQDLLSYINPSTFDQDYVNLVKLQNWTAFYSNNCVGNTC
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00808874;
- 90% Identity
- iTF_00851489; iTF_01092835; iTF_00120261; iTF_00147072; iTF_01246598; iTF_01192391; iTF_00924361; iTF_01230281; iTF_00036394; iTF_00726056; iTF_00037548; iTF_00111370; iTF_00274219; iTF_01116939; iTF_00039564; iTF_00122124; iTF_00906829; iTF_00809829; iTF_00808874; iTF_00273383; iTF_00041550; iTF_00042403; iTF_01118030; iTF_00071184; iTF_01538512; iTF_00043249; iTF_01093809; iTF_01439665; iTF_00123122; iTF_00124005; iTF_00040563; iTF_00887987; iTF_01029923; iTF_01027068; iTF_00973492; iTF_00425141; iTF_00622581; iTF_00745397; iTF_00757909; iTF_00363799; iTF_01440759; iTF_00830936; iTF_01030904; iTF_00850362; iTF_00905882; iTF_01028987; iTF_01025744; iTF_00374941; iTF_00771645; iTF_00951565; iTF_01028030; iTF_01119050; iTF_00907648;
- 80% Identity
- -