Datr013570.2
Basic Information
- Insect
- Drosophila atroscutellata
- Gene Symbol
- Nfat5
- Assembly
- GCA_035041605.1
- Location
- JAWNLA010000108.1:22266803-22286096[+]
Transcription Factor Domain
- TF Family
- RHD
- Domain
- RHD domain
- PFAM
- PF00554
- TF Group
- Beta-Scaffold Factors
- Description
- Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal DNA-binding domain that is similar to that found in P53. The C-terminal domain has an immunoglobulin-like fold (See PF16179) that functions as a dimerisation domain [1-2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 6 3.8 1.1e+04 -3.9 1.1 110 131 228 249 216 258 0.57 2 6 0.96 2.7e+03 -1.9 0.7 103 152 361 395 340 408 0.43 3 6 7.4e-36 2.1e-32 112.3 1.4 1 168 525 684 525 685 0.97 4 6 1.9 5.5e+03 -2.9 0.0 48 70 784 806 777 820 0.80 5 6 2.3 6.5e+03 -3.1 0.5 103 129 1002 1028 994 1036 0.51 6 6 4.1 1.2e+04 -4.0 2.7 124 128 1287 1291 1248 1322 0.55
Sequence Information
- Coding Sequence
- atgcgCTTCACCTACAATCAGTATAAATATTACGAGTCCGGCTATCGTATTCCCTCCAAAATGCATAGTCTGGGCCATCATCATAATGGCAATGGTACTGCTGgtaaaatcaacagcaacaacagcaacaacaacaacaacaacagcaacacagcaaacagcaacaacaatggcagtgGCCATCATCACTACAATCatcaccacaacaacaacaacagcaacaaccacaataattccaatacaaatgcatacCAACATAATTACAAAAGCAATTTTGGGATGCGCATGACGATATCGACAAACTCGACAATGAGTCCTCGCATCCATCGCAAAGGCTTTCGCATACCATCGAAGCGGCAGACGGGCAAGGGATTGCCCGGTAAATTGCACACGATTGCCAGGACGGGGCCCGGGAAGCTGGTGCCGGGCAAGCGAATACCACCGCGACCGCATCCGCCGCCCTGTGACAATTCGAATGACAGCGGCTTAGGTTTCGATCAGCATACGGAGCTGCGAAATGGTGccgtaacagcaacaactgttgctgctgtcgatgccagcaacaatggtccaagtagcagcaacagcaatcagaTCACAAATCTGGTGATAAATAGCGACTTGACCAACACtgttgccgcagcagcagcagccgcagctgcgGCGGTCGCCAGCAACAccttgcagcagcatcatcaacagcagcagcaacagcagcatgtacagctgcagcaacagcaacagcaacatcagccgcagcaacagcaacatccacCGCAGCAGCATTTAATACGCGCGATACCCGTTTCACGatcGCGTCATTCCAAGAAAATGCCCAACTACCTGGATGAAATCGATTCGGACTCATCGATATCAACATTTGTGGATGAATTGGCCACAGTTCACAGCAACGATGAGGATACTGGCTTTGGCACCGACAGCAATcaaacaatggcaacggttgcagcaacaacaacagcaacagccacatcgtcatcgtcatcatcctcGTCAACGCCAACGCTGAGCTCAGAACTGTTTGGGGGCATCTTCAATACGAATGGCATTGCtgtacaacagcagcagcagcagcagcaccaacagcaacattaccagcagcagcagcaatttcatTATCCTGGATTACATttgcagctacagcagcaacaacaacaacagcagcagcagcagcatcaccagcaacatgagagcagcagcagttgcagcagcaacagtagcaacagcagcagcaatggcaatggcctTGGTCGTGCTGGTGGCAAGCGCAGGAAATACGAGTGCAATCAGGTGGAGCTCGACAACGATGATGCATGCAGCGAGGATGAGTTTATACGCAAaattgccacagttgctgccGATAATAGTGGCGtcagcagcgacgtcagcgCCGGCACCAAAATCACCAACAAAATCCATTGCAGGATCGGCAGCAGTCATCACAGCAGTGAGCCAAAATTTATTGGAACGCGCACCGTGACGCGTGTGGCGAACAAGAGGCAACCGACGACGCCACTGAATAGCATCGCCAGCTCCAATGACGGCCATGTCCAACTGGAGATTGTGTCGCAGCCGGAGCAACAGCATCGCGCTCGCTACCAGACCGAGGGCAGCCGCGGCGCCGTCAAGGATCGCAGCGGCAACGGCTTTCCCATTGTCCGGCTAACCGGCTACGTTAAGTCCGCCGTGCTGCAGGTGTTCATTGGCACCGACATTGGACGCGTTGTGCCGCACATGTTCTATCAGGCGTGCAAGGTGGCCGGCAAGAATTCGACGCAGTGCAACGAGAAGAAGGTCGATGGCACCATGGTCATCGAGATTGACTTCAAGCCCGAAACGGACATGACCGTCACCTGCGATTGCGTTGGCATACTCAAGGAACGCAACGTCGATGTGGAGCACCGTTTCCCCGAGCATCTGGCgcagaagaacaagaagaagtcaACACGTTGTCGCATGGTATTCCGCACTCAGCTGACAAAAGATGATGGCAGCACCGAAACGTTGCAGGTCTGCTCCAATCCCATCATCTGCAcTCAGCCACCGGGCGTGCCAGAGATATGCAAGAAATCATTGAACTCCTGTCCCGTTGATGGCGGCTTGGAGTTGTTCATCATTGGCAAGAACTTTTTGAAGGACACCCATGTTGTGTTCCAGGAGACGTACGACAGCGTCAATGGCGATGATCCCGCAACGGAAATAGCCGTGCGCCAGCAGCTTATCGGTGGCACCGCTGCTCTCTGGGAGCAGAGCGTGCTGCCCGACAAGGAGTATTTGCATCAGACGCATTTGATCTGCACAGTGCCGCCGTATTTGCATCAGAATGTGCTCAAGCCGGTCAGCGTGCAGGTGTCCATTATATCCAGTGGCAAGAAGAGCGAGCCGCACACCTTCACCTATACACCCAAGGGCAGCTATACGACACTAGCGGCGGCCAGCACGTTAAGTAGCACAGTGCTTAGCCAagATGTTAGCAGTTTCATGGACACGAGCAGTGCCGCATCCGCTGGCAACGCAGTGTGGACGAATGGTGGCAGCAATGCCGCCACTGCAGCGAACGTTGTCGAGACGAAGCATGAGATTGATTTGGGCATGATGCCGCCACCGATAACCACACAGATACCGATGGGATTACGTCGATCCTCGCTGTCTGGCCCCACACCGATGATCAccgatcagcagcagcagcaacagcagcagcagcagctggtgcaTCTTAGCGCTGAGGCGGCACTGAAAAGTGAACTGCTCGATGAGAATAGTTCGCATAGTCCGCTCACCGCACAGGACAACCACTCGCCGGAGGCGCTCATCAATCCATGCAGTCCCAATGGTCTGCAATATCATGCACACTATGCACGCAAGAACAGTCTGGATGCGATGATGTACGACCAGTCGAATAGTATGCCCGGTTTTCCGGTTGCTGCCACAACGACAGTTGAACTGGATCCCGCCgctgttgccgtcgccgttgAGCTGGCCGTCAAGAATGAGATTGTGAAGCATGtggtgcagcagcatcaacagcatcagtcaccggcacagcagcagcagcagcagcacaagttCATTGATGATCTAACCAAATCCACAACCGTTGTCAATAGCAATGGGACAAGTGAGCCGGCGATCTTCACCACCGCAGCGGTGATTGATCATGCCCTAACCGATATACTCCAACAGAAGGTGGGCGTTTTGTCGCACACATCTGCAACATCTGCGCCAAGTGTGCTGAAGCGCAGCCTTTCGATCAGTAGCAACAATTCAAATAGCTCCCATTCGAGCAGCGACACACAATCGCCAAACAGTTCGCCACTGACGCAGGACATCATATTGAACTCGGAGCCGGCAGCAGCACTTGGCGCTGGCTTGCAGCAATTGGCAACTGGGCCCGCTGTTGCTGGGGGAGGCGTGAACAGTGCCAATGGCTCACTGTCCACCGACATCATTATGAATCCAGCTGTCTCACCATCGACCATACTCTGTTCGGCAAATGGGGCAGCGACTGCCGTTGTGCCCAACATATTGGCGCCACATCAGGTGACGATGGCCAATTCGATATTGAATGACATTGCCATGCAGGCGGAGCCAACGCAACAAGATGCAGCTGTCGCTGCGCTGGCATTGAGCAACATTATGATGAGTCCACCAAATGCGGATGTCAACGATACGCTGCCGCCGACACCAGTTGTTATGCAACCGgaagtggcagccacagcaacatcGACGGCAGTCAGCAACATGATTATTAAAGCAGCCGCCGATTTTATAACCAcacaggagcaggaacagcatcattatcatcatcaacatcaacatcagcatcagcaacaccaacaacaccaacaacaccagcagcaacatcaacaacactTGCCACATCCACAcgctcagcaacagcagcagcagcagcaggccggCAATGCGGCTGCAACCACAAATGATCCACTTGTTAATCTACTGCTAAATCATACAACAACCACAGAAACGGTTGCTGTagaagcggcagcagcggctgctgctgctgccgccgccgctgcagctgTCTTTCCCACGCTGTCCGCTGTTCATTTGCCCGTTGTGCCACCCACACCACAAGAATCTCTCATTGTCGCCCTCGCCACGGAGAATGCGCTGCAGAAATCGGTGGCCACCGCGGCCGTAACCACAAATGGTGCCGTCATGACCCAACAGGCATCGGCACCCAGCAGCGCCGGCAGCATTCTGCCGGCGGCCGTTGGAgccgtcgctgctgccgctgctgttgctgtgcagCCACCGATACCTCAGGAGCTGACCACAATGTCCGATCAGGATCTAATCAGTTACATCAATCCAAGCACCTTCGATCAGCGTAAGTTCTTAGTCTCTTGCCCCTCAACGATTTCAAAttga
- Protein Sequence
- MRFTYNQYKYYESGYRIPSKMHSLGHHHNGNGTAGKINSNNSNNNNNNSNTANSNNNGSGHHHYNHHHNNNNSNNHNNSNTNAYQHNYKSNFGMRMTISTNSTMSPRIHRKGFRIPSKRQTGKGLPGKLHTIARTGPGKLVPGKRIPPRPHPPPCDNSNDSGLGFDQHTELRNGAVTATTVAAVDASNNGPSSSNSNQITNLVINSDLTNTVAAAAAAAAAAVASNTLQQHHQQQQQQQHVQLQQQQQQHQPQQQQHPPQQHLIRAIPVSRSRHSKKMPNYLDEIDSDSSISTFVDELATVHSNDEDTGFGTDSNQTMATVAATTTATATSSSSSSSSTPTLSSELFGGIFNTNGIAVQQQQQQQHQQQHYQQQQQFHYPGLHLQLQQQQQQQQQQQHHQQHESSSSCSSNSSNSSSNGNGLGRAGGKRRKYECNQVELDNDDACSEDEFIRKIATVAADNSGVSSDVSAGTKITNKIHCRIGSSHHSSEPKFIGTRTVTRVANKRQPTTPLNSIASSNDGHVQLEIVSQPEQQHRARYQTEGSRGAVKDRSGNGFPIVRLTGYVKSAVLQVFIGTDIGRVVPHMFYQACKVAGKNSTQCNEKKVDGTMVIEIDFKPETDMTVTCDCVGILKERNVDVEHRFPEHLAQKNKKKSTRCRMVFRTQLTKDDGSTETLQVCSNPIICTQPPGVPEICKKSLNSCPVDGGLELFIIGKNFLKDTHVVFQETYDSVNGDDPATEIAVRQQLIGGTAALWEQSVLPDKEYLHQTHLICTVPPYLHQNVLKPVSVQVSIISSGKKSEPHTFTYTPKGSYTTLAAASTLSSTVLSQDVSSFMDTSSAASAGNAVWTNGGSNAATAANVVETKHEIDLGMMPPPITTQIPMGLRRSSLSGPTPMITDQQQQQQQQQQLVHLSAEAALKSELLDENSSHSPLTAQDNHSPEALINPCSPNGLQYHAHYARKNSLDAMMYDQSNSMPGFPVAATTTVELDPAAVAVAVELAVKNEIVKHVVQQHQQHQSPAQQQQQQHKFIDDLTKSTTVVNSNGTSEPAIFTTAAVIDHALTDILQQKVGVLSHTSATSAPSVLKRSLSISSNNSNSSHSSSDTQSPNSSPLTQDIILNSEPAAALGAGLQQLATGPAVAGGGVNSANGSLSTDIIMNPAVSPSTILCSANGAATAVVPNILAPHQVTMANSILNDIAMQAEPTQQDAAVAALALSNIMMSPPNADVNDTLPPTPVVMQPEVAATATSTAVSNMIIKAAADFITTQEQEQHHYHHQHQHQHQQHQQHQQHQQQHQQHLPHPHAQQQQQQQQAGNAAATTNDPLVNLLLNHTTTTETVAVEAAAAAAAAAAAAAAVFPTLSAVHLPVVPPTPQESLIVALATENALQKSVATAAVTTNGAVMTQQASAPSSAGSILPAAVGAVAAAAAVAVQPPIPQELTTMSDQDLISYINPSTFDQRKFLVSCPSTISN
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00485633; iTF_00499361; iTF_00615972; iTF_00513231; iTF_00560005; iTF_00521121; iTF_00566561; iTF_00494882; iTF_00497156; iTF_00576837; iTF_01320965; iTF_00501577; iTF_00597130; iTF_00496364; iTF_00498641; iTF_01320244; iTF_00619613; iTF_00518216; iTF_00542212; iTF_00527817; iTF_00564425; iTF_00500077; iTF_00582733; iTF_00592865; iTF_00497924; iTF_00595723; iTF_00521936; iTF_00576117; iTF_00570170; iTF_00552079; iTF_00573817; iTF_00558429; iTF_00543514; iTF_00511044; iTF_00535085; iTF_01326938; iTF_01325436; iTF_00516765; iTF_00524894; iTF_00548575;
- 90% Identity
- iTF_00521121;
- 80% Identity
- -