Hgut008508.1
Basic Information
- Insect
- Hypselothyrea guttata
- Gene Symbol
- Nfat5
- Assembly
- GCA_035045125.1
- Location
- JAWNNJ010001577.1:755958-776499[+]
Transcription Factor Domain
- TF Family
- RHD
- Domain
- RHD domain
- PFAM
- PF00554
- TF Group
- Beta-Scaffold Factors
- Description
- Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal DNA-binding domain that is similar to that found in P53. The C-terminal domain has an immunoglobulin-like fold (See PF16179) that functions as a dimerisation domain [1-2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 6 0.014 65 3.5 0.2 87 132 299 344 284 364 0.76 2 6 0.19 8.7e+02 -0.2 2.3 109 137 422 451 395 476 0.57 3 6 3.7e-36 1.7e-32 112.7 0.9 1 168 506 665 506 666 0.97 4 6 0.25 1.2e+03 -0.6 0.0 48 73 765 790 754 811 0.79 5 6 0.95 4.4e+03 -2.5 0.0 103 122 966 985 891 1002 0.55 6 6 0.86 3.9e+03 -2.3 0.4 112 122 1238 1247 1205 1270 0.53
Sequence Information
- Coding Sequence
- ATGCGTTTCACCTACAATCAGTATAAATATTACGAGTCCGGCTATCGTATTCCATCCAAAATGCACAATCTGACCCATCATCACggcggcaatggcaacggcagtGCCGGCAAGAtgaacaccaacaacaataacaacaacaataacaacaacaataacaccaacaacaacaacaatcatcattacaacaatagcaacaacaacaacaacagcaactcaaACACAAATGCGTACCAACATCATTACAAAAGCAATTTTGGCATGCGGATGACCATGTCGACAAACTCAACCATGAGTCCGCGAATACATCGGAAGGGCTTTCGCATACCGTCGAAGCGACAGCCGGGCAAGGGACTGCCGGGAAAGCTGCACACCATAGTCAGGACGGGGCCTGGGAAGTTGGTGCCCGGCAAGCGAATACCACAGCGACCGCATCCGCCGCCCTGCGACAATTCGAATGACAGCGGCCTGGGCTTCGATCAGCATACGGAGGTGCGCAACGGCacgccagcaactgttgccgatgtgcggagcagcagcagcagcaacaacagcagcaacagcagcagcagcggcagcagcaatggtCTGATTAACAGCGCTCTGACCAacacagttgctgctgcggcggcggcagcagctgctgctgttgccagcaACACAttgcagcagcaccagcagcaacatcagcagcagcagcatttaatACGGGCGATACCAGTATCAAGaTCGCGCCATGCCAAAAAAATGCTGAACTATCTGGAAGAGCTAGAAACGGACGCAGGACTTTTAACGCCAACCAGCAATGATGAGGACACGGGCTTTGGCACCGACAACAGTcagacggcagcagcagcaacagcaacagcagcagcagcaacaacaatatccaGCGAGCTCTTTGGCGGCATATTCAATACAGGCGgagttgcatgccacaagaCGAACGATGTGGGCAGCTTtatgctgcaacagcagcagcagcagctgcaacagcagcaacaacaacagcagcagcagcaactctatcagcagcaacagctgctgcagcagcaacagtttcaGTATCCCggcttgcagttgcagctgcaacaagagagcagcagcggcagcagcagcagcagtagcagcagcaatggtGTTGCCACTCGCACTGGTGGCAAGAATCTGAAGCGCAAGAAACTCGAGTGCAATCAGGTGGAGCTGGACAACGACGATGCATGCAGCGAGGATGAGTTCATACGCAAGATTGCCACAGcggaacagcagcaacagcagcagcagcagcagcagcagcaacaacaacagcagcaacagcagcagcagcaacagcagcagcaagttaGCAGTGCCACATGCATCTTGTCGGTGGAGACGCTGCCAcatcgcagcagcaacagaaacaacaacgaGTCAAAATTCATTGGCAATCGCACCGTCACCCGCGTGGCCAATAAGAGGCAGCCCACCACGCCCCTGAACAGCATCGCCAGCTCGAATGATGGCCAGGTGCAGCTGGAGATTGTCTCACAGCCGGAGCAACAGCATCGGGCGCGTTATCAGACCGAGGGCAGTCGCGGTGCCGTCAAGGATCGGAGTGGCAACGGATTTCCCATTGTGCGGTTGAACGGCTTCGATAAGGCTGCCGTGTTGCAGGTATTCATTGGCACCGATATTGGTCGTGTGGCGCCACACATGTTCTATCAGGCTTGCAAGGTCGCTGGCAAGAATTCCACACAGTGCAACGAGAAGAAGGTCGATGGCACCATGGTCATCGAGATCGATTTTAAGCCAGAGACCGAAATGACCATAACCTGTGATTGCGTTGGCATACTCAAGGAACGCAACGTCGATGTCGAACATCGCTTCCCGGAACATCTGGCCCAGAAGAATAAGAAGAAATCCACACGTTGTCGCATGGTCTTTCGCACCCAGTTAACCCGTGATGACGGCAGCACGGAGACCCTGCAGGTCTGCTCCAATCCCATCATTTGCACTCAACCACCAGGCGTTCCGGAGATCTGTAAAAAATCGTTGAACTCTTGTCCAGTGGATGGTGGCCTAGAGCTATTCATAATTGGCAAAAACTTTCTGAAGGATACTCATGTGGTGTTCCAGGAGACCTACGACAGTGTTAATGCCGATGATCCAGCAAACGAAATAGCGGTGCGACAGCAATTGATCGGCGGTACCTCAGCATTATGGGAGCAAAGCGTGTTGCCCGATAAGGAGTATTTGCATCAGACGCATCTGATCTGCACCGTGCCCCCGTATCTGCATCAGAATCTGTTGAAGCCGGTCGCCGTGCAGGTGTCGATCGTGTCCAGTGGCAAGAAGAGCGAGCCGCACAACTTCACCTACACCCCCAAGGGCAGCTATACAACGCTGGCGGCGGCCAGCACGTTAAGTAGCACAATGCACAGCCaagATGTCGGCAGCTTCATGGACACCAGCAATGGTCCAAGCGGCAACAACCCGAACTGggcagccgctgccgctgcccccgtgcccctgcccctgcccctgcccctgccactgtCCGTGACGGTGGAGACGAAGCATGAGCTGGACTCGGGCATGATGCCACCACCGATAACCACACAGATACCGATGGGCCTGCGTCGCTCCTCGCTGTCCGGCAGCACGCCGATGATTACGGATCAGCAGCTGGTGATCGGAATGAGCGCGGAGGCGGCACTGAAGAGCGAACTGCTGGACGAGAATAGTTCGCACAGTCCGCACTCGCCGGAGGCGTATCACGGACACTACGGCCGGAAGGGCAGCCTGGATGCGATGATGTACGATCAGACGAACAGTCTGCCCGGTTTTCCGGTGCCAACGGCAGCCGTCGATCTGGATCCGGCCGCTGTGGCCGTTGCCGTCGAGTTGGCGGTCAAAAATGAGATTGTGAAGCATGTGgtgcaacagcatcagcagcatcatgCCTCGACGTCggcgccgcagcagcagcagcaacagaccgCCGCCAATGTGCACAAGTTCATCGATGATCTGACCAAATCCACCTCCGTGGTGAATAGCAATGGCACCAGCGAGCCGGCGCTGTTCACCAATGCCGCTGTCATCGACCATGCCCTCACCGATATACTGCAGCAAAAGGTGGGCGTGCTGACGCATCCGTCCGCCAAGCGCAGCCTCTCGCTGAGCAGCAGCTCCGCATCCAATAGCTCGCTATCCGGCAGCGACAGCTCTCCGAACAGTTCCCCCCTCACCCAGGACATCATACTCAACTCGGAGCCGGCAGCGGCATTGGGCGCCCTGGGCGGCGGGGGCGTGCCAACGGCGACGTCTGCCACGCTGTCCACGGACATCATCATGAATCCGGCGGTATCGCCGTCAACCATCTTGTGCTCCGCCAATGGCGCTGCCACGGCGGTGGTACCCAACATCCTGGCACCGCATCAGGTGACCATGGCCAATTCCATATTGAACGACATCGCCATGCAGGCGGAGCCCACGCAGCAGGACGCCGCCGTCGCTGCCTTGGCATTGAGCAACATCATGATGAGCCCACCGAATGCCAACGCTGGCGTTGTGGGCGTGGTGGGCGTCGATGTGGCCTTGCCACCCACGCCGGCCGTCATGCAGCCGGAGGTGGCAGCCACCGCCCAATCGACGGCGGTCAGCAACATGATCATCAAGGCGGCTGCCGATTTTATCACCAcacaggagcaggagcagcatcactatcatcaacagcagcagcatcagcagcagcagcaacaatcgcagcagcaacaggaacaggcaacagttgctgccaccgATCCGTTGGTCAATCTCCTGCTCAACCACTCGGCCAACCTGCCATCCGAAACTGTGGCCGTTGAggccgttgctgttgccggtTTTCCGTCGCTATCGGGCGTACATTTGCCGGTGGTGCCGCCAGCACCACAAGAATCTCTCATCGTCGCTCTGGCCACGGAGAATGCGCTGCAGAAGTCGGTGGCAACCGCAGCGGTTACCACAAATGGCGCCGTCATGACCCAACAGGCATCGGCGCCCAGCAGCGCCGGCAGCATCATACCCGCTGCGGTAGGGGCGGtggcagccgctgccgctgtggCGGTGCAACCACCGATACCACAGGAGCTGACCACCATGTCGGATCAGGACCTGATCAGCTACATCAATCCCAGCACCTTTGATCAGCGTAAGTTCACAACCGTTGCATAA
- Protein Sequence
- MRFTYNQYKYYESGYRIPSKMHNLTHHHGGNGNGSAGKMNTNNNNNNNNNNNNTNNNNNHHYNNSNNNNNSNSNTNAYQHHYKSNFGMRMTMSTNSTMSPRIHRKGFRIPSKRQPGKGLPGKLHTIVRTGPGKLVPGKRIPQRPHPPPCDNSNDSGLGFDQHTEVRNGTPATVADVRSSSSSNNSSNSSSSGSSNGLINSALTNTVAAAAAAAAAAVASNTLQQHQQQHQQQQHLIRAIPVSRSRHAKKMLNYLEELETDAGLLTPTSNDEDTGFGTDNSQTAAAATATAAAATTISSELFGGIFNTGGVACHKTNDVGSFMLQQQQQQLQQQQQQQQQQQLYQQQQLLQQQQFQYPGLQLQLQQESSSGSSSSSSSSNGVATRTGGKNLKRKKLECNQVELDNDDACSEDEFIRKIATAEQQQQQQQQQQQQQQQQQQQQQQQQQQVSSATCILSVETLPHRSSNRNNNESKFIGNRTVTRVANKRQPTTPLNSIASSNDGQVQLEIVSQPEQQHRARYQTEGSRGAVKDRSGNGFPIVRLNGFDKAAVLQVFIGTDIGRVAPHMFYQACKVAGKNSTQCNEKKVDGTMVIEIDFKPETEMTITCDCVGILKERNVDVEHRFPEHLAQKNKKKSTRCRMVFRTQLTRDDGSTETLQVCSNPIICTQPPGVPEICKKSLNSCPVDGGLELFIIGKNFLKDTHVVFQETYDSVNADDPANEIAVRQQLIGGTSALWEQSVLPDKEYLHQTHLICTVPPYLHQNLLKPVAVQVSIVSSGKKSEPHNFTYTPKGSYTTLAAASTLSSTMHSQDVGSFMDTSNGPSGNNPNWAAAAAAPVPLPLPLPLPLSVTVETKHELDSGMMPPPITTQIPMGLRRSSLSGSTPMITDQQLVIGMSAEAALKSELLDENSSHSPHSPEAYHGHYGRKGSLDAMMYDQTNSLPGFPVPTAAVDLDPAAVAVAVELAVKNEIVKHVVQQHQQHHASTSAPQQQQQQTAANVHKFIDDLTKSTSVVNSNGTSEPALFTNAAVIDHALTDILQQKVGVLTHPSAKRSLSLSSSSASNSSLSGSDSSPNSSPLTQDIILNSEPAAALGALGGGGVPTATSATLSTDIIMNPAVSPSTILCSANGAATAVVPNILAPHQVTMANSILNDIAMQAEPTQQDAAVAALALSNIMMSPPNANAGVVGVVGVDVALPPTPAVMQPEVAATAQSTAVSNMIIKAAADFITTQEQEQHHYHQQQQHQQQQQQSQQQQEQATVAATDPLVNLLLNHSANLPSETVAVEAVAVAGFPSLSGVHLPVVPPAPQESLIVALATENALQKSVATAAVTTNGAVMTQQASAPSSAGSIIPAAVGAVAAAAAVAVQPPIPQELTTMSDQDLISYINPSTFDQRKFTTVA
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -