Acep005721.1
Basic Information
- Insect
- Atta cephalotes
- Gene Symbol
- Rel
- Assembly
- GCA_000143395.2
- Location
- NW:1516073-1523567[-]
Transcription Factor Domain
- TF Family
- RHD
- Domain
- RHD domain
- PFAM
- PF00554
- TF Group
- Beta-Scaffold Factors
- Description
- Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal DNA-binding domain that is similar to that found in P53. The C-terminal domain has an immunoglobulin-like fold (See PF16179) that functions as a dimerisation domain [1-2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 1 2.8e-40 9.6e-37 125.7 0.4 1 168 131 321 131 322 0.91
Sequence Information
- Coding Sequence
- ATGCTGTTGACGCATTCAGGAATTAAAGGCAGTCGATTATGCGCGCCTGGTGAACCATCACGCAACTCAACGCATTCCTCGAGCCGCATCTTGTTGTACGAAGTGAAGAGAATCGCCGGAGAGATGCCGGACGCgtacgaaaatattaatatggcGATGCGCAGCATATTGACGACTGGTGTTATTGGAGATAATGAGGACCCGTACTTCCAGTTCAACGATTTAACATCAATGGGTAGTACAGATATTCTAAGTCCATTCAGTAGTTCTGGATCTCAAGGAAACATCAGCATGACATCGCCTATGTCGACGTCCTCGTCACCTATGCATATCACagatgattatattaatctaacaAATGCAGGAGTAATGATACTTGGTGAACCATATCTCAGTATCATGGTGCAGCCGATGGAGAAATTTAGATTTCGATACAAATCCGAAATGGTGGGAACCCATGGAAGCTTGCTCGGTGTATATACTGGAcgtaaacgaaataaaaataacgtacCTACAGTCAAGttacataattattcgGATAATGCGATCATACGTTGCACATTAGTTACAACTGACGAAGACCTGAGAATACCGCATGCTCATAGACTAGTGAGACGAGTAGATAGTGCGGATATAGGGGATCCACATTATATGGAAGTTTCGtctcaaaatgattttacagCCGAATTTGTTGGTATGGGCATAATACATACGGCGAGAAGACATGTTAAAGACGAAATAGTACGAAAATTGCGCGAAGAAGCTCTAGAGAAACTGAAACGTTCAAATATAAACGCTACTCTCAATCTTCGTGATGATGCGCAGATTAAATCTGATGCTGAGCAATatcaaaaaactataaatctaAATAGTGTTTGCCTTTGTTTTCAAGGATTTATCAAAGATCAACATAATATTATGAGACCGATTACAGCAGCTGTTTATAGCAATCCAGTCAATAATTTGaaaagtgCACTAACTGGAGAACTTAAGATATGCAGAATTGACAAATTCACTAGCAGCTGTGAAGGTGGCGAAGAGGTATTTATACTTGTggaaaaagtttcaaaaaaaaatataaagataaagttCTTTGAACTCGACGATGATGACACTGAAATTTGGATGGATTATGGTCGTTTTTCGGAATTAGATGTCCATCATCAATATGCCATGGTATTTCGAACGCCACCATACAGAGATCGAAATATCACTTCTCCAAAGGAAGTTTTCATACAACTAGAACGTCCTTCAGATTCATATTGTTCAGAACCAATCAAGTTTACATATAAACCAACTGAGCGTATGAtaggtAGGAAGAGAACGCGAGTCTCTCATTCAAATAGTGCGGAATTGACCCAagcattaacatttaataatgatatgcTTTTAACAAATACACCATTAAGTTCCACCACAAGTTCGCCTTCAAGCAACGACAGTGCCGAAATatctaaagaaataaaaaagatgttagACGATAGATGTTCTTCTAGTGAGTTTCGTGATTTTGTCGACCATATAAATTTAGACTTGTATGAGAAATTGTTAAACCAAGGCGGCGAAGATAAGTTAACTTATGATAGTGCGCCGAGTAAAAAGgACAAATTGTtcgcaaaaaatgttattattgatacgataaagaatatgaaaatgaaGCCCCATGaagtaaaagatattattaaaatatcattcaaaGACAGAACAACTTACGGAGATACGCCATTGCACTGTGCGCTCCGATATGGACAAAAGAACAATGTAAAACGTATCTTAATGCTTATGAGCACTCTAAATACAGATGCTGAAGAATTGGTGAACATTCGAAATAGTTCTGGAAAAACTCCGTTGCATTATGCTGCTTCACAAGATCAACCAGAAATTATACGAGCATTGCTCATGCTTGGAGCAGATCCTAATATAACAGATCATTATGGACAAATGCCATTACATAGAgctgtaaaatTTCCTGAAGCAGAAGGAAGCATTGATGTTTTATTAGCTGAAAAAGATGTCAACATCGAAGCAAATACAGACTGTGGATGGTCACCTTTGCAATTAGCTGCCCAAGCTGGCTCGTATCATGCAGTATGTTCTCTCATTAAAGCTGGCGCAGATGTAAATAATACTGACATGACCTATGGAAGAACTGCCTTACATATAGCAGTCGAAGGAGGTCACAAAGACATTGTTGcatttctgttaaaaaatACCAAAATAGATGTAAACAAGAAGAACTTTAGCGGAAATACAGCATTGCACACCGCAATAGTTACACCAGGGACAAAAGCCAAAGAGATATGTGCTCTTCTAGTGAAATACGGTGCTGATCCGCACATAAGGAATTATAATCGTGAATCAAATAATgTTGACGGAGAACAGatacaagatataaaaatcgaaGTGCATTCAGAAGATGAAAATATGGAAGAATGCAATGGGCAATCATCTTTTGACTTAGCATTAAATAAACCAGACatTTTACAACTCGTCTCCGGTCAAAatgatatgttaataaataacacgataaaagaagaaaatataaacgatACACAAAAAGTTTGGATGGATATAGATCAAGAAAAACATTTGGCAAGCATTTTAGACGAGACAAAAGGATGGAAAAAATTGGCGCATTATTTCGACTATAAGTACTTAATGAATATCTTTAAACAGACATCCAGTTCGTCATTACTCCTCTTAAATTATATCGCTatACAAACTAATACATCTCTCAGtgatttgagaaatatattgcaaaatgtaGGTGAAGAACAAGCTGCAACATACatgaatcaaattttatctatgAAACATGAAACGTGa
- Protein Sequence
- MLLTHSGIKGSRLCAPGEPSRNSTHSSSRILLYEVKRIAGEMPDAYENINMAMRSILTTGVIGDNEDPYFQFNDLTSMGSTDILSPFSSSGSQGNISMTSPMSTSSSPMHITDDYINLTNAGVMILGEPYLSIMVQPMEKFRFRYKSEMVGTHGSLLGVYTGRKRNKNNVPTVKLHNYSDNAIIRCTLVTTDEDLRIPHAHRLVRRVDSADIGDPHYMEVSSQNDFTAEFVGMGIIHTARRHVKDEIVRKLREEALEKLKRSNINATLNLRDDAQIKSDAEQYQKTINLNSVCLCFQGFIKDQHNIMRPITAAVYSNPVNNLKSALTGELKICRIDKFTSSCEGGEEVFILVEKVSKKNIKIKFFELDDDDTEIWMDYGRFSELDVHHQYAMVFRTPPYRDRNITSPKEVFIQLERPSDSYCSEPIKFTYKPTERMIGRKRTRVSHSNSAELTQALTFNNDMLLTNTPLSSTTSSPSSNDSAEISKEIKKMLDDRCSSSEFRDFVDHINLDLYEKLLNQGGEDKLTYDSAPSKKDKLFAKNVIIDTIKNMKMKPHEVKDIIKISFKDRTTYGDTPLHCALRYGQKNNVKRILMLMSTLNTDAEELVNIRNSSGKTPLHYAASQDQPEIIRALLMLGADPNITDHYGQMPLHRAVKFPEAEGSIDVLLAEKDVNIEANTDCGWSPLQLAAQAGSYHAVCSLIKAGADVNNTDMTYGRTALHIAVEGGHKDIVAFLLKNTKIDVNKKNFSGNTALHTAIVTPGTKAKEICALLVKYGADPHIRNYNRESNNVDGEQIQDIKIEVHSEDENMEECNGQSSFDLALNKPDILQLVSGQNDMLINNTIKEENINDTQKVWMDIDQEKHLASILDETKGWKKLAHYFDYKYLMNIFKQTSSSSLLLLNYIAIQTNTSLSDLRNILQNVGEEQAATYMNQILSMKHET
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00015169; iTF_00016473; iTF_00128737; iTF_00126474; iTF_00127220; iTF_00127957; iTF_00129500; iTF_01015585; iTF_01476708; iTF_01477352; iTF_01355087; iTF_00015828; iTF_00014505; iTF_01261647; iTF_00181752; iTF_00417338; iTF_00385051; iTF_01475963;
- 90% Identity
- iTF_01476708;
- 80% Identity
- -