Amod010821.1
Basic Information
- Insect
- Allygus modestus
- Gene Symbol
- -
- Assembly
- GCA_963675035.1
- Location
- OY776111.1:108716150-108718369[-]
Transcription Factor Domain
- TF Family
- RFX
- Domain
- RFX domain
- PFAM
- PF02257
- TF Group
- Basic Domians group
- Description
- RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer [1]. It recognize X-boxes (DNA of the sequence 5'-GTNRCC(0-3N)RGYAAC-3', where N is any nucleotide, R is a purine and Y is a pyrimidine) using a highly conserved 76-residue DNA-binding domain (DBD) [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 17 0.031 9.4e+02 3.2 0.0 19 34 29 44 20 57 0.84 2 17 0.03 8.9e+02 3.2 0.0 18 34 71 87 62 100 0.84 3 17 0.035 1.1e+03 3.0 0.0 19 34 115 130 106 142 0.84 4 17 0.034 1e+03 3.0 0.0 19 34 158 173 149 186 0.84 5 17 0.034 1e+03 3.0 0.0 19 34 201 216 192 229 0.84 6 17 0.013 3.9e+02 4.4 0.0 18 34 243 259 234 272 0.82 7 17 0.013 3.9e+02 4.4 0.0 18 34 286 302 277 315 0.82 8 17 0.03 8.9e+02 3.2 0.0 18 34 329 345 320 358 0.84 9 17 0.031 9.4e+02 3.2 0.0 19 34 373 388 364 401 0.84 10 17 0.03 8.8e+02 3.2 0.0 19 35 416 432 407 445 0.83 11 17 0.034 1e+03 3.1 0.0 19 34 459 474 450 486 0.85 12 17 0.03 9e+02 3.2 0.0 19 35 502 518 493 533 0.82 13 17 0.031 9.4e+02 3.2 0.0 19 34 545 560 536 573 0.84 14 17 0.03 8.9e+02 3.2 0.0 18 34 587 603 578 616 0.84 15 17 0.035 1.1e+03 3.0 0.0 19 34 631 646 622 658 0.84 16 17 0.034 1e+03 3.0 0.0 19 34 674 689 665 702 0.84 17 17 0.046 1.4e+03 2.6 0.0 19 34 717 732 708 736 0.86
Sequence Information
- Coding Sequence
- ATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTACATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGATGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTACGTCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGGCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGAGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACGTATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGAGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACGTATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGATGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTACGTCCAGAAGTGAAATACATTCCAATTATATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTACATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTACATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCTGCAGCCACAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTACATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAAGATGATCTGCCGGGGTCCGCAGCTGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGGTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTACATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGATGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTACGTCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGGCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGTAG
- Protein Sequence
- MLLQDDLPESAAAVAAVPLSQLPEVLELFTSRSEIHSNYISTCMLLQDDLPESAAAVAAVPMSQLPEVLELFTSRSEIHSNYISTCMLLQDDLPESAAAVAAVPLSQLPEVLELFASRSEIHSNYISTCMLLQDDLPEAAAAVAAVPLSQLPEVLELFASRSEIHSNYISTCMLLQDDLPESAAAVAAVPLSQLPEVLELFASRSEIHSNYISTCMLLQDDLPESAAAVAAVPLSQLPEELELFASRSEIHSNYVSTCMLLQDDLPESAAAVAAVPLSQLPEELELFASRSEIHSNYVSTCMLLQDDLPESAAAVAAVPMSQLPEVLELFTSRSEIHSNYISTCMLLQDDLPESAAAVAAVPLSQLPEVLELFTSRSEIHSNYISTCMLLQDDLPESAAAVAAVPLSQLPEVLELFTSRSEIHSNYISTCMLLQDDLPESAATVAAVPLSQLPEVLELFTSRSEIHSNYISTCMLLQDDLPGSAAAVAAVPLSQLPEVLELFASRSEIHSNYISTCMLLQDDLPESAAAGAAVPLSQLPEVLELFTSRSEIHSNYISTCMLLQDDLPESAAAVAAVPMSQLPEVLELFTSRSEIHSNYISTCMLLQDDLPESAAAVAAVPLSQLPEVLELFASRSEIHSNYISTCMLLQDDLPEAAAAVAAVPLSQLPEVLELFASRSEIHSNYISTCMLLQDDLPESAAAVAAVPLSQLPEVLELFASRSEIHSNYISTCMLLQDDLP
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -