Basic Information

Gene Symbol
-
Assembly
GCA_963675035.1
Location
OY776111.1:108716150-108718369[-]

Transcription Factor Domain

TF Family
RFX
Domain
RFX domain
PFAM
PF02257
TF Group
Basic Domians group
Description
RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer [1]. It recognize X-boxes (DNA of the sequence 5'-GTNRCC(0-3N)RGYAAC-3', where N is any nucleotide, R is a purine and Y is a pyrimidine) using a highly conserved 76-residue DNA-binding domain (DBD) [2].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 17 0.031 9.4e+02 3.2 0.0 19 34 29 44 20 57 0.84
2 17 0.03 8.9e+02 3.2 0.0 18 34 71 87 62 100 0.84
3 17 0.035 1.1e+03 3.0 0.0 19 34 115 130 106 142 0.84
4 17 0.034 1e+03 3.0 0.0 19 34 158 173 149 186 0.84
5 17 0.034 1e+03 3.0 0.0 19 34 201 216 192 229 0.84
6 17 0.013 3.9e+02 4.4 0.0 18 34 243 259 234 272 0.82
7 17 0.013 3.9e+02 4.4 0.0 18 34 286 302 277 315 0.82
8 17 0.03 8.9e+02 3.2 0.0 18 34 329 345 320 358 0.84
9 17 0.031 9.4e+02 3.2 0.0 19 34 373 388 364 401 0.84
10 17 0.03 8.8e+02 3.2 0.0 19 35 416 432 407 445 0.83
11 17 0.034 1e+03 3.1 0.0 19 34 459 474 450 486 0.85
12 17 0.03 9e+02 3.2 0.0 19 35 502 518 493 533 0.82
13 17 0.031 9.4e+02 3.2 0.0 19 34 545 560 536 573 0.84
14 17 0.03 8.9e+02 3.2 0.0 18 34 587 603 578 616 0.84
15 17 0.035 1.1e+03 3.0 0.0 19 34 631 646 622 658 0.84
16 17 0.034 1e+03 3.0 0.0 19 34 674 689 665 702 0.84
17 17 0.046 1.4e+03 2.6 0.0 19 34 717 732 708 736 0.86

Sequence Information

Coding Sequence
ATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTACATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGATGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTACGTCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGGCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGAGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACGTATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGAGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACGTATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGATGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTACGTCCAGAAGTGAAATACATTCCAATTATATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTACATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTACATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCTGCAGCCACAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTACATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAAGATGATCTGCCGGGGTCCGCAGCTGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGGTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTACATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGATGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTACGTCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGGCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGGAGTCCGCAGCCGCAGTTGCTGCCGTTCCGCTGAGCCAGCTCCCAGAAGTGTTAGAATTGTTTGCATCCAGAAGTGAAATACATTCCAATTACATATCGACGTGTATGTTGTTGCAGGATGATCTGCCGTAG
Protein Sequence
MLLQDDLPESAAAVAAVPLSQLPEVLELFTSRSEIHSNYISTCMLLQDDLPESAAAVAAVPMSQLPEVLELFTSRSEIHSNYISTCMLLQDDLPESAAAVAAVPLSQLPEVLELFASRSEIHSNYISTCMLLQDDLPEAAAAVAAVPLSQLPEVLELFASRSEIHSNYISTCMLLQDDLPESAAAVAAVPLSQLPEVLELFASRSEIHSNYISTCMLLQDDLPESAAAVAAVPLSQLPEELELFASRSEIHSNYVSTCMLLQDDLPESAAAVAAVPLSQLPEELELFASRSEIHSNYVSTCMLLQDDLPESAAAVAAVPMSQLPEVLELFTSRSEIHSNYISTCMLLQDDLPESAAAVAAVPLSQLPEVLELFTSRSEIHSNYISTCMLLQDDLPESAAAVAAVPLSQLPEVLELFTSRSEIHSNYISTCMLLQDDLPESAATVAAVPLSQLPEVLELFTSRSEIHSNYISTCMLLQDDLPGSAAAVAAVPLSQLPEVLELFASRSEIHSNYISTCMLLQDDLPESAAAGAAVPLSQLPEVLELFTSRSEIHSNYISTCMLLQDDLPESAAAVAAVPMSQLPEVLELFTSRSEIHSNYISTCMLLQDDLPESAAAVAAVPLSQLPEVLELFASRSEIHSNYISTCMLLQDDLPEAAAAVAAVPLSQLPEVLELFASRSEIHSNYISTCMLLQDDLPESAAAVAAVPLSQLPEVLELFASRSEIHSNYISTCMLLQDDLP

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-