Pcor006232.2
Basic Information
- Insect
- Pandemis corylana
- Gene Symbol
- rfx3
- Assembly
- GCA_949127965.1
- Location
- OX421854.1:14578030-14602067[+]
Transcription Factor Domain
- TF Family
- RFX
- Domain
- RFX domain
- PFAM
- PF02257
- TF Group
- Basic Domians group
- Description
- RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer [1]. It recognize X-boxes (DNA of the sequence 5'-GTNRCC(0-3N)RGYAAC-3', where N is any nucleotide, R is a purine and Y is a pyrimidine) using a highly conserved 76-residue DNA-binding domain (DBD) [2].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 16 0.46 4.4e+03 -1.0 0.0 3 16 127 140 126 143 0.86 2 16 0.017 1.6e+02 3.7 0.0 3 23 146 166 144 168 0.90 3 16 0.017 1.6e+02 3.7 0.0 3 23 169 189 167 191 0.90 4 16 0.017 1.6e+02 3.7 0.0 3 23 192 212 190 214 0.90 5 16 0.017 1.6e+02 3.7 0.0 3 23 215 235 213 237 0.90 6 16 0.017 1.6e+02 3.7 0.0 3 23 238 258 236 260 0.90 7 16 0.017 1.6e+02 3.7 0.0 3 23 261 281 259 283 0.90 8 16 0.017 1.6e+02 3.7 0.0 3 23 284 304 282 306 0.90 9 16 0.017 1.6e+02 3.7 0.0 3 23 307 327 305 329 0.90 10 16 0.017 1.6e+02 3.7 0.0 3 23 330 350 328 352 0.90 11 16 0.017 1.6e+02 3.7 0.0 3 23 353 373 351 375 0.90 12 16 0.017 1.6e+02 3.7 0.0 3 23 376 396 374 398 0.90 13 16 0.017 1.6e+02 3.7 0.0 3 23 399 419 397 421 0.90 14 16 0.017 1.6e+02 3.7 0.0 3 23 422 442 420 444 0.90 15 16 0.017 1.6e+02 3.7 0.0 3 23 445 465 443 467 0.90 16 16 1.6e-35 1.6e-31 109.4 0.1 5 78 470 543 466 544 0.97
Sequence Information
- Coding Sequence
- ATGGTAAGCCCTAGAGAGAGAAGCGAGGTCCGATATTGTTCTGGGAAGTCGGATTCGGGCCGAGTTGCCCTGGAGACCTCTCGCGGTATGGAGGTGCCTCGATACCGGTCCTACTACCCTTgcgAGGGCAGCAATGGCGTCAACTATCACGTGCAGTATGTGGAGCCCCAAGAAATCTATACCCAAGGCCATGAGACTCACATGGAGACTCTCCGCTCGTACCCGGTGTACGGCGTGGCGACGGTCAACGCGAGCGAGCCGCCGGCCTCGTCGTGGAGCGCGGACGAGTTCTCGTACAACGTGGTGGTCGGCAACGACGACGCGCCCGCctcccccgcgcccgcccccgccacgccgccgcagccgcgcaTGCCGcccgccaccgtgcagtggctgctcgacaactacgagaccgccgacgcccgcgcccgccccgccaccgtgcagtggctgctcgacaactacgagaccgccgacggtgagtcgctagcccgcgcccgccccgccaccgtgcagtggctgctcgacaactacgagaccgccgacggtgagtcgctagcccgcgcccgccccgccaccgtgcagtggctgctcgacaactacgagaccgccgacggtgagtcgctagcccgcgcccgccccgccaccgtgcagtggctgctcgacaactacgagaccgccgacggtgagtcgctagcccgcgcccgccccgccaccgtgcagtggctgctcgacaactacgagaccgccgacggtgagtcgctagcccgcgcccgccccgccaccgtgcagtggctgctcgacaactacgagaccgccgacggtgagtcgctagcccgcgcccgccccgccaccgtgcagtggctgctcgacaactacgagaccgccgacggtgagtcgctagcccgcgcccgccccgccaccgtgcagtggctgctcgacaactacgagaccgccgacggtgagtcgctagcccgcgcccgccccgccaccgtgcagtggctgctcgacaactacgagaccgccgacggtgagtcgctagcccgcgcccgccccgccaccgtgcagtggctgctcgacaactacgagaccgccgacggtgagtcgctagcccgcgcccgccccgccaccgtgcagtggctgctcgacaactacgagaccgccgacggtgagtcgctagcccgcgcccgccccgccaccgtgcagtggctgctcgacaactacgagaccgccgacggtgagtcgctagcccgcgcccgccccgccaccgtgcagtggctgctcgacaactacgagaccgccgacggtgagtcgctagcccgcgcccgccccgccaccgtgcagtggctgctcgacaactacgagaccgccgacggtgagtcgctagcccgcgcccgcccccgccaccgtgcatggctgctcgacaactacgagaccgccgacgGCGTGTCGCTGCCTCGCTCCACGCTCTACGCGCACTACTTGCGGCACTGCGCCGCGCACCGGCTCGACCCGGTCAACGCGGCGTCCTTCGGCAAGCTCATCCGTTCGGTGTTCGTCGgcctgcgcacgcgccgcctcGGCACCCGCGGCAACTCCAAGTATCACTACTATGGGATACGCGCCAAGGCTGGCCAGGGTCAGCAAGAGACTGAAAACATGGACGAGAAGAGTGATGTGCAGCATGATGATATGCAGGAACCGCGAGACCGCGAGTCGGAACCGACGACGCCGACGGGCGGCGTGCCCGGCCTCGCTCACAGGCAATATCTAGGCACAGggtctgcgccggcgccgcccgcgctgccgccggccgccgcgcacgACCTGCCGCCCGCCGCGCTCACCGCCTTGCAGGATCATCACAGGTCCCACGGCGTGGAGTTCCTCGAAGCGGTGGCAGCCCTAGACATCGGCGCCGTGGAGCGCGCGCGGCGCAACTTctggcgccggccgccgcccgcgctgtgCCGCCGCGTGCTGGCCCGCCTGGCCGCCAGGAGGGACGTGGCCGCCTGGCTGCGCCGCGCAGACCTGGAGCTGTACCAGCGCGCCGTGGAGCTGCTGCTGCCCGACGTGCTGAGGCCCATACCGCCGCAGCTCACGCAGGTACTGCTGCAGCCGCATGCTGCCCGCCTGGCCGCCAGGAGGGACGTGGCCGCCTGGCTGCGCCGCGCAGACCTGGAGCTGTACCAGCGCGCCGTGGAGCTGCTGCTGCCCGACGTGCTGAGGCCCATACCGCCGCAGCTCACGCAGGTACTGCTGCAGCCGCATGCTGCCCGCCTGGCCGCCAGGAGGGACGTGGCCGCCTGGCTGCGCCGCGCAGACCTGGAGCTGTACCAGCGCGCCGTGGAGCTGCTGCTGCCCGACGTGCTGAGGCCCATACCGCCGCAGCTCACGCAGGTACTGCTGCAGCCGCATGCTGCCCGCCTGGCCGCCAGGAGGGACGTGGCCGCCTGGCTGCGCCGCGCAGACCTGGAGCTGTACCAGCGCGCCGTGGAGCTGCTGCTGCCCGACGTGCTGAGGCCCATACCGCCGCAGCTCACGCAGGTACTGCTGCAGCCGCATGCTGCCCGCCTGGCCGCCAGGAGGGACGTGGCCGCCTGGCTGCGCCGCGCAGACCTGGAGCTGTACCAGCGCGCCGTGGAGCTGCTGCTGCCCGACGTGCTGAGGCCCATACCGCCGCAGCTCACGCAGGTACTGCTGCAGCCGCATGCTGCCCGCCTGGCCGCCAGGAGGGACGTGGCCGCCTGGCTGCGCCGCGCAGACCTGGAGCTGTACCAGCGCGCCGTGGAGCTGCTGCTGCCCGACGTGCTGAGGCCCATACCGCCGCAGCTCACGCAGGTACTGCTGCAGCCGCATGCTGCCCGCCTGGCCGCCAGGAGGGACGTGGCCGCCTGGCTGCGCCGCGCAGACCTGGAGCTGTACCAGCGCGCCGTGGAGCTGCTGCTGCCCGACGTGCTGAGGCCCATACCGCCGCAGCTCACGCAGGTACTGCTGCAGCCGCATGCTGCCCGCCTGGCCGCCAGGAGGGACGTGGCCGCCTGGCTGCGCCGCGCAGACCTGGAGCTGTACCAGCGCGCCGTGGAGCTGCTGCTGCCCGACGTGCTGAGGCCCATACCGCCGCAGCTCACGCAGGCGATCCGCAACTTCGCGAAGAGCTTGGAGTCGGCGCTGGCGTCAGGGTCGGGctgcgcgccggcggcggcggcgcgggcgcaggcgtcggccgccgccgcgctcgccgccgcgctgcgccgctaCACCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGGTGGGACACATGCAATAGCcgagccggcggcggcggcgcgggcgcaggcgtcggccgccgccgcgctcgccgccgcgctgcgccgctaCACCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGGTGGGACACATGCAATAGCcgagccggcggcggcggcgcgggcgcaggcgtcggccgccgccgcgctcgccgccgcgctgcgccgctaCACCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGGTGGGACACATGCAATAGCcgagccggcggcggcggcgcgggcgcaggcgtcggccgccgccgcgctcgccgccgcgctgcgccgctaCACCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGGTGGGACACATGCAATAGCcgagccggcggcggcggcgcgggcgcaggcgtcggccgccgccgcgctcgccgccgcgctgcgccgctaCACCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGGTGGGACACATGCAATAGCcgagccggcggcggcggcgcgggcgcaggcgtcggccgccgccgcgctcgccgccgcgctgcgccgctaCACCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGGTGGGACACATGCAATAGCcgagccggcggcggcggcgcgggcgcaggcgtcggccgccgccgcgctcgccgccgcgctgcgccgctaCACCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGGTGGGACACATGCAATAGCcgagccggcggcggcggcgcgggcgcaggcgtcggccgccgccgcgctcgccgccgcgctgcgccgctaCACCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGGTGGGACACATGCAATAGCcgagccggcggcggcggcgcgggcgcaggcgtcggccgccgccgcgctcgccgccgcgctgcgccgctaCACCAGCCTCAACCACCTCGCGCAGGCCGCGCGCGCCGTGCTCGCCAACCAGCACCAGATACAGCAGATGCTGTCGGACCTGAACCGCGTAGACTTTAGGGTGGTGCGCGAGCAGGCCGCCTGGGCTTGCTCCTGCGGTAGTGCCGCCACCGCGCACCGCCTCGAGGCTGACTTCAAGGCGACACTGGGGCGCGGCGCGTCGCTGGAGCAGTGGGCGGCGTGGCTGGAGGGCTGCGtgcgcggcgcgctggcggcgcacGAGCGCCGGCCCGACTacacggcgcgcgcgcgccgcctgctgCTCGACTGGTCCTTCTACTCGTCGCTCGTCATACGCGAGCTGACGCTGCGGTCGGCGGCGTCGTTCGGCTCGTTCCACCTGATCCGGCTGCTGTACGACGAGTACGTGTCGTACCTGATCGAGCGCCGCGTGGCCGCGCACCACCGCGCGCCTCCCATCGCTGTCATGCAGCGGCCGCCGGAGGAGGAGGACGACGTGCCGGAGGAGCCGCCGCGCGACGACGACGACGAGCCCGACTGGGACTGGGGCGACGACGACGACATCGACGACGAGCCCGCGCCCTGCAAGAAGCCCAAGGGCGAGTAG
- Protein Sequence
- MVSPRERSEVRYCSGKSDSGRVALETSRGMEVPRYRSYYPCEGSNGVNYHVQYVEPQEIYTQGHETHMETLRSYPVYGVATVNASEPPASSWSADEFSYNVVVGNDDAPASPAPAPATPPQPRMPPATVQWLLDNYETADARARPATVQWLLDNYETADGESLARARPATVQWLLDNYETADGESLARARPATVQWLLDNYETADGESLARARPATVQWLLDNYETADGESLARARPATVQWLLDNYETADGESLARARPATVQWLLDNYETADGESLARARPATVQWLLDNYETADGESLARARPATVQWLLDNYETADGESLARARPATVQWLLDNYETADGESLARARPATVQWLLDNYETADGESLARARPATVQWLLDNYETADGESLARARPATVQWLLDNYETADGESLARARPATVQWLLDNYETADGESLARARPATVQWLLDNYETADGESLARARPRHRAWLLDNYETADGVSLPRSTLYAHYLRHCAAHRLDPVNAASFGKLIRSVFVGLRTRRLGTRGNSKYHYYGIRAKAGQGQQETENMDEKSDVQHDDMQEPRDRESEPTTPTGGVPGLAHRQYLGTGSAPAPPALPPAAAHDLPPAALTALQDHHRSHGVEFLEAVAALDIGAVERARRNFWRRPPPALCRRVLARLAARRDVAAWLRRADLELYQRAVELLLPDVLRPIPPQLTQVLLQPHAARLAARRDVAAWLRRADLELYQRAVELLLPDVLRPIPPQLTQVLLQPHAARLAARRDVAAWLRRADLELYQRAVELLLPDVLRPIPPQLTQVLLQPHAARLAARRDVAAWLRRADLELYQRAVELLLPDVLRPIPPQLTQVLLQPHAARLAARRDVAAWLRRADLELYQRAVELLLPDVLRPIPPQLTQVLLQPHAARLAARRDVAAWLRRADLELYQRAVELLLPDVLRPIPPQLTQVLLQPHAARLAARRDVAAWLRRADLELYQRAVELLLPDVLRPIPPQLTQVLLQPHAARLAARRDVAAWLRRADLELYQRAVELLLPDVLRPIPPQLTQAIRNFAKSLESALASGSGCAPAAAARAQASAAAALAAALRRYTSLNHLAQAARAVLANQHQIQQPQPPRAGRARRARQPAPDTAGGTHAIAEPAAAARAQASAAAALAAALRRYTSLNHLAQAARAVLANQHQIQQPQPPRAGRARRARQPAPDTAGGTHAIAEPAAAARAQASAAAALAAALRRYTSLNHLAQAARAVLANQHQIQQPQPPRAGRARRARQPAPDTAGGTHAIAEPAAAARAQASAAAALAAALRRYTSLNHLAQAARAVLANQHQIQQPQPPRAGRARRARQPAPDTAGGTHAIAEPAAAARAQASAAAALAAALRRYTSLNHLAQAARAVLANQHQIQQPQPPRAGRARRARQPAPDTAGGTHAIAEPAAAARAQASAAAALAAALRRYTSLNHLAQAARAVLANQHQIQQPQPPRAGRARRARQPAPDTAGGTHAIAEPAAAARAQASAAAALAAALRRYTSLNHLAQAARAVLANQHQIQQPQPPRAGRARRARQPAPDTAGGTHAIAEPAAAARAQASAAAALAAALRRYTSLNHLAQAARAVLANQHQIQQPQPPRAGRARRARQPAPDTAGGTHAIAEPAAAARAQASAAAALAAALRRYTSLNHLAQAARAVLANQHQIQQMLSDLNRVDFRVVREQAAWACSCGSAATAHRLEADFKATLGRGASLEQWAAWLEGCVRGALAAHERRPDYTARARRLLLDWSFYSSLVIRELTLRSAASFGSFHLIRLLYDEYVSYLIERRVAAHHRAPPIAVMQRPPEEEDDVPEEPPRDDDDEPDWDWGDDDDIDDEPAPCKKPKGE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -