Basic Information

Gene Symbol
-
Assembly
GCA_963576445.1
Location
OY754895.1:4905117-4912496[-]

Transcription Factor Domain

TF Family
IRF
Domain
IRF domain
PFAM
PF00605
TF Group
Helix-turn-helix
Description
This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryptophan residues bind to DNA.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 18 0.05 1.3e+03 0.2 0.0 52 88 234 270 209 278 0.79
2 18 0.028 7.2e+02 1.0 0.1 60 102 269 313 253 316 0.78
3 18 0.21 5.3e+03 -1.7 0.1 55 97 350 392 348 397 0.76
4 18 0.051 1.3e+03 0.2 0.1 60 88 458 486 449 498 0.72
5 18 0.00017 4.5 8.1 0.4 58 100 572 614 540 616 0.78
6 18 0.00079 20 6.0 0.1 55 100 627 672 622 677 0.90
7 18 0.00034 8.6 7.2 0.1 55 100 685 730 679 731 0.90
8 18 0.00085 22 5.9 0.2 56 100 715 759 711 762 0.85
9 18 0.028 7.3e+02 1.0 0.1 55 98 772 815 767 818 0.79
10 18 0.00035 8.8 7.2 0.3 54 100 858 904 854 932 0.91
11 18 0.045 1.1e+03 0.4 0.1 55 100 946 991 941 993 0.87
12 18 0.0036 91 3.9 0.1 55 100 1004 1049 999 1053 0.82
13 18 0.00039 10 7.0 0.2 55 100 1091 1136 1062 1138 0.80
14 18 0.0025 63 4.4 0.2 56 100 1150 1194 1145 1196 0.83
15 18 0.39 1e+04 -2.6 0.0 56 100 1208 1252 1205 1257 0.79
16 18 0.0023 59 4.5 0.1 55 100 1265 1310 1260 1312 0.83
17 18 0.017 4.4e+02 1.7 0.1 55 100 1352 1397 1323 1402 0.64
18 18 0.04 1e+03 0.6 0.0 56 87 1382 1413 1378 1424 0.76

Sequence Information

Coding Sequence
ATGTCACAAGAGTTACGCCGCTGCGGCATCATCACCGGCACGCTGAACTGCTACTGTCTGCCGTGCGACAGGTTCCTTGAGAACAGGCGAGAGGCATGCAGCCACGTCGGGACTTCTTCGCATTGGGAAAAACTGAGCGAGGTTCCTTATTTACCCAGGTTTCAGGGTCATTATATTAGAAAGtttgaaaCTGGCTATTTCTGCGAACTTTGCAACCGATTGATTCCAACTGCTGTACGAGTTGATCTACACACCTCCAAAGACGAACACATACAAAATAGAGGCAGATATTTACTAAAATCATTACGTTATGAAGTTACGGCATTTGATAACATTCCCATTGACGACCGCGCGTGGCATGGCCTGTTGGACAACTACTGTTGTGTGTGTGACATTGAGATCGAAGATGAAGTTagccataaaaaaaataattctcatTTGTTGAATTTAACACAGAAACCTATCAAATTTGGAATAAATGATGACATTTACCGacaaattaataatttattcatacagtGTCTCACGTGTAACAAATTACTTCCTGTAAATGAGATTGACGAACATTTTGATAATGCTGAGCACGTTGCTGGTAACTCAAAAAGCAAAATAGAAGCAAATCACCAACAACAAAACACTCCTGAGAAGAATACCAGTATTAACGATAATAACAAAACCGATACAACAGCACTCAATTATTCCGATAACAAGGGTGAACATGGTAACTCGTCTAATATTGAAGAactttatacatataaaaatatgtttcatacTGCTCTCGATAATTCCGAAGGCATGGTTAAACATAACTCATCTCATATTGTAAGACCTGAGATGTCTAAATCTATGCTTCATGCAGCATTCAACGATTCTGATAGCAAGAATGAACATGATAACGCATCTGATATTGAAGAAGTTGAGccatttaaaactatttattcaGTACTTAATGAGTCCGATAGCAAGTATGAATATGACGACTCATCTTATATTAAACTTGAGCCATTTCATTCTATGCTTTATTCAGAGCTCGATGAATTCGGGAGTAAGgataaatatgacaatttatctAATATTGAAGAACTTGAGTCATTTAAATCTATGCTTCATTCAGTATTCAATGAGTCTgatagcaaaaataaatatgacatcTTATCTAAAATTGAAGAACATGAGAAGTTTAATATTATGCTTCATTCACCACACAATGAGTTCGATTGCAATGATGAACTTGACACCTCATCTAGTATTGAAGCACTTGAGCCATTTAAAACTTTACTTGAGTCAGCACTTAATGAGTCAGGTAGCAAGGATGTATATGATAACTCATCCAACAAACTCGATGAGTCCGATAGCAAGAATGAACGTGACAACTCATCTAAAATTGAAGAGCTTGAGCCATTTAAAACTACGCTTCAGTCAGCACTCAATGAGTCCAATAGCAAGGCTGAACATGAAATCacatataatattaaagaaCTTGAGCCATTTGAAACAATACTTCATTCAGAACTCAATAAGCCCGATAGCAAGAATAAACATGTCAACTCATCTAAAATTGAAAAGTTTGAGCCATTTAAATCTATACTTCATTCAGTAGTCAATGAGTCTAGTAGCAACGATGAAAATAACAAGTCTTGTAATACTGAAGAACATGAGCCATTTGAAACTATACTTCATTCAGAACTCAATGAGTCCGATAGCAAGTATGAACATGATAACTCATCTAAAATTGAAGACCTTGAGCCATTTAAAACTAAGTTTCATTCAGCACTCAATGAGTCCAATAGCAAGGATGAACATGACAACTCATCTAACATTGAGGAGCTTGAGCCAtttaaaactacgcttcattcaGAAGTCAATCAGTCCAGTAGCAAGGATGAATATGACAAGTCATCCAATATTGAAGAGCTTGAGCCAtttaaaactacgcttcattcaGCACTCAATGAGTCCAGTGGCAAGAATGAACATGACAACTCATCTAATATTGAGGAGCTTGAGCCAtttaaaactacgcttcattcaACACTCAATCAGTCCAGTAGCAAGGATGAATATGACAACTCATCTAATATTGAGGAGCTTGAGCCAtttaaaactacgcttcattcaGCACTCAATGAGTCCAGTAGCAAGAATGAACATGAAAAATCTTCTAATATTGAGGAGCTTGAGCCAtttaaaactacgcttcattcaGCACTCAATGAGTCCAGTAGCAAGAATGAACATGACAACTCATCTAATATTCAGGAGCTTGAGCCATTTAAAACTGCGCTTCATTCAGCACCCAATGAGTCCAGTAGCAAGGATGAACATGACAACTCATCTAATATTGAGGAGCTTGAGCCAtttaaaactacgcttcattcaACACTCAATCAGTCCAGTAGCAAGGATGAATATGACAACTCATCTAATATTGAGGAGCTTGAGCCATTTGAAACTACGCTTCATTCAGCACCCAATGAGTCCAGTAGCAAGAATGAACATGAAAAATCTTCTAATATTGAGAAGCTTGAGCCATTTATAACTACGCTTCATTCAGCACCCAATCAGTCCAGTAGCAAGGGTAAACATGACAACTCATCTAATATTGAGGAGCTTGAGCCAtttaaaactacgcttcattcaGCACTCAATGAGTCCAGTAGCAAGAATGAACATGACAACTCATCTAATATTAAGGAGCTTAAGCCAtttaaaactacgcttcattcaACACTCAATCAGTCCAGTAGCAAGGATGAATATGACAACTCATCTAATATTGAGGAGCTTGAGCCATTTGAAACTACGCTTCATTCAGCACCCAATGAGTCCAGTAGCAAGGGTGAACATGACAACTCATCTAATATTGAGGAGCTCGAGCCAtttaaaactacgcttcattcaGCACCCAATGAGTCCAGTAGCAAGGATGAACATGACAACTCATCTAATATTGAGGAGCTTGAGCCATTTAAAACTGCGCTTCATTCAGCACCCAATGAGTCCAGTAGCAAGGATGAACATGACAACTCATCTAATATTGAGGAGCTTGACCCAtttaaaactacgcttcattcaGCACTCAATGAGTCCAGTAGCAAGAATGAACATGAAAAATCATGTAATATTGAGGAGCTTGAGCCATTTAAAACTGCGCTTTATTCAGCACCCAATGAGTCCAGTAGCAAGGATGAACATGACAACTCATCTAATATTGAGGAGCTTGAGCTAtttaaaactacgcttcattcaGCACCCAATGAGTCCAGTAGCAAGGATGAACATGACAACTCATCTAATATTGAGGAGCTTGAGCCATTTAAAACTACGTTTCATTCAGCACTCAATGAGTCCAGTAGCAAGAATGAACATGACAACTCATCTAATATTGAGGAGCTCGAGCCAtttaaaactacgcttcattcaGCACCCAATGAGTCCAGTAGCAAGGATGAACATGACAACTCATCTAATATTGAGGAGCTTGAGCCAtttaaaactacgcttcattcgGCACTCAATGAGTCCAGTAGCAAGAATGAACATGACAACTCATCTAATATTGAGGAGCTTGAGCCACttaaaactacgcttcattcaGCACCCAATGAGTTCAGTAGCAAGGATGAACATGACAACTCATCTAATATTGAGGACCTTGAGCTATTTAAAATTACGCTTCATTCAGCACCCAATGAGTCCAGTAGCAAGGGTGAACATGACAACTCATCTAATTTTGAGGAGCTCGAGCCAtttaaaactacgcttcattcaGCACCCAATGAGTCCAGTAGCAAGGATGAACATGACAACTCATCTAATATTGAGGAGCTTGAGCCAtttaaaactacgcttcattcaGCACTCAATGAGTCCAGTAGCAAGAATGAACATGACAACTCATCTAATATTGAGGAGCTTGAGCCACttaaaactacgcttcattcaGCACCCAATGAGTCCAGTAGCAAGGATGAACATGACAACTCATCTAATATTGAGGACCTTGAGCTAtttaaaactacgcttcattcaGCACCCAATGAGTCCAGTAGCAAGGATGAACATGACAACTCATCTAATATTGAGGAGCTTGAACCATTTGAAACTACGCTTCATTCAGCACTAAATGACTCCAGTAGCAAGGATGAACATGACAACTCATCTAATATTGAGAAGCTTGAGCCAtttaaaactacgcttcattcaGCACTAAATGAGTCCAGTAGCAAAGATGAATATAACAAGTTGTGTAATTTTAAAGAACTTGAGCCATCTGAAACTATACTTCACTCGGAAATCAACAAGTCCGATAGCAAGAATGAACATGACAACTTATCTAAACTTGAAGAATTTGGGTTATCTAAAACTATGCTTTATGCAGGGCTTTTTGATTCAGATAGCCAGGATAAACATGAAAACTCATCTAATATTAGAAAACTTGAGTTACCAGACCCTTCGCTTTATGCAGCAATGTACCAAAGCATTGATATTAATTGTGAGACAGAAAGTATGTTTTGTAGCGATTGTAGGCAAGACGTTATGCTTTACCATGATGATATTAAGAAGCATATTGAGAAACATGTAAATATTGGATCATCGACTTCGACAGTAGAATTTAACTTTGACGTAAATGAAATGCTGAATTTTCATTTTACTAAAAATGACAACACAGATGATAATGTACCAAATGACGTGACAGAAAACTCTATTACGGAGGACCAGCCTTGCTACCAAAGTAATAGTAGCTTTGAAAGCGCTTTGGTCGAAACCAATAATGAAGGACTTAGCTCTAGTGATCCTCATCAATTCGcaaaagataataatttgaCTTATAACACACATACAGGTAACGCATTTTGTCGCATCTGTCAAGCAAGACTGCCATCTTCTTTAAAATCTATGAAAGAACATGTACAGGGTGCGAACCATAAAAGAAGACTTTCAGCTGCAGGCGCTGCagcagataaaataaaaaacaagaactATCAAACTCCGCAAACTCTGCTATCAGTTATACGTAGTTTGAAGGTTGTAAATTCAGACTTTGGTACtggttatattattaataacattttttatataactacgGCAAGCTTCTTTTTGCTCGTAGATATGGGTAAATGTGGACTTAAATGTTTGTTTTGCGGAtcaattttaatcaaaactGGAAAACCTCTCGAAAAACACGTTGAAACTTGTTCTCTTCCATTGGGATCAAAACTGCGAATTGTTACATCTAAGAAAGATGAATTTATTAGAGAGatTGCACCGAAGAAATTCCATTGCGGATATTGTCACGTGGTCGTGAACAGTTGGAGCAAGATGGAAATTCACTTGCAGACCGCCGAGCATGACTATGCAAAGACTCTAGCAAATGTTCGTCTTGGTGTATTATCTCGTAAAGAACCACTATACACTTTAACCTTTAACGATGACTAA
Protein Sequence
MSQELRRCGIITGTLNCYCLPCDRFLENRREACSHVGTSSHWEKLSEVPYLPRFQGHYIRKFETGYFCELCNRLIPTAVRVDLHTSKDEHIQNRGRYLLKSLRYEVTAFDNIPIDDRAWHGLLDNYCCVCDIEIEDEVSHKKNNSHLLNLTQKPIKFGINDDIYRQINNLFIQCLTCNKLLPVNEIDEHFDNAEHVAGNSKSKIEANHQQQNTPEKNTSINDNNKTDTTALNYSDNKGEHGNSSNIEELYTYKNMFHTALDNSEGMVKHNSSHIVRPEMSKSMLHAAFNDSDSKNEHDNASDIEEVEPFKTIYSVLNESDSKYEYDDSSYIKLEPFHSMLYSELDEFGSKDKYDNLSNIEELESFKSMLHSVFNESDSKNKYDILSKIEEHEKFNIMLHSPHNEFDCNDELDTSSSIEALEPFKTLLESALNESGSKDVYDNSSNKLDESDSKNERDNSSKIEELEPFKTTLQSALNESNSKAEHEITYNIKELEPFETILHSELNKPDSKNKHVNSSKIEKFEPFKSILHSVVNESSSNDENNKSCNTEEHEPFETILHSELNESDSKYEHDNSSKIEDLEPFKTKFHSALNESNSKDEHDNSSNIEELEPFKTTLHSEVNQSSSKDEYDKSSNIEELEPFKTTLHSALNESSGKNEHDNSSNIEELEPFKTTLHSTLNQSSSKDEYDNSSNIEELEPFKTTLHSALNESSSKNEHEKSSNIEELEPFKTTLHSALNESSSKNEHDNSSNIQELEPFKTALHSAPNESSSKDEHDNSSNIEELEPFKTTLHSTLNQSSSKDEYDNSSNIEELEPFETTLHSAPNESSSKNEHEKSSNIEKLEPFITTLHSAPNQSSSKGKHDNSSNIEELEPFKTTLHSALNESSSKNEHDNSSNIKELKPFKTTLHSTLNQSSSKDEYDNSSNIEELEPFETTLHSAPNESSSKGEHDNSSNIEELEPFKTTLHSAPNESSSKDEHDNSSNIEELEPFKTALHSAPNESSSKDEHDNSSNIEELDPFKTTLHSALNESSSKNEHEKSCNIEELEPFKTALYSAPNESSSKDEHDNSSNIEELELFKTTLHSAPNESSSKDEHDNSSNIEELEPFKTTFHSALNESSSKNEHDNSSNIEELEPFKTTLHSAPNESSSKDEHDNSSNIEELEPFKTTLHSALNESSSKNEHDNSSNIEELEPLKTTLHSAPNEFSSKDEHDNSSNIEDLELFKITLHSAPNESSSKGEHDNSSNFEELEPFKTTLHSAPNESSSKDEHDNSSNIEELEPFKTTLHSALNESSSKNEHDNSSNIEELEPLKTTLHSAPNESSSKDEHDNSSNIEDLELFKTTLHSAPNESSSKDEHDNSSNIEELEPFETTLHSALNDSSSKDEHDNSSNIEKLEPFKTTLHSALNESSSKDEYNKLCNFKELEPSETILHSEINKSDSKNEHDNLSKLEEFGLSKTMLYAGLFDSDSQDKHENSSNIRKLELPDPSLYAAMYQSIDINCETESMFCSDCRQDVMLYHDDIKKHIEKHVNIGSSTSTVEFNFDVNEMLNFHFTKNDNTDDNVPNDVTENSITEDQPCYQSNSSFESALVETNNEGLSSSDPHQFAKDNNLTYNTHTGNAFCRICQARLPSSLKSMKEHVQGANHKRRLSAAGAAADKIKNKNYQTPQTLLSVIRSLKVVNSDFGTGYIINNIFYITTASFFLLVDMGKCGLKCLFCGSILIKTGKPLEKHVETCSLPLGSKLRIVTSKKDEFIREIAPKKFHCGYCHVVVNSWSKMEIHLQTAEHDYAKTLANVRLGVLSRKEPLYTLTFNDD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-