Basic Information

Gene Symbol
-
Assembly
GCA_947579665.1
Location
OX388346.1:18966501-19084822[+]

Transcription Factor Domain

TF Family
ARID
Domain
ARID domain
PFAM
PF01388
TF Group
Helix-turn-helix
Description
This domain is know as ARID for AT-Rich Interaction Domain [2], and also known as the BRIGHT domain [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 20 0.13 1.2e+03 2.6 0.0 17 51 73 107 64 116 0.81
2 20 0.2 1.9e+03 2.0 0.0 21 51 111 141 102 163 0.77
3 20 0.11 9.9e+02 2.9 0.0 10 51 171 209 166 232 0.77
4 20 0.36 3.3e+03 1.2 0.0 12 51 207 243 196 255 0.78
5 20 0.15 1.4e+03 2.4 0.0 20 50 246 276 225 284 0.79
6 20 0.17 1.6e+03 2.3 0.0 12 51 275 311 271 333 0.76
7 20 0.38 3.6e+03 1.1 0.0 20 51 314 345 294 362 0.79
8 20 0.13 1.2e+03 2.6 0.0 21 51 349 379 340 401 0.78
9 20 0.034 3.2e+02 4.5 0.0 13 50 412 446 374 469 0.72
10 20 0.037 3.4e+02 4.4 0.0 11 51 478 515 463 537 0.79
11 20 0.067 6.3e+02 3.5 0.0 18 51 547 583 527 605 0.70
12 20 0.11 1e+03 2.9 0.0 25 51 625 651 602 689 0.64
13 20 0.36 3.4e+03 1.2 0.0 21 51 689 719 670 741 0.78
14 20 0.41 3.9e+03 1.0 0.0 21 51 723 753 703 762 0.78
15 20 0.038 3.6e+02 4.3 0.0 20 51 754 787 736 809 0.79
16 20 0.15 1.4e+03 2.4 0.0 10 51 817 855 811 893 0.77
17 20 1.7 1.6e+04 -1.0 0.0 21 56 966 1000 925 1008 0.73
18 20 0.0056 53 7.0 0.0 13 79 1142 1202 1097 1204 0.68
19 20 0.12 1.1e+03 2.7 0.0 13 51 1319 1354 1302 1377 0.76
20 20 0.41 3.8e+03 1.0 0.0 21 51 1358 1388 1349 1410 0.77

Sequence Information

Coding Sequence
ATGTTAGGATCCCTCGGTTGGAACTCTCTTAGTGATAGAAGACGTGAATATCGACGGAAGCTTTTTGATAAATTTATCTCTACCGATTTCGGGGAGGAGATAACAGATATTGTCATCCCAGTTATATACCTACGCGACCTTCGAGAGAAAACAAAAAAGCGATACAGGGAAGTAGTTGCCAGGACGGATAGCTACTTTTGGTCATTTTTTCCGCGTAAAACAACATCAGAACAGTTCTCAAGCATGGAAGAAGAAAGATTGGATGCACAGATGCTGCAGAAATACATCTGGACCAGAGGAGGAATTCAAAATACCATGAAAACAACATCTGAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGTACATGTGCTGCAGAGATACATCTGGTCCAGAGATGGAATTCAAAATACCATGAAAACAACATCAGAACGGTTTTCAAGAATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAGATACATCTGGTGCAGAGATGAAATTCAAAATACCATGAAAACAACATCTGAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGTACATGTGCTGCAGAGATACATCTGGTCCAGAGATGGAATTCAAAATACCATGAAAACAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGTTTGGATGCACAGATGCTGCAGAGATACATCTGGTCCAGAGGGGGAAATCAAAATACCATGAAAACAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGTTTGGATGCACAGATGCTGCAGAGATATATCTGGTCCAGAGGGGGAATTCAAAATACCATGAAAACAACATCAGAACCGTTCACAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGCGATACATCTGGTCCAGAGGGGGAAATCAAAATACCATGAAAACAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGTTTGGATGCACAGATGCTGCAGAGATACATCTGGTCCAGAGGGGGAAATCAAAATACCATGAAAACAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGTTTGGATGCACAGATGCTGCAGAGATATATCTGGTCCAGAGGGGGAATTCAAAATACCATGAAAACAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGGGATTGGATGCACAGATGCTGCAAAAATACATCTGGTCCAGAGGGGGAATTCAAAATACCATGAAAACAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAAATACATCTGGTCCAGAGGGGGAATTCAAAATACCATGAAAACAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAGATACATTTGGTCCAGAGGGGTAATTCAAAATACCATGAAAAAAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAAAAATACATCTGGTCCAGAGGGGGAATTCAAAATACCATGAAAACAACATCAGAACGGTTCTCAAGCATGGAAGATGAGAGATTGGATGCACAGATGCTGCAGAGATACATGTGGTCCAGAGGGGGAAATCAAAATACCATGAAAACAACATCAGAACCGTATTCAAGAATGGAAGAAGAGAGATTGGATGCACAGATTCTGCAGAGATACATCTGGTCCAGAGGGGGAATTCAAAATACCATGAAAACAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGTTTGGATGCACAGATGCTGCAGAGATACATCTGGTCCAGAAGGGGAAATCAAAATACCATGAAAACAGCATCAGAACCGTTTTCAAGAATGGAAGAAGAGAGATTGGATGCACAGATTCTGCAGAGATACATCTGGTCCAGAGGGGGAATTCAAAATACCATGAAAACAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAGATACATCTGGTCCAGAGGGGGAAATCAAAATACCATGAAAACAGCATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAGATACATCTGGTCCAGAGGGGGAAATCAAAATACCATGAAAACAGCATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAGATACATCTGGTCCAGAGGGGGAAATCAAAATACCATGAAAACAACATCAGAACAGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAAATACATCTGGTCCAGAGGGGGAATTCAAAATACCATGAAAACAACATCAGAACGGTTTTCAAGAATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAGATACATCTGGTGCAGAGATGAAATTCAAAATACCATGAAAACAACATCAGAACGGTTCTCAAGTAGGGAAGAAGAGAGATTGGATGCACAAATGCTGCAGAGATACATCTGGTCCAGAGGGGGAATTCAAAATACCATGAAAACAACATCAGAACAGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAGATACATTTGGTCCAATGGGAAACAACATCTGAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGTACATAAACAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGTTTGGATGCACAAATGCTGCAGAGATACATTTGGTCCAGAGGGAAACAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCGCAGATGCTGCAGAGTTACATCTGGTCCAGAGGGCGAATTGAAAATACCATGAGAACAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAGATACATTTGGTCCAGAGGGAAACAACATCAGAACAGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAGATACATTTGGTCCAATGGGAAACAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAGATACATCTGGTCCAGAGGGGGAATTTAAAATATCATGAAAACAGCATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAGATAAATCTGGTCCAGAGGGGGAATTTAAAATATCATGAAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAGTTACATTTGGTCCAGAGGGAAACAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAGATACATCTGGTCCAGAGGGGGAATTCAAAATACCATGAAAACAACATCAGAAAGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAAATACATCTGGTCCAGAGGGGGAATTCAAAATACCATGAAAACAATATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGTGGATGCACAAATGCTGCAGAGATACATCTGAAACAACATCAGAACGGTTCTCAAGAATGGAAGAAGAGAGATGGGATGCACAGATGCTGCAGAGATACAATTGGTCCAGAGGGTACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAAATAGATCTGGTCCAGAGGGGAAATTCAAAATACCATGAAAACAACATCAGAACAGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCACAGATGCTGCAGAGATACATTTGGTCCAATGGGAAACAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGCGCAGATGCTGCAGAGTTACATCTGGTCCAGAGGGCGAATTCAAAATACCATGAAAACAACATCTGAACGGTTCTCAAGCATGGAAGAAGAGAGATTGGATGTACATGTGCTGCAGAGATACATCTGGTCCAGAGATGGAATTCAAAATACCATGAAAACAACATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGTTTGGATGCACAGATGCTACAGAGATACATCTGGTCCAGAGGGGGAAATCAAAATACCATGAAAACAGCATCAGAACGGTTCTCAAGCATGGAAGAAGAGAGGTTGGATGCACAGATGCTGCAGAGATACATTTGGTCCAGAGGGGTAATTCAAAATACCATGAAAACAACATCAGAACGGTTCTCAAGCATGCAAGAAGAGAGATTGGATGCACAGATGCTGCAGAGATACATCTGGTCCAGAGATGAAATTCAAAATACCATGAATATTATTTGA
Protein Sequence
MLGSLGWNSLSDRRREYRRKLFDKFISTDFGEEITDIVIPVIYLRDLREKTKKRYREVVARTDSYFWSFFPRKTTSEQFSSMEEERLDAQMLQKYIWTRGGIQNTMKTTSERFSSMEEERLDVHVLQRYIWSRDGIQNTMKTTSERFSRMEEERLDAQMLQRYIWCRDEIQNTMKTTSERFSSMEEERLDVHVLQRYIWSRDGIQNTMKTTSERFSSMEEESLDAQMLQRYIWSRGGNQNTMKTTSERFSSMEEESLDAQMLQRYIWSRGGIQNTMKTTSEPFTSMEEERLDAQMLQRYIWSRGGNQNTMKTTSERFSSMEEESLDAQMLQRYIWSRGGNQNTMKTTSERFSSMEEESLDAQMLQRYIWSRGGIQNTMKTTSERFSSMEEEGLDAQMLQKYIWSRGGIQNTMKTTSERFSSMEEERLDAQMLQKYIWSRGGIQNTMKTTSERFSSMEEERLDAQMLQRYIWSRGVIQNTMKKTSERFSSMEEERLDAQMLQKYIWSRGGIQNTMKTTSERFSSMEDERLDAQMLQRYMWSRGGNQNTMKTTSEPYSRMEEERLDAQILQRYIWSRGGIQNTMKTTSERFSSMEEESLDAQMLQRYIWSRRGNQNTMKTASEPFSRMEEERLDAQILQRYIWSRGGIQNTMKTTSERFSSMEEERLDAQMLQRYIWSRGGNQNTMKTASERFSSMEEERLDAQMLQRYIWSRGGNQNTMKTASERFSSMEEERLDAQMLQRYIWSRGGNQNTMKTTSEQFSSMEEERLDAQMLQKYIWSRGGIQNTMKTTSERFSRMEEERLDAQMLQRYIWCRDEIQNTMKTTSERFSSREEERLDAQMLQRYIWSRGGIQNTMKTTSEQFSSMEEERLDAQMLQRYIWSNGKQHLNGSQAWKKRDWMYINNIRTVLKHGRREFGCTNAAEIHLVQRETTSERFSSMEEERLDAQMLQSYIWSRGRIENTMRTTSERFSSMEEERLDAQMLQRYIWSRGKQHQNSSQAWKKRDWMHRCCRDTFGPMGNNIRTVLKHGRREIGCTDAAEIHLVQRGNLKYHENSIRTVLKHGRREIGCTDAAEINLVQRGNLKYHETVLKHGRREIGCTDAAELHLVQRETTSERFSSMEEERLDAQMLQRYIWSRGGIQNTMKTTSERFSSMEEERLDAQMLQKYIWSRGGIQNTMKTISERFSSMEEESGCTNAAEIHLKQHQNGSQEWKKRDGMHRCCRDTIGPEGTVLKHGRREIGCTDAAEIDLVQRGNSKYHENNIRTVLKHGRREIGCTDAAEIHLVQWETTSERFSSMEEERLDAQMLQSYIWSRGRIQNTMKTTSERFSSMEEERLDVHVLQRYIWSRDGIQNTMKTTSERFSSMEEESLDAQMLQRYIWSRGGNQNTMKTASERFSSMEEERLDAQMLQRYIWSRGVIQNTMKTTSERFSSMQEERLDAQMLQRYIWSRDEIQNTMNII

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-