Basic Information

Insect: Brachylomia viminalis
Gene Symbol: -
Assembly: GCA_937001565.2
Location: CAKZJP020000273.1:33447-34529[+]

Transcription Factor Domain

TF Family: PAX
Domain: PAX domain
PFAM: PF00292
TF Group: Helix-turn-helix
Description: The paired domain, a ~126 amino acid DNA-binding domain, is found in eukaryotic transcription regulatory proteins involved in embryogenesis. Initially identified in Drosophila’s paired (prd) protein, it typically resides in the N-terminal region and may be followed by an octapeptide, a homeodomain, or a Pro-Ser-Thr-rich C terminus. Paired domain proteins act as transcription repressors or activators, with DNA-binding specificity mediated by three subdomains. Crystal structures reveal a bipartite DNA-binding paired domain: an N-terminal subdomain (PAI) and a C-terminal subdomain (RED), linked by a flexible linker. Both subdomains contain a helix-turn-helix motif that binds DNA's major groove, while the linker may bind the minor groove. Variations in domain usage across Pax proteins and isoforms determine sequence specificity.
Hmmscan Out: # of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc

1 9 0.37 9.5e+02 1.3 0.0 42 56 14 28 2 44 0.78

2 9 0.099 2.5e+02 3.2 0.0 42 56 50 64 41 80 0.82

3 9 0.11 2.7e+02 3.1 0.0 42 56 86 100 80 115 0.82

4 9 0.097 2.5e+02 3.2 0.0 42 56 122 136 113 153 0.82

5 9 0.11 2.7e+02 3.1 0.0 42 56 158 172 153 189 0.82

6 9 0.1 2.6e+02 3.1 0.0 42 56 194 208 188 224 0.82

7 9 0.11 2.7e+02 3.1 0.0 42 56 230 244 224 259 0.82

8 9 0.097 2.5e+02 3.2 0.0 42 56 266 280 257 297 0.82

9 9 0.073 1.9e+02 3.6 0.0 42 56 302 316 291 341 0.78

#	of	c-Evalue	i-Evalue	score	hmm coord from	hmm coord to	ali coord from	ali coord to	env coord from	env coord to	acc
1	9	0.37	9.5e+02	1.3	42	56	14	28	2	44	0.78
2	9	0.099	2.5e+02	3.2	42	56	50	64	41	80	0.82
3	9	0.11	2.7e+02	3.1	42	56	86	100	80	115	0.82
4	9	0.097	2.5e+02	3.2	42	56	122	136	113	153	0.82
5	9	0.11	2.7e+02	3.1	42	56	158	172	153	189	0.82
6	9	0.1	2.6e+02	3.1	42	56	194	208	188	224	0.82
7	9	0.11	2.7e+02	3.1	42	56	230	244	224	259	0.82
8	9	0.097	2.5e+02	3.2	42	56	266	280	257	297	0.82
9	9	0.073	1.9e+02	3.6	42	56	302	316	291	341	0.78

Sequence Information

Coding Sequence: ATGCATTCCGTTGGTCGTGCAAGGATACTTCCACGAGCTCGAGCACGAGTGGAGCACGGCTGCACGGCGCGCATgctacgccgcgcgccgcccgcaTACTGTGACGCAACTCAAATCAAATCTCGCTATTCGCACGCATACTTCCACGAGCTCCGAGTGGAGCACGGCTGCACGGCGCGCATgctacgccgcgcgccgcccgcaTACTGTGACGCAACTCAAATCAAATCTCGCTATTCGCACGCATACTTCCACGAGCTCCGAGTGGAGCACGGCTGCACGGCGCGCATgctacgccgcgcgccgcccgcaTACTGTGACGCAACTCAAATCAAATCTCGCTATTCGCACGCATACTTCCACGAGCTCCGAGTGGAGCACGGCTGCACGGCGCGCATgctacgccgcgcgccgcccgcaTACTGTGACGCAACTCAAATCAAATCTCGCTATTCGCACGCATACTTCCACGAGCTCCGAGTGGAGCACGGCTGCACGGCGCGCATgctacgccgcgcgccgcccgcaTACTGTGACGCAACTCAAATCAAATCTCGCTATTCGCACGCATACTTCCACGAGCTCCGAGTGGAGCACGGCTGCACGGCGCGCATgctacgccgcgcgccgcccgcaTACTGTGACGCAACTCAAATCAAATCTCGCTATTCGCACGCATACTTCCACGAGCTCCGAGTGGAGCACGGCTGCACGGCGCGCATgctacgccgcgcgccgcccgcaTACTGTGACGCAACTCAAATCAAATCTCGCTATTCGCACGCATACTTCCACGAGCTCCGAGTGGAGCACGGCTGCACGGCGCGCATgctacgccgcgcgccgcccgcaTACTGTGACGCAACTCAAATCAAATCTCGCTATTCGCACGCATACTTCCACGAGCTCCGAGTGGAGCACGGCTGCACGGCGCGCATgctacgccgcgcgccgcccgcaTACTGTGACGCAACTCAAATCAAATCTCGCTATTCGCACGCACACGAAAATCGCGTTTCGTGCTACCTACATCAGCACAATTTGGCAATTGGCAACATCCATTCAaggtttcttttgttttaa
Protein Sequence: MHSVGRARILPRARARVEHGCTARMLRRAPPAYCDATQIKSRYSHAYFHELRVEHGCTARMLRRAPPAYCDATQIKSRYSHAYFHELRVEHGCTARMLRRAPPAYCDATQIKSRYSHAYFHELRVEHGCTARMLRRAPPAYCDATQIKSRYSHAYFHELRVEHGCTARMLRRAPPAYCDATQIKSRYSHAYFHELRVEHGCTARMLRRAPPAYCDATQIKSRYSHAYFHELRVEHGCTARMLRRAPPAYCDATQIKSRYSHAYFHELRVEHGCTARMLRRAPPAYCDATQIKSRYSHAYFHELRVEHGCTARMLRRAPPAYCDATQIKSRYSHAHENRVSCYLHQHNLAIGNIHSRFLLF

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity: -
90% Identity: -
80% Identity: -