Basic Information

Gene Symbol
-
Assembly
GCA_032445375.1
Location
CM063642.1:4575795-4578230[-]

Transcription Factor Domain

TF Family
GTF2I
Domain
GTF2I domain
PFAM
PF02946
TF Group
Other Alpha-Helix Group
Description
This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain [2, 1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 21 0.003 3.2e+02 4.2 0.0 32 59 23 50 12 58 0.86
2 21 0.0024 2.6e+02 4.5 0.0 32 64 57 89 47 98 0.82
3 21 0.0011 1.2e+02 5.6 0.0 32 66 108 142 95 151 0.79
4 21 0.005 5.3e+02 3.5 0.0 32 58 125 151 115 162 0.76
5 21 0.0045 4.8e+02 3.7 0.1 32 60 159 187 148 200 0.81
6 21 0.0079 8.4e+02 2.9 0.0 32 58 210 236 199 241 0.85
7 21 0.0014 1.5e+02 5.3 0.1 32 65 244 277 237 285 0.78
8 21 0.0091 9.7e+02 2.7 0.0 32 58 295 321 288 326 0.87
9 21 0.0088 9.3e+02 2.7 0.0 32 58 329 355 322 362 0.87
10 21 0.0012 1.3e+02 5.5 0.0 32 66 363 397 353 405 0.78
11 21 0.0012 1.3e+02 5.5 0.0 32 65 414 447 401 455 0.79
12 21 0.0048 5.2e+02 3.6 0.0 32 57 431 456 420 466 0.74
13 21 0.0013 1.4e+02 5.4 0.0 32 65 465 498 456 506 0.79
14 21 0.0055 5.8e+02 3.4 0.0 32 59 516 543 505 557 0.82
15 21 0.011 1.1e+03 2.5 0.0 32 57 550 575 544 579 0.88
16 21 0.0011 1.2e+02 5.6 0.1 32 66 567 601 554 609 0.76
17 21 0.0038 4.1e+02 3.9 0.0 32 59 584 611 574 619 0.76
18 21 0.0014 1.5e+02 5.3 0.1 32 65 618 651 612 659 0.78
19 21 0.0016 1.7e+02 5.1 0.1 33 65 653 685 650 693 0.76
20 21 0.0038 4.1e+02 3.9 0.0 32 59 669 696 659 704 0.76
21 21 0.0039 4.1e+02 3.9 0.1 33 63 687 717 682 726 0.76

Sequence Information

Coding Sequence
ATGACAGAAATTCTGCGCAACTCAATGATGTTGGACGTCATAGGGCTCCCGTGTGTTGCACCCCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGCTGTACCCCATCACCACCCCATGATGTTGGACGTCATAGGGCTCCCGTGTGTTGCACCCCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGCTGTACCCCATCACCACCCCATGATGCTGGACTTCATAGGTCTCCCGTGTGTTGTACCTCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGTTGCACCCCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGCTGTACCCCATCACCACCCCATGATGCTGGACGTCATAGGTCTCCCGTGTGTTGTACCTCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGTTGCACCCCATCACCACCCCATGATGCTGGACGTCATAGGTCTCCCGTGTGTTGTACCTCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGTTGTACCTCATCTCCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGTTGCACCCCATCACCACCCCATGATGCTGGACGTCATAGGTCTCCCGTGTGTTGTACCTCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGTTGCACCCCATCACCACCCCATGATGTTGGACGTCATAGGGCTCCCGTGTGCTGTACCCCATCACCACCCCATGATGCTGGACGTCATAGGTCTCCCGTGTGTTGTACCTCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGTTGCACCCCATCACCACCCCATGATGCTGGACGTCATAGGTCTCCCGTGTGTTGTACCTCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGTTGCACCCCATCACCACCCCATGATGCTGGACGTCATAGGTCTCCCGTGTGTTGTACCTCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGTTGCACCCCATCACCACCCCATGATGTTGGACGTCATAGGGCTCCCGTGTGCTGTACCCCATCACCACCCCATGATGCTGGACGTCATAGGTCTCCCGTGTGTTGTACCTCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGTTGCACCCCATCACCACCCCATGATGTTGGACGTCATAGGGCTCCCGTGTGCTGTACCCCATCACCACCCCATGATGCTGGACGTCATAGGTCTCCCGTGTGTTGTACCTCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGTTGCACCCCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGCTGTACCCCATCACCACCCCATGATGCTGGACGTCATAGGTCTCCCGTGTGTTGTACCTCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGTTGCACCCCATCACCACCCCATGATGCTGGACGTCATAGGTCTCCCGTGTGTTGTACCTCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGTTGCACCCCATCACCACCCCATGATGTTGGACGTCATAGGGCTCCCGTGTGCTGTACCCCATCACCAACCCATGATGCTGGACGTCATAGGGCTCCCGTGTGCTGTACCCCATCACCACCCCATGATGCTGGACGTCATAGGTCTCCCGTGTGTTGTACCTCATCACCACCCCATGATGCTGGACGTCATAGGTCTCCCGTGTGTTGTACCTCATCACCACCCCATGATGCTGGACGTCATAGGTCTCCCGTGTGCTGTACCCCATCACCACCCCATGATGCTGGACGTCATAGGTCTCCCGTGTGTTGTACCTCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGCTGTACCCCATCACCACCCCATGATGCTGGACGTCATAGGTCTCCCGTGTGTTGTACCTCATCACCACCCCATGATGCTGGACGTCATAGGTCTCCCGTGTGTTGTACCTCATCACCACCCCATGATGCTGGACGTCATAGGGCTCCCGTGTGTTGTACCCCATCACCACCCCATGATGCTGGACGGTTCGTGCCGGAGAGCGAGTGGCGCTGGTGTCTGTGCCAGCATCTCGCCGGCCGGCGTCGCTCCGCCGCAAGGATCACGTTCCCGCGAGTCACGACCAGAAATACCTGTTCCCCGCCGTGCCAGGGGGAGGAGGGGCGGCCAGCCAGTGTTGCCAAATTACACGCGCGCGAGAAAACAAGAGAGCGCCGGATCGCCAGCTCGCACCGCGAGGCCGCGGCTACCGACAGCAAACAAGCAAAGTGGCGCTAAGGGCTAG
Protein Sequence
MTEILRNSMMLDVIGLPCVAPHHHPMMLDVIGLPCAVPHHHPMMLDVIGLPCVAPHHHPMMLDVIGLPCAVPHHHPMMLDFIGLPCVVPHHHPMMLDVIGLPCVAPHHHPMMLDVIGLPCAVPHHHPMMLDVIGLPCVVPHHHPMMLDVIGLPCVAPHHHPMMLDVIGLPCVVPHHHPMMLDVIGLPCVVPHLHPMMLDVIGLPCVAPHHHPMMLDVIGLPCVVPHHHPMMLDVIGLPCVAPHHHPMMLDVIGLPCAVPHHHPMMLDVIGLPCVVPHHHPMMLDVIGLPCVAPHHHPMMLDVIGLPCVVPHHHPMMLDVIGLPCVAPHHHPMMLDVIGLPCVVPHHHPMMLDVIGLPCVAPHHHPMMLDVIGLPCAVPHHHPMMLDVIGLPCVVPHHHPMMLDVIGLPCVAPHHHPMMLDVIGLPCAVPHHHPMMLDVIGLPCVVPHHHPMMLDVIGLPCVAPHHHPMMLDVIGLPCAVPHHHPMMLDVIGLPCVVPHHHPMMLDVIGLPCVAPHHHPMMLDVIGLPCVVPHHHPMMLDVIGLPCVAPHHHPMMLDVIGLPCAVPHHQPMMLDVIGLPCAVPHHHPMMLDVIGLPCVVPHHHPMMLDVIGLPCVVPHHHPMMLDVIGLPCAVPHHHPMMLDVIGLPCVVPHHHPMMLDVIGLPCAVPHHHPMMLDVIGLPCVVPHHHPMMLDVIGLPCVVPHHHPMMLDVIGLPCVVPHHHPMMLDGSCRRASGAGVCASISPAGVAPPQGSRSRESRPEIPVPRRARGRRGGQPVLPNYTRARKQESAGSPARTARPRLPTANKQSGAKG

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-