Basic Information

Gene Symbol
-
Assembly
GCA_963583965.1
Location
CAUZXB010000025.1:3829292-3841393[-]

Transcription Factor Domain

TF Family
GTF2I
Domain
GTF2I domain
PFAM
PF02946
TF Group
Other Alpha-Helix Group
Description
This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain [2, 1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 15 0.38 9.3e+03 -2.5 0.0 27 63 192 230 191 234 0.75
2 15 0.0083 2.1e+02 2.8 0.0 27 44 277 294 270 308 0.93
3 15 0.16 3.9e+03 -1.3 0.0 27 64 367 406 364 411 0.83
4 15 0.38 9.3e+03 -2.5 0.0 27 63 546 584 545 588 0.75
5 15 0.0083 2.1e+02 2.8 0.0 27 44 631 648 624 662 0.93
6 15 0.16 4e+03 -1.3 0.0 27 64 721 760 719 765 0.83
7 15 0.38 9.3e+03 -2.5 0.0 27 63 900 938 899 942 0.75
8 15 0.061 1.5e+03 0.0 0.0 27 44 985 1002 982 1015 0.92
9 15 0.38 9.3e+03 -2.5 0.0 27 63 1211 1249 1210 1253 0.75
10 15 0.0083 2.1e+02 2.8 0.0 27 44 1296 1313 1289 1327 0.93
11 15 0.16 4e+03 -1.3 0.0 27 64 1386 1425 1384 1430 0.83
12 15 0.38 9.3e+03 -2.5 0.0 27 63 1565 1603 1564 1607 0.75
13 15 0.061 1.5e+03 0.0 0.0 27 44 1650 1667 1647 1680 0.92
14 15 0.37 9.2e+03 -2.5 0.0 27 63 1782 1820 1781 1824 0.75
15 15 0.93 2.3e+04 -3.8 0.0 27 42 1914 1929 1913 1946 0.88

Sequence Information

Coding Sequence
atgacactcttaaggcagggcaccagtgtctgctggtcacgggattgccgaccccgacttacccgagttggcccactccaagggtcttcggacccatgcgcatgccctctgacattcagtgctccgaatgtcagcgtgaagaaagtcaagaaagcgctggtaaacgacgggagtaactatgactttcttaaggcagggcacctgtgtctgctggtcacgggattgacgacccagacttacccgagttggcccactccaagggtcttgggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggcaccagtgtctgctggtcacgggattgacgaccccgacttacccgagttggcccactccaagggtcttcggacccatgcgcctgccccctgacattcagtgctccgaatgtcagcgtgaagaaattcaagaaagcgcggggtcttgggacccatgcgcctgccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgactctcttaaggcagggcaccagtgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggtcttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggtcttcggacccatgcgcctgccccccgacattcagtgttctgaatgtcaaagtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggccccagtgtctgctggtcacgggattgccgaccccgacttacccgagttggcccactccaagggtcttcggacccatgcgcatgccctctgacattcagtgctccgaatgtcagcgtgaagaaagtcaagaaagcgctgttggcacactccaagggtcttcggacccatgcgcctgccccccgacattcagtgttctgaatgtcaacgtgaagaaactcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggcaccaatgtctgctggtaacgggattgacgtccccgacttacccgagttggctcactccaagggacttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggcacctgtgtctgctggtcacgggattgacgacccagacttacccgagttggcccactcaaagggtcttgggacccatgcgcctaccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagtaagcgcagggcaccagtgtctgctggtcacgggattgacgaccccgacttacccgagatggcccactccaagggtcttcggacccatgcgcctgccccctgacattcagtgctccgaatgtcagcgtgaagaaattcaagaaagcgcggggtcttgggacccatgcgcctgccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgactctcttaaggcagggcaccagtgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggtcttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggtcttcggacccatgcgcctgccccccgacattcagtgttctgaatgtcaaagtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggccccagtgtctgctggtcacgggattgccgaccccgacttacccgagttggcccactccaagggtcttcggacccatgcgcatgccctctgacattcagtgctccgaatgtcagcgtgaagaaagtcaagaaagcgctgttggcacactccaagggtcttcggacccatgcgcctgccccccgacattcagtgttctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggcaccaatgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggacttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggcacctgtgtctgctggtcacgggattgacgacccagacttacccgagttggcccactcaaagggtcttgggacccatgcgcctaccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagtaagcgcagggcaccagtgtctgctggtcacgggattgacgaccccgacttacccgagatggcccactccaagggtcttcggacccatgcgcctgccccctgacattcagtgctccgaatgtcagcgtgaagaaattcaagaaagcgcggggtcttgggacccatgcgcctgccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgactctcttaaggcagggcaccagtgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggtcttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggtcttcggacccatgcgcctgccccccgacattcagtgttctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggcaccagtgtctgctggtcacgggattgccgaccccgacttacccgagttggcccactccaagggtcttcggacccatgcgcatgccctctgacattcagtgctccgaatgtcagcgtgaagaaagtcaagaaagcgctgggcaccaatgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggacttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggcacctgtgtctgctggtcacgggattgacgacccagacttacccgagttggcccactcaaagggtcttgggacccatgcgcctaccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagtaagcgcagggcaccagtgtctgctggtcacgggattgacgaccccgacttacccgagatggcccactccaagggtcttcggacccatgcgcctgccccctgacattcagtgctccgaatgtcagcgtgaagaaattcaagaaagcgcggggtcttgggacccatgcgcctgccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgactctcttaaggcagggcaccagtgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggtcttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggtcttcggacccatgcgcctgccccccgacattcagtgttctgaatgtcaaagtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggccccagtgtctgctggtcacgggattgccgaccccgacttacccgagttggcccactccaagggtcttcggacccatgcgcatgccctctgacattcagtgctccgaatgtcagcgtgaagaaagtcaagaaagcgctgttggcacactccaagggtcttcggacccatgcgcctgccccccgacattcagtgttctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggcaccaatgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggacttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggcacctgtgtctgctggtcacgggattgacgacccagacttacccgagttggcccactcaaagggtcttgggacccatgcgcctaccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagtaagcgcagggcaccagtgtctgctggtcacgggattgacgaccccgacttacccgagatggcccactccaagggtcttcggacccatgcgcctgccccctgacattcagtgctccgaatgtcagcgtgaagaaattcaagaaagcgcggggtcttgggacccatgcgcctgccccccgacgttcagtgctctgaatgtcaacgtgaaggaattcaagaaagcgcgggtaaacggcgggagtaactatgactctcttaaggcagggcaccagtgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggtcttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggtcttcggacccatgcgcctgccccccgacattcagtgttctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggcaccagtgtctgctggtcacgggattgccgaccccgacttacccgagttggcccactccaagggtcttcggacccatgcgcatgccctctgacattcagtgctccgaatgtcagcgtgaagaaagtcaagaaagcgctgggcaccagtgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggtcttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggtctacggacccatgcgcctgccccccgacattcagtgttctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggcaccagtgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggtcttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggcacctgtgtctgctggtcacgggattgacgacccagacttacccgagttggcccactcaaagggtcttgggacccatgcgcctaccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagtaagcgcggggtcttgggacccatgcgcctgccccccgacgttcagtgctctgaatgtcagcgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgactctcttaaggcaggggaccagtgtctgctggtgacgggattgacgaccccgacttacccgagttggcccactccaagggtcttcggacccatgcgcctgccccctgacattcagtgctccgaatgtcagcgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaa
Protein Sequence
MTLLRQGTSVCWSRDCRPRLTRVGPLQGSSDPCACPLTFSAPNVSVKKVKKALVNDGSNYDFLKAGHLCLLVTGLTTQTYPSWPTPRVLGPMRLPPVIQCSECQREEIQESAGHQCLLVTGLTTPTYPSWPTPRVFGPMRLPPDIQCSECQREEIQESAGSWDPCACPPTFSALNVNVKKFKKARVNGGSNYDSLKAGHQCLLVTGLTSPTYPSWLTPRVFGPMRLPPVIQCSECQREEIQESAGSSDPCACPPTFSVLNVKVKKFKKARVNGGSNYDTLKAGPQCLLVTGLPTPTYPSWPTPRVFGPMRMPSDIQCSECQREESQESAVGTLQGSSDPCACPPTFSVLNVNVKKLKKARVNGGSNYDTLKAGHQCLLVTGLTSPTYPSWLTPRDFGPMRLPPVIQCSECQREEIQESAGHLCLLVTGLTTQTYPSWPTQRVLGPMRLPPDVQCSECQREEIQVSAGHQCLLVTGLTTPTYPRWPTPRVFGPMRLPPDIQCSECQREEIQESAGSWDPCACPPTFSALNVNVKKFKKARVNGGSNYDSLKAGHQCLLVTGLTSPTYPSWLTPRVFGPMRLPPVIQCSECQREEIQESAGSSDPCACPPTFSVLNVKVKKFKKARVNGGSNYDTLKAGPQCLLVTGLPTPTYPSWPTPRVFGPMRMPSDIQCSECQREESQESAVGTLQGSSDPCACPPTFSVLNVNVKKFKKARVNGGSNYDTLKAGHQCLLVTGLTSPTYPSWLTPRDFGPMRLPPVIQCSECQREEIQESAGHLCLLVTGLTTQTYPSWPTQRVLGPMRLPPDVQCSECQREEIQVSAGHQCLLVTGLTTPTYPRWPTPRVFGPMRLPPDIQCSECQREEIQESAGSWDPCACPPTFSALNVNVKKFKKARVNGGSNYDSLKAGHQCLLVTGLTSPTYPSWLTPRVFGPMRLPPVIQCSECQREEIQESAGSSDPCACPPTFSVLNVNVKKFKKARVNGGSNYDTLKAGHQCLLVTGLPTPTYPSWPTPRVFGPMRMPSDIQCSECQREESQESAGHQCLLVTGLTSPTYPSWLTPRDFGPMRLPPVIQCSECQREEIQESAGHLCLLVTGLTTQTYPSWPTQRVLGPMRLPPDVQCSECQREEIQVSAGHQCLLVTGLTTPTYPRWPTPRVFGPMRLPPDIQCSECQREEIQESAGSWDPCACPPTFSALNVNVKKFKKARVNGGSNYDSLKAGHQCLLVTGLTSPTYPSWLTPRVFGPMRLPPVIQCSECQREEIQESAGSSDPCACPPTFSVLNVKVKKFKKARVNGGSNYDTLKAGPQCLLVTGLPTPTYPSWPTPRVFGPMRMPSDIQCSECQREESQESAVGTLQGSSDPCACPPTFSVLNVNVKKFKKARVNGGSNYDTLKAGHQCLLVTGLTSPTYPSWLTPRDFGPMRLPPVIQCSECQREEIQESAGHLCLLVTGLTTQTYPSWPTQRVLGPMRLPPDVQCSECQREEIQVSAGHQCLLVTGLTTPTYPRWPTPRVFGPMRLPPDIQCSECQREEIQESAGSWDPCACPPTFSALNVNVKEFKKARVNGGSNYDSLKAGHQCLLVTGLTSPTYPSWLTPRVFGPMRLPPVIQCSECQREEIQESAGSSDPCACPPTFSVLNVNVKKFKKARVNGGSNYDTLKAGHQCLLVTGLPTPTYPSWPTPRVFGPMRMPSDIQCSECQREESQESAGHQCLLVTGLTSPTYPSWLTPRVFGPMRLPPVIQCSECQREEIQESAGSTDPCACPPTFSVLNVNVKKFKKARVNGGSNYDTLKAGHQCLLVTGLTSPTYPSWLTPRVFGPMRLPPVIQCSECQREEIQESAGHLCLLVTGLTTQTYPSWPTQRVLGPMRLPPDVQCSECQREEIQVSAGSWDPCACPPTFSALNVSVKKFKKARVNGGSNYDSLKAGDQCLLVTGLTTPTYPSWPTPRVFGPMRLPPDIQCSECQREEIQESAGKRRE

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-