Basic Information

Gene Symbol
-
Assembly
GCA_028673005.1
Location
CM054112.1:16600374-16610016[+]

Transcription Factor Domain

TF Family
GTF2I
Domain
GTF2I domain
PFAM
PF02946
TF Group
Other Alpha-Helix Group
Description
This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain [2, 1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 9 0.027 4.9e+02 3.9 0.0 37 56 222 242 212 253 0.80
2 9 0.033 6e+02 3.6 0.0 37 54 448 465 438 478 0.82
3 9 0.027 4.9e+02 3.9 0.0 37 56 723 743 713 754 0.80
4 9 0.027 4.9e+02 3.9 0.0 37 56 998 1018 988 1029 0.80
5 9 0.027 4.9e+02 3.9 0.0 37 56 1273 1293 1263 1304 0.80
6 9 0.027 4.9e+02 3.9 0.0 37 56 1548 1568 1538 1579 0.80
7 9 0.027 4.9e+02 3.9 0.0 37 56 1823 1843 1813 1854 0.80
8 9 0.027 4.9e+02 3.9 0.0 37 56 2098 2118 2088 2129 0.80
9 9 0.027 4.9e+02 3.9 0.0 37 56 2373 2393 2363 2404 0.80

Sequence Information

Coding Sequence
atgacctctcccgtcgctccgaggcagtgtgactctccgaatttcaattgtccccccacggatccgcctcttcggacggccagcggcacctcgcccccctccccggcgaaatcgccccacgtgttttcgaaggatccccgaattcggccgccgactaccatcccacccccgcgacgactgcccgctctctgccccgccgaacgagcccctcgccgcccccgacaagtcggccccggcgctcgaccacgcggacgctccccggcacttggcgaaaattcgcctccctcgggcctatacggctccaataaaccgtcaggaggtcttccccttatcgctgtccaggggtttgctttcttttttttaacggtgaaaatttcatctatggcgggaacctttatttgcccgggtcgttcggccccaactacgtgtgggtggcgtaaatgccctcctcacccctcgcgacaggggagaattccccccatacctcaggcgtgtcacctaaggtcgggctcttggtggcaccgaccaaatatttggcgcatccgtggccgagctaaaaaggcagtcgcgccgaaggaggtgcggaagggcggtcacccatccaagtactatccgggccctacgatgcttaacttcggtgatcggacgagaaccggtgttttcatcgtggtatggtcgttgccgagagatttaagccttcgacgccctaagacacattacaccttatcggcgatccgtcgatatttccatacccgcgccgcctcaaccgagaccaaaaccgacttgcgaagtccgtggccgaatttaaaatatgccgccatccgcatgacctctcccgtcgctccgaggcagtgtgactctccgaatttcaattgtccccccacggatccgcctcttcggacggccagcggcacctcgcccccctccccggcgaaatcgccccacgtgttttcgaaggatccccgaattcggccgccgactaccatcccacccccgcgacgactgcccgctctctgccccgccgaacgagcccctcgccgcccccgacaagtcggccccggcgctcgaccacgcggacgctccccggcacttggcgaaaattcgcctccctcgggcctatacggctccaataaaccgtcaggaggtcttccccttatcgctgtccaggggtgtcacctaaggtcgggctcttggtggcaccgaccaaatatttggcgcatccgtggccgagctaaaaaggcagtcgcgccgaaggaggtgcggaagggcggtcacccatccaagtactatccgggccctacgatgcttaacttcggtgatcggacgagaaccggtgttttcatcgtggtatggtcgttgccgagagatttaagccttcgacgccctaagacacattacaccttatcggcgatcagtcgatatttccatacccgcgccgcctcaaccgagaccaaaaccgacttgcgaagtccgtggccgaatttaaaatatgccgccatccgcatgacctctcccgtcgctccgaggcagtgtgactctccgaatttcaattgtccccccacggatccgcctcttcggacggccagcggcacctcgcccccctccccggcgaaatcgccccacgtgttttcgaaggatccccgaattcggccgccgactaccatcccacccccgcgacgactgcccgctctctgccccgccgaacgagcccctcgccgcccccgacaagtcggccccggcgctcgaccacgcggacgctccccggcacttggcgaaaattcgcctccctcgggcctatacggctccaataaaccgtcaggaggtcttccccttatcgctgtccaggggtttgctttcttttttttaacggtgaaaatttcatctatggcgggaacctttatttgcccgggtcgttcggccccaactacgtgtgggtggcgtaaatgccctcctcacccctcgcgacaggggagaattccccccatacctcaggcgtgtcacctaaggtcgggctcttggtggcaccgaccaaatatttggcgcatccgtggccgagctaaaaaggcagtcgcgccgaaggaggtgcggaagggcggtcacccatccaagtactatccgggccctacgatgcttaacttcggtgatcggacgagaaccggtgttttcatcgtggtatggtcgttgccgagagatttaagccttcgacgccctaagacacattacaccttatcggcgatccgtcgatatttccatacccgcgccgcctcaaccgagaccaaaaccgacttgcgaagtccgtggccgaatttaaaatatgccgccatccgcatgacctctcccgtcgctccgaggcagtgtgactctccgaatttcaattgtccccccacggatccgcctcttcggacggccagcggcacctcgcccccctccccggcgaaatcgccccacgtgttttcgaaggatccccgaattcggccgccgactaccatcccacccccgcgacgactgcccgctctctgccccgccgaacgagcccctcgccgcccccgacaagtcggccccggcgctcgaccacgcggacgctccccggcacttggcgaaaattcgcctccctcgggcctatacggctccaataaaccgtcaggaggtcttccccttatcgctgtccaggggtttgctttcttttttttaacggtgaaaatttcatctatggcgggaacctttatttgcccgggtcgttcggccccaactacgtgtgggtggcgtaaatgccctcctcacccctcgcgacaggggagaattccccccatacctcaggcgtgtcacctaaggtcgggctcttggtggcaccgaccaaatatttggcgcatccgtggccgagctaaaaaggcagtcgcgccgaaggaggtgcggaagggcggtcacccatccaagtactatccgggccctacgatgcttaacttcggtgatcggacgagaaccggtgttttcatcgtggtatggtcgttgccgagagatttaagccttcgacgccctaagacacattacaccttatcggcgatccgtcgatatttccatacccgcgccgcctcaaccgagaccaaaaccgacttgcgaagtccgtggccgaatttaaaatatgccgccatccgcatgacctctcccgtcgctccgaggcagtgtgactctccgaatttcaattgtccccccacggatccgcctcttcggacggccagcggcacctcgcccccctccccggcgaaatcgccccacgtgttttcgaaggatccccgaattcggccgccgactaccatcccacccccgcgacgactgcccgctctctgccccgccgaacgagcccctcgccgcccccgacaagccggccccggcgctcgaccacgcggacgctccccggcgcttggcgaaaattcgcctccctcgggcctatacggctccaataaaccgtcaggaggtcttccccttatcgctgtccaggggtttgctttcttttttttaacggtgaaaatttcatctatggcgggaacctttatttgcccgggtcgttcggccccaactacgtgtgggtggcgtaaatgccctcctcaccactcgcgacaggggagaattccccccatacctcaggcgtgtcacctaaggtcgggctcttggtggcaccgaccaaatatttggcgcatccgtggccgagctaaaaaggcagtcgcgccgaaggaggtgcggaagggcggtcacccatccaagtactacccgggccctacgatgcttaacttcggtgatcggacgagaaccggtgttttcatcgtggtatggtcgttgccgagagatttaagccttcgacgccctaagacacattacaccttatcggcgatccgtcgatatttccatacccgcgccgcctcaaccgagaccaaaaccgacttgcgaagtccgtggccgaatttaaaatatgccgccatccgcatgacctctcccgtcgctccgaggcagtgtgactctccgaatttcaattgtccccccacggatccgcctcttcggacggccagcggcacctcgcccccctccccggcgaaatcgccccacgtgttttcgaaggatccccgaattcggccgccgactaccatcccacccccgcgacgactgcccgctctctgccccgccgaacgagcccctcgccgcccccgacaagtcggccccggcgctcgaccacgcggacgctccccggcgcttggcgaaaattcgcctccctcgggcctatacggctccaataaaccgtcaggaggtcttccccttatcgctgtccaggggtttgctttcttttttttaacggtgaaaatttcatctatggcgggaacctttatttgcccgggtcgttcggccccaactacgtgtgggtggcgtaaatgccctcctcacccctcgcgacaggggagaattccccccatacctcaggcgtgtcacctaaggtcgggctcttggtggcaccgaccaaatatttggcgcatccgtggccgagctaaaaaggcagtcgcgccgaaggaggtgcggaagggcggtcacccatccaagtactacccgggccctacgatgcttaacttcggtgatcggacgagaaccggtgttttcatcgtggtatggtcgttgccgagagatttaagccttcgacgccctaagacacattacaccttatcggcgatccgtcgatatttccatacccgcgccgcctcaaccgagaccaaaaccgacttgcgaagtccgtggccgaatttaaaatatgccgccatccgcatgacctctcccgtcgctccgaggcagtgtgactctccgaatttcaattgtccccccacggatccgcctcttcggacggccagcggcacctcgcccccctccccggcgaaatcgccccacgtgttttcgaaggatccccgaattcggccgccgactaccatcccacccccgcgacgactgcccgctctctgccccgccgaacgagcccctcgccgcccccgacaagtcggccccggcgctcgaccacgcggacgctccccggcacttggcgaaaattcgcctccctcgggcctatacggctccaataaaccgtcaggaggtcttccccttatcgctgtccaggggtttgctttcttttttttaacggtgaaaatttcatctatggcgggaacctttatttgcccgggtcgttcggccccaactacgtgtgggtggcgtaaatgccctcctcacccctcgcgacaggggagaattccccccatacctcaggcgtgtcacctaaggtcgggctcttggtggcaccgaccaaatatttggcgcatccgtggccgagctaaaaaggcagtcgcgccgaaggaggtgcggaagggcggtcacccatccaagtactatccgggccctacgatgcttaacttcggtgatcggacgagaaccggtgttttcatcgtggtatggtcgttgccgagagatttaagccttcgacgccctaagacacattacaccttatcggcgatccgtcgatatttccatacccgcgccgcctcaaccgagaccaaaaccgacttgcgaagtccgtggccgaatttaaaatatgccgccatccgcatgacctctcccgtcgctccgaggcagtgtgactctccgaatttcaattgtccccccacggatccgcctcttcggacggccagcggcacctcgcccccctccccggcgaaatcgccccacgtgttttcgaaggatccccgaattcggccgccgactaccatcccacccccgcgacgactgtccgctctctgccccgccgaacgagcccctcgccgcaaccgacaagtcggccccggcgctcgaccacgcggacgctccccggcacttggcgaaaattcgcctccctcgggcctatacggctccaataaaccgtcaggaggtcttccccttatcgctgtccaggggtttgctttcttttttttaacggtgaaaatttcatctatggcgggaacctttatttgcccgggtcgttcggccccaactacgtgtgggtggcgtaaatgccctcctcacccctcgcgacaggggagaattccccccatacctcaggcgtgtcacctaaggtcgggctcttggtggcaccgaccaaatatttggcgcatccgtggccgagctaaaaaggcagtcgcgccgaaggaggtgcggaagggcggtcacccatccaagtactatccgggccctacgatgcttaacttcggtgatcggacgagaaccggtgttttcatcgtggtatggtcgttgccgagagatttaagccttcgacgccctaagacacattacaccttatcggcgatccgtcgatatttccatacccgcgccgcctcaaccgagaccaaaaccgacttgcgaagtccgtggccgaatttaaaatatgccgccatccgcatgacctctcccgtcgctccgaggcagtgtgactctccgaatttcaattgtccccccacggatccgcctcttcggacggccagcggcacctcgcccccctccccggcgaaatcgccccacgtgttttcgaaggatccccgaattcggccgccgactaccatcccacccccgcgacgactgtccgctctctgccccgccgaacgagcccctcgccgcaaccgacaagtcggccccggcgctcgaccacgcggacgctccccggcacttggcgaaaattcgcctccctcgggcctatacggctccaataaaccgtcaggaggtcttccccttatcgctgtccaggggtttgctttcttttttttaacggtgaaaatttcatctatggcgggaacctttatttgcccgggtcgttcggccccaactacgtgtgggtggcgtaaatgccctcctcacccctcgcgacaggggagaattccccccatacctcaggcgtgtcacctaaggtcgggctcttggtggcaccgaccaaatatttggcgcatccgtggccgagctaaaaaggcagtcgcgccgaaggaggtgcggaagggcggtcacccatccaagtactatccgggccctacgatgcttaacttcggtgatcggacgagaaccggtgttttcatcgtggtatggtcgttgccgagagatttaagccttcgacgccctaagacacattacaccttatcggcgatccgtcgatatttccatacccgcgccgcctcaaccgagaccaaaaccgacttgcgaagtccgtggccgaatttaaaatatgccgccatccgcatgacctctcccgtcgctccgaggcagtgtgactctccgaatttcaattgtccccccacggatccgcctcttcggacggccagcggcacctcgcccccctccccggcgaaatcgccacacgtgttttcgaaggatccccgaattcggccgccgactaccatcccacccccgcgacgactgcccgctctctgccccgccgaacgagcccctcgccgcccccgacaagtcggccccggcgctcgaccacgcggacgctccccggcacttggcgaaaattcgcctccctcgggcctatacggctccaataaaccgtcaggaggtcttccccttatcgctgtccaggggtttgctttcttttttttaacggtgaaaatttcatctatggcgggaacctttatttgcccgggtcgttcggccccaactacgtgtgggtggcgtaaatgccctcctcacccctcgcgacaggggagaattccccccatacctcaggcgttttcttcgggggggataacgtcgcggcttagggacacggtttccgagtag
Protein Sequence
MTSPVAPRQCDSPNFNCPPTDPPLRTASGTSPPSPAKSPHVFSKDPRIRPPTTIPPPRRLPALCPAERAPRRPRQVGPGARPRGRSPALGENSPPSGLYGSNKPSGGLPLIAVQGFAFFFLTVKISSMAGTFICPGRSAPTTCGWRKCPPHPSRQGRIPPIPQACHLRSGSWWHRPNIWRIRGRAKKAVAPKEVRKGGHPSKYYPGPTMLNFGDRTRTGVFIVVWSLPRDLSLRRPKTHYTLSAIRRYFHTRAASTETKTDLRSPWPNLKYAAIRMTSPVAPRQCDSPNFNCPPTDPPLRTASGTSPPSPAKSPHVFSKDPRIRPPTTIPPPRRLPALCPAERAPRRPRQVGPGARPRGRSPALGENSPPSGLYGSNKPSGGLPLIAVQGCHLRSGSWWHRPNIWRIRGRAKKAVAPKEVRKGGHPSKYYPGPTMLNFGDRTRTGVFIVVWSLPRDLSLRRPKTHYTLSAISRYFHTRAASTETKTDLRSPWPNLKYAAIRMTSPVAPRQCDSPNFNCPPTDPPLRTASGTSPPSPAKSPHVFSKDPRIRPPTTIPPPRRLPALCPAERAPRRPRQVGPGARPRGRSPALGENSPPSGLYGSNKPSGGLPLIAVQGFAFFFLTVKISSMAGTFICPGRSAPTTCGWRKCPPHPSRQGRIPPIPQACHLRSGSWWHRPNIWRIRGRAKKAVAPKEVRKGGHPSKYYPGPTMLNFGDRTRTGVFIVVWSLPRDLSLRRPKTHYTLSAIRRYFHTRAASTETKTDLRSPWPNLKYAAIRMTSPVAPRQCDSPNFNCPPTDPPLRTASGTSPPSPAKSPHVFSKDPRIRPPTTIPPPRRLPALCPAERAPRRPRQVGPGARPRGRSPALGENSPPSGLYGSNKPSGGLPLIAVQGFAFFFLTVKISSMAGTFICPGRSAPTTCGWRKCPPHPSRQGRIPPIPQACHLRSGSWWHRPNIWRIRGRAKKAVAPKEVRKGGHPSKYYPGPTMLNFGDRTRTGVFIVVWSLPRDLSLRRPKTHYTLSAIRRYFHTRAASTETKTDLRSPWPNLKYAAIRMTSPVAPRQCDSPNFNCPPTDPPLRTASGTSPPSPAKSPHVFSKDPRIRPPTTIPPPRRLPALCPAERAPRRPRQAGPGARPRGRSPALGENSPPSGLYGSNKPSGGLPLIAVQGFAFFFLTVKISSMAGTFICPGRSAPTTCGWRKCPPHHSRQGRIPPIPQACHLRSGSWWHRPNIWRIRGRAKKAVAPKEVRKGGHPSKYYPGPTMLNFGDRTRTGVFIVVWSLPRDLSLRRPKTHYTLSAIRRYFHTRAASTETKTDLRSPWPNLKYAAIRMTSPVAPRQCDSPNFNCPPTDPPLRTASGTSPPSPAKSPHVFSKDPRIRPPTTIPPPRRLPALCPAERAPRRPRQVGPGARPRGRSPALGENSPPSGLYGSNKPSGGLPLIAVQGFAFFFLTVKISSMAGTFICPGRSAPTTCGWRKCPPHPSRQGRIPPIPQACHLRSGSWWHRPNIWRIRGRAKKAVAPKEVRKGGHPSKYYPGPTMLNFGDRTRTGVFIVVWSLPRDLSLRRPKTHYTLSAIRRYFHTRAASTETKTDLRSPWPNLKYAAIRMTSPVAPRQCDSPNFNCPPTDPPLRTASGTSPPSPAKSPHVFSKDPRIRPPTTIPPPRRLPALCPAERAPRRPRQVGPGARPRGRSPALGENSPPSGLYGSNKPSGGLPLIAVQGFAFFFLTVKISSMAGTFICPGRSAPTTCGWRKCPPHPSRQGRIPPIPQACHLRSGSWWHRPNIWRIRGRAKKAVAPKEVRKGGHPSKYYPGPTMLNFGDRTRTGVFIVVWSLPRDLSLRRPKTHYTLSAIRRYFHTRAASTETKTDLRSPWPNLKYAAIRMTSPVAPRQCDSPNFNCPPTDPPLRTASGTSPPSPAKSPHVFSKDPRIRPPTTIPPPRRLSALCPAERAPRRNRQVGPGARPRGRSPALGENSPPSGLYGSNKPSGGLPLIAVQGFAFFFLTVKISSMAGTFICPGRSAPTTCGWRKCPPHPSRQGRIPPIPQACHLRSGSWWHRPNIWRIRGRAKKAVAPKEVRKGGHPSKYYPGPTMLNFGDRTRTGVFIVVWSLPRDLSLRRPKTHYTLSAIRRYFHTRAASTETKTDLRSPWPNLKYAAIRMTSPVAPRQCDSPNFNCPPTDPPLRTASGTSPPSPAKSPHVFSKDPRIRPPTTIPPPRRLSALCPAERAPRRNRQVGPGARPRGRSPALGENSPPSGLYGSNKPSGGLPLIAVQGFAFFFLTVKISSMAGTFICPGRSAPTTCGWRKCPPHPSRQGRIPPIPQACHLRSGSWWHRPNIWRIRGRAKKAVAPKEVRKGGHPSKYYPGPTMLNFGDRTRTGVFIVVWSLPRDLSLRRPKTHYTLSAIRRYFHTRAASTETKTDLRSPWPNLKYAAIRMTSPVAPRQCDSPNFNCPPTDPPLRTASGTSPPSPAKSPHVFSKDPRIRPPTTIPPPRRLPALCPAERAPRRPRQVGPGARPRGRSPALGENSPPSGLYGSNKPSGGLPLIAVQGFAFFFLTVKISSMAGTFICPGRSAPTTCGWRKCPPHPSRQGRIPPIPQAFSSGGITSRLRDTVSE

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01402114; iTF_01402112;
90% Identity
-
80% Identity
-