Nfer000428.1
Basic Information
- Insect
- Nomada ferruginata
- Gene Symbol
- -
- Assembly
- GCA_963583965.1
- Location
- CAUZXB010000025.1:3829292-3841393[-]
Transcription Factor Domain
- TF Family
- GTF2I
- Domain
- GTF2I domain
- PFAM
- PF02946
- TF Group
- Other Alpha-Helix Group
- Description
- This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain [2, 1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 15 0.38 9.3e+03 -2.5 0.0 27 63 192 230 191 234 0.75 2 15 0.0083 2.1e+02 2.8 0.0 27 44 277 294 270 308 0.93 3 15 0.16 3.9e+03 -1.3 0.0 27 64 367 406 364 411 0.83 4 15 0.38 9.3e+03 -2.5 0.0 27 63 546 584 545 588 0.75 5 15 0.0083 2.1e+02 2.8 0.0 27 44 631 648 624 662 0.93 6 15 0.16 4e+03 -1.3 0.0 27 64 721 760 719 765 0.83 7 15 0.38 9.3e+03 -2.5 0.0 27 63 900 938 899 942 0.75 8 15 0.061 1.5e+03 0.0 0.0 27 44 985 1002 982 1015 0.92 9 15 0.38 9.3e+03 -2.5 0.0 27 63 1211 1249 1210 1253 0.75 10 15 0.0083 2.1e+02 2.8 0.0 27 44 1296 1313 1289 1327 0.93 11 15 0.16 4e+03 -1.3 0.0 27 64 1386 1425 1384 1430 0.83 12 15 0.38 9.3e+03 -2.5 0.0 27 63 1565 1603 1564 1607 0.75 13 15 0.061 1.5e+03 0.0 0.0 27 44 1650 1667 1647 1680 0.92 14 15 0.37 9.2e+03 -2.5 0.0 27 63 1782 1820 1781 1824 0.75 15 15 0.93 2.3e+04 -3.8 0.0 27 42 1914 1929 1913 1946 0.88
Sequence Information
- Coding Sequence
- atgacactcttaaggcagggcaccagtgtctgctggtcacgggattgccgaccccgacttacccgagttggcccactccaagggtcttcggacccatgcgcatgccctctgacattcagtgctccgaatgtcagcgtgaagaaagtcaagaaagcgctggtaaacgacgggagtaactatgactttcttaaggcagggcacctgtgtctgctggtcacgggattgacgacccagacttacccgagttggcccactccaagggtcttgggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggcaccagtgtctgctggtcacgggattgacgaccccgacttacccgagttggcccactccaagggtcttcggacccatgcgcctgccccctgacattcagtgctccgaatgtcagcgtgaagaaattcaagaaagcgcggggtcttgggacccatgcgcctgccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgactctcttaaggcagggcaccagtgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggtcttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggtcttcggacccatgcgcctgccccccgacattcagtgttctgaatgtcaaagtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggccccagtgtctgctggtcacgggattgccgaccccgacttacccgagttggcccactccaagggtcttcggacccatgcgcatgccctctgacattcagtgctccgaatgtcagcgtgaagaaagtcaagaaagcgctgttggcacactccaagggtcttcggacccatgcgcctgccccccgacattcagtgttctgaatgtcaacgtgaagaaactcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggcaccaatgtctgctggtaacgggattgacgtccccgacttacccgagttggctcactccaagggacttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggcacctgtgtctgctggtcacgggattgacgacccagacttacccgagttggcccactcaaagggtcttgggacccatgcgcctaccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagtaagcgcagggcaccagtgtctgctggtcacgggattgacgaccccgacttacccgagatggcccactccaagggtcttcggacccatgcgcctgccccctgacattcagtgctccgaatgtcagcgtgaagaaattcaagaaagcgcggggtcttgggacccatgcgcctgccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgactctcttaaggcagggcaccagtgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggtcttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggtcttcggacccatgcgcctgccccccgacattcagtgttctgaatgtcaaagtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggccccagtgtctgctggtcacgggattgccgaccccgacttacccgagttggcccactccaagggtcttcggacccatgcgcatgccctctgacattcagtgctccgaatgtcagcgtgaagaaagtcaagaaagcgctgttggcacactccaagggtcttcggacccatgcgcctgccccccgacattcagtgttctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggcaccaatgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggacttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggcacctgtgtctgctggtcacgggattgacgacccagacttacccgagttggcccactcaaagggtcttgggacccatgcgcctaccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagtaagcgcagggcaccagtgtctgctggtcacgggattgacgaccccgacttacccgagatggcccactccaagggtcttcggacccatgcgcctgccccctgacattcagtgctccgaatgtcagcgtgaagaaattcaagaaagcgcggggtcttgggacccatgcgcctgccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgactctcttaaggcagggcaccagtgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggtcttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggtcttcggacccatgcgcctgccccccgacattcagtgttctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggcaccagtgtctgctggtcacgggattgccgaccccgacttacccgagttggcccactccaagggtcttcggacccatgcgcatgccctctgacattcagtgctccgaatgtcagcgtgaagaaagtcaagaaagcgctgggcaccaatgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggacttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggcacctgtgtctgctggtcacgggattgacgacccagacttacccgagttggcccactcaaagggtcttgggacccatgcgcctaccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagtaagcgcagggcaccagtgtctgctggtcacgggattgacgaccccgacttacccgagatggcccactccaagggtcttcggacccatgcgcctgccccctgacattcagtgctccgaatgtcagcgtgaagaaattcaagaaagcgcggggtcttgggacccatgcgcctgccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgactctcttaaggcagggcaccagtgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggtcttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggtcttcggacccatgcgcctgccccccgacattcagtgttctgaatgtcaaagtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggccccagtgtctgctggtcacgggattgccgaccccgacttacccgagttggcccactccaagggtcttcggacccatgcgcatgccctctgacattcagtgctccgaatgtcagcgtgaagaaagtcaagaaagcgctgttggcacactccaagggtcttcggacccatgcgcctgccccccgacattcagtgttctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggcaccaatgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggacttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggcacctgtgtctgctggtcacgggattgacgacccagacttacccgagttggcccactcaaagggtcttgggacccatgcgcctaccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagtaagcgcagggcaccagtgtctgctggtcacgggattgacgaccccgacttacccgagatggcccactccaagggtcttcggacccatgcgcctgccccctgacattcagtgctccgaatgtcagcgtgaagaaattcaagaaagcgcggggtcttgggacccatgcgcctgccccccgacgttcagtgctctgaatgtcaacgtgaaggaattcaagaaagcgcgggtaaacggcgggagtaactatgactctcttaaggcagggcaccagtgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggtcttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggtcttcggacccatgcgcctgccccccgacattcagtgttctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggcaccagtgtctgctggtcacgggattgccgaccccgacttacccgagttggcccactccaagggtcttcggacccatgcgcatgccctctgacattcagtgctccgaatgtcagcgtgaagaaagtcaagaaagcgctgggcaccagtgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggtcttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggtctacggacccatgcgcctgccccccgacattcagtgttctgaatgtcaacgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgacactcttaaggcagggcaccagtgtctgctggtgacgggattgacgtccccgacttacccgagttggctcactccaagggtcttcggacccatgcgcctgccccccgtcattcagtgctctgaatgtcaacgtgaagaaattcaagaaagcgcggggcacctgtgtctgctggtcacgggattgacgacccagacttacccgagttggcccactcaaagggtcttgggacccatgcgcctaccccccgacgttcagtgctctgaatgtcaacgtgaagaaattcaagtaagcgcggggtcttgggacccatgcgcctgccccccgacgttcagtgctctgaatgtcagcgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaactatgactctcttaaggcaggggaccagtgtctgctggtgacgggattgacgaccccgacttacccgagttggcccactccaagggtcttcggacccatgcgcctgccccctgacattcagtgctccgaatgtcagcgtgaagaaattcaagaaagcgcgggtaaacggcgggagtaa
- Protein Sequence
- MTLLRQGTSVCWSRDCRPRLTRVGPLQGSSDPCACPLTFSAPNVSVKKVKKALVNDGSNYDFLKAGHLCLLVTGLTTQTYPSWPTPRVLGPMRLPPVIQCSECQREEIQESAGHQCLLVTGLTTPTYPSWPTPRVFGPMRLPPDIQCSECQREEIQESAGSWDPCACPPTFSALNVNVKKFKKARVNGGSNYDSLKAGHQCLLVTGLTSPTYPSWLTPRVFGPMRLPPVIQCSECQREEIQESAGSSDPCACPPTFSVLNVKVKKFKKARVNGGSNYDTLKAGPQCLLVTGLPTPTYPSWPTPRVFGPMRMPSDIQCSECQREESQESAVGTLQGSSDPCACPPTFSVLNVNVKKLKKARVNGGSNYDTLKAGHQCLLVTGLTSPTYPSWLTPRDFGPMRLPPVIQCSECQREEIQESAGHLCLLVTGLTTQTYPSWPTQRVLGPMRLPPDVQCSECQREEIQVSAGHQCLLVTGLTTPTYPRWPTPRVFGPMRLPPDIQCSECQREEIQESAGSWDPCACPPTFSALNVNVKKFKKARVNGGSNYDSLKAGHQCLLVTGLTSPTYPSWLTPRVFGPMRLPPVIQCSECQREEIQESAGSSDPCACPPTFSVLNVKVKKFKKARVNGGSNYDTLKAGPQCLLVTGLPTPTYPSWPTPRVFGPMRMPSDIQCSECQREESQESAVGTLQGSSDPCACPPTFSVLNVNVKKFKKARVNGGSNYDTLKAGHQCLLVTGLTSPTYPSWLTPRDFGPMRLPPVIQCSECQREEIQESAGHLCLLVTGLTTQTYPSWPTQRVLGPMRLPPDVQCSECQREEIQVSAGHQCLLVTGLTTPTYPRWPTPRVFGPMRLPPDIQCSECQREEIQESAGSWDPCACPPTFSALNVNVKKFKKARVNGGSNYDSLKAGHQCLLVTGLTSPTYPSWLTPRVFGPMRLPPVIQCSECQREEIQESAGSSDPCACPPTFSVLNVNVKKFKKARVNGGSNYDTLKAGHQCLLVTGLPTPTYPSWPTPRVFGPMRMPSDIQCSECQREESQESAGHQCLLVTGLTSPTYPSWLTPRDFGPMRLPPVIQCSECQREEIQESAGHLCLLVTGLTTQTYPSWPTQRVLGPMRLPPDVQCSECQREEIQVSAGHQCLLVTGLTTPTYPRWPTPRVFGPMRLPPDIQCSECQREEIQESAGSWDPCACPPTFSALNVNVKKFKKARVNGGSNYDSLKAGHQCLLVTGLTSPTYPSWLTPRVFGPMRLPPVIQCSECQREEIQESAGSSDPCACPPTFSVLNVKVKKFKKARVNGGSNYDTLKAGPQCLLVTGLPTPTYPSWPTPRVFGPMRMPSDIQCSECQREESQESAVGTLQGSSDPCACPPTFSVLNVNVKKFKKARVNGGSNYDTLKAGHQCLLVTGLTSPTYPSWLTPRDFGPMRLPPVIQCSECQREEIQESAGHLCLLVTGLTTQTYPSWPTQRVLGPMRLPPDVQCSECQREEIQVSAGHQCLLVTGLTTPTYPRWPTPRVFGPMRLPPDIQCSECQREEIQESAGSWDPCACPPTFSALNVNVKEFKKARVNGGSNYDSLKAGHQCLLVTGLTSPTYPSWLTPRVFGPMRLPPVIQCSECQREEIQESAGSSDPCACPPTFSVLNVNVKKFKKARVNGGSNYDTLKAGHQCLLVTGLPTPTYPSWPTPRVFGPMRMPSDIQCSECQREESQESAGHQCLLVTGLTSPTYPSWLTPRVFGPMRLPPVIQCSECQREEIQESAGSTDPCACPPTFSVLNVNVKKFKKARVNGGSNYDTLKAGHQCLLVTGLTSPTYPSWLTPRVFGPMRLPPVIQCSECQREEIQESAGHLCLLVTGLTTQTYPSWPTQRVLGPMRLPPDVQCSECQREEIQVSAGSWDPCACPPTFSALNVSVKKFKKARVNGGSNYDSLKAGDQCLLVTGLTTPTYPSWPTPRVFGPMRLPPDIQCSECQREEIQESAGKRRE
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -