Stip003357.1
Basic Information
- Insect
- Synanthedon tipuliformis
- Gene Symbol
- -
- Assembly
- GCA_947623395.1
- Location
- OX392435.1:147788-153590[+]
Transcription Factor Domain
- TF Family
- zf-GAGA
- Domain
- zf-GAGA domain
- PFAM
- PF09237
- TF Group
- Zinc-Coordinating Group
- Description
- Members of this family bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 22 0.052 1.9e+02 2.6 0.1 23 44 576 597 573 603 0.90 2 22 0.081 2.9e+02 1.9 0.0 20 49 629 658 626 664 0.89 3 22 0.085 3.1e+02 1.9 0.0 27 49 667 689 661 691 0.91 4 22 0.049 1.8e+02 2.6 0.0 27 49 695 717 692 723 0.88 5 22 2.8 1e+04 -3.0 0.0 27 48 726 747 725 751 0.86 6 22 0.053 1.9e+02 2.5 0.0 27 49 754 776 750 782 0.88 7 22 1.5 5.5e+03 -2.1 0.0 27 48 785 806 783 808 0.89 8 22 0.0021 7.7 7.0 0.1 27 49 844 866 838 871 0.88 9 22 0.91 3.3e+03 -1.4 0.0 24 49 872 897 869 902 0.82 10 22 0.0028 10 6.6 0.0 24 49 903 928 898 931 0.93 11 22 0.062 2.2e+02 2.3 0.0 24 48 934 958 932 961 0.88 12 22 0.25 9e+02 0.4 0.0 27 49 965 987 963 992 0.88 13 22 0.01 37 4.8 0.0 24 49 1024 1049 1016 1054 0.87 14 22 0.013 46 4.5 0.0 24 49 1055 1080 1051 1085 0.87 15 22 0.012 43 4.6 0.0 24 49 1086 1111 1083 1117 0.87 16 22 0.58 2.1e+03 -0.8 0.0 27 48 1120 1141 1118 1143 0.88 17 22 0.36 1.3e+03 -0.1 0.0 27 48 1148 1169 1146 1172 0.89 18 22 0.16 5.9e+02 1.0 0.0 24 48 1176 1200 1174 1203 0.87 19 22 0.14 5.1e+02 1.2 0.0 27 49 1207 1229 1205 1234 0.88 20 22 0.22 8e+02 0.5 0.0 24 45 1266 1287 1262 1294 0.87 21 22 0.024 87 3.6 0.4 26 49 1296 1319 1289 1325 0.86 22 22 1.8 6.6e+03 -2.4 0.0 24 48 1325 1349 1322 1352 0.83
Sequence Information
- Coding Sequence
- ATGGCTGAAACATTAAACCCAACACCTTTTTGTAGATGTTGTCACAAGAATGGCGATTTCAAGTGTTTATTTTCCGACTTCATTAACGAGAATGAGATTGAAAATTATTCGCAGATGTTGTACGATACATTTGGAATCTTATTTGTACCCTCCATAGATGACAAAATCTATACAATATGCGATGAGTGTATTGAAAAACTACGTATCTCAACGGAGTTCAAGAATTGTGTCCTCGCCTGTAAAAAAAATGCTTGTTGGCGAGGGCACTGTCGTGCTCGACTTGACGGAGGCACTGCAGAGACCCCAGATATGAAACAAGAGACCTCCGAAGATGAAGACAGTGACATGGATATAATAAATGAGGGTTCAGACGAAGATTGCAATCAGGAGGTCGAAGTTGAAGACAGCGGAAACAAATCTGATGAAACCTGGAGTGAATCTGAAACGTACTTCATAGAAGAAGCGTTATTAGCAGACGACATCAAAGTTGAAACAGAAGAGATGGTCAACAATGAAGACTCTGGAATAAATAAGGTTGAAGATATCACAAATGAAGACTCAAGTATGAATAAGGTTTTAGATGAAGATTCCAATGAAGATGTCAGAAATGAAGACTCAAGTATGAATAAGGTTTTAGATGAAGATTCCAATGAAGATGTCAGAAATGAAGACTCAAGTATGAATAAGGTTTTAGATGAAGATTCCAATGAAGATGTCAGAAATGAAGACTCAAGTATGAATAAGGTTTTAGATGAAGATTCCAATGAAGATGTCAGAAATGAAGACTCAAGTATGAATAAGGTTTTAGATGAAGATTCCAATGAAGATGTCAGAAATGAAGACTCAAGTATGAATAAGGTTTTAGATGAAGATTCCAATGAAGATGTCAGAAATGAAGACTCAAGTATGAATAAGGTTTTAGATGAAGATTCCAATGAAGATATCACAAATGAAGACTCAAGTATAAATAAGGTTTTAGATGAAGATTCCAATGAAGATGTCAGAAATGAAGACTCAAGTATGAATAAGGTTTTAGATGAAGATTCCAATGAAGATGTCGGAAATGAAGACTCAAGTATGAATAAGGTTTTAGATGAAGATTCCAATGAAGATGTCAGAAACGAAGACTCAAGTATAAATAAGGTTTCAGATGAAGAGTTCAATGAAGAGTGTGAAATGAAATCAGATGAAACCTTTAATGATGCTGGTGTCAGTTTGAGAAAAGCAAGCGACATACATGAGTATAAATGCAACATTTGCAATAAAACTTTCAAACGCAAGGTGACCCTAGAAAATCACATCAGAACTCACGATGGCTCTAAACGCCAAACACGTGCAGCAATATTGTTGAATAAGAGAGAGAACGCGAAACACCATCAGTTGAAGGTTCACATGGACAGACAGACACGTGAGTGCCAAGAGTGCGGCCGGAGATTCACAGAAAAGTCAAGCTTACTCAGGCATTATCAAAACATACACGTGGGTGGAAAAAAATTCACAGAAAAGAGAAAATTACTACGGCATTGTCGAGAGGTGCAAAATATGGATGTAAAAAAGTACAGTTGCCGGTTTTGCGATAAAAGACTTGCGACTAAACAGGCATTGGTTCTACATGAAAGGACACACACGGGAGTAAAATTGTATGTGTGCAAAATATGCGATAAACCGTTTCATAGTATAAGGGGTTTACTGGATCATAATGCAGTGCATTTGGAATATAATTATAAGCCTTTCACTTGTAACGTGTGTGATAAAGCATTCAAAAAGAAAAATACTTTAAACAGACACATCTTGACGCATACTAGGATAAAATCTTTTCAATGCAGTGTCTGTAGCAAGAGTTTTGCACGTAAAGATCATTTTAATAATCACATAAAGGTCCATACGAGCGAAAGTCCTTACAAATGCGGCACTTGCAAGAAGACATTTAAACACAAAAAGAGTTTAAGCGAGCACATGTCGGTTGTACATTTGGGCTTTAAACCAACGCTCTTTGAATGTGACATTTGTAAAAAGACATTTAAATACAAAAGTGATTTAAGGAAGCACATGTCGGTTGTACATTTGGGCTTTAAGCTTGACTGCAACATTTGCAAAAAGACATTTAAATACAAAAGTGATTTAAGGAAGCACATGTCGGTTGTACATTTGGGCTTTAAGCCAACGCTCTTTGAATGTGACATTTGCAAAAAGACATTTAAACACAAAAAGAGTTTAAGCGAGCACATGTCGGCTGTACATTTGGGCTTTAAGCTTGACTGCAACATTTGCAAAGAGACATTTAAATACAAAAGTGATTTAAGGAAGCACATGTCGGTTGTACATTTGGGCTTTAAACCAACGCTCTTTGAATGTGATATTTGTAAAAAGACATTTAAATCCAAACAGTATTTAACGAAGCACATGTCGGTTTTACATTTGGGCTTTAAGCTTGACTACAACATTTGCAAAAAGACATTTAAACACAAAAAGAGTTTAGGCGAGCACATGTCGGTTGTACATTTGGGCTTTAAGCCAACGCTCTTTGAGTGTGATATTTGCAAAAAGACATTTAAATACAAAAGGAATTTAAGAGAGCACATGTCGACTGTACATTTGAGCTTTAAACCAACGCCCTTTGAGTGTGATATTTACAAAAAGACATTTAAATACAAAAGGAATTTAAGCGAGCACATGTCGGCTGTACATTTGGGCTTTAAGCCAACGCCCTCTGAGTGTGACATTTGTAAAAAGACATTTAAACACAAAAACAGTTTAAGGAAGCACATGTCGGCTGTACATTTCGGCTTTAAGCTAACCCCCTTTGAGTGTGATATTTGCAAAAAGACATTTAAATATAAACATAATTTAAGCAAGCACTTGTTGGCTGTACATTTGGGCTTTAAGTTTGACTGCAATATTTGCAAGAAGACATTCAAATACAAAGACAGTTTAAGGAAGCACATGTCGGCTGTACATTTGGGCTTTAAGCCAACGCTCTTTGAGTATGATATTTGCAAAAAGGCATTTAAATATAAAAAGAATTTAAGCGAGCACATGTCGGCTGTACATTTGAGCTTTAAGCCAACGCCCTTTGAGTGTGATATTTGCAAAAAAACATTTAAATACAAAAGGAATTTAAGCGAGCACATGTCGGCTGTACATTTGGGCTTTAAGCCAACGCCCTTTGAGTGTGATATTTGCAAAAAGACATTTAAATACAAAAGGAATTTAAGCGAGCACATGTCGGCTGTACATTTGGGCTTTAAGCCAACGCCCTTTGAGTGTGATATTTGCAAAAAGACATTTAAATACAAAAGGAATTTAAGCGAGCACATGTCGGCTGTACATTTGGGCTTTAAGCCAACGCTCTTTGAGTGTGACATTTGTAAAAAGACATTTAAATATAAACATAATTTAAGCGAGCACTTGTTGGCTGTACATTTGGGCTTTAAGTTTGACTGCAATATTTGCAAGAAGACATTCAAATACAAAGACAGTTTAAGGAAGCACATGTCGGCTGTACATTTGGGCTTTAAGCTAACCCCCTTTGAGTGTGATATTTGCAAAAAGACATTTAAATATAAACATAATTTAAGCGAGCACTTGTTGGCTGTACATTTGGGCTTTAAGTTTGACTGCAATATTTGCAAGAAGACATTCAAATACAAACACAGTTTAAGGAAGCACATGTCGGCTGTACATTTGGGCTTTAAGCCAACGCCCTTTGAGTATGATATTTGCAAAAAAACATTTAAATATAAAAAGAATTTAAGCGAGCACATGTCGGCTGTACATTTGGGCTTTAAGCCAACGCCCTTTGAGTGTGATATTTGCAAAAAGACATTTAAATATAAAAAGAATTTAAGCGAGCACATGTTGGCTGTACATTCGGGCTTTAAGTGTGACTGCAACATTTGCATGAAGACATTTAAATACAAAAGGAGTTTATGCAGGCACATGTCGGCTGTACATTTGGGCTTTAAGCCAACGCCCCTTGAGTGTGACATTTGCAAAAAGACATTTAAATCCGAACAGTATTTAACTGAGCACATGTCGGCTGTACATTTGGGCTTTAAAGGATTCACATAA
- Protein Sequence
- MAETLNPTPFCRCCHKNGDFKCLFSDFINENEIENYSQMLYDTFGILFVPSIDDKIYTICDECIEKLRISTEFKNCVLACKKNACWRGHCRARLDGGTAETPDMKQETSEDEDSDMDIINEGSDEDCNQEVEVEDSGNKSDETWSESETYFIEEALLADDIKVETEEMVNNEDSGINKVEDITNEDSSMNKVLDEDSNEDVRNEDSSMNKVLDEDSNEDVRNEDSSMNKVLDEDSNEDVRNEDSSMNKVLDEDSNEDVRNEDSSMNKVLDEDSNEDVRNEDSSMNKVLDEDSNEDVRNEDSSMNKVLDEDSNEDITNEDSSINKVLDEDSNEDVRNEDSSMNKVLDEDSNEDVGNEDSSMNKVLDEDSNEDVRNEDSSINKVSDEEFNEECEMKSDETFNDAGVSLRKASDIHEYKCNICNKTFKRKVTLENHIRTHDGSKRQTRAAILLNKRENAKHHQLKVHMDRQTRECQECGRRFTEKSSLLRHYQNIHVGGKKFTEKRKLLRHCREVQNMDVKKYSCRFCDKRLATKQALVLHERTHTGVKLYVCKICDKPFHSIRGLLDHNAVHLEYNYKPFTCNVCDKAFKKKNTLNRHILTHTRIKSFQCSVCSKSFARKDHFNNHIKVHTSESPYKCGTCKKTFKHKKSLSEHMSVVHLGFKPTLFECDICKKTFKYKSDLRKHMSVVHLGFKLDCNICKKTFKYKSDLRKHMSVVHLGFKPTLFECDICKKTFKHKKSLSEHMSAVHLGFKLDCNICKETFKYKSDLRKHMSVVHLGFKPTLFECDICKKTFKSKQYLTKHMSVLHLGFKLDYNICKKTFKHKKSLGEHMSVVHLGFKPTLFECDICKKTFKYKRNLREHMSTVHLSFKPTPFECDIYKKTFKYKRNLSEHMSAVHLGFKPTPSECDICKKTFKHKNSLRKHMSAVHFGFKLTPFECDICKKTFKYKHNLSKHLLAVHLGFKFDCNICKKTFKYKDSLRKHMSAVHLGFKPTLFEYDICKKAFKYKKNLSEHMSAVHLSFKPTPFECDICKKTFKYKRNLSEHMSAVHLGFKPTPFECDICKKTFKYKRNLSEHMSAVHLGFKPTPFECDICKKTFKYKRNLSEHMSAVHLGFKPTLFECDICKKTFKYKHNLSEHLLAVHLGFKFDCNICKKTFKYKDSLRKHMSAVHLGFKLTPFECDICKKTFKYKHNLSEHLLAVHLGFKFDCNICKKTFKYKHSLRKHMSAVHLGFKPTPFEYDICKKTFKYKKNLSEHMSAVHLGFKPTPFECDICKKTFKYKKNLSEHMLAVHSGFKCDCNICMKTFKYKRSLCRHMSAVHLGFKPTPLECDICKKTFKSEQYLTEHMSAVHLGFKGFT
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01388686;
- 90% Identity
- iTF_01388686;
- 80% Identity
- iTF_01388686;