Phyg050574.1
Basic Information
- Insect
- Pseudolycoriella hygida
- Gene Symbol
- -
- Assembly
- GCA_029228625.1
- Location
- WJQU01004293.1:1-10431[+]
Transcription Factor Domain
- TF Family
- zf-GATA
- Domain
- zf-GATA domain
- PFAM
- PF00320
- TF Group
- Zinc-Coordinating Group
- Description
- This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 13 0.008 32 5.8 0.0 23 32 20 29 14 33 0.86 2 13 0.0045 18 6.6 0.1 23 34 70 81 64 83 0.83 3 13 0.0055 22 6.3 0.0 23 33 120 130 114 133 0.84 4 13 7.3 3e+04 -3.7 0.1 23 31 170 178 170 179 0.64 5 13 0.0036 15 6.9 0.0 23 33 220 230 214 233 0.86 6 13 0.01 41 5.5 0.0 23 32 270 279 264 280 0.87 7 13 0.0065 26 6.1 0.1 23 32 320 329 314 332 0.86 8 13 0.0013 5.5 8.3 0.0 23 33 436 446 430 449 0.84 9 13 0.0045 18 6.6 0.1 23 34 486 497 480 499 0.83 10 13 0.0055 22 6.3 0.0 23 33 536 546 530 549 0.84 11 13 7.3 3e+04 -3.7 0.1 23 31 586 594 586 595 0.64 12 13 0.011 45 5.4 0.0 23 31 636 644 630 646 0.88 13 13 0.0065 26 6.1 0.1 23 32 686 695 680 698 0.86
Sequence Information
- Coding Sequence
- AGACTTGCCGCCAGTCATCTTGTTTCCACTTTACTTATAGACGAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTTAAACGACCACAACAATTGGCGAGTCATAAGCTCACCCATTCTGGTAAGGAGTTAATCTTACAATTGCAGAGACTTGCCGCCAGTCATCTTGTTTCCTCTTTACTTATAGACAAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTTAAACGATTACACCATTTGGCGAATCATAAGCTCACCCATTCTGGTAAGGAGTTAATTTTACAATTGCAGAGACTTGCCGCCAGTCATCTTGTTTCCACTTTACTTATAGACGAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTTAAACGATCACAACAATTGGCGAGTCATAAGCTCACCCATTCTGGTAAGGAGTTAATCTTACAATTGCAGAGACATGCGGCCAATTATTTTGTTTCCTGTTTACTTATAGACGAAGAGCCTTTCAAATGTGACGACTGCCAAATGACTTTCAAACGATCGGACTATTTGGCGATTCATAAGCGCACCCATTCTGGTAAGAAGTTAATTTTACAATTGCAGAGACTTGCCTCCAGTCATCTTATTTCCTCTTTACTTATAGACGAAAAGCCTTTCAAATGTAATGATTGCGGAATGACATTCAAACTATCACACGCTTTGGCGAATCATAAGCTCACCCATTCTGGTAAGGAGTTAATTTTACAATTGCAAAGACTTGCCGCCAGTCATCTTGTTTCCTCTTTACTTATAGACGAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTCAAACGATCAGTCTATTTGGCGAGTCATAAGCTCACCCATTCTGGTAAGAAGTTAATTTTACAACAGCAGAGATTTGTCGCCTGTCATCTTGTTTCCTCTTTACTTATAGACGAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTCAAACGATCACGCTATTTGGCGGATCATAAGCTCACCCATTCTGTCAAGGACGAGGAAAAAGCAGAACCTTGTGTTCCCGGAGCAGCGGTAAACCTTACAGTATCAATGATAGTCTTCGAGGACGGCCTACTTCAAAACGTAGAAGTCAAAATTAAATCAGAAGAACCTTTAGATATTTCATTCACAGAGGAAAATTTACTCACCGAGGACATTTCACTCAACGAGGACAATTCACTCCGGTGTACAATCTGCCAAAAGTCATTTCATAACAAACGCAATTTGAGAAGGCATAAATTAACCCATTTAGACGAAAAGCCTTTCAAATGTAACGACTGCGGATTGACATTTAAACGACCACATCAATTGCGGAGTCATAAGCTCGCCCATTCTGGTAAGGAGTTAATCTTACAATTGCAGAGACTTGCCGCCAGTCATCTTGTTTCCTCTTTACTTATAGACAAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTTAAACGATTACACCATTTGACGAATCATAAGCTCACCCATTCTGGTAAGGAGTTAATTTTACAATTGCAGAGACTTGCCGCCAGTCATCTTGTTTCCTCTTTACTTATAGACGAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTTAAACGATCACAACAATTGGCTTGTCATAAGCTCACCCATTCTGGTAAGGAGTTAATCTTACAATTGCAGAGACTTGCGGCCAATTATTTTGTTTCCTGTTTACTTATAGATGAAGAGCCTTTCAAATGTGACGACTGCCAAATGACTTTCAAACGATCGGACTATTTGGCGTATCATAAGCTCACCCATTCTGGTAAGGAGTTAATTTTACAATTGCAAGGACTTGCCGCCAGTCATCTTGTTTCCTCTTTACTTATAGACGAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTCAAACGACCAGTCTATTTGGCGAGTCATAAGCTCACCCATTCTGGTAAGGAGTTAATTTTACAATTGCAGAGACTTGTCGCCAGTCATCTTGTTTCCTCTTTACTTATAGACGAAAAGCCTTTCAAATGTAACGACTGCGGAATGACATTCAAACGATCACGCTATTTGGCGGATCATAAGCTCACCCATTCTG
- Protein Sequence
- RLAASHLVSTLLIDEKPFKCNDCGMTFKRPQQLASHKLTHSGKELILQLQRLAASHLVSSLLIDKKPFKCNDCGMTFKRLHHLANHKLTHSGKELILQLQRLAASHLVSTLLIDEKPFKCNDCGMTFKRSQQLASHKLTHSGKELILQLQRHAANYFVSCLLIDEEPFKCDDCQMTFKRSDYLAIHKRTHSGKKLILQLQRLASSHLISSLLIDEKPFKCNDCGMTFKLSHALANHKLTHSGKELILQLQRLAASHLVSSLLIDEKPFKCNDCGMTFKRSVYLASHKLTHSGKKLILQQQRFVACHLVSSLLIDEKPFKCNDCGMTFKRSRYLADHKLTHSVKDEEKAEPCVPGAAVNLTVSMIVFEDGLLQNVEVKIKSEEPLDISFTEENLLTEDISLNEDNSLRCTICQKSFHNKRNLRRHKLTHLDEKPFKCNDCGLTFKRPHQLRSHKLAHSGKELILQLQRLAASHLVSSLLIDKKPFKCNDCGMTFKRLHHLTNHKLTHSGKELILQLQRLAASHLVSSLLIDEKPFKCNDCGMTFKRSQQLACHKLTHSGKELILQLQRLAANYFVSCLLIDEEPFKCDDCQMTFKRSDYLAYHKLTHSGKELILQLQGLAASHLVSSLLIDEKPFKCNDCGMTFKRPVYLASHKLTHSGKELILQLQRLVASHLVSSLLIDEKPFKCNDCGMTFKRSRYLADHKLTHS
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01266155;
- 90% Identity
- iTF_01266155;
- 80% Identity
- iTF_01266155;