Aorb014812.2
Basic Information
- Insect
- Acrocera orbiculus
- Gene Symbol
- -
- Assembly
- GCA_947359355.1
- Location
- OX375758.1:36023034-36026462[-]
Transcription Factor Domain
- TF Family
- zf-GATA
- Domain
- zf-GATA domain
- PFAM
- PF00320
- TF Group
- Zinc-Coordinating Group
- Description
- This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 18 0.0035 7.1 6.2 0.1 18 34 45 60 41 62 0.83 2 18 0.28 5.8e+02 0.1 0.0 18 33 73 87 68 90 0.78 3 18 0.0035 7.1 6.2 0.1 18 34 170 185 166 187 0.83 4 18 0.28 5.8e+02 0.1 0.0 18 33 198 212 193 215 0.78 5 18 0.0035 7.1 6.2 0.1 18 34 295 310 291 312 0.83 6 18 0.28 5.8e+02 0.1 0.0 18 33 323 337 318 340 0.78 7 18 0.0035 7.1 6.2 0.1 18 34 420 435 416 437 0.83 8 18 0.28 5.8e+02 0.1 0.0 18 33 448 462 443 465 0.78 9 18 0.0035 7.1 6.2 0.1 18 34 545 560 541 562 0.83 10 18 0.28 5.8e+02 0.1 0.0 18 33 573 587 568 590 0.78 11 18 0.009 18 4.9 0.1 18 34 670 685 666 687 0.81 12 18 0.28 5.8e+02 0.1 0.0 18 33 698 712 693 715 0.78 13 18 0.0035 7.1 6.2 0.1 18 34 795 810 791 812 0.83 14 18 0.28 5.8e+02 0.1 0.0 18 33 823 837 818 840 0.78 15 18 0.009 18 4.9 0.1 18 34 920 935 916 937 0.81 16 18 0.28 5.8e+02 0.1 0.0 18 33 948 962 943 965 0.78 17 18 0.009 18 4.9 0.1 18 34 1045 1060 1041 1062 0.81 18 18 0.28 5.8e+02 0.1 0.0 18 33 1073 1087 1068 1090 0.78
Sequence Information
- Coding Sequence
- ATGAAACATGAACATGCAAGAAAAACAAGCGAGGGGAGTCAGTTCAATATCACTTATGATTGCAGTATTCGCAAAAAAAGCTTTAATCGGAAAGATAACCTCAAGGCGCATGAAAAACTCCATACAGGTGAAAGGCCATATGAGTGTAATATATGTGGCGTTACTTTTGCTCTAAAAGGAAATTTAACTAAACATAAAATAATCCATACAGAAGAAAGGCCATATATTTGTAACATATGTCACGCTACTTTTCGTTCAAATTCAGGTTTACAAAGACACAAGGGTATCGTGCATAAAGGAAAATCGAAAAGCGAGGGGAATCCGTTCAATTTCCCTGCGGATAATGGAACAAAAAATAATTTTGATTACACAATTATGAAACATGACCATGCAAGAAAAACAAGCGAGGGGAGTCAGTTCAATATCACTTATGATTGCAGTATTCGCAAAAAAAGCTTTAATCGGAAAGATAACCTCAAGGCGCATGAAAAACTCCATACAGGTGAAAGGCCATATGAGTGTAATATATGTGGCGTTACTTTTGCTCTAAAAGGAAATTTAACTAAACATAAAATAATCCATACAGAAGAAAGGCCATATATTTGTAACATATGTCACGCTACTTTTCGTTCAAATTCAGGTTTACAAAGACACAAGGGTATCGTGCATAAAGGAAAATCGAAAAGCGAGGGGAATCCGTTCAATTTCCCTGCGGATAATGGAACAAAAAATAATTTTGATTACACAATTATGAAACATGACCATGCAAGAAAAACAAGCGAGGGGAGTCAGTTCAATATCACTTATGATTGCAGTATTCGCAAAAAAAGCTTTAATCGGAAAGATAACCTCAAGGCGCATGAAAAACTCCATACAGGTGAAAGGCCATATGAGTGCAATATATGTGGCGTTACTTTTGCTCTAAAAGGAAATTTAACTAAACATAAAATAATCCATACAGAAGAAAGGCCATATATTTGTAACATATGTCACGCTACTTTTCGTTCAAATTCAGGTTTACAAAGACACAAGGGTATCGTGCATAAAGAAAAATCGAAAAGTGAGGGGAATCAGTTCAATTTCCCTGCAGATAATGGAACAAAAATTAATTTTGATTACACAATTATGAAACATGAACATGCAAGAAAAACAAGCGAAGGGAGTCAGTTCAATATCACTTATGATTGCAGTATTCGCAAAAAAAGCTTTAATCGGAAAGATAACCTCAAGGCGCATGAAAAACTCCATACAGGTGAAAGGCCATATGAGTGTAATATATGTGGCGTTACTTTTGCTCTAAAAGGAAATTTAACTAAACATAAAATAATCCATACAGAAGAAAGGCCATATATTTGTAACATATGTCACGCTACTTTTCGTTCAAATTCAGGTTTACAAAGACACAAGGGTATCGTGCATAAAGGAAAATCGAAAAGTGAGGGGAATCCGTTCAATTTCCCTGCGGATAATGGAACAAAAAATAATTTTGATTACACAATTATGAAACATGACCATGCAAGAAAAACAAGCGAGGGGAGTCAGTTCAATATCACTTATGATTGCAGTATTCGCAAAAAAAGCTTTAATCGGAAAGATAACCTCAAGGCGCATGAAAAACTCCATACAGGTGAAAGGCCATATGAGTGCAATATATGTGGCGTTACTTTTGCTCTAAAAGGAAATTTAACTAAACATAAAATAATCCATACAGAAGAAAGGCCATATATTTGTAACATATGTCACGCTACTTTTCGTTCAAATTCAGGTTTACAAAGACACAAGGGTATCGTGCATAAAGAAAAATCGAAAAGTGAGGGGAATCAGTTCAATTTCCCTGCAGATAATGGAACAAAAATTAATTTTGATTACACAATTATGAAACATGAACATGCAAGAAAAACAAGCGAAGGGAGTCAGTTCAATATCACTTATGATTGCAGTATTCGCAAAAAAAGCTTTAATCGGAAAGATAACCTCAAGGCGCATGAAAAACTCCATACAGGTGAAAGGCCATATGAGTGCAATATATGTGGCGTTACTTTTGCTCTAAAAGCAAATTTAACTAAACATAAAATAATCCATACAGAAGAAAGGCCATATATTTGTAACATATGTCACGCTACTTTTCGTTCAAATTCAGGTTTACAAAGACACAAGGGTATCGTGCATAAAGAAAAATTGAAAAGCGAGGGGAATCAGTTCAATTTCCCTGCAGATAATGGAACAAAAATTAATTTTGATTACACAATTATGAAACATGAACATGCAAGAAAAACAAGCGAAGGGAGTCAGTTCAATATCACTTATGATTGCAGTATTCGCAAAAAAAGCTTTAATCGGAAAGATAACCTCAAGGCGCATGAAAAACTCCATACAGGTGAAAGGCCATATGAGTGTAATATATGTGGCGTTACTTTTGCTCTAAAAGGAAATTTAACTAAACATAAAATAATCCATACAGAAGAAAGGCCATATATTTGTAACATATGTCACGCTACTTTTCGTTCAAATTCAGGTTTACAAAGACACAAGGGTATCGTGCATAAAGAAAAATCGAAAAGTGAGGGGAATCAGTTCAATTTCCCTGCAGATAATGGAACAAAAATTAATTTTGATTACACAATTATGAAACATGACCATGCAAGAAAAACAAGCGAGGGGAGTCAGTTCAATATCACTTATGATTGCAGTATTCGCAAAAAAAGCTTTAATCGGAAAGATAACCCCAAGGCGCATGAAAAACTCCATACAGGTGAAAGGCCATATGAGTGCAATATATGTGGCGTTACTTTTGCTCTAAAAGCAAATTTAACTAAACATAAAATAATCCATACAGAAGAAAGGCCATATATTTGTAACATATGTCACGCTACTTTTCGTTCAAATTCAGGTTTACAAAGACACAAGGGTATCGTGCATAAAGAAAAATTGAAAAGCGAGGGGAATCCGTTCAATTTCCCTGCAGATAATGGAACAAAAAATAATTTTGATTACACAATTATGAAACATGACCATGCAAGAAAAACAAGCGAGGGGAGTCAGTTCAATATCACTTATGATTGCAGTATTCGCAAAAAAAGCTTTAATCGGAAAGATAACCCCAAGGCGCATGAAAAACTCCATACAGGTGAAAGGCCATATGAGTGCAATATATGTGGCGTTACTTTTGCTCTAAAAGCAAATTTAACTAAACATAAAATAATCCATACAGAAGAAAGGCCATATATTTGTAACATATGTCACGCTACTTTTCGTTCAAATTCAGGTTTACAAAGACACAAGGGTATCGTGCATAAAGAAAAATCGAAAAGCGAGGGGAATCAGTTCAATTTCCCTGCAGATAATGGAAAAAAAATTAATTTTGATTACACAATTATGAAACATGACCATGCAAGAAAAACAAGCGAGGGGAGTCAAAATAGATATTAG
- Protein Sequence
- MKHEHARKTSEGSQFNITYDCSIRKKSFNRKDNLKAHEKLHTGERPYECNICGVTFALKGNLTKHKIIHTEERPYICNICHATFRSNSGLQRHKGIVHKGKSKSEGNPFNFPADNGTKNNFDYTIMKHDHARKTSEGSQFNITYDCSIRKKSFNRKDNLKAHEKLHTGERPYECNICGVTFALKGNLTKHKIIHTEERPYICNICHATFRSNSGLQRHKGIVHKGKSKSEGNPFNFPADNGTKNNFDYTIMKHDHARKTSEGSQFNITYDCSIRKKSFNRKDNLKAHEKLHTGERPYECNICGVTFALKGNLTKHKIIHTEERPYICNICHATFRSNSGLQRHKGIVHKEKSKSEGNQFNFPADNGTKINFDYTIMKHEHARKTSEGSQFNITYDCSIRKKSFNRKDNLKAHEKLHTGERPYECNICGVTFALKGNLTKHKIIHTEERPYICNICHATFRSNSGLQRHKGIVHKGKSKSEGNPFNFPADNGTKNNFDYTIMKHDHARKTSEGSQFNITYDCSIRKKSFNRKDNLKAHEKLHTGERPYECNICGVTFALKGNLTKHKIIHTEERPYICNICHATFRSNSGLQRHKGIVHKEKSKSEGNQFNFPADNGTKINFDYTIMKHEHARKTSEGSQFNITYDCSIRKKSFNRKDNLKAHEKLHTGERPYECNICGVTFALKANLTKHKIIHTEERPYICNICHATFRSNSGLQRHKGIVHKEKLKSEGNQFNFPADNGTKINFDYTIMKHEHARKTSEGSQFNITYDCSIRKKSFNRKDNLKAHEKLHTGERPYECNICGVTFALKGNLTKHKIIHTEERPYICNICHATFRSNSGLQRHKGIVHKEKSKSEGNQFNFPADNGTKINFDYTIMKHDHARKTSEGSQFNITYDCSIRKKSFNRKDNPKAHEKLHTGERPYECNICGVTFALKANLTKHKIIHTEERPYICNICHATFRSNSGLQRHKGIVHKEKLKSEGNPFNFPADNGTKNNFDYTIMKHDHARKTSEGSQFNITYDCSIRKKSFNRKDNPKAHEKLHTGERPYECNICGVTFALKANLTKHKIIHTEERPYICNICHATFRSNSGLQRHKGIVHKEKSKSEGNQFNFPADNGKKINFDYTIMKHDHARKTSEGSQNRY
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00014403;
- 90% Identity
- iTF_00014403;
- 80% Identity
- iTF_00014403;