Kina012315.1
Basic Information
- Insect
- Kallima inachus
- Gene Symbol
- Gata5
- Assembly
- None
- Location
- HiC:6251087-6280252[+]
Transcription Factor Domain
- TF Family
- zf-GATA
- Domain
- zf-GATA domain
- PFAM
- PF00320
- TF Group
- Zinc-Coordinating Group
- Description
- This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 11 8 2.3e+04 -5.4 1.7 13 26 550 563 547 564 0.68 2 11 4.9 1.4e+04 -3.9 1.3 12 26 624 638 622 639 0.72 3 11 9.7e-08 0.00028 20.8 0.4 21 35 682 696 666 697 0.87 4 11 6.9e-11 2e-07 30.8 0.5 12 35 739 761 738 762 0.96 5 11 0.0019 5.5 7.1 0.1 25 35 826 836 825 837 0.89 6 11 7.5 2.2e+04 -4.5 3.0 10 26 897 913 892 914 0.76 7 11 0.00043 1.3 9.1 0.1 25 35 978 988 977 989 0.91 8 11 7.7e-16 2.3e-12 46.7 1.3 2 35 1093 1125 1093 1126 0.96 9 11 3.1e-12 9e-09 35.2 0.2 8 35 1175 1201 1172 1202 0.93 10 11 0.99 2.9e+03 -1.7 1.8 6 16 1250 1260 1249 1265 0.84 11 11 7.5 2.2e+04 -4.5 3.0 10 26 1587 1603 1582 1604 0.76
Sequence Information
- Coding Sequence
- ATGTGTAATTTGCCGCCGAAGTTCCATTCAGTGTGTCGCCTTTGTTTATCACTCTGTGGCGATAATTGCAGTGATGTTAAACTACCAATATTCGACCGTGATAAGGATAAATCACGACTTTCCGAGATGATAATGACATATTTGTCAATAATGGTATCACCGGAAGATATGCTGCCACAGGTGGTTTGTGGGAGTTGTGCACACAAACTTGATGAGTTCCATACATTTAGAGAACTGTCACACAAATCTGAGCTACTATTGGAACAATTTGTACAGTATGCAAATTCACTGACTGGTACAAAAGAGGATATCTTGAATATAACGGCGGACAAATTAGAAGAAATAATCAGGCCACTAAATGATTGCGAGTATGATGATGCGAGTAAACACAAATATTCCGAAATCGGATCTCCGGACTCTACGGAGGAAATGAAAAACTTAGAGAGTCGGCAGGCGGCAGTCACACTGCTTCAGATAAAAAACTACGATCCGTCTAAATACGCTGTCAAGACTGAGGAAAGTCCTCACATAATGTTCAACACCGGACCGAGTTTACCGCCTTCCGATAGAGCGAGAGAGGTTATGCACTGTAATGCCGTCATTGATATTATAAGCAAGGCTGTCGCGGTCGCGCAGCGAGAAAATGAAGAATCTCAAAATTTCTCGCCTAACTATTCAAGCGTCATTGACAGGACGCATGCGCCCAGTGCTAGCGAAGTGACCTACGCGCAAGAGTACCCGGATGACCAATACAGCACTTACAACCCTCCACAGAGCGCTACCAGTCCTGGTAGTAACGATGATCATGACCACAAAGAAATGGACCTCTCATTATACGGCTCAATCAAGAATGAATCACTGGAACCGAGCGAGCAGGCAGAACGTGACGTAACGGGAGGCTATTTACAAAGAACATTAACGAGCAATAAAACCCCCAGCTTCGCAGACGAGTACAAACAACATGTTTTTGGTCGAACGTGTAATAAAACAAAACAATCGCAAGAAAACTCTGTATATGAAGAATGCAGTCAAAGCAGTAGTGGTTCCGATCCAGACAGATTACAGATGGATATTTCTGAGGTGTCGCAGGACGATCCAGAAGAGACCCAGTCTGTGCCATCAGCCCAGTCCTCTCCGAAACCGCCCCACGAAAACGAAGGCGATAAGGATTCTCTATGGCAGGCTCTTCATAGACAAAACGGTCGCGGCGGAGAGGCGACGCAGCTTCTAAGGCGACTGATCAACAGCAAACACCTGGGTATGACCGTGTCGCCGCTGCGCGGTACGACGTCTCCTGTCCCTGCGCCGATGTTGCCTAACGGTGCAGTTTCACCGAACGGTGAGTGGTCGAATGCGGGTCGCGGGGCGGCGGCGGCGGCGCGGCGGCGGCGGCGCGGCGGCGGCGGCGGCGGGACTGCGCGCAGGAAACAGAGCTGTCCGGCGCGTGCGCAACCCATACACGAGCCTGCCAACGTCTGGCCCGTCGCGCACGATAACCAGGAGGGCGCGGAGAGCGCGGGCGGCGCGGGCGGCGGCGGCGGCGCGGCGGCGGCGGCGGCGCGGCGGCGGCGGCGGCGGCGCGGCGGCGGCGGCGCGGCGGCGGCGGCGGCGGCGGCGCGGCGGCGGCGGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCCGGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACCCGCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGCACGCACACCACCACATCTGGCGGCGCGACGCGCGCGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACACGCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGAACACAACTGCGCACGCACACCACCACATCTGGCGGCGCGACGCGCGCGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGCACGCACACCACCACATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCGCGCCCCACCGCATGCGCCGCGACACATCCACACGCGCCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCACTGCGGCACGCACACCACCACCATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACATCCACACGCGCCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGGCACGCACACCACCACCATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACACGCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGGCACGCACACCACCACCATCTGGCGGCGCGACGCGCGCGCGAGATGGTGTGCACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACACGCGCCGCCGCCGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGTCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGGCACGCACACCACCACCACCATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCTGCGGCCTCTACTACAAGCTGCACGGCGTGCGCGCCCCACCGCCATGCGCCGCGAACACCATCCACACGCGCCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACACAACTGCGGCACGCACACCACCACCATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACACGCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGGCACGCCACACCACCACCATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACACGCGCCGCCGCCGCCGGCCGCGACCACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGTCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGCACGCACACCACCACCATCTGGCGGCGCGAACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACCACCATCCACACGCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGTCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGGCACGCACACCACCACATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACACGCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGGCACGCACACCACCACCATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACACGCGCCGCCGCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGGCACGCACACCACCACCATCTGGCGCGCGACGCGCGCGCGAGATGGTGTGCAACGCTGCGGCACGCACACCCCACCATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACAACCATCCACACGCGCCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGGCACGCACACCACCACCATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACACGCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAAGTCTCGGCGCAGCGGCAGCGGCGGCGGCGAGACGGGCGGCGCGGGCGGCGCAGGCGGCGCGGGGGGCGCGGGCGGCGCGGAGGCGGCGGCGGCGGAGGGGACGCGCGGCGAGGGCGCGGAGGAGGCGGTGCTGGCGGCGCTGCGGCGGCAGCTGCAGCCGCACCTGCTGGCGGCGCTGCACGCGCACTCGCCGCGCTCGCACGCGCAACCGCACTCGCACCGCTCGCAGGTGGGACGGAGCGTTTCCGAGTACGACGAGGCGCCCCTCAACCTCGTGGCGAGCCACGTGGCCGCCGAGGAGACTCACTGACTGCCTCGCGGTACGCTCGCCGATCCGAGTTCTTAGTGACTTTGCCAGTGGTCCGGAACCGTCAGCGGCGCCCGCTCGCCGACGGAGCCGCCTACTCGTGTTCCAGTTTCATAATCCGCGTTCGAAACGACGGCCAGTTACATACCATATACAATTTCGAGTTGCGCCAAAGCATCGATTGTCGAAACATTTGAGATTAATTTTTACTCTTTCGAATAGTAGTCGAAACGCAGTCACAAATGAGAAAAGATCTTATAATATAATAAGTATATTCAGAAAATTCGACTAG
- Protein Sequence
- MCNLPPKFHSVCRLCLSLCGDNCSDVKLPIFDRDKDKSRLSEMIMTYLSIMVSPEDMLPQVVCGSCAHKLDEFHTFRELSHKSELLLEQFVQYANSLTGTKEDILNITADKLEEIIRPLNDCEYDDASKHKYSEIGSPDSTEEMKNLESRQAAVTLLQIKNYDPSKYAVKTEESPHIMFNTGPSLPPSDRAREVMHCNAVIDIISKAVAVAQRENEESQNFSPNYSSVIDRTHAPSASEVTYAQEYPDDQYSTYNPPQSATSPGSNDDHDHKEMDLSLYGSIKNESLEPSEQAERDVTGGYLQRTLTSNKTPSFADEYKQHVFGRTCNKTKQSQENSVYEECSQSSSGSDPDRLQMDISEVSQDDPEETQSVPSAQSSPKPPHENEGDKDSLWQALHRQNGRGGEATQLLRRLINSKHLGMTVSPLRGTTSPVPAPMLPNGAVSPNGEWSNAGRGAAAAARRRRRGGGGGGTARRKQSCPARAQPIHEPANVWPVAHDNQEGAESAGGAGGGGGAAAAAARRRRRRRGGGGAAAAAAAARRRRTRAARWCATPAASTTSCRACRAPPPCAATPSTRAAAGRDTTPSTPGTVSTLGLYYTILYSRRAATAHAHHHIWRRDARARWCATPAASTTSCTACRAPPPCAATPSTRAAAGRDTTPSTPEHNCARTPPHLAARRAREMVCNACGLYYKLHGVPRPTAMRRDTTPSTPGTVSTLGLYYTILYSRRAATAHAHHHIWRRDARGEMVCNACGLYYKLHGVRAPPHAPRHIHTRRRRRPRHDAKHPRNSQYTRLYYTILYSRRAALRHAHHHHLAARRARRDGVQRCGLYYKLHGVPRPTAMRRDTSTRAAAAGRDTTPSTPGTVSTLGLYYTILYSRRAATAARTPPPSGGATRAARWCATPAASTTSCTACRAPPPCAATPSTRAAAGRDTTPSTPGTVSTLGLYYTILYSRRAATAARTPPPSGGATRARDGVHACGLYYKLHGVPRPTAMRRDTIHTRRRRRDTTPSTPGTVSTLGLYYTILYSRRAATAARTPPPPSGGATRAARWCATLRPLLQAARRARPTAMRREHHPHAPPPPAATRRQAPQEHNCGTHTTTIWRRDARGEMVCNACGLYYKLHGVPRPTAMRRDTIHTRRRRPRHDAKHPRNSQYTRLYYTILYSRRAATAARHTTTIWRRDARGEMVCNACGLYYKLHGVPRPTAMRRDTIHTRRRRRPRPRRQAPQEQSVHSVYTTLYCIAVVQQLRTHTTTIWRRERARRDGVQRLRPLLQAARRAAPHRHAPRPPSTRAAAGRDTTPSTPGTVSTLGLYYTILYSRRAATAARTPPHLAARRARRDGVQRLRPLLQAARRAAPHRHAPRHHPHAPPPAATRRQAPQEQSVHSAYTTLYCIAVVQQLRHAHHHHLAARRARRDGVQRLRPLLQAARRAAPHRHAPRHHPHAPPRPRHDAKHPRNSQYTRLYYTILYSRRAATAARTPPPSGARRAREMVCNAAARTPHHLAARRARRDGVQRLRPLLQAARRAAPHRHAPRQPSTRAAAAGRDTTPSTPGTVSTLGLYYTILYSRRAATAARTPPPSGGATRAARWCATPAASTTSCTACRAPPPCAATPSTRAAAGRDTTPSTPGTSLGAAAAAAARRAARAAQAARGARAARRRRRRRGRAARARRRRCWRRCGGSCSRTCWRRCTRTRRARTRNRTRTARRWDGAFPSTTRRPSTSWRATWPPRRLTDCLAVRSPIRVLSDFASGPEPSAAPARRRSRLLVFQFHNPRSKRRPVTYHIQFRVAPKHRLSKHLRLIFTLSNSSRNAVTNEKRSYNIISIFRKFD
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -