Basic Information

Gene Symbol
Gata5
Assembly
None
Location
HiC:6251087-6280252[+]

Transcription Factor Domain

TF Family
zf-GATA
Domain
zf-GATA domain
PFAM
PF00320
TF Group
Zinc-Coordinating Group
Description
This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 11 8 2.3e+04 -5.4 1.7 13 26 550 563 547 564 0.68
2 11 4.9 1.4e+04 -3.9 1.3 12 26 624 638 622 639 0.72
3 11 9.7e-08 0.00028 20.8 0.4 21 35 682 696 666 697 0.87
4 11 6.9e-11 2e-07 30.8 0.5 12 35 739 761 738 762 0.96
5 11 0.0019 5.5 7.1 0.1 25 35 826 836 825 837 0.89
6 11 7.5 2.2e+04 -4.5 3.0 10 26 897 913 892 914 0.76
7 11 0.00043 1.3 9.1 0.1 25 35 978 988 977 989 0.91
8 11 7.7e-16 2.3e-12 46.7 1.3 2 35 1093 1125 1093 1126 0.96
9 11 3.1e-12 9e-09 35.2 0.2 8 35 1175 1201 1172 1202 0.93
10 11 0.99 2.9e+03 -1.7 1.8 6 16 1250 1260 1249 1265 0.84
11 11 7.5 2.2e+04 -4.5 3.0 10 26 1587 1603 1582 1604 0.76

Sequence Information

Coding Sequence
ATGTGTAATTTGCCGCCGAAGTTCCATTCAGTGTGTCGCCTTTGTTTATCACTCTGTGGCGATAATTGCAGTGATGTTAAACTACCAATATTCGACCGTGATAAGGATAAATCACGACTTTCCGAGATGATAATGACATATTTGTCAATAATGGTATCACCGGAAGATATGCTGCCACAGGTGGTTTGTGGGAGTTGTGCACACAAACTTGATGAGTTCCATACATTTAGAGAACTGTCACACAAATCTGAGCTACTATTGGAACAATTTGTACAGTATGCAAATTCACTGACTGGTACAAAAGAGGATATCTTGAATATAACGGCGGACAAATTAGAAGAAATAATCAGGCCACTAAATGATTGCGAGTATGATGATGCGAGTAAACACAAATATTCCGAAATCGGATCTCCGGACTCTACGGAGGAAATGAAAAACTTAGAGAGTCGGCAGGCGGCAGTCACACTGCTTCAGATAAAAAACTACGATCCGTCTAAATACGCTGTCAAGACTGAGGAAAGTCCTCACATAATGTTCAACACCGGACCGAGTTTACCGCCTTCCGATAGAGCGAGAGAGGTTATGCACTGTAATGCCGTCATTGATATTATAAGCAAGGCTGTCGCGGTCGCGCAGCGAGAAAATGAAGAATCTCAAAATTTCTCGCCTAACTATTCAAGCGTCATTGACAGGACGCATGCGCCCAGTGCTAGCGAAGTGACCTACGCGCAAGAGTACCCGGATGACCAATACAGCACTTACAACCCTCCACAGAGCGCTACCAGTCCTGGTAGTAACGATGATCATGACCACAAAGAAATGGACCTCTCATTATACGGCTCAATCAAGAATGAATCACTGGAACCGAGCGAGCAGGCAGAACGTGACGTAACGGGAGGCTATTTACAAAGAACATTAACGAGCAATAAAACCCCCAGCTTCGCAGACGAGTACAAACAACATGTTTTTGGTCGAACGTGTAATAAAACAAAACAATCGCAAGAAAACTCTGTATATGAAGAATGCAGTCAAAGCAGTAGTGGTTCCGATCCAGACAGATTACAGATGGATATTTCTGAGGTGTCGCAGGACGATCCAGAAGAGACCCAGTCTGTGCCATCAGCCCAGTCCTCTCCGAAACCGCCCCACGAAAACGAAGGCGATAAGGATTCTCTATGGCAGGCTCTTCATAGACAAAACGGTCGCGGCGGAGAGGCGACGCAGCTTCTAAGGCGACTGATCAACAGCAAACACCTGGGTATGACCGTGTCGCCGCTGCGCGGTACGACGTCTCCTGTCCCTGCGCCGATGTTGCCTAACGGTGCAGTTTCACCGAACGGTGAGTGGTCGAATGCGGGTCGCGGGGCGGCGGCGGCGGCGCGGCGGCGGCGGCGCGGCGGCGGCGGCGGCGGGACTGCGCGCAGGAAACAGAGCTGTCCGGCGCGTGCGCAACCCATACACGAGCCTGCCAACGTCTGGCCCGTCGCGCACGATAACCAGGAGGGCGCGGAGAGCGCGGGCGGCGCGGGCGGCGGCGGCGGCGCGGCGGCGGCGGCGGCGCGGCGGCGGCGGCGGCGGCGCGGCGGCGGCGGCGCGGCGGCGGCGGCGGCGGCGGCGCGGCGGCGGCGGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCCGGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACCCGCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGCACGCACACCACCACATCTGGCGGCGCGACGCGCGCGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACACGCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGAACACAACTGCGCACGCACACCACCACATCTGGCGGCGCGACGCGCGCGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGCACGCACACCACCACATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCGCGCCCCACCGCATGCGCCGCGACACATCCACACGCGCCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCACTGCGGCACGCACACCACCACCATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACATCCACACGCGCCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGGCACGCACACCACCACCATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACACGCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGGCACGCACACCACCACCATCTGGCGGCGCGACGCGCGCGCGAGATGGTGTGCACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACACGCGCCGCCGCCGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGTCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGGCACGCACACCACCACCACCATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCTGCGGCCTCTACTACAAGCTGCACGGCGTGCGCGCCCCACCGCCATGCGCCGCGAACACCATCCACACGCGCCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACACAACTGCGGCACGCACACCACCACCATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACACGCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGGCACGCCACACCACCACCATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACACGCGCCGCCGCCGCCGGCCGCGACCACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGTCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGCACGCACACCACCACCATCTGGCGGCGCGAACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACCACCATCCACACGCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGTCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGGCACGCACACCACCACATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACACGCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGGCACGCACACCACCACCATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACACGCGCCGCCGCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGGCACGCACACCACCACCATCTGGCGCGCGACGCGCGCGCGAGATGGTGTGCAACGCTGCGGCACGCACACCCCACCATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACAACCATCCACACGCGCCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAGTCAGTACACTCGGCCTATACTACACTATACTGTATAGCCGTCGTGCAGCAACTGCGGCACGCACACCACCACCATCTGGCGGCGCGACGCGCGCGGCGAGATGGTGTGCAACGCCTGCGGCCTCTACTACAAGCTGCACGGCGTGCCGCGCCCCACCGCCATGCGCCGCGACACCATCCACACGCGCCGCCGCCGGCCGCGACACGACGCCAAGCACCCCAGGAACAAGTCTCGGCGCAGCGGCAGCGGCGGCGGCGAGACGGGCGGCGCGGGCGGCGCAGGCGGCGCGGGGGGCGCGGGCGGCGCGGAGGCGGCGGCGGCGGAGGGGACGCGCGGCGAGGGCGCGGAGGAGGCGGTGCTGGCGGCGCTGCGGCGGCAGCTGCAGCCGCACCTGCTGGCGGCGCTGCACGCGCACTCGCCGCGCTCGCACGCGCAACCGCACTCGCACCGCTCGCAGGTGGGACGGAGCGTTTCCGAGTACGACGAGGCGCCCCTCAACCTCGTGGCGAGCCACGTGGCCGCCGAGGAGACTCACTGACTGCCTCGCGGTACGCTCGCCGATCCGAGTTCTTAGTGACTTTGCCAGTGGTCCGGAACCGTCAGCGGCGCCCGCTCGCCGACGGAGCCGCCTACTCGTGTTCCAGTTTCATAATCCGCGTTCGAAACGACGGCCAGTTACATACCATATACAATTTCGAGTTGCGCCAAAGCATCGATTGTCGAAACATTTGAGATTAATTTTTACTCTTTCGAATAGTAGTCGAAACGCAGTCACAAATGAGAAAAGATCTTATAATATAATAAGTATATTCAGAAAATTCGACTAG
Protein Sequence
MCNLPPKFHSVCRLCLSLCGDNCSDVKLPIFDRDKDKSRLSEMIMTYLSIMVSPEDMLPQVVCGSCAHKLDEFHTFRELSHKSELLLEQFVQYANSLTGTKEDILNITADKLEEIIRPLNDCEYDDASKHKYSEIGSPDSTEEMKNLESRQAAVTLLQIKNYDPSKYAVKTEESPHIMFNTGPSLPPSDRAREVMHCNAVIDIISKAVAVAQRENEESQNFSPNYSSVIDRTHAPSASEVTYAQEYPDDQYSTYNPPQSATSPGSNDDHDHKEMDLSLYGSIKNESLEPSEQAERDVTGGYLQRTLTSNKTPSFADEYKQHVFGRTCNKTKQSQENSVYEECSQSSSGSDPDRLQMDISEVSQDDPEETQSVPSAQSSPKPPHENEGDKDSLWQALHRQNGRGGEATQLLRRLINSKHLGMTVSPLRGTTSPVPAPMLPNGAVSPNGEWSNAGRGAAAAARRRRRGGGGGGTARRKQSCPARAQPIHEPANVWPVAHDNQEGAESAGGAGGGGGAAAAAARRRRRRRGGGGAAAAAAAARRRRTRAARWCATPAASTTSCRACRAPPPCAATPSTRAAAGRDTTPSTPGTVSTLGLYYTILYSRRAATAHAHHHIWRRDARARWCATPAASTTSCTACRAPPPCAATPSTRAAAGRDTTPSTPEHNCARTPPHLAARRAREMVCNACGLYYKLHGVPRPTAMRRDTTPSTPGTVSTLGLYYTILYSRRAATAHAHHHIWRRDARGEMVCNACGLYYKLHGVRAPPHAPRHIHTRRRRRPRHDAKHPRNSQYTRLYYTILYSRRAALRHAHHHHLAARRARRDGVQRCGLYYKLHGVPRPTAMRRDTSTRAAAAGRDTTPSTPGTVSTLGLYYTILYSRRAATAARTPPPSGGATRAARWCATPAASTTSCTACRAPPPCAATPSTRAAAGRDTTPSTPGTVSTLGLYYTILYSRRAATAARTPPPSGGATRARDGVHACGLYYKLHGVPRPTAMRRDTIHTRRRRRDTTPSTPGTVSTLGLYYTILYSRRAATAARTPPPPSGGATRAARWCATLRPLLQAARRARPTAMRREHHPHAPPPPAATRRQAPQEHNCGTHTTTIWRRDARGEMVCNACGLYYKLHGVPRPTAMRRDTIHTRRRRPRHDAKHPRNSQYTRLYYTILYSRRAATAARHTTTIWRRDARGEMVCNACGLYYKLHGVPRPTAMRRDTIHTRRRRRPRPRRQAPQEQSVHSVYTTLYCIAVVQQLRTHTTTIWRRERARRDGVQRLRPLLQAARRAAPHRHAPRPPSTRAAAGRDTTPSTPGTVSTLGLYYTILYSRRAATAARTPPHLAARRARRDGVQRLRPLLQAARRAAPHRHAPRHHPHAPPPAATRRQAPQEQSVHSAYTTLYCIAVVQQLRHAHHHHLAARRARRDGVQRLRPLLQAARRAAPHRHAPRHHPHAPPRPRHDAKHPRNSQYTRLYYTILYSRRAATAARTPPPSGARRAREMVCNAAARTPHHLAARRARRDGVQRLRPLLQAARRAAPHRHAPRQPSTRAAAAGRDTTPSTPGTVSTLGLYYTILYSRRAATAARTPPPSGGATRAARWCATPAASTTSCTACRAPPPCAATPSTRAAAGRDTTPSTPGTSLGAAAAAAARRAARAAQAARGARAARRRRRRRGRAARARRRRCWRRCGGSCSRTCWRRCTRTRRARTRNRTRTARRWDGAFPSTTRRPSTSWRATWPPRRLTDCLAVRSPIRVLSDFASGPEPSAAPARRRSRLLVFQFHNPRSKRRPVTYHIQFRVAPKHRLSKHLRLIFTLSNSSRNAVTNEKRSYNIISIFRKFD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-