Basic Information

Gene Symbol
-
Assembly
GCA_964007535.1
Location
OZ023331.1:49609596-49611294[-]

Transcription Factor Domain

TF Family
zf-GATA
Domain
zf-GATA domain
PFAM
PF00320
TF Group
Zinc-Coordinating Group
Description
This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 6 0.00014 0.79 10.5 0.0 15 28 30 43 24 48 0.84
2 6 0.00015 0.83 10.4 0.0 15 28 103 116 98 121 0.84
3 6 0.00015 0.83 10.4 0.0 15 28 176 189 171 194 0.84
4 6 0.00018 1 10.1 0.0 15 28 249 262 244 267 0.86
5 6 0.00064 3.6 8.4 0.0 15 28 274 287 268 292 0.83
6 6 0.00021 1.2 9.9 0.0 15 28 347 360 342 361 0.85

Sequence Information

Coding Sequence
ATGTGGGGACTGGGAGGGAGGAGGTTGGCCAGCGCCGCCACGAGCCAGCAGTCCCCTGCACATAGACAACCCCATCAACACCGCGTGGTGCCGCAAGCACAGTTACACCTGTGCGAAGCGTGCGGTATACCTGCACAGACAACCCCATCAACACCGTGTGGTGCCGCAAGCACAGTTACACCTGTGCGAAGCGTGCGGTATACCTGCACAGACAACCCCATCAACACCGTGTGGTGCCGCAAGCACAGTTACACCTGTGCGAAGCGTGCGGTATACCTGCACAGACAACCCCATCAACACCGTGTGGTGCCGCAAGCACAGTTACACCTGTGCGAAGCGTGCGGTATACCTGCACAGACAACCCCATCAACACCGTGTGGTGCCGCAAGCACAGTTACGCCTGTGCGAAGCGTGCGGTATACCTGCACAGACAACCCCATCAACACCGCGTGGTGCCGCAAGCACAGTTACACCTGTGCGAAGCGTGCGGTATACCTGCACAGACAACCCCATCAACACCGTGTGGTGCCGCAAGCACAGTTACACCTGTGCGAAGCGTGCGGTATACCTGCACAGACAACCCCATCAACACCGCGTGGTGCCGCAAGCACAGTTACACCTGTGCGAAGCGTGCGGTATACCTGCACAGACAACCCCATCAACACCGCGTGGTGCCGCAAGCACAGTTACACCTGTGCGAAGCGTGCGGTATACCTGCACAGACAACCCCATCAACACCGCGTGGTGCCGCAAGCACAGTTACACCTGTGCGAAGCGTGCGGTATACCTGCACATAGACAACCCCATCAACACCGTGTGGTGCCGCAAGCACAGTTACACGTGTGCGAAGCGTGCGGTATACCTGCACAGACAACCCCATCAACACCGTGTGGTGCCGCAAGCACAGTTACACCTGTGCGAAGCGTGCGGTATACCTGCACAGACAACCCCATCAACACCGTGTGGTGCCGCAAGCGCAGTTACACCTGTGCGAAGCGTGCGGTATACCTGCACAGACAACCCCATCAACACCGTGTGGTGCCGCAAGCACAGTTACACCTGTGCGAAGCGTGCGGTATACCTGGGTGGGCATACTACATCCGGCTAGCTAGCACTCCTACTGCTAGCACTCCTACTGCTAGCACTCCTACTGCTAGCACTCCTACTGCTAGCACTCTTACTGCTAGCAAGGTGTTTGCTAGCAAGGTGTTTGCTAGGGGAACTCTAGCCGGGTTGAGGGGAGGCCAGCGCCTCTTGGAGTGTTCCTGA
Protein Sequence
MWGLGGRRLASAATSQQSPAHRQPHQHRVVPQAQLHLCEACGIPAQTTPSTPCGAASTVTPVRSVRYTCTDNPINTVWCRKHSYTCAKRAVYLHRQPHQHRVVPQAQLHLCEACGIPAQTTPSTPCGAASTVTPVRSVRYTCTDNPINTAWCRKHSYTCAKRAVYLHRQPHQHRVVPQAQLHLCEACGIPAQTTPSTPRGAASTVTPVRSVRYTCTDNPINTAWCRKHSYTCAKRAVYLHRQPHQHRVVPQAQLHLCEACGIPAHRQPHQHRVVPQAQLHVCEACGIPAQTTPSTPCGAASTVTPVRSVRYTCTDNPINTVWCRKRSYTCAKRAVYLHRQPHQHRVVPQAQLHLCEACGIPGWAYYIRLASTPTASTPTASTPTASTPTASTLTASKVFASKVFARGTLAGLRGGQRLLECS

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-