Basic Information

Gene Symbol
-
Assembly
GCA_937001565.2
Location
CAKZJP020000465.1:508618-519186[+]

Transcription Factor Domain

TF Family
zf-GATA
Domain
zf-GATA domain
PFAM
PF00320
TF Group
Zinc-Coordinating Group
Description
This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 20 0.009 39 5.5 0.2 19 36 80 96 77 96 0.86
2 20 0.009 39 5.5 0.2 19 36 126 142 123 142 0.86
3 20 0.009 39 5.5 0.2 19 36 172 188 169 188 0.86
4 20 0.009 39 5.5 0.2 19 36 218 234 215 234 0.86
5 20 0.009 39 5.5 0.2 19 36 264 280 261 280 0.86
6 20 0.009 39 5.5 0.2 19 36 310 326 307 326 0.86
7 20 0.009 39 5.5 0.2 19 36 356 372 353 372 0.86
8 20 0.009 39 5.5 0.2 19 36 402 418 399 418 0.86
9 20 0.009 39 5.5 0.2 19 36 448 464 445 464 0.86
10 20 0.009 39 5.5 0.2 19 36 494 510 491 510 0.86
11 20 0.009 39 5.5 0.2 19 36 540 556 537 556 0.86
12 20 0.009 39 5.5 0.2 19 36 586 602 583 602 0.86
13 20 0.009 39 5.5 0.2 19 36 632 648 629 648 0.86
14 20 0.009 39 5.5 0.2 19 36 678 694 675 694 0.86
15 20 0.009 39 5.5 0.2 19 36 724 740 721 740 0.86
16 20 0.009 39 5.5 0.2 19 36 770 786 767 786 0.86
17 20 0.009 39 5.5 0.2 19 36 816 832 813 832 0.86
18 20 0.009 39 5.5 0.2 19 36 862 878 859 878 0.86
19 20 0.16 6.8e+02 1.6 0.0 16 27 994 1005 991 1009 0.82
20 20 1.2 5.1e+03 -1.2 0.0 22 27 1067 1072 1061 1079 0.73

Sequence Information

Coding Sequence
ATGAAATCGAAGTCGTCAAAGCTGAAGAGTGGCGATGAGAAGCCGCGCCGCAAGCACCAGAAGAGGCCGCACGCCTGCGAGTACTGCAACCAGAAGTTCCTCCACCTGAACATGCTGGAGGTGCACCGGCGCGCGCACGCGGGCGAGGCGCTGGTGCTGCGCTGTCACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAAACTTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGTACCACCGGAAGCGCCACGCGCCGGACAAGGAGTTCGTGTGCGACGTGTGCTCCAAGCGCTTCCCCGCCGCCTGCAAGCTGCACAAGCACCTCCTCACGCACCGCCGCGACGCCTTCGTGCTGCGCTACGAGTGTCCCGTCTGCGCACACATGTTCCACACGCGCTACCACGTGCACATGCACCTCAGCACGCACCAGAAGGAGGGCCTGATCTTAGAAGAGAATCGCAGCGAGATCTTGGCTATGGTTCTACAGAACGCGCGCAAGATCCCGCGCGCGGGCTGCACGCTAGCGCCCGCCGCGGCCGCGCACGAGCcgcctgcgcacgcgcacgcgccgccgcccgACGAGCGCTCGCGCGTGTGCAACATCTGCGGGGCCGTGTTCTCGCACTTCTACTACCTCGAGGAGCACCTCAAGAGCCACGGCGAGCGCATCGCCGTCGCCGACCTCGACAAGCCAGAAGATAAGAAATACATCTGTCCGATCTGTAATAAAGGCTTCAAGCTACACTACTACCTCAAACTCCACAGCTTCACGCATTCGAAGGAGAAGCCCTTCATCTGCCAACAGTGCGGGAAAGGGTTCATCACGAAAGGTAAACTGAAAAGACATTTGGAGACCCACACGGGCCTGAAGAAGTATCAGTGTCATATCTGCTACAAGTTCTTCACGCGGCCCAGCTACCTGCGCATACACGTGCGCACGATACACGGCACGCAGGACTATAACTTCAGGTTCGACAAGCGGTACGGACTCGGCTCGCTCGCTGTGTCGGCCATGACGATGTCGGATGTCAGTCAAAATAGTATataa
Protein Sequence
MKSKSSKLKSGDEKPRRKHQKRPHACEYCNQKFLHLNMLEVHRRAHAGEALVLRCHYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVYHRKRHAPDKEFVCDVCSKRFPAACKLHKHLLTHRRDAFVLRYECPVCAHMFHTRYHVHMHLSTHQKEGLILEENRSEILAMVLQNARKIPRAGCTLAPAAAAHEPPAHAHAPPPDERSRVCNICGAVFSHFYYLEEHLKSHGERIAVADLDKPEDKKYICPICNKGFKLHYYLKLHSFTHSKEKPFICQQCGKGFITKGKLKRHLETHTGLKKYQCHICYKFFTRPSYLRIHVRTIHGTQDYNFRFDKRYGLGSLAVSAMTMSDVSQNSI

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00238033;
90% Identity
iTF_00238033;
80% Identity
iTF_00238033;