Basic Information

Gene Symbol
-
Assembly
GCA_905147135.1
Location
LR990063.1:2923718-2926548[-]

Transcription Factor Domain

TF Family
zf-GATA
Domain
zf-GATA domain
PFAM
PF00320
TF Group
Zinc-Coordinating Group
Description
This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 14 0.25 9.3e+02 0.6 0.0 6 17 34 45 33 48 0.83
2 14 0.25 9.3e+02 0.6 0.0 6 17 63 74 62 77 0.83
3 14 0.25 9.3e+02 0.6 0.0 6 17 92 103 91 106 0.83
4 14 0.086 3.2e+02 2.0 0.0 6 18 121 133 120 135 0.87
5 14 0.086 3.2e+02 2.0 0.0 6 18 150 162 149 164 0.87
6 14 0.086 3.2e+02 2.0 0.0 6 18 179 191 178 193 0.87
7 14 0.086 3.2e+02 2.0 0.0 6 18 210 222 209 224 0.87
8 14 0.086 3.2e+02 2.0 0.0 6 18 239 251 238 253 0.87
9 14 0.086 3.2e+02 2.0 0.0 6 18 268 280 267 282 0.87
10 14 0.086 3.2e+02 2.0 0.0 6 18 297 309 296 311 0.87
11 14 0.086 3.2e+02 2.0 0.0 6 18 326 338 325 340 0.87
12 14 0.086 3.2e+02 2.0 0.0 6 18 357 369 356 371 0.87
13 14 0.086 3.2e+02 2.0 0.0 6 18 386 398 385 400 0.87
14 14 0.086 3.2e+02 2.0 0.0 6 18 415 427 414 429 0.87

Sequence Information

Coding Sequence
ATGCTTCGTGATCTGTTCCACCTCCTCAGACTGCGCCTGGGTGAGTTCGGAGTGTCCGCGGCCCCGCAgcagcgcccgcccccgcccccgccggtaCTCACCGCGTCCACCAGCGCCTGGCGCCTGGGTGAGTTCGGAGTGTCCGCGGCCCCGCAgcagcgcccgcccccgcccccgccggtaCTCACCGCGTCCACCAGCGCCTGGCGCCTGGGTGAGTTCGGAGTGTCCGCGGCCCCGCAgcagcgcccgcccccgcccccgccggtaCTCACCGCGTCCACCAGCGCCTGGCGCCTGGGTGAGTTCGGAGTGTCCGCGGCCCCGCAgcagcgcccgcccccgcccccgccggtaCTCACCGCGTCCACCAGCGCCTGGCGCCTGGGTGAGTCCGGAGTGTCCGCGGCCCCGCAgcagcgcccgcccccgcccccgccggtaCTCACCGCGTCCACCAGCGCCTGGCGCCTGGGTGAGTCCGGAGTGTCCGCGGCCCCGCAgcagcgcccgcccccgcccccgccggtaCTCACCGCGTCCACCAGCGCCTGGCGCCTGGGTGAGTCCGGAGTGTCCGCGGCCCCGCAgcagcgcccgcccccgcccccgcccccgccggtaCTCACCGCGTCCACCAGCGCCTGGCGCCTGGGTGAGTCCGGAGTGTCCGCGGCCCCGCAgcagcgcccgcccccgcccccgccggtaCTCACCGCGTCCACCAGCGCCTGGCGCCTGGGTGAGTCCGGAGTGTCCGCGGCCCCGCAgcagcgcccgcccccgcccccgccggtaCTCACCGCGTCCACCAGCGCCTGGCGCCTGGGTGAGTCCGGAGTGTCCGCGGCCCCGCAgcagcgcccgcccccgcccccgccggtaCTCACCGCGTCCACCAGCGCCTGGCGCCTGGGTGAGTCCGGAGTGTCCGCGGCCCCGCAgcagcgcccgcccccgcccccgccggtaCTCACCGCGTCCACCAGCGCCTGGCGCCTGGGTGAGTCCGGAGTGTCCGCGGCCCCGCAgcagcgcccgcccccgcccccgcccccgccggtaCTCACCGCGTCCACCAGCGCCTGGCGCCTGGGTGAGTCCGGAGTGTCCGCGGCCCCGCAgcagcgcccgcccccgcccccgccggtaCTCACCGCGTCCACCAGCGCCTGGCGCCTGGGTGAGTCCGGAGTGTCCGCGGCCCCGCAgcagcgcccgcccccgcccccgccggtaCTCACCGCGTCCACCAGCGCCTGGCGCCTGGGTGAGTCCGGAGTGTCCGCGGCCCCGCAgcagcgcccgcccccgcccccgccggtaCTCACCGCGTCCACCAGCGCCTGGGTGAGTCCGGAGTGTCCGCGGCCCCGCAgcagcgcccgcccccgcccccgccggtaCTCACCGCGTCCACCAGCGCCTGGGTGA
Protein Sequence
MLRDLFHLLRLRLGEFGVSAAPQQRPPPPPPVLTASTSAWRLGEFGVSAAPQQRPPPPPPVLTASTSAWRLGEFGVSAAPQQRPPPPPPVLTASTSAWRLGEFGVSAAPQQRPPPPPPVLTASTSAWRLGESGVSAAPQQRPPPPPPVLTASTSAWRLGESGVSAAPQQRPPPPPPVLTASTSAWRLGESGVSAAPQQRPPPPPPPPVLTASTSAWRLGESGVSAAPQQRPPPPPPVLTASTSAWRLGESGVSAAPQQRPPPPPPVLTASTSAWRLGESGVSAAPQQRPPPPPPVLTASTSAWRLGESGVSAAPQQRPPPPPPVLTASTSAWRLGESGVSAAPQQRPPPPPPPPVLTASTSAWRLGESGVSAAPQQRPPPPPPVLTASTSAWRLGESGVSAAPQQRPPPPPPVLTASTSAWRLGESGVSAAPQQRPPPPPPVLTASTSAWVSPECPRPRSSARPRPRRYSPRPPAPG*

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-