Basic Information

Gene Symbol
-
Assembly
GCA_034769895.1
Location
CM068372.1:267810263-267828408[-]

Transcription Factor Domain

TF Family
zf-GATA
Domain
zf-GATA domain
PFAM
PF00320
TF Group
Zinc-Coordinating Group
Description
This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 20 0.031 2e+03 2.5 0.0 3 15 65 77 64 82 0.88
2 20 0.029 1.9e+03 2.6 0.1 3 15 84 96 83 106 0.87
3 20 1 6.6e+04 -2.4 0.0 3 15 103 115 102 117 0.87
4 20 0.011 7.4e+02 3.9 0.0 3 15 122 134 121 137 0.89
5 20 0.0099 6.4e+02 4.1 0.0 3 16 141 154 140 163 0.83
6 20 0.01 6.7e+02 4.0 0.0 3 15 179 191 178 195 0.89
7 20 0.051 3.3e+03 1.8 0.0 3 15 198 210 197 213 0.89
8 20 0.033 2.1e+03 2.4 0.2 3 19 217 233 216 242 0.84
9 20 0.035 2.3e+03 2.3 0.0 3 15 255 267 254 270 0.89
10 20 0.032 2.1e+03 2.5 0.0 3 15 274 286 273 291 0.89
11 20 0.03 1.9e+03 2.6 0.1 3 15 293 305 292 310 0.87
12 20 0.033 2.1e+03 2.4 0.0 3 15 312 324 311 328 0.89
13 20 0.035 2.3e+03 2.3 0.0 3 15 331 343 330 348 0.88
14 20 0.13 8.7e+03 0.5 0.0 3 15 350 362 349 367 0.87
15 20 0.048 3.1e+03 1.9 0.0 3 15 369 381 368 384 0.91
16 20 0.35 2.2e+04 -0.9 0.1 3 15 388 400 387 403 0.87
17 20 0.081 5.2e+03 1.2 0.0 3 15 416 428 415 432 0.90
18 20 0.21 1.3e+04 -0.1 0.1 3 15 435 447 434 448 0.89
19 20 0.04 2.6e+03 2.1 0.0 3 15 454 466 453 471 0.90
20 20 0.028 1.8e+03 2.6 0.0 3 15 492 504 491 508 0.87

Sequence Information

Coding Sequence
ATGTGCCAGGAACGTCTTGTTGGGCTTTCTTTAATGTCAATAGAATTTGAAGATCTTCAAGATATCGACATAAATGTTATAATAAAAGACTTTGCAGAAAAAAAGCAAGAAAAGTCATGTTCTAAGCGATATTATGAAAATCACCAAAAGCTCATCAATACTGGGTTCACAAGAGCCGCCATCGGACCAGTTAACTGCCTTGCCACTGGTACAAGCCAGGTGAGAGCAAACACCGTCATCGGACCAGTTAACTGCCTTGCTACTGGTACAAGCCAGGTGAGAGCAAACACCGTCATCGGACTAGTTAACTGCCTTGCTATTGGTACAAGCCGGGTGAGAGCAAACACCGTCACCGGACTAGTTAACTGCCTTGCTACTTTTACAAGCCAGGTGAGAGCAAACACCGTCCCAGGACTAGTTAACTGCCTTGCTACTGGTACAAGCCAGATGAGAGCAAACACCGTCATCGGACCAGTTAACTGCCTTGCTACTGGTAAAAGCCAGGTGAGAGCAAACACAGTCCCCGGACTAGTTAATTGCTTTGCTACTGGTACAAGCCAGGTGAGAGCAAACACAGTCCCCGGACTAGTTAATTGCTTTGCTACTGGTACAAGCCGGGTGAGAGCAAACACCGTCATCGGACTAGTTAACTGCCTTGCTACTGGTACAAGCCGGGTGAGAGAAAACACCGTCATGGGACTAGTTAATTGCCTTGCTAATGGTACAAGCCGGGTGAGAGCAAACACCGTCACCGGACTAGTTAACTGCCTTGCTACTGGTACAAGCCAGGTGAGAGCAAACACCGTCCCCGGACTAGTTAACTGCCTTGCTACTGGTACAAGCCAGGTGAGAGCAAACACCGTCATCAGACCAGTTAACTGCCTTGCTACTGGTACAAGCCAGGTGAGAGCAAACACCGTCACCGGACAAGTTAACTGCCTTACTACTGGCACAAGCCGGGTGAGAGCAAACATCGTCACCGGACTAGTTAACTGCCTTGCTACTGGTACAAAACAGGTGAGAGCAAACACCGTCACCGGACTAGTTAACTGCCTTGCTACTGGTACAAGCCGAGTGAGAGCAAACACCGTCCCCGGACCAGTTAACTGCCTTGCTACTGGTACAAGCCAGGTGAGAGCAAACATCGTCATCGGACTAGTTACCTGCCTTACAACTGGTACAGGTCAGGTGAGAGCAAACACTGTCATCGACCAGGTGAGAGCAAACACCGCCATCGGACTAGTTAACTGCCTTGCTACTGGTACAAGCCGGGTGAGAGAAAACACCGTCATCGGAGTAGTTAACTGCCTTGCTACTGGTACAAGCCGGGTGAGAGCAAACACCATCATCGGACTAGTTAACTGCCTTGCTACTGGTACAAGCCAGGTGAGAGCAAACATCGTCCCCGGACTAGTTAACTGCCTTGCTACTGGTACAACCCGGGTGACAGCAAACACCGTCACCGGACTAGTTAACTGCCTTGCTACTGGTACAAGCCAGGTCAGAGCAAACACAGCCCCGGACTAG
Protein Sequence
MCQERLVGLSLMSIEFEDLQDIDINVIIKDFAEKKQEKSCSKRYYENHQKLINTGFTRAAIGPVNCLATGTSQVRANTVIGPVNCLATGTSQVRANTVIGLVNCLAIGTSRVRANTVTGLVNCLATFTSQVRANTVPGLVNCLATGTSQMRANTVIGPVNCLATGKSQVRANTVPGLVNCFATGTSQVRANTVPGLVNCFATGTSRVRANTVIGLVNCLATGTSRVRENTVMGLVNCLANGTSRVRANTVTGLVNCLATGTSQVRANTVPGLVNCLATGTSQVRANTVIRPVNCLATGTSQVRANTVTGQVNCLTTGTSRVRANIVTGLVNCLATGTKQVRANTVTGLVNCLATGTSRVRANTVPGPVNCLATGTSQVRANIVIGLVTCLTTGTGQVRANTVIDQVRANTAIGLVNCLATGTSRVRENTVIGVVNCLATGTSRVRANTIIGLVNCLATGTSQVRANIVPGLVNCLATGTTRVTANTVTGLVNCLATGTSQVRANTAPD

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
-
90% Identity
-
80% Identity
-