Basic Information

Gene Symbol
LdOrf-130
Assembly
GCA_963082625.1
Location
OY720259.1:11939391-11963682[+]

Transcription Factor Domain

TF Family
zf-GATA
Domain
zf-GATA domain
PFAM
PF00320
TF Group
Zinc-Coordinating Group
Description
This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 18 7.1 2.4e+04 -3.8 0.3 12 28 490 506 488 511 0.69
2 18 0.61 2.1e+03 -0.4 1.3 20 33 1697 1710 1688 1713 0.77
3 18 0.5 1.7e+03 -0.1 0.5 1 30 1753 1786 1753 1788 0.56
4 18 0.0017 5.9 7.7 0.0 13 29 1897 1913 1896 1919 0.91
5 18 0.033 1.1e+02 3.6 0.1 17 29 1930 1942 1924 1948 0.88
6 18 0.033 1.1e+02 3.6 0.1 17 29 1959 1971 1953 1977 0.88
7 18 0.033 1.1e+02 3.6 0.1 17 29 1988 2000 1982 2006 0.88
8 18 0.033 1.1e+02 3.6 0.1 17 29 2017 2029 2011 2035 0.88
9 18 0.033 1.1e+02 3.6 0.1 17 29 2046 2058 2040 2064 0.88
10 18 0.033 1.1e+02 3.6 0.1 17 29 2075 2087 2069 2093 0.88
11 18 0.018 59 4.5 0.0 16 29 2103 2116 2099 2122 0.85
12 18 0.018 59 4.5 0.0 16 29 2132 2145 2128 2151 0.85
13 18 0.018 59 4.5 0.0 16 29 2161 2174 2157 2180 0.85
14 18 0.018 59 4.5 0.0 16 29 2190 2203 2186 2209 0.85
15 18 0.018 59 4.5 0.0 16 29 2219 2232 2215 2238 0.85
16 18 0.018 59 4.5 0.0 16 29 2248 2261 2244 2267 0.85
17 18 0.016 55 4.6 0.0 16 30 2277 2291 2273 2294 0.84
18 18 6.5 2.2e+04 -3.7 0.1 23 29 2312 2318 2312 2323 0.80

Sequence Information

Coding Sequence
ATGTCCACCACCATGGAGGATCTCTTGGCAAATCAAAGCCAAATCATCGGAATGCTGAATCAGCTCATTAGCAACTTCAAGAAGGATGGCGCTGATCGGAAAACTCAAGCATACATAAAGAAAAGGCTGAGCTCTCTAGAAACATACTGGCATGAGTTTCAGTCCAATCATATAAAATTATGCGAATTTGATGATAAATCGCATGAATATTTCACTGGTGATGCTTATAAGCGAACATCAGAATTTTACATTGAGTCACGTGCACACATGCAAAAATACTTATCCTTTGACACTAAGACAGAGGAAAAACAATTTCTGCGTCCTGCTACACCCCTAGCAGCAGCAGGGGGGCATGCGTCGCCGTGTCCCGGGACGTCGGAACCGTGGCAACAGCCACCATCGTCCGAGTTTCAAGCTCCACGCAGCAGCGGCGGGCAGGGAACCAGCAGCAAGCTGGATGAGATGCTACGAAGGCAGCAGAGTAACTTCAAAGCATTTTCTAGGACGATCAATTTAATATGCATCGACGAAATCACCAGCAGGTGGCAACTCGAAGATTCTCTAAGGTCAGTGCAAGCTCGTTGGTCTGTAGTTGACTGCCTTCATTGGGAGATTGATAGCGAGCTGTACGAAGAAAACTTCGAGTACGAGCAAAGCTATAGTAGATATGAACAacagtttaataatataaaaatatctcttaATGAGAAAATGTGGTCGCAATGTCATCAAGAAAGATCGACACCACAAATGGATATACCTACCTTCAATGGGAATTTTATGCAGTGGGTATCCTTCAAAGATTTGTTTGTAGAGTCTATACACAAGAATCAAACTCTATCGAACGCCCAAAAGATGCAGTTcctcaaaagtaaaataaaaggcGAAGCTGAAAAATTAATACAACATTTACCTATCAGCTCCGAAAACTACGAGGTAAGCTGGGAAATCCTTACACATCGTTATAATAACGTAAAATTGATTTTCAACAGTCACATGAATACCTTGACAAGTTTACCAACGTTGCAGCAGCAAACGGCAGCAGCCGTCAAAAAAATCTATGATACCACCAACGAGTGTCTAAACGCTATAAAGGCCCTAGGAGTCGACATAAGTACTTGGGACCCAATTATCGTTTACCTCATCACGAACAAACTGGATGCCGACACTCATATCGACTATATAGAGTCCTTAAAAAATCCTAGAGATTTACCTACTCTGAAAGATCTAGTAGAGTTTCTTGAGCGGAAATTTACGTCTTTGGAAGCATCCAAACGTAAAGGAGAGTCTCCAAAACCTCAGCAGCAAGAACCATCGACTAACAAAAAGACAAATTTCTTTAAAGCATTTAccagtaacaaaaaaatacaaaactccGATAGCGATGAACCTAAATGTTTAGTTTGTAACTCAACAAAACAACACGGCATATTCAATTGCGAAAAGTTCCTGGAAATGAAACCCAATATGAGGCTAAAGACAGTTGAAAGGTTGTCCATATGTAAAAGATGTCTTAGGTCCCATCGTAATAAGAAATGTATTTCAACCAAGAAATGTCGTGAGTGCGAGGGAGAACACAATACCTTATTGCACAACACAACCAACGATCAAATTTCTTCCGAACCGAGTGAAGCAAAAGGTTCTCAGCAGAGTCAGCAGTCAGGTGCACACGTTTTACAAGGAAAAACGGGTGAAATACTTCTTGCAACTGCGCTAATAAATGTCACCAAGGCCGATGGAACTCACCAAACTCTACGAGCGCTAATCGATCAGGGCTCACAAACATCCTTGATTTCAGAAAACGCAGCTCAACTTTTAGAAATCAAACGAAAAAGATGCAAAGGGATAATTTTTGGAGTAGGGGCTAAAGAAATGGTTTGCAAAGGCATAATTGACATTTCAATCTCATCTCTTCATACCAACTATGAATTGGACATCGAGGTATACATTATGAAACATGTCGTCAACAATTTGCCAAATCAAACGTTCACGAAACCATCGTGGGCCCATATAGAGGACATTCAGTTGGCTGATCCAGAGTTTTACAGAAGTCGATCTGTGGATCTATTGTTAGGCGCAGACGTCTACTCAGAAATAATAATGAGTGGTATCATCAGAGAAAATCCGTCCATGCCAATCGCTCAGCAGACGCGTTTGGGTTGGATATTATGTGGTAGCACAAATATCACTAAATCATTCCACTGTAACGTGGTTCAAACAGACGATAGTGACCTATCTAGATTTTGGTCTATGAAAGAAGTCCCAGATCAACATAACATGTCTACAGAAGACATACAGTGCCTAAAACTGTACAAAGATACGACGGTACGCCAGGCAGATGGCCGGTATCCCGCAGCAGCAAAGGCTGTTGAAGACCAGTTCTACATGGACGACCTACTGAGTGGATCCCACAGTATAGAGTCTGCACTTGAACTAAAATCGAATCTGATCGCACTCTTAAAAGCGGGGGGTTTTAATCTTCGCAAGTGGGCTTCCAACGAGCCATCATTACAGGATAATGATACGTCATCTCAAGACACCTTCGACTTTCGTCATCAGGAATCAACCAAAACATTGGGACTTACCTGGAAAGCTGCGGAAGATGTCTTTACTTTTAAATGCAAGATGTCTTTACCGAAATTGGAGCTAAGTGGAGCGCACCTATTAGTAAATCTAATGACCAAAATCTCAGAGAGCTTGAACGGTCACAAAATCGAGTTGTACGGCTGGTGTGACTCAACCGCAGTACTAGGTTGGCTAAGTGGAGAACCAAGTAGATGGAAACCTTTTGTCGCCAACAGAGTCATAAAAATAACCCAGGTCATGCCTTCAAGGTGCTGGCGTTACATTAAATCTAAAGAAAACCCAGCAGATTGCGCTAGCCGTGGTCTATCAGCCAGTGAACTCCAAAATCATATGCTCTGGTGGACCGGCCCTACCTGGCTATCATcattcaaaaaagaaaataaaaaagtcgtaTACGATACTCAATTAGAGACAAAGAATGTGAAACAGTCAGTCAATTCGATACGAATAGATAACACAGTCAACGGCGCTTACAACGTTACTTCAGTTCAGAGTAACAACAGTATATACTTTGACAAGATATCTGACATGAGAGTAATTCAAGACAAGTGGAAACTTATAGTATACTACAACATGGAACCGTACTGgaaagaaactaaattactGTCTGCCCACTATCAACATCTTGAGGAATTATGCTCACAGTTTAAGGAACGGAAATCCCACTGTGACATCATACTGCTCCAGCTACGTCAATGTTTCCGAGAATTagagtataataattatttactacaGCACGGTAATAAGAACACTCGAACTAAAAGAGGCCTTATCGACGGAGTAGGTAGCATCGCAAACAGTTTGTTCGGCGTTTTGGATAGCCAGTTCGCAAAGCAATACGAGATTGACATTTCCAAACTAAGAGAAAACCAAAAGCATTTGGCTACTCTATGGAAAAATCAGACCTCAGTCATAGAAGCGGAAAATAATCTTTTAAAAAGGATCGAATCCACATTAGAAAAACATGACGACCAATTTGCTAAACATCTAAAGGAGTTGGAACAGGCAGAAGTCAAATTAAATGACGCACTTGTCAGGGTAGAGACTGTTGATGAATTCgttatatccagcattattgcCAATAGTATCTACGTAAGTCTCAAGAACATTCAAGACGATTTGCTGGGTACCATTATTAACTCTCATGAATTCAACGTGCATCTACTTACACCAACACAGTTTCGCGAAGAGCTCGGCATCATATCTGGACAGTTGTCTAAGGACTTATCTCTGCCTATAAACAATATCCATTCAGATCTTACAAAACTTTATAAGATTCTTCAAGTTAGAGCTTGCATGACTTCACAATACTTCATCCTCGAAATACAGATACCTTTGGTGAATAGAGACAATTACGAGCTATTCAAGTCCTACTCAGTCCCGCGACAAATTGGGAATGCAGTTGTCAGCATATTGGCAAAGGCGGCTTATTTTGCAATTAACCTACAGAAAGATACCTACATCCCGATTTCGGAGAATGAATTTTACCAGAAGTGCACTAACTTTGAAGACAATTTACGTTTATGCAACCTTCAGAAGCCGGTGTATCAAAAGGCAACTGGGGACAATCTATGTGTCAAGGATAATCAGAAAGACCaatgtaaaacaataataagcaCGTGTCAACCTACTTGGACGGAGCTTATCAATGTAAACACATTCATACTCACCTGTTGCGGTCAATGCCCTGTAAGATTTATCTGTGGCTATCAAGTTACACAGAAACATTTGCCAGGTGTGAGTATTATAGCACTAGGAAACGGCTGCGTCATCAAGGGCGAGTCCTTCACTATTACAGCACATAAACAACACGCTAGCAATTTGAATATTGAGCCAGACCTATATCCGATTGAGATCGCACCCGTCAATACCATCATAAATCTGACGATCCCGATCAGTCATATAGAAGAGAGCAACAGCAACTTCAACAAGTCGGTCGAGCTGATAGACCAGCGCATCGCCGAGTTGAAGGCAACAGCGGATCAAGTGGAAGTGGGGAACGAGATCTCCTCTCACGACGTACACCACTACGTCGCCATCTACACCGTGGCAGCTGCAGCGGCGGTCGTCGGCGGGCTCACCTGGTGGCGAgcacgccgcgccgccgcctcctCCGCCAGGCAGCCCGTGCCAGCAGTTGGTGCAGCGCCCGTTGCTAGTAGTATATCCAAATGGGATCGCTTACCAAAATGGTTTTGTTACGAGTGTGCAGCATTGCTGTATAAATTCTATGAGTTTAAGAAGAGATGTCACAATGGACAAAAAGCATTCAAGAAACTCTTGGAGACAAAATCtAGgtcagtaaaaaagaaattcCTCGACGAGAAACATTGGAAGAAGATAACCTTGACTGATGAGGAAGCTTCTCGACGGTTCCAGGCGAAGGCATTGGAACAAAAGTATTTGAAGGCCGACTTCAAATGTACAGATTGCTATAGAACATTTTCACAGGAGGATATGAAGAAGCGGCATGTAAAGTTGAGGCATTGTGAGTCGCTGGGGCCACACGAGTGCCGCCACTGCCGTATGCGATTCAAGTGGAAGTCTCGACTGCAGAAGCATTTGAAGGAACACTTCACTATCTTTAAATGCCTGAGATGTGAATTGTCTTTTCCCGTTGAAATATCTGCGTTCCAACACGACTATTCGCACAACGGAGTCACGTGGACCTGTGCTCACTGCGGACAGGGATTCAGACACAGTTCCACATACTACACCCACTTGAGGAAGCATAAGAGTAAATACGTGTGCACTCTGTGCGGCGTCTCGTTCGTCAGCGAGTTCGGACTGTTCATGCATAAGCGTGTCAAGCATGTCATTAACGAGGATACCAAAGAAGAAAGCTCCAACACTTACTGCAAAGTGTGCGACATTAAATTCGACACGTTGAAAGGGTTTAATGAACACTTTGCCGAATCGGCCTTGCACGCCATTGAGGTAGCTGATGGCGCAACAAGTCTGCCGCTAAAGACCAAGTGTACGAGACGGCGAGTGCCCAGGATATCCACCAACTGTAATATTTGCCTGCGCGCATTCTCGACGTACTCCGCCTTCTCCAAGCATCACGCGCAGGCGCACGCGGGCGCGCCGCTGCAGCGCGCGCGCGAGGGCACCGACGGACAGAGGCAGGTCTGCGAAGTGTGCGGCCTCACGCTAGCTGTAAGTACACACGTGCCAACGTACCACCACGCGTGTCGCACCGACGGACAGAGGCAGGTCTGCGAAGTGTGCGGCCTCACGCTAGCTGTAAGTACACACGTGCCAACGTACCACCACGCGTGTCGCACCGACGGACAGAGGCAGGTCTGCGAAGTGTGCGGCCTCACGCTAGCTGTAAGTACACACGTGCCAACGTACCACCACGCGTGTCGCACCGACGGACAGAGGCAGGTCTGCGAAGTGTGCGGCCTCACGCTAGCTGTAAGTACACACGTGCCAACGTACCACCACGCGTGTCGCACCGACGGACAGAGGCAGGTCTGCGAAGTGTGCGGCCTCACGCTAGCTGTAAGTACACACGTGCCAACGTACCACCACGCGTGTCGCACCGACGGACAGAGGCAGGTCTGCGAAGTGTGCGGCCTCACGCTAGCTGTAAGTACACACGTGCCAACGTACCACCACGCGTGTCGCACCGACGGACAGAGGCAGGTCTGCGAAGTGTGCGGCCTCACGCTAGCTGTAAGTACACACGTGCCAACGTACAACCACGCGCGTCGCACGGACGGACAGAGGCAGGTCTGCGAAGTGTGCGGCCTCACGCTAGCTGTAAGTACACACGTGCCAACGTACAACCACGCGCGTCGCACGGACGGACAGAGGCAGGTCTGCGAAGTGTGCGGCCTCACGCTAGCTGTAAGTACACACGTGCCAACGTACAACCACGCGCGTCGCACGGACGGACAGAGGCAGGTCTGCGAAGTGTGCGGCCTCACGCTAGCTGTAAGTACACACGTGCCAACGTACAACCACGCGCGTCGCACCGACGGACAGAGGCAGGTCTGCGAAGTGTGCGGCCTCACGCTAGCTGTAAGTACACACGTGCCAACGTACAACCACGCGCGTCGCACGGACGGACAGAGGCAGGTCTGCGAAGTGTGCGGCCTCACGCTAGCTGTAAGTACACACGTGCCAACGTACAACCACGCGCGTCGCACGGACGGACAGAGGCAGGTCTGCGAAGTGTGCGGCCTCACGCTAGCTGTAAGTACACACGTGCCAACGTACAACCACGCGCGTCGCACGGACGGACAGAGGCAGGTCTGCGAAGTGTGCGGCCTCACGTTAGCTAGTTCGTGGTCGCTGCTTAACCATTTGAACACTCACTCGCGCGTCCGACTCTTCACGTGCAAGACGTGCGGCCTTCAGCTCAGCTCCAAGAGTGTACTCCAGAGGCATCAGCTGACGCACACACACGACAAGAGTTTCTCGTGCGACCGCTGCGAACGGAGGTTCACACACGGCTCGGGGCTCCGCCTCCACATGAAGACGCACGACAAGAAGCACGCCTCCACAAGCAGAAAGAGGAAAGCGGAACCAATAGTGATGGAATACTTCAAGAACTATGGACAACTTCACTAG
Protein Sequence
MSTTMEDLLANQSQIIGMLNQLISNFKKDGADRKTQAYIKKRLSSLETYWHEFQSNHIKLCEFDDKSHEYFTGDAYKRTSEFYIESRAHMQKYLSFDTKTEEKQFLRPATPLAAAGGHASPCPGTSEPWQQPPSSEFQAPRSSGGQGTSSKLDEMLRRQQSNFKAFSRTINLICIDEITSRWQLEDSLRSVQARWSVVDCLHWEIDSELYEENFEYEQSYSRYEQQFNNIKISLNEKMWSQCHQERSTPQMDIPTFNGNFMQWVSFKDLFVESIHKNQTLSNAQKMQFLKSKIKGEAEKLIQHLPISSENYEVSWEILTHRYNNVKLIFNSHMNTLTSLPTLQQQTAAAVKKIYDTTNECLNAIKALGVDISTWDPIIVYLITNKLDADTHIDYIESLKNPRDLPTLKDLVEFLERKFTSLEASKRKGESPKPQQQEPSTNKKTNFFKAFTSNKKIQNSDSDEPKCLVCNSTKQHGIFNCEKFLEMKPNMRLKTVERLSICKRCLRSHRNKKCISTKKCRECEGEHNTLLHNTTNDQISSEPSEAKGSQQSQQSGAHVLQGKTGEILLATALINVTKADGTHQTLRALIDQGSQTSLISENAAQLLEIKRKRCKGIIFGVGAKEMVCKGIIDISISSLHTNYELDIEVYIMKHVVNNLPNQTFTKPSWAHIEDIQLADPEFYRSRSVDLLLGADVYSEIIMSGIIRENPSMPIAQQTRLGWILCGSTNITKSFHCNVVQTDDSDLSRFWSMKEVPDQHNMSTEDIQCLKLYKDTTVRQADGRYPAAAKAVEDQFYMDDLLSGSHSIESALELKSNLIALLKAGGFNLRKWASNEPSLQDNDTSSQDTFDFRHQESTKTLGLTWKAAEDVFTFKCKMSLPKLELSGAHLLVNLMTKISESLNGHKIELYGWCDSTAVLGWLSGEPSRWKPFVANRVIKITQVMPSRCWRYIKSKENPADCASRGLSASELQNHMLWWTGPTWLSSFKKENKKVVYDTQLETKNVKQSVNSIRIDNTVNGAYNVTSVQSNNSIYFDKISDMRVIQDKWKLIVYYNMEPYWKETKLLSAHYQHLEELCSQFKERKSHCDIILLQLRQCFRELEYNNYLLQHGNKNTRTKRGLIDGVGSIANSLFGVLDSQFAKQYEIDISKLRENQKHLATLWKNQTSVIEAENNLLKRIESTLEKHDDQFAKHLKELEQAEVKLNDALVRVETVDEFVISSIIANSIYVSLKNIQDDLLGTIINSHEFNVHLLTPTQFREELGIISGQLSKDLSLPINNIHSDLTKLYKILQVRACMTSQYFILEIQIPLVNRDNYELFKSYSVPRQIGNAVVSILAKAAYFAINLQKDTYIPISENEFYQKCTNFEDNLRLCNLQKPVYQKATGDNLCVKDNQKDQCKTIISTCQPTWTELINVNTFILTCCGQCPVRFICGYQVTQKHLPGVSIIALGNGCVIKGESFTITAHKQHASNLNIEPDLYPIEIAPVNTIINLTIPISHIEESNSNFNKSVELIDQRIAELKATADQVEVGNEISSHDVHHYVAIYTVAAAAAVVGGLTWWRARRAAASSARQPVPAVGAAPVASSISKWDRLPKWFCYECAALLYKFYEFKKRCHNGQKAFKKLLETKSRSVKKKFLDEKHWKKITLTDEEASRRFQAKALEQKYLKADFKCTDCYRTFSQEDMKKRHVKLRHCESLGPHECRHCRMRFKWKSRLQKHLKEHFTIFKCLRCELSFPVEISAFQHDYSHNGVTWTCAHCGQGFRHSSTYYTHLRKHKSKYVCTLCGVSFVSEFGLFMHKRVKHVINEDTKEESSNTYCKVCDIKFDTLKGFNEHFAESALHAIEVADGATSLPLKTKCTRRRVPRISTNCNICLRAFSTYSAFSKHHAQAHAGAPLQRAREGTDGQRQVCEVCGLTLAVSTHVPTYHHACRTDGQRQVCEVCGLTLAVSTHVPTYHHACRTDGQRQVCEVCGLTLAVSTHVPTYHHACRTDGQRQVCEVCGLTLAVSTHVPTYHHACRTDGQRQVCEVCGLTLAVSTHVPTYHHACRTDGQRQVCEVCGLTLAVSTHVPTYHHACRTDGQRQVCEVCGLTLAVSTHVPTYNHARRTDGQRQVCEVCGLTLAVSTHVPTYNHARRTDGQRQVCEVCGLTLAVSTHVPTYNHARRTDGQRQVCEVCGLTLAVSTHVPTYNHARRTDGQRQVCEVCGLTLAVSTHVPTYNHARRTDGQRQVCEVCGLTLAVSTHVPTYNHARRTDGQRQVCEVCGLTLAVSTHVPTYNHARRTDGQRQVCEVCGLTLASSWSLLNHLNTHSRVRLFTCKTCGLQLSSKSVLQRHQLTHTHDKSFSCDRCERRFTHGSGLRLHMKTHDKKHASTSRKRKAEPIVMEYFKNYGQLH

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_01161878;
90% Identity
iTF_01161878;
80% Identity
iTF_01161878;