Bvim037040.1
Basic Information
- Insect
- Brachylomia viminalis
- Gene Symbol
- -
- Assembly
- GCA_937001565.2
- Location
- CAKZJP020000465.1:508618-519186[+]
Transcription Factor Domain
- TF Family
- zf-GATA
- Domain
- zf-GATA domain
- PFAM
- PF00320
- TF Group
- Zinc-Coordinating Group
- Description
- This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 20 0.009 39 5.5 0.2 19 36 80 96 77 96 0.86 2 20 0.009 39 5.5 0.2 19 36 126 142 123 142 0.86 3 20 0.009 39 5.5 0.2 19 36 172 188 169 188 0.86 4 20 0.009 39 5.5 0.2 19 36 218 234 215 234 0.86 5 20 0.009 39 5.5 0.2 19 36 264 280 261 280 0.86 6 20 0.009 39 5.5 0.2 19 36 310 326 307 326 0.86 7 20 0.009 39 5.5 0.2 19 36 356 372 353 372 0.86 8 20 0.009 39 5.5 0.2 19 36 402 418 399 418 0.86 9 20 0.009 39 5.5 0.2 19 36 448 464 445 464 0.86 10 20 0.009 39 5.5 0.2 19 36 494 510 491 510 0.86 11 20 0.009 39 5.5 0.2 19 36 540 556 537 556 0.86 12 20 0.009 39 5.5 0.2 19 36 586 602 583 602 0.86 13 20 0.009 39 5.5 0.2 19 36 632 648 629 648 0.86 14 20 0.009 39 5.5 0.2 19 36 678 694 675 694 0.86 15 20 0.009 39 5.5 0.2 19 36 724 740 721 740 0.86 16 20 0.009 39 5.5 0.2 19 36 770 786 767 786 0.86 17 20 0.009 39 5.5 0.2 19 36 816 832 813 832 0.86 18 20 0.009 39 5.5 0.2 19 36 862 878 859 878 0.86 19 20 0.16 6.8e+02 1.6 0.0 16 27 994 1005 991 1009 0.82 20 20 1.2 5.1e+03 -1.2 0.0 22 27 1067 1072 1061 1079 0.73
Sequence Information
- Coding Sequence
- ATGAAATCGAAGTCGTCAAAGCTGAAGAGTGGCGATGAGAAGCCGCGCCGCAAGCACCAGAAGAGGCCGCACGCCTGCGAGTACTGCAACCAGAAGTTCCTCCACCTGAACATGCTGGAGGTGCACCGGCGCGCGCACGCGGGCGAGGCGCTGGTGCTGCGCTGTCACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAAACTTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGAGTCCGCGTCACGACTACTGCCTGcagcccgcgcccgcgcgcgaCGAGCTGCGCGAGCACGAGGCCACGCACCTGGGCGCGCGGCCCTACCTGTGCACCGTGTGCGGGAAGACGTACAAGAAGAGGGAGACTATGGTGTACCACCGGAAGCGCCACGCGCCGGACAAGGAGTTCGTGTGCGACGTGTGCTCCAAGCGCTTCCCCGCCGCCTGCAAGCTGCACAAGCACCTCCTCACGCACCGCCGCGACGCCTTCGTGCTGCGCTACGAGTGTCCCGTCTGCGCACACATGTTCCACACGCGCTACCACGTGCACATGCACCTCAGCACGCACCAGAAGGAGGGCCTGATCTTAGAAGAGAATCGCAGCGAGATCTTGGCTATGGTTCTACAGAACGCGCGCAAGATCCCGCGCGCGGGCTGCACGCTAGCGCCCGCCGCGGCCGCGCACGAGCcgcctgcgcacgcgcacgcgccgccgcccgACGAGCGCTCGCGCGTGTGCAACATCTGCGGGGCCGTGTTCTCGCACTTCTACTACCTCGAGGAGCACCTCAAGAGCCACGGCGAGCGCATCGCCGTCGCCGACCTCGACAAGCCAGAAGATAAGAAATACATCTGTCCGATCTGTAATAAAGGCTTCAAGCTACACTACTACCTCAAACTCCACAGCTTCACGCATTCGAAGGAGAAGCCCTTCATCTGCCAACAGTGCGGGAAAGGGTTCATCACGAAAGGTAAACTGAAAAGACATTTGGAGACCCACACGGGCCTGAAGAAGTATCAGTGTCATATCTGCTACAAGTTCTTCACGCGGCCCAGCTACCTGCGCATACACGTGCGCACGATACACGGCACGCAGGACTATAACTTCAGGTTCGACAAGCGGTACGGACTCGGCTCGCTCGCTGTGTCGGCCATGACGATGTCGGATGTCAGTCAAAATAGTATataa
- Protein Sequence
- MKSKSSKLKSGDEKPRRKHQKRPHACEYCNQKFLHLNMLEVHRRAHAGEALVLRCHYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVSPRHDYCLQPAPARDELREHEATHLGARPYLCTVCGKTYKKRETMVYHRKRHAPDKEFVCDVCSKRFPAACKLHKHLLTHRRDAFVLRYECPVCAHMFHTRYHVHMHLSTHQKEGLILEENRSEILAMVLQNARKIPRAGCTLAPAAAAHEPPAHAHAPPPDERSRVCNICGAVFSHFYYLEEHLKSHGERIAVADLDKPEDKKYICPICNKGFKLHYYLKLHSFTHSKEKPFICQQCGKGFITKGKLKRHLETHTGLKKYQCHICYKFFTRPSYLRIHVRTIHGTQDYNFRFDKRYGLGSLAVSAMTMSDVSQNSI
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00238033;
- 90% Identity
- iTF_00238033;
- 80% Identity
- iTF_00238033;