Agen050975.1
Basic Information
- Insect
- Agriphila geniculea
- Gene Symbol
- -
- Assembly
- GCA_943789515.1
- Location
- CALSUL010000851.1:332792-335161[+]
Transcription Factor Domain
- TF Family
- zf-BED
- Domain
- zf-BED domain
- PFAM
- PF02892
- TF Group
- Zinc-Coordinating Group
- Description
- The BED finger, which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 18 0.0068 9.4 8.1 1.2 18 43 8 30 4 31 0.93 2 18 6.1 8.4e+03 -1.3 0.2 17 43 35 58 32 59 0.67 3 18 0.0012 1.6 10.6 0.0 18 43 67 89 63 90 0.87 4 18 0.84 1.1e+03 1.5 2.1 17 38 97 115 95 121 0.88 5 18 0.012 16 7.4 3.3 14 44 124 151 111 151 0.86 6 18 0.00061 0.84 11.5 0.2 13 43 154 181 152 182 0.87 7 18 0.00064 0.89 11.4 0.4 17 43 187 210 183 211 0.90 8 18 0.0093 13 7.7 0.3 17 41 218 239 213 242 0.79 9 18 0.0042 5.7 8.8 0.2 13 43 303 330 297 331 0.88 10 18 0.074 1e+02 4.8 0.0 15 44 364 390 360 390 0.83 11 18 0.61 8.4e+02 1.9 2.5 14 39 408 430 402 435 0.72 12 18 0.011 15 7.5 5.3 5 30 427 457 423 463 0.75 13 18 0.0047 6.4 8.7 2.2 14 43 466 492 454 493 0.80 14 18 0.13 1.7e+02 4.1 0.8 17 44 498 522 494 522 0.85 15 18 0.92 1.3e+03 1.3 0.7 18 28 529 539 525 551 0.84 16 18 5.1 7.1e+03 -1.1 0.0 35 43 573 581 565 582 0.87 17 18 5.6e-05 0.077 14.8 1.5 5 43 610 647 608 648 0.82 18 18 2.6 3.6e+03 -0.1 0.2 12 27 662 677 655 694 0.85
Sequence Information
- Coding Sequence
- ATGTATTCAAATCTAGATTTTGTCTGCGACTATTGCAGTCGAACCTTCACTCGGAAATACAACTTACAAACGCACATAGAGAACTGCCATCTCAACTCATCATGCTACTGCGACATTTGCGACCAAACGTTTGGTAGTCCCGCCGGCTTACAACTACATTTATCCAGAGGCCATAACCGGCATGGCCAACCTTTCCCAGAATGTGATATCTGCGGGCGAATTTTTActagaaaacaaaatattacatcCCATATGATCACAGTTCATTTACAAGGTATAGGTCCGGAAATAAAATGTCATACTTGCGATAAAACTTTTACTACTGAGAGAAACTTAAAGAGGCATAACAGCCAATTGCACAATCCTGATGCTGAAAACTTAACATGCGATAGTTGTAATAAAGCATTTAAAGGAAAGCACTCATTAATAGCTCATATGCAAGCTATGCATAATTTAGCAGACAAGGGCATCATCAAATGCCCGCTCTGTGATAAAGTCTACACCAACAACAGAAATCTGAAACGCCACGTAGAAATGTATCATGGTGAAAAAGGAGAATTCAAATGTGAAATATGCCCTAAAGTTTACACTTCGAACCAGAGCTTAAGGCGGCATGTTAGAACAAGACATTATTCGGACGATGGAGACACGTGCGTTTGTGAGTATTGTAATAAACATATCGTCGGAAAAGAAAATTTGGATAGTCATGTGTCCTTCTTTCATAAACTCGATGATGATATGGATAGCGACTCAGCTTCTTTGTATTATcaatgtgaaagttgttccaaGGGATTTGGCGAAGAGCCTTTGCTGAGACAACATGTCAAAATGGAGCATTCATTTAACACTTTTTATAATTACTGCCGAAAATCTTTGTTGAGACAAGAAGACGTCAGTAAAGCCAGCAAAAGCATGTTTTACAAATGCGAATATTGTGTCAACGCTTTTAGCAGTGTTTATGAATTGAAAGACCACATGAGGGTGAATCATGATAGGGAATATTCTCTATCTACCTGCAACGTTTGTTTCGATAAGTTTTATAGCAAAGAAACCATGTCTGATCATAAAAAGATTTGCTTGCCTCCGCCTAACGTAAACCGCTGTAGTCACTGCGACAAATTATTTACAGATATATCTAGTTTAGACTTCCATATACGTATTTTTCATCCCCAAGCTCAAATCGCTGATTCTAATATAAGTTCAACAAACGTTGATGATACTTCCGATAATTATAAATGTTCGCATTGTGACCGAGTATATTATAGTGATAGATCTCTCAAACATCACACAAAACTAAAACACACAACTGATGAAGCTGTAGAATGTGAATATTGCGGTAAGATTTGCAGTAATAAATACTACTTAGCGTCCCACACTAAAATAGTGCACAGTAACGATTCCTGGTCGAAATGTGATTATTGTGACAAGCAATTTAAATCAAAGAGGAATATTCGGAGACACATAGAATATACGCATCTAGGAATGCAAAGATATAAGTGTATTGAGTGTGAAACACTTTTTAAAGAGAAGAGAAGTTTAAGGAAACATGTAAGGACTAAACATCCGAATTCGGCAGCTTTTCCTCAATGCCACATTTGCCACAAGAGATTCGAGTCTGCCAAGTCCTGCAAGATTCACCTAAAATTATTACACTCCTTCAATATGAACACCTTTCCTTGTGATCTCTGCTCAGTATCGTTTGGTTCGAATGCGGCATTGTCTATTCATTTGCAAACGAAGCATTTGGCAAAAGACGAAATTTACAAATGCGAAGAATGCAACTTAGTATTTAAAGGGCAAGAAAAATTTGAGCAGCACAATGATCTATGCCATGTTAATTTGGTCCCGAATATAAAGCAAAAGGTTTTGCCTCGATGTATTATATGTATGAAAGATTTTAGCACACGAAAGACATTGAAAAGACATATAAAGAAATTCCACgatgattttgatgttgatgaGCTGGCCACTTTTGGCTCGAGGCGTCGTAACTTTAATGTGGAATGCGAAGATTGTATTAAAAACTTTAGTGACGCATTTCACTTCAATATTTATCAGAAACTTAAGCATCTTAGAGACTCGGTGATCTTCAAATGCGAGTCCTGTCAAACTTCTTACAATTCGTTAGAGTATTTCATACAACGATATAAGCTGACAAATTGTGCTTTTAAAGGGAAAATTATTTTGAGCGACCTTTGCACTGCTGAAATGAGTGATGGTGAATCTTCGTATAATTGCTTCGGGTCGTATCATGAGATGATGGAACCGGAGAGTACTACAAATGATGTTCAAGTAAAAATTGAGCCATTAGAAGAGATGGAACCTTCGATCGTGATAGAACACGTAAAGACTGAACCGTGGACGCCTTAG
- Protein Sequence
- MYSNLDFVCDYCSRTFTRKYNLQTHIENCHLNSSCYCDICDQTFGSPAGLQLHLSRGHNRHGQPFPECDICGRIFTRKQNITSHMITVHLQGIGPEIKCHTCDKTFTTERNLKRHNSQLHNPDAENLTCDSCNKAFKGKHSLIAHMQAMHNLADKGIIKCPLCDKVYTNNRNLKRHVEMYHGEKGEFKCEICPKVYTSNQSLRRHVRTRHYSDDGDTCVCEYCNKHIVGKENLDSHVSFFHKLDDDMDSDSASLYYQCESCSKGFGEEPLLRQHVKMEHSFNTFYNYCRKSLLRQEDVSKASKSMFYKCEYCVNAFSSVYELKDHMRVNHDREYSLSTCNVCFDKFYSKETMSDHKKICLPPPNVNRCSHCDKLFTDISSLDFHIRIFHPQAQIADSNISSTNVDDTSDNYKCSHCDRVYYSDRSLKHHTKLKHTTDEAVECEYCGKICSNKYYLASHTKIVHSNDSWSKCDYCDKQFKSKRNIRRHIEYTHLGMQRYKCIECETLFKEKRSLRKHVRTKHPNSAAFPQCHICHKRFESAKSCKIHLKLLHSFNMNTFPCDLCSVSFGSNAALSIHLQTKHLAKDEIYKCEECNLVFKGQEKFEQHNDLCHVNLVPNIKQKVLPRCIICMKDFSTRKTLKRHIKKFHDDFDVDELATFGSRRRNFNVECEDCIKNFSDAFHFNIYQKLKHLRDSVIFKCESCQTSYNSLEYFIQRYKLTNCAFKGKIILSDLCTAEMSDGESSYNCFGSYHEMMEPESTTNDVQVKIEPLEEMEPSIVIEHVKTEPWTP
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00035013;
- 90% Identity
- iTF_00035013;
- 80% Identity
- iTF_00035013;