Aste005818.1
Basic Information
- Insect
- Anopheles stephensi
- Gene Symbol
- -
- Assembly
- GCA_013141755.1
- Location
- CM023249.1:72533491-72576337[-]
Transcription Factor Domain
- TF Family
- zf-BED
- Domain
- zf-BED domain
- PFAM
- PF02892
- TF Group
- Zinc-Coordinating Group
- Description
- The BED finger, which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 20 0.0025 4.3 7.2 2.9 14 38 210 235 201 237 0.81 2 20 0.00062 1.1 9.2 2.2 9 33 234 258 230 264 0.72 3 20 0.00047 0.8 9.5 2.1 6 33 259 286 256 292 0.79 4 20 0.0017 2.9 7.8 2.5 5 32 286 313 283 319 0.79 5 20 0.00072 1.2 9.0 1.8 13 33 321 342 314 348 0.75 6 20 0.0022 3.7 7.4 2.3 14 33 350 370 342 376 0.75 7 20 0.00051 0.87 9.4 1.4 6 30 371 396 368 403 0.79 8 20 0.00091 1.5 8.6 0.9 14 40 406 429 398 431 0.80 9 20 0.00098 1.7 8.5 3.0 13 33 433 454 426 459 0.75 10 20 0.0014 2.3 8.1 1.9 9 33 458 482 454 487 0.72 11 20 0.0012 2 8.3 2.2 6 33 483 510 482 516 0.77 12 20 0.0017 3 7.7 2.7 13 33 517 538 510 544 0.75 13 20 0.0016 2.8 7.8 2.7 13 33 545 566 538 571 0.75 14 20 0.0014 2.4 8.0 3.7 6 33 567 594 565 600 0.78 15 20 0.006 10 6.0 3.2 6 34 595 623 594 627 0.74 16 20 0.00066 1.1 9.1 3.3 13 33 629 650 622 656 0.75 17 20 0.016 28 4.6 4.0 13 37 657 678 650 684 0.70 18 20 0.0021 3.5 7.5 6.1 4 38 677 711 674 712 0.79 19 20 0.0073 12 5.7 6.1 19 43 719 740 714 741 0.93 20 20 0.14 2.4e+02 1.6 4.9 13 44 802 830 793 830 0.84
Sequence Information
- Coding Sequence
- ATGAAAACACTCAAAGTGTCTCGGGTCCACTGGGAAACAGCCATTTTaaGGTCGTCACACCTCACCAACCTGTCTGGCCCTGCTGCTACACATAGTAGCAGGGGCAGCACAGGCCATCTTCCTCCTCCCCCGGGGTCGCACCTTCAGCAACTCCATGGGTCTGGAGTCCTTACGGATCATACAAAATTACAGTATTTGGGCAACATGCAGgTGCATCCCGCACACAGCGGTAGCGGAGGACATCATCCGCCACCGCCGCCCACCCAGGGACCGCCTCAGTCCGCCCATCCGCCACCACCGCAACCGATACCGATCGTGAAGCCGGAGTTTAAGGTGCCGCCGATACCGGCCCacctgatggatgtgcgcggaccggatgggtcgctggtCAAGATCAACATGCCGGAGCATCACGATCCGAGCAAACCGCCGAACAACGTCGAAATGCTGAAGGTGAACATCGAGGATCTCAGCCAGTTTCTGTCGTACCATGAGGTGTTCGGCAAGCTGCCCTCCGACATGCTGACGGGCCCGTCCGCCAGCTTAGTGCCATCGTCCTCTACCAACACCGCTACGACCAGTGCGCATAGCAATGGTAAATTAAATCACATCCGTCAGCATACGGGCGAATCTCCGTTCAAGTGTAGCTACTGTGCGAAAAGCTTCACCCGTAAGGAACACCTGACTAACCACGTTCGCCAGCATACCGGTGAATCGCCTTACCGATGTCCGTACTGCGGTAAAACGTTCACCCGAAAGGAACATCTTACGAACCATGTCAGGTTGCACACCGGCGAGACTCCTTTTCAATGTTCTTATTGTCAGAAAAAGTTCACTAGGAAAGAACATTTAACGAACCACGTCAGGTTGCACACCGGCGAGACACCTTTTCAGTGTTCTTATTGCCAGAAGAAGTTCACTCGCAAAGAACATTTAACGTACCACGTCAGGTTGCACACCGGCGAGACTCCTTTTCAATGTTCTTACTGTCAGAAAAAGTTCACTAGGAAAGAACACTTAACGAACCACGTCAGATGGCACACGGGAGAGACTCCGTACCATTGCACGTACTGCGAGAAAAAGTTCGCCCGCAAAGAGCACCTTACTAATCACGTCAGATTGCACACAGGAGAGACCCCTTATCAATGCACATATTGTGAAAAGAAATTCAGTAGAAGAGAGCGTTTAACGATTCACACAAGaatCCATACCGGAGAGACTCCGTACGCATGCACAtattgtgataaaaaatttagcCGCAAAGAACGATTAACGTATCACATAAGATTACATACCGGAGAAACTCCCTATCAGTGCACGTATTGTGAGAAGAAATTCACTCGCAAAGAACACCTTACAAATCACGTCAGaTTACACACCGGAGAAACTCCCTATCCGTGTACGTACTGTGAAAAGAAGTTCACACGTAAAGAGCATCTTACCAATCACGTCAGATTGCACACCGGAGAGACCCCTTATCAGTGCACCTATTGTGACAAGAAATTCACTCGCAAAGAGCATTTGACTAATCATACAAGATTGCATACCGGAGAAACTCCGTACCAATGTACGTACTGCCAGAAGAAATTCACGCGAAAAGAGCATTTAACAAACCACACCAGATTGCATACGGGCGAAACACCTTACCAGTGTTCTTACTGTCAGAAGAAGTTCACCCGCAAAGAACATCTCACGAATCACGTCAGGCTGCACACCGGCGAGACTCCCTACACGTGTACCTACTGTCAGAAGAAGTTTACGCGTAAAGAGCATTTGACGAATCACGTCAGATTACACACCGGTGAGACTCCGTATCATTGCACCTATTGCGAGAAGAAGTTCATGCGAAAAGAACACCTTACGAATCATATCAGATTGCACACCGGAGAAACTCCCTATCAGTGCACGTATTGCGGGAAGAAATTCACCCGAAAAGAGCATTTAACAAATCATGTCAGATTGCATACCGGAGAGTCTCCCTATCGGTGTTCGTACTGTAACAAGTCATTCACCAGAAAAGAGCACCTCAAGAATCACGTCAGATTGCACACGGGTGATTCACCGCACAAATGTGAGTACTGCAACAAAACGTTCACGCGGAAGGAGCATCTCAACAATCACATGCGCCAGCACAGCGGAGACAATCCACACTGCTGCAATGTGTGCAACAAAACCTTCACCCGGAAGGAGCATTTGATCAACCACATGAGCCGTTCGCACACGGGCGAGCGACCGTTCCAGTGTGACGAGTGTGGCAAATCGTTCCCGCTGAAGGGCAACCTGCTGTTCCACCAGCGCAGCCACACCAAGGGCCAGCCGATGGATCGGCCGTTCCGGTGCGACATGTGCCCGAAGGATTTCATCTGCAAGGGCCACCTGGTGTCGCACCAGCGGTCGCACACGGGCGAGAAGAACCATCACTGCCCGCAGTGCAGCAAATCGTACGTCGAGCGGGGCAACATGCTGCGGCACATGAAGAAAACGCACCCGGACGCGGTCATACCGGTGCTGCCCAAGTTGCCACACATCAAGGTGGAACCGAAGTCTACCGTATCTCAGCCCGCGTCCGTGGTGACGTCGCCGGCTCAGGCCACGTCAACAATAACGTCGATTATCTCCAACAATACTCACCCATcgcagcatcaccatcacatTCAACCGAAAATGGAACAGAAATCTCAGATACATCATCTTCCGCTTCCTCATCAGCATCCAGCTCACCAGCTGCACCACCACATCTCGGCGGGCCCCCCGCCCCCACCGCCCTCGATGCATCTGCCACCATCAAATCCGCAGTTGGCAGCGCTCGCCCTCCATCATCaggtccagcagcagcatcagcagcagcagcaacaacagcagcagcaacagcagccgcaATCTTCGCCTTCAACGGTACGAACCCAGCCATCCCCTCGTACATCACATCAGCCGCCGCCGcatccacagcagcagcaaccgcagcagcagccgccgccggAAAGCCACCAGCAACAGGGACCGGGCCCAGCTGGACCCCTTCCACCTCAGCACCATACGGTCCCGGTTACTATTATAACCCATGCTGGAAGCATTAATCCCGGCAACATGCCGGTGGTGGATGCggtcggtggcggtggcggcggtggcccACCGAACGGTCGGGTCGGTGCAATCCAGATGCATCATCTGGCGGcggcccagcagcagcaggcggccGAAGAGCATTCGGTGGTTTATTGA
- Protein Sequence
- MKTLKVSRVHWETAILRSSHLTNLSGPAATHSSRGSTGHLPPPPGSHLQQLHGSGVLTDHTKLQYLGNMQVHPAHSGSGGHHPPPPPTQGPPQSAHPPPPQPIPIVKPEFKVPPIPAHLMDVRGPDGSLVKINMPEHHDPSKPPNNVEMLKVNIEDLSQFLSYHEVFGKLPSDMLTGPSASLVPSSSTNTATTSAHSNGKLNHIRQHTGESPFKCSYCAKSFTRKEHLTNHVRQHTGESPYRCPYCGKTFTRKEHLTNHVRLHTGETPFQCSYCQKKFTRKEHLTNHVRLHTGETPFQCSYCQKKFTRKEHLTYHVRLHTGETPFQCSYCQKKFTRKEHLTNHVRWHTGETPYHCTYCEKKFARKEHLTNHVRLHTGETPYQCTYCEKKFSRRERLTIHTRIHTGETPYACTYCDKKFSRKERLTYHIRLHTGETPYQCTYCEKKFTRKEHLTNHVRLHTGETPYPCTYCEKKFTRKEHLTNHVRLHTGETPYQCTYCDKKFTRKEHLTNHTRLHTGETPYQCTYCQKKFTRKEHLTNHTRLHTGETPYQCSYCQKKFTRKEHLTNHVRLHTGETPYTCTYCQKKFTRKEHLTNHVRLHTGETPYHCTYCEKKFMRKEHLTNHIRLHTGETPYQCTYCGKKFTRKEHLTNHVRLHTGESPYRCSYCNKSFTRKEHLKNHVRLHTGDSPHKCEYCNKTFTRKEHLNNHMRQHSGDNPHCCNVCNKTFTRKEHLINHMSRSHTGERPFQCDECGKSFPLKGNLLFHQRSHTKGQPMDRPFRCDMCPKDFICKGHLVSHQRSHTGEKNHHCPQCSKSYVERGNMLRHMKKTHPDAVIPVLPKLPHIKVEPKSTVSQPASVVTSPAQATSTITSIISNNTHPSQHHHHIQPKMEQKSQIHHLPLPHQHPAHQLHHHISAGPPPPPPSMHLPPSNPQLAALALHHQVQQQHQQQQQQQQQQQQPQSSPSTVRTQPSPRTSHQPPPHPQQQQPQQQPPPESHQQQGPGPAGPLPPQHHTVPVTIITHAGSINPGNMPVVDAVGGGGGGGPPNGRVGAIQMHHLAAAQQQQAAEEHSVVY
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_00093593;
- 90% Identity
- iTF_00099797;
- 80% Identity
- iTF_00108532;