Acoo004837.1
Basic Information
- Insect
- Atta colombica
- Gene Symbol
- -
- Assembly
- GCA_001594045.1
- Location
- NW:2883874-3047313[-]
Transcription Factor Domain
- TF Family
- zf-BED
- Domain
- zf-BED domain
- PFAM
- PF02892
- TF Group
- Zinc-Coordinating Group
- Description
- The BED finger, which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain [3].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 22 0.00036 0.35 10.1 6.7 6 44 20 59 19 59 0.91 2 22 2.7e-09 2.7e-06 26.5 1.2 4 44 76 113 73 113 0.96 3 22 2.5e-07 0.00025 20.2 3.7 5 44 154 193 152 193 0.94 4 22 2.2e-10 2.1e-07 30.0 2.7 4 44 210 247 207 247 0.96 5 22 0.00074 0.73 9.1 5.2 6 43 289 327 288 328 0.87 6 22 1.1e-08 1.1e-05 24.5 1.1 4 44 345 382 344 382 0.96 7 22 0.00063 0.63 9.3 5.7 6 44 424 463 423 463 0.88 8 22 1.6e-11 1.6e-08 33.6 1.6 4 44 480 517 479 517 0.97 9 22 0.00074 0.73 9.1 5.2 6 43 559 597 558 598 0.87 10 22 5.2e-09 5.2e-06 25.6 1.1 4 44 615 652 614 652 0.97 11 22 0.00074 0.73 9.1 5.2 6 43 694 732 693 733 0.87 12 22 1.1e-08 1.1e-05 24.5 1.1 4 44 750 787 749 787 0.96 13 22 0.0015 1.5 8.1 6.9 6 44 829 870 828 870 0.87 14 22 8.1e-11 8.1e-08 31.4 1.0 4 44 887 924 886 924 0.97 15 22 7.8e-05 0.077 12.2 6.9 6 44 966 1005 965 1005 0.89 16 22 2.7e-09 2.7e-06 26.5 1.2 4 44 1022 1059 1019 1059 0.96 17 22 2.2e-06 0.0022 17.2 4.7 5 44 1099 1138 1097 1138 0.93 18 22 1.2e-08 1.2e-05 24.4 1.0 4 44 1155 1192 1154 1192 0.96 19 22 2.5e-07 0.00025 20.2 3.7 5 44 1233 1272 1231 1272 0.94 20 22 2.4e-10 2.4e-07 29.9 1.3 4 44 1289 1326 1286 1326 0.96 21 22 0.00014 0.14 11.4 3.6 11 44 1374 1406 1368 1406 0.87 22 22 7.4e-05 0.073 12.3 0.2 4 40 1422 1454 1421 1459 0.82
Sequence Information
- Coding Sequence
- ATGGCAGAACAAGAAGCAGGGAACATTATAATCCCAGTTCGTCGTTGGATACGTGgacattatacaaaattaacaaatagtAATGTAGCAAGATGTAATCATTGTAacgtacaattatttatacataagaaTAGATCCTTAGCTATTTTACATAAACACTTAGTGAAGAGACATTCAGacaaattaaatgaagaacagaagaaagaagataaattcGATTGGACTTGGGATTATTTTATCGCAGAAAGCGATATAGAGGCAACATGCAAAGAATGTAACTTAACAATTAAGTACCAATCAGTATCTTGTCTCAAAAAGCATTTGAAACGTATACACAAGATATTGGGTCCAAATTCAGATCATATAGACAACGAGAGCAATTCAGACAATGACTTGATTGCAAATGAGGATCTAACTCCAAAGACGGACAATTTTATCCCAGTTCGTCTTTATATACGTAAACattacacaaaattaataaggAATAAGATGGCAAAATGCAACTATTGTAATGCAAAATTCGCTATAcgaaataaatctttagattatttacataaacacTTGGTGAAGGCACATCCAGacaaattaaatgaagaacAGAAGAAAGAAGACAAATTCGATTGGACTTGGGATTATTTTATCGCAGAAAGCGATATAGAGGCAACATGCAAAGAAtgtaacttaataattaagtacCATTCAGTATCTTGTCtcaaaaagcatttaaaacgTAAGCACAAGATATTGAGTCCAAATTCAGATCATATAGACAACGAGGGCAATTCAGACAATGACTTGATTGCAAATAAGGATCTAACTCCAAAGACAGACAATTTTATCCCAGTTCGTCGTTGGATACGTGgacattatacaaaattaacaagTAGTAATGTAGCAAGATGTAATCATTGTAACgtacaatttttcatacataagAATAGATCCTTAGCTATTTTACATGAACACTTAGTGAAAAGACATTTAGacaaattaaatgaagaacAAAAGAATGAAGATAAATTCCATTGGACTTGGGATTATTTTATCGCAGAAAGCGATATAGAGGCAACATGCACAGAATGTAACTTAACAATTAAGTACCAATCAGTAACTTGTCTCAAAAGTCATTTAAAACGTATACACAAGATATTGGGTCCAAATTCAGATAATGTAGATAACGAGAGCAATTCAGACAATGACTTGATTGCAAATGAGGATCTAACTCCAAAGACAGACATTTTTATCCCAGTTCGTCGTTGGATACGTGgacattatacaaaattaacaagTAGTAATGTAGCAAGATGTAATCATTGTAAGGTACAATTTCTCATACATAAGAATAGATCCTTAGCTATTTTACATGAACACTTAGTGAAGAGACATTCAGacaaattaaatgaagaacagaagaaagaagataaattcCATTGGACTTGGGATTATTTTATCGCAGATAGCGATATAGAGGcaatatgcaaaaaatgtaactCAACAATTAAGTATCAATCAGTATCTTGTCtcaaaaagcatttaaaacgTATGCACAAGATATTGAGTCCAAATTCAGATCATATAGACAACGAGAGCAATTCAGACAATGACTTCATTGCAAACGAGGATCTAACTCAAAAGACAGACAGTTTTATCCCAGTTCGTCGCTGGATACGTGgacattatacaaaattaacaagTAGTAATGTAGCAAGATGTAATCATTGTAACgtacaatttttcatacataagAATAGATCCTTAGCTATTTTACATGAACACTTAGTGAAGAGACATTTAGacaaattaaatgaagaacAAAAGAATGAAGATAAATTCCATTGGACTTGGGATTATTTTATCGCAGAAAGCGATATAGCGGCAACATGCACAGAATGtaacttaacaattaaataccAATCAGTAACTTGTCTCAAAAGTCATTTAAAACGTATACACAAGATATTGCGTCCAAATTCAGATAATGTAGATAACGAGAGCAATTCAGACAATGACTTCATTGCAAACGAGGATCTAACTCAAAAGACAGACAGTTTGATCCCAGTTCGTCGTTGGATACGTGgacattatacaaaattaacaagTAGTAATGTAGCAAGATGTAATCATTGTAACgtacaatttttcatacataagAATAGATCCTTAGCTATTTTACATGAACACTTAGTGAAGAGACATTTAGacaaattaaatgaagaacAAAAGAATGAAGATAAATTCCATTGGACTTGGGATTATTTTATCGCAGAAAGCGATATAGAGGCAACATGCACAGAATGTAACTTAACAATTAAGTACCAATCAGTAACTTGTCTCAAAAGTCATTTAAAACGTATACACAAGATATTGGGTCCAAATTCAGATAATGTAGATAACGAGAGCAATTCACACAATGACTTGATTGCAAATGAGGATCTAACTCCAAAGACAGACATTTTTATCCCAGTTCGTCGTTGGATACGTGgacattatacaaaattaacaagTAGTAATGTAGCAAGATGTAATCATTGTAAGGTACAATTTCTCATacataagaataagaatagaTCCTTAGCTATTTTACATAAACACTTAGTGAAGAGACATTCAGacaaattaaatgaagaacagaagaaagaagataaattcCATTGGACTTGGGATTATTTTATCGCAGATAGCGATATAGAGGCAATATGCAAAGAATGTAACTCAACAATTAAGTATCAATCAGTATCTTGTCtcaaaaagcatttaaaacgTATGCACAAGATATTGGGTCCAAATTCAGATCATATAGACAACGAGAGCAATTCAGACAATGACTTGATTGCAAATGAGGATCTAACTCCAAAGACAGACAATTTTATCCCAGTTCGTTGTTGGATACGTGgacattatacaaaattaacaagTAGTAATGTAGCAAGATGTAATCATTGTAACGTACAATTCTTCATACATAAGAATAGATCCTCAGCTATTCTACATAAACACTTAGTGAAGAGACATTCAGacaaattaaatgaagaacagaagaaagaagataaattcGATTGGACTTGGGATTATTTTATCGCAGAAAGCGATATAGAGGCAACATGCAAAGAATGTAACTTAACAATTAAGTACCAATCAGTATCTTGTCTCAAAAAGCATTTGAAACGTATACACAAGATATTGGGTCCAAATTCAGATCATATAGACGAGAGCAATTCAGACAATGATTTGATTGCAAATGAGGATCTAACTCCAAAGACAGACAATTTTATCCCAGTTCGTCTTTCTATACGTAAACattacacaaaattaataaggAATAGGATGGCAAAATGCAACTATTGCAATGCAAAATTCACTACAcgaaataaatctttagattatttacataaacacTTGGTGAAGGCACATCcagagaaattaaatgaagaacAGAAGAAAGAAGACAAATTCCATTGGACTTGGGATTATTTTATCGCAGAAAGCGATATAGAGGCAACATGCACAGAATGtaacttaacaattaattaccAATCAGTAACTTGTCTCAAAAGTCATTTAAAACGTATACACAAGATATTAGGTCCAAATTCAGATCATATAGACAACGAGAGCAATTCAGACAATGACTTGATTGCAAATGAGGATCTAACTCCAAAGACAGACAATTTTATCCCAGTTCGTCTTTATATACGTAAACattacacaaaattaataaggAATAAGATGGCAAAATGCAACTATTGTAATGCAAAATTCGCTATAcgaaataaatctttagattatttacataaacacTTGGTGAAGGCACATCCAGacaaattaaatgaagaacAGAAGAAAGAAGACAAATTCGATTGGACTTGGGATTATTTTATCGCAGAAAGCGATATAGAGGCAACATGCAAAGAAtgtaacttaataattaagtacCAATCAGTATCTTGTCtcaaaaagcatttaaaacgTAAGCACAAGATATTAGGTCCAAATTCAGATCGTGTAGATAACAAGAGCAATTCAGATAATGATACGAGTACAAATGAGGATATAACTCCAAAGACAGACAATTTTATCTCAGATCATCTTTGAATACGTAGACGTTACACAAAATTAACAAGTAGAAATGAGACAAGATGCAATTATTGCAATGcaaaattcaatatacataatagATTGTTAGCTATTTTACATAAACACTTGGTGAAGGCACATCCAACATATTAAgtagaagagaagaaagaagataaattcCATTGAACTTGGGATTATTTTATCGCAGAAAGCGGTACAGAGGCAACATGTATACTCTGTAATGTAACAATTGGGTCCAAACTTACAAGTTTAACACTACACTTGAAACAATGTCTTATGCGATCCTCTCTCGGTCTACTTGGCGTTCAAGCATCTTCCTCGTTCATGGATGAGACTCTTTCTTTAGCATTGGTTCTATCTCGTTACGATATAGAGCTAACGGCTTCTTTTCTCGACACGGTCACACTGTCGCGGCTCACGGGCTTGGAAGCTAGACCCCCACCTTCCGGCTACTCTTTTGTAAGGGACTCCTCCATCGCCGTCTTTCTCTGCCTCCGATTCCGTACTCCCTCCACGCCAGATTGGAAAGTGTGCTCCATCGAGAATTCACTAAGAGCCCAGGATATACCTACCAAGATATACCTACGTTGTCGCGGTTTTATACCGCCGGCCAAGCTAACTTCGAAGATCGTATGA
- Protein Sequence
- MAEQEAGNIIIPVRRWIRGHYTKLTNSNVARCNHCNVQLFIHKNRSLAILHKHLVKRHSDKLNEEQKKEDKFDWTWDYFIAESDIEATCKECNLTIKYQSVSCLKKHLKRIHKILGPNSDHIDNESNSDNDLIANEDLTPKTDNFIPVRLYIRKHYTKLIRNKMAKCNYCNAKFAIRNKSLDYLHKHLVKAHPDKLNEEQKKEDKFDWTWDYFIAESDIEATCKECNLIIKYHSVSCLKKHLKRKHKILSPNSDHIDNEGNSDNDLIANKDLTPKTDNFIPVRRWIRGHYTKLTSSNVARCNHCNVQFFIHKNRSLAILHEHLVKRHLDKLNEEQKNEDKFHWTWDYFIAESDIEATCTECNLTIKYQSVTCLKSHLKRIHKILGPNSDNVDNESNSDNDLIANEDLTPKTDIFIPVRRWIRGHYTKLTSSNVARCNHCKVQFLIHKNRSLAILHEHLVKRHSDKLNEEQKKEDKFHWTWDYFIADSDIEAICKKCNSTIKYQSVSCLKKHLKRMHKILSPNSDHIDNESNSDNDFIANEDLTQKTDSFIPVRRWIRGHYTKLTSSNVARCNHCNVQFFIHKNRSLAILHEHLVKRHLDKLNEEQKNEDKFHWTWDYFIAESDIAATCTECNLTIKYQSVTCLKSHLKRIHKILRPNSDNVDNESNSDNDFIANEDLTQKTDSLIPVRRWIRGHYTKLTSSNVARCNHCNVQFFIHKNRSLAILHEHLVKRHLDKLNEEQKNEDKFHWTWDYFIAESDIEATCTECNLTIKYQSVTCLKSHLKRIHKILGPNSDNVDNESNSHNDLIANEDLTPKTDIFIPVRRWIRGHYTKLTSSNVARCNHCKVQFLIHKNKNRSLAILHKHLVKRHSDKLNEEQKKEDKFHWTWDYFIADSDIEAICKECNSTIKYQSVSCLKKHLKRMHKILGPNSDHIDNESNSDNDLIANEDLTPKTDNFIPVRCWIRGHYTKLTSSNVARCNHCNVQFFIHKNRSSAILHKHLVKRHSDKLNEEQKKEDKFDWTWDYFIAESDIEATCKECNLTIKYQSVSCLKKHLKRIHKILGPNSDHIDESNSDNDLIANEDLTPKTDNFIPVRLSIRKHYTKLIRNRMAKCNYCNAKFTTRNKSLDYLHKHLVKAHPEKLNEEQKKEDKFHWTWDYFIAESDIEATCTECNLTINYQSVTCLKSHLKRIHKILGPNSDHIDNESNSDNDLIANEDLTPKTDNFIPVRLYIRKHYTKLIRNKMAKCNYCNAKFAIRNKSLDYLHKHLVKAHPDKLNEEQKKEDKFDWTWDYFIAESDIEATCKECNLIIKYQSVSCLKKHLKRKHKILGPNSDRVDNKSNSDNDTSTNEDITPKTDNFISDHL*IRRRYTKLTSRNETRCNYCNAKFNIHNRLLAILHKHLVKAHPTY*VEEKKEDKFH*TWDYFIAESGTEATCILCNVTIGSKLTSLTLHLKQCLMRSSLGLLGVQASSSFMDETLSLALVLSRYDIELTASFLDTVTLSRLTGLEARPPPSGYSFVRDSSIAVFLCLRFRTPSTPDWKVCSIENSLRAQDIPTKIYLRCRGFIPPAKLTSKIV
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -