Frequently Asked Questions
This page contains a list of frequently asked questions and answers about bioKepler.
Downloads and Releases
bioKepler is a Kepler suite facilitating rapid development and scalable distributed execution of bioinformatics workflows in Kepler while simplifying access to a wide range of bioinformatics tools executed locally or distributedly. bioKepler contains a set of Kepler actors, called “bioActors”, that are specialized for running bioinformatics tools at scale along with Kepler directors for distributed data-parallel (DDP) execution on Hadoop and Stratosphere engines. For more information, see the About page.
bioKepler is funded by NSF Award DBI-1062565 under CI Reuse and Advances in Bioinformatics programs.
bioKepler is released under the BSD license.
bioKepler is a suite of add-on modules that includes Kepler. The bioKepler suite adds bioActors, demo bioinformatics workflows, and the DDP framework to the Kepler suite.
A bioActor is a Kepler actor that executes a single application or tool, e.g., the blastall bioActor runs blastall, the bowtie bioActor runs bowtie, etc. A bioActor can execute the tool on the local machine; some bioActors additionally can execute the tool on distributed computational resources. For more information, see the bioKepler 1.0 User Guide.
The bioKepler suite contains over 40 example workflows that demonstrate the bioKepler actors and directors. They can be viewed here.
The bioKepler suite includes the Kepler Distributed Data-Parallel (DDP) framework for data-parallel execution on distributed computational resources. Many DDP patterns facilitate data-intensive applications/workflows, which can execute in parallel with partitioned data on distributed computing nodes. For more information, see the bioKepler 1.0 User Guide.
You can send email to one of the Kepler mailing lists, submit a bug report, or chat on the Kepler IRC channel. See the Kepler Contact Us page for more information.
We welcome contributions, e.g., bioActors, demo workflows, etc., to bioKepler. Please contact us by emailing one of the Kepler mailing lists. See the Kepler Contact Us page for more information.
Downloads and Releases
See the bioKepler 1.0 Release page.
See the bioKepler Amazon EC2 page.
We are currently building an OpenStack image for bioKepler. We will announce its completion on the Kepler mailiing lists.
If a bioActor has a DDP sub-workflow, such as the 'HadoopMapOnly' in blast bioActor, it can run in parallel if you have a scalable distributed computing platform, such as a cluster. First check whether a Hadoop/Stratosphere software is already running on the platform. If such a software is already configured and running, just set the config directory path to the DDP engine config parameter for the sub-workflow in the bioActor, e.g., 'HadoopConf' in 'HadoopMapOnly' and select this sub-workflow for the 'Choice' parameter. If you have nodes available but Hadoop/Stratosphere is not running on them, you can use and configure the Hadoop/Stratosphere in bioKepler. For instance, you can change the configuration files in '$HOME/KeplerData/workflows/module/hadoop-1.0/tools/conf/'.
bioKepler includes many bioActors that execute bioinformatics tools. Before you can use a bioActor in a workflow, you must first download and install the bioinformatics tool, and put the binary in your $PATH. For example, you must install blastall before you can use the blastall bioActor. There are several bioActors that use applications and databases from Weizhong Li's group at UCSD; they can be downloaded here.
See the Configuring VNC page.
Open Applications\System Tools\dconf Editor navigate to: org\gnome\desktop\wm\keybindings then change the "show-desktop" keybinding to "", without the quotations, and then restart the service.