Bioshake: a Haskell EDSL for bioinformatics pipelines

This article has 4 evaluations Published on
Read the full article Related papers
This article on Sciety

Abstract

Background

Typical bioinformatics analysis comprise long running computational pipelines. An important part of producing reproducible research is the management and execution of these computational pipelines to allow robust execution and to minimise errors. Bioshake is an embedded domain specific language embedded in Haskell for specifying and executing computational pipelines in bioinformatics that significantly reduces the possibility of errors occurring.

Results

Unlike other pipeline frameworks, Bioshake raises many properties to the type level to allow the correctness of a pipeline to be statically checked during compilation, catching errors before any lengthy execution process. Bioshake builds on the Shake build tool to provide robust dependency tracking, parallel execution, reporting, and resumption capabilities. Finally, Bioshake abstracts execution so that jobs can either be executed directly or submitted to a cluster.

Conclusions

Bioshake is available at <ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="http://github.com/papenfusslab/bioshake">http://github.com/papenfusslab/bioshake</ext-link>.

Related articles

Related articles are currently not available for this article.