Unconference tutorial: Escaping dependency hell -- Docker for reproducible research?

Academic research depends on a software ecosystem of ever-increasing complexity.  Moreover, each researcher's software environment is unique -- make use of different tools, different libraries, and different versions.  These details are rarely fully described even for the researchers themselves. This poses a substantial barrier to reproducibility.

Docker provides a 'shipping container' to easily share your software environment with others.  Unlike existing solutions, Docker isn't monolithic -- use the parts you like. This has made it [very successful](http://www.cloudwedge.com/4891-docker-bags-40m-in-venture-capital-funding/) in the world of professional software developers [because](http://www.infoworld.com/article/2608903/application-virtualization/docker-founder-solomon-hykes-explains-docker.html) they, like researchers, have developed their own favorite tools and ways of doing things and don't want to change, but still need an easy way for others to run their software.

This tutorial would introduce Docker by illustrating 4 key concepts desirable in any approach to reproducible software environments:
1. A _flexible_ approach: We don't want to make any assumptions about a user's preferred OS, text editor, etc. (Docker runtime)
2. An _extensible_ approach: A user should be able to extend & repackage the environment with any of their favorite tools with minimal learning curve.  (Docker containers)
3. A _community_ approach: Common extensions of tools & combinations should be developed & maintained as a community base environment.  This saves time and permits optimization without restricting flexibility of individual users.  (Docker Hub)
4. A _DevOps_ approach: Uses scripts instead of manuals to install. These are human-readable, machine-readable, extensible, portable, & easily versioned.  (Dockerfiles)

This would be a hands-on demo of running a 'Dockerized' environment, extending it, committing & sharing those changes. (We probably do this using RStudio, though I could also demonstrate this for ipython-notebooks or other computational environments).


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unconference tutorial: Escaping dependency hell -- Docker for reproducible research? #11

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Unconference tutorial: Escaping dependency hell -- Docker for reproducible research? #11

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions