Description
I see a lot of reading material on how to choose the cloud service and how to connect to it. I think I would like to see/add another topic with Cloud for Genomics section called "Dockers and Containers".
A major part of researchers preferring to use the same version of tools is so that their results are reproducible. Since in cloud you are creating your own infrastructure, it is very difficult to keep track of all the environment variables and versions that were previously used. That is where dockers and containers come in. Consider docker as a virtual environment which runs your pipeline. This docker/container needs to be created once and then can be used with any machine. Thus providing you with a robust infrastructure which allows you to run multiple pipelines.