Skip to content
Prev 14321 / 21312 Next

[Bioc-devel] RFC: Bioc repository for single-version packages

To summarize some further discussions we had on this, with the bottom 
line that it needs more thought:

The proposal amounts to establishing a third generation of media used 
for academic publishing:
1. Printed paper (since 1665)
2. Portable document format (PDF) files (since 1990s)
3. Executable documents that contain data, code and text

While there are obvious small-scale solutions for 3., incl. those 
sketched by Martin and me, doing this well has similar requirements and 
aspirations for scalability, scope, durability, and time-unlimited 
support as we take for granted for 1.+2. There are millions of papers 
published per year across many disciplines of science.

Besides the technical challenges there are economic and organizational 
ones. The publishing industry should also have a role to play, although 
of course this a fluid area.

There are relevant existing efforts, incl. this incomplete list:

Binders (documents with containers):
https://mybinder.readthedocs.io/en/latest/examples.html

Jupyter/RStudio interfaces to published datasets and results:
https://wholetale.org/index.html

The Pachyderm framework for running pipelines on archived data, tracks
provenance:
http://www.pachyderm.io/

Code Ocean
https://codeocean.com/


---
Thanks to Martin Morgan and Michael Lawrence for input.


5.11.18 23:17, Martin Morgan scripsit: