Skip to content
Prev 536 / 5632 Next

[R-meta] Open Data: preferred way of publishing

Dear Moritz,

It is very nice that you are considering to make your meta-analytic
research more open by sharing data, codes, post pre-prints, etc.

I am sending below two examples of reproducible reports for meta-analysis
manuscripts I have published recently using R Markdown (Rmd)  (literate
programming).

Here are the usual steps I follow:

1) Use excel or alike for preparing the data but export and work with text
files such as csv
2) Create an Rproj. and perform all data munging, visualisation, analysis,
etc in Rmd files (text + codes)
3) Produce a webpage (single) or website (multiple html and a nav bar) as
output using knitr
4) Host all files on a GitHub repository
5) Store data (csv) and codes in a permanent repo such as http://www.osf.io
to get the DOI and a citation
6) Post a pre-print of the manuscript


Example 1
My first one, uses knitr to produce a single page.

webpage: https://emdelponte.github.io/paper-white-mold-meta-analysis/
repo: https://github.com/emdelponte/paper-white-mold-meta-analysis


Example 2
Most recent one is a website + nav bar with four main Rmd files for intro,
data, code and manuscript:

website: https://emdelponte.github.io/paper-FHB-Brazil-meta-analysis/
repo: https://github.com/emdelponte/paper-FHB-Brazil-meta-analysis

this is not a MA paper, but it follows the same structure
https://emdelponte.github.io/paper-FGSC-fitness/index.html

I prepared a template for a research compendium (data + code + manuscript +
figures) for this last example
website: https://emdelponte.github.io/research-compendium-website/
report: https://github.com/emdelponte/research-compendium-website


Finally, I post the manuscript (bioRxiv or PeerJ) with the following
addition:

"Data processing and analyses.
All data processing and analyses, as well as graphical work, were performed
running R version 3.4.3 (R Core Team, 2017). Texts and scripts were
prepared as R Markdown documents. A collection of these latter files were
rendered as a website, using the render_site function of the R package
rmarkdown (Allaire et al. 2017), where all analysis are documented,
reproducible and openly available at
https://github.com/emdelponte/paper-FGSC-fitness. The data in text format
are deposited at the Open Science Framework data repository and available
at https://osf.io/c2mbr/."


One of the most important aspects to ensure reproducibility of methods is a
clear documentation, besides access to all files. I found that this is not
only good for others but for myself  when I need to go back to previous
analysis and understand what and why I did something!

Hope these are useful. Any question, let me know.

Best wishes,

Emerson



Prof. Emerson M. Del Ponte
Departamento de Fitopatologia
Universidade Federal de Vi?osa
Vi?osa, MG - Brasil
+55 (31) 3899-1103
Twitter: @edelponte


2018-01-27 13:19 GMT-02:00 Moritz Tobiasch <moritztobiasch at gmail.com>: