This repository contains all materials for the Introduction to Reproducibility with Pytho/R workshop. You can choose to work in either Python or R, with dedicated folders for each:
python-workshop/
: All Python notebooks and scripts.r-workshop/
: All R scripts and examples.
- Understand the basic concepts of research reproducibility.
- Apply best practices for reproducible workflows.
- Automate data processing and analysis in Python and R.
- Connect to DataverseNL via API to programmatically download and upload data.
- Python Building Blocks
- Reproducible Analysis
- Working with pyDataverse
- Python Data Analysis
- Python Data Transformations
- Python: Python 3.6+.
- R: R 4.0+ with RStudio (optional).
- Jupyter: Jupyter Notebook, JupyterLab or Google Colab.
- pyDataverse: Docs - Release v0.3.1.
- DSRI Account: Register here (Only available to Maastricht University scholars)
- DataverseNL Account: Required for exercises involving data uploads to the DataverseNL sandbox. Create an account at demo.dataverse.nl
This workshop integrates materials from Valentin Danchev's Reproducible Data Science with Python, which is available under the Creative Commons Attribution-ShareAlike 4.0. We also draw a lot of inspiration from The Turing Way handbook, particularly the section of the Guide for Reproducible Research.Β
Many examples were initially developed for the Maastricht University's Global Studies Methods Track programme, and we also got inspiration from the famous Data Analysis and Visualization in Python for Ecologists by Data Carpentry.
This workshop ultimately builds up on the Coding Basics for Researchers series, organized by Carien Hilvering and Erik Jansen during the pandemic in collaboration with the Institute of Data Science, which has been adapted to the current Open Science context.
- BITSS Resource Library has a collection of materials on research transparency and reproducibility.
- Awesome Reproducible Research is a curated list of reproducible research case studies, projects, tutorials, and media
- Ten simple rules for writing and sharing computational analyses in Jupyter Notebooks: doi: 10.1371/journal.pcbi.1007007
- For those based in the Netherlands, the eScience Center is continuously hosting training on digital skills.
- Valentin Danchev (2021). Reproducible Data Science with Python. valdanchev.github.io/reproducible-data-science-python/.
- The Turing Way Community. (2022). The Turing Way: A handbook for reproducible, ethical and collaborative research. Zenodo. doi: 10.5281/zenodo.3233853.
- Data Carpentry. Data Analysis and Visualization in Python for Ecologists. datacarpentry.org/python-ecology-lesson/.
- Baker, M. (2016). 1,500 scientists lift the lid on reproducibility. Nature, 533, 452β454. doi: 10.1038/533452a.
Copyright (c) 2024 Maastricht University