Reproducibility in Research

Julia Romanowska

22.11.2022

WHY BOTHER?

It’s all about us the data

  • if we don’t document, organize, keep track and inform about the changes:

    • nobody else can help us

    • nobody else can continue our work (bus factor)

    • nobody will trust and use our method or cite our paper

What should we do?

1. organize

2. document

  • README files are essential

  • maintain a change log

  • comment the code/scripts

    • if using R, Rmarkdown/Quarto is highly recommended!
  • keep digital notebook/blog

3. track changes

  • with 1. and 2. tracking changes is easy

  • and/or use git

4. share information

  • don’t just dump the data

  • meta-data is just as important

    • http://goodtables.io/ - checking data
    • what is ‘good data’?
    • https://create.frictionlessdata.io/ - app to help annotate datasets