Talk

What is reproducible research?

Or rather, what is reproducible data analysis?

(This slide was taken from https://github.com/mark-andrews/sips2019)

Necessary criteria for reproducible data analysis

(This slide was taken from https://github.com/mark-andrews/sips2019)

Software tools for reproducible data analysis

What is git?

Why use git?

Git

Staging, local, remote

Staging area to local to remote

Staging area to local to remote

Github.com

exercise

Exercise continued

Packrat

Coding style

Coding style syntax R

functions

add_a_to_b <- function(a = "a long argument",
                       separator = ", ",
                       b = "another long argument") {
  str_c(a, separator, b) # only use return for early returns
}

Pipes

bop(
  scoop(
    hop(foo_foo, through = forest),
    up = field_mice
  ), 
  on = head
)
foo_foo <- hop(foo_foo, through = forest)
foo_foo <- scoop(foo_foo, up = field_mice)
foo_foo <- bop(foo_foo, on = head)
foo_foo %>%
  hop(through = forest) %>%
  scoop(up = field_mice) %>%
  bop(on = head)

some command for the command line

Some things can’t be done within rstudio.