A student who has met the objectives of the course will be able to:
Explain why reproducible data analysis is important, as well as identify relevant challenges and explain replicability versus reproducibility
Describe the components of a reproducible data analysis
Use Tidyverse R to perform exploratory data analysis (EDA) for data insights, including using ggplot to visualize multilayer data from e.g. high-througput -omics platforms
Use Tidyverse R to perform data cleansing, transformation, visualization and communication
Use RStudio and github for collaborative analysis projects
Perform and interpret standard dimension reduction and clustering techniques, as well as basic statistical tests and models