Welcome to the tidyverse

In this course, you'll learn the basics of the Tidyverse, an integrated set of R packages that are designed to be easy to learn, easy to use, and solve the most frequent data analysis problems. We'll cover a subset of R for Data Science, focusing on data visualization with ggplot2 and data manipulation with dplyr.

We'll also touch briefly on how to get your data into R (whether it's a csv, database, or spreadsheet), and how to tidy it into a form that you'll find easy to work with.

During the course, you'll alternate between listening and doing, and you should expect to spend at least 50% of your time writing code in RStudio on your own laptop.

About the instructors

Hadley Wickham is a Chief Scientist at RStudio, and an Adjunct Professor of Statistics at the University of Auckland, Stanford University, and Rice University. He builds tools (computational and cognitive) that make data science easier, faster, and more fun.

Jenny Bryan is a part of Hadley Wickham’s team at RStudio. We develop open source  packages to make data science faster, easier and more fun.

Attendance requirements

  • Maximum number of attendees is 50.

  • Bring a laptop with up-to-date versions of both R and RStudio. The current version of R is 4.2.1 and the most recent version of the RStudio IDE is  2022.07.1+554 and this is what we recommend, unless you have a good reason otherwise. For more discussion of how to think about keeping R/RStudio current, see https://happygitwithr.com/install-r-rstudio.html#install-r-rstudio.

  • Install the current version of the tidyverse package and the palmerpenguins dataset. In R, do: install.packages(c("tidyverse", "palmerpenguins")).