Data Transformation I
Week 09, Fall 2025
- Start: Monday, October 20
- End: Friday, October 24
Summary
This week we will dive into data visualization and transformation. In doing so, we will move from using strictly Base R, to supplementing Base R with the tidyverse.
Learning Objectives
After completing this week, you are expected to be able to:
- Understand the difference between a
tibbleand adata.frame. - Visualize data using
ggplot2. - Transform data using
dplyr, specifically using the single table verbs:selectto pick columns (variables) based on their namesfilterto pick rows (observations) based on their valuesmutateto add new columns using functions of existing variablessummarizeto create single number statistical summaries of columnsarrangeto change the ordering of rows
Reading
Additional Reading
| Link | Source |
|---|---|
tibble Vignette |
tibble Documentation |
| Tidy Data Vignette | tidyr Documentation |
Cheatsheets
| Link | Source |
|---|---|
ggplot2 |
Posit Cheatsheets |
dplyr |
Posit Cheatsheets |
readr |
Posit Cheatsheets |
tidyr |
Posit Cheatsheets |
Data
- Package:
palmerpenguins - Package:
dplyr::starwars - Package:
Lahman - Package:
nycflights13
Video
| Title | Link | Mirror |
|---|---|---|
| 9.1 - Welcome to Week 09 | 9.1 - YouTube | 9.1 - ClassTranscribe |
| 9.2 - Data and Tibbles | 9.2 - YouTube | 9.2 - ClassTranscribe |
| 9.3 - Data Visualization with ggplot2 | 9.3 - YouTube | 9.3 - ClassTranscribe |
| 9.4 - Data Manipulation with dplyr | 9.4 - YouTube | 9.4 - ClassTranscribe |
| 9.5 - Lab 06 | 9.5 - YouTube | 9.5 - ClassTranscribe |
Assignments
| Assignment | Deadline | Credit |
|---|---|---|
| Lab 05 | Thursday, October 23 | 100% |
| Quiz 05 | Thursday, October 23 | 105% |
| Lab 06 | Thursday, October 30 | 100% |
| Quiz 06 | Thursday, October 30 | 105% |
Office Hours
There has been a slight change in office hours this week:
- Noah’s Office Hours now on Wednesday, 12:00 PM - 2:00 PM
- Varad’s Office Hours now on Thursday, 9:30 AM - 11:00 AM