PSTAT 100: Data Science Concepts and Analysis
Course Material
Acknowledgements: Special thanks to previous instructors Dr. Trevor Ruiz, Dr. Alex Franks, and Dr. Laura Baracaldo for graciously providing material and guidance for this course.
Readings:
- For Tuesday:
- R4DS Chapter 2: Workflow: basics
- R4DS Chapter 27: A field guide to base R
- For Thursday:
- Hadley Wickham Tidy Data, Journal of Statistical Software (2014)
- R4DS Chapter 5: Data tidying
- R4DS Chapter 3: Data transformation
- R4DS Chapter 19: Joins
Optional Reading:
- I2R, Chapter 2: Some R basics
- I2R, Chapter 3: Data in R
- LDS Chapter 2: Questions and Data Scope
- Tidyverse Official Site
Lectures:
Tues. Lecture 1: Course Introduction; Intro to Data
Thurs. Lecture 2: Data, Part II
Lab:
There is no required lab for this week, as we will not be having any Sections on Monday. However, please keep in mind the following:
If you have never programmed in
R
before, please read Chapter 2: SomeR
basics and Chapter 3: Data inR
, from “An Introduction to R”, and Chapters 2 and 27 from R4DS.- This Lab provides a summary of important information about the basics of programming in
R
.
- This Lab provides a summary of important information about the basics of programming in
If you have some exposure to programming in
R
(or have already read Chapters 2 and 3 from I2R), but would like more practice with dataframes, we encourage you to complete the following lab: [Click Here]
Please note that, by virtue of being prerequisite, these concepts and topics are potentially testable on In-Class Assignments (and are also crucial for success in our future PSTAT 100 endeavors!).
Textbook Abbreviations
- R4DS: R for Data Science
- LDS: Learning Data Science
- I2R: An Introduction to R
Week | Topic | Lab | Homework | Project |
---|---|---|---|---|
1 | Introduction to Data | |||
2 | Statistical Graphics | L1 | HW01 | |
3 | Exploring and Cleaning Data | L2 | MP01 | |
4 | Sampling Techniques | L3 | ||
5 | Missingness, and KDE | L4 | HW02 | |
6 | Principal Components Analysis, and an Intro to Statistics | L5 | MP02 | |
7 | Introduction to Statistical Modeling | L6 | HW03 | |
8 | Regression | L7 | ||
9 | More Regression | MP03 | ||
10 | Classification and Clustering | L8 | ||
11 | Finals | Final Proj. |
- L: lab
- HW: Homework
- MP: Mini-Project