Statistics 600

Applied statistics and data analysis I

Syllabus

Notes:

Introduction to R
Least squares
Decomposing variance
Misspecification and confounding
Diagnostics, transformations
Model selection
Prediction
Fixed effects and clustered data

Problem sets:

Problem set 1 (solutions)
Problem set 2 (solutions)
Problem set 3 (solutions)
Problem set 4

Code examples:

1.R Sampling properties of regression coefficient estimates
2.R Coverage probabilities for regression coefficient interval estimates
3.R Basic NHANES analysis
4.R Large sample behavior of the test statistic, R^2, and partial R^2
5.R Confidence and prediction bands for polynomial regression
6.R Cook's distances
7.R Box-Cox transformations
8.R Ridge regression solution paths
9.R Ridge regression MSE
10.R Fixed/random effects, GLS, and clustered data
10.R Fixed/random effects, GLS, and clustered data
11.R Nonparametric regression

Data sets:

NHANES-1.gz

Exam 1, with solutions (#4c corrected on 10/20)