Data Analysis for Economics
Statistical Learning and Causal Inference

Lectures

Introduction (slides)

  • Topics: statistical learning; parametric vs. nonparametric models; model complexity; training and test errors; bias-variance tradeoff; adaptive statistical models; causal inference; the do-operator; treatment effect; randomized experiment; observational studies; self-selection; causal diagrams; the back-door criterion; quasi-experimental design; regression; matching; instrumental variables; fixed effects; difference-in-differences; regression discontinuity; structural estimation; counterfactual simulation
  • Notes and resources: link
  • codes: R

Part I

Foundations of Statistical Learning (slides)

  • Topics: learning theory; VC analysis; approximation-generalization tradeoff; bias-variance tradeoff; information theory; KL divergence; cross entropy; maximum likelihood; decision theory; bayes classifier; regression function; discriminative vs. generative model; scientific model
  • Notes and resources: link

Regression (slides)

  • Topics: linear regression; OLS estimator; maximum likelihood estimator; Frisch-Waugh-Lovell theorem; gradient descent; hypothesis testing; robust standard error; bootstrap; regression diagnostics; polynomial regression; piecewise constant model; regression splines; generalized additive models
  • Notes and resources: link
  • codes: R

Significance Testing, P-Hacking, and Publication Bias (slides)

  • Topics: p-values; multiple testing; Bonferroni correction; publication bias; p-hacking; data-snooping
  • Notes and resources: link

Classification and Discrete Choice Models (slides)

  • Topics: binary and multiclass classification; generalized linear models; logistic regression; similarity-based methods; K-nearest neighbors (KNN); ROC curve; discrete choice models; random utility maximization (RUM); probit; conditional logit; independence of irrelevant alternatives (IIA)
  • Notes and resources: link
  • codes: R

Model Selection and Regularization (slides)

  • Topics: cross validation; information criteria; forward stepwise regression; regularization; ridge regression; lasso; elastic net; smoothing splines
  • Notes and resources: link
  • codes: R