Statistical stories
Statistical stories
Posts
Light
Dark
Automatic
Recent Posts
Causal Inference 4: School Autonomy and Regression Discontinuity
In this post we discuss what is perhaps the most elegant technique for doing causal inference: regression discontinuity (RD). Although it is known today as a toolkit in the econometric toolbox, it was originally developed by an education researcher in 1960.
Last updated on Aug 3, 2021
22 min read
Causal Inference 3: Synthetic Control
So far, we first saw that fixed effects was able to get rid of confounders that did not change over time. We then noted that difference in differences designs could do better, as they got rid of confounders that changed over time parallel to the whole control group.
Last updated on Jul 29, 2021
19 min read
Causal Inference 2: Difference in Differences
In the previous post we explored the fixed effects approach to causal inference. Here we discuss the difference in differences approach, which is less widely applicable, but can make a stronger claim as to uncovering a cause.
Last updated on Jul 29, 2021
10 min read
Causal Inference 1: Fixed Effects and a Modest Specification Curve
Over the past few decades, economists have been the main drivers of causal analysis in social science. The most influential text has probably been Angrist and Pischke’s Mostly Harmless Econometrics.
Last updated on Jul 25, 2021
14 min read
Predicting High School Graduation From Kindergarten Data. Hyperparameter tuning (part 2)
In the previous blogpost, we used a basic decision tree and logistic regression to predict who would graduate high school among a bunch of kindergarten kids. In this post, we bring the random forest and XgBoost algorithms to bear on the data set, to see if they can improve the predictions in the test data set.
Last updated on Jul 2, 2021
9 min read
See all posts
Cite
×