Here is an excellent talk by Michael Manapat at the PyData Seattle 2015 conference. I wish that this style of talk — of really digging deep with specific examples — becomes more common!
Michael Manapat: Counterfactual evaluation of machine learning models
The slides can be found here, and the paper that it’s partially based on is here.