44 Comments

User's avatar
Carina's avatar

Great post! A related point is that people try interventions after an extreme outcome—such as a goalie having a horrible game—and if performance improves, we give credit to the intervention when regression to the mean is a likely explanation.

Any time an intervention is a reaction to unusually poor performance, this is an issue.

It’s even harder for people to wrap their minds around this when we don’t have trend data. For example, I’ve seen a few news stories about racism in home appraisals using the following anecdote: A Black homeowner gets a low appraisal, then stages the house to make it look like a white family lives there, and gets a much higher appraisal. They conclude that “being Black cost me $200k in the appraisal.”

But homeowners only seek a second appraisal when they feel they have been lowballed, for example if the neighbor’s houses have higher values. It was probably an unusual / extremely low appraisal. So it makes sense that the second appraisal is higher—it’s probably closer to the average you’d get if the home were appraised over and over--but you can’t attribute the extra $200k to photographs of white people.

Of course I’m willing to believe there is racial bias in home appraisals, but to measure it we would need a study with many data points and (ideally) randomization. We definitely need more than anecdotes about people who got an unusually low appraisal and then tried again.

Expand full comment
Robert Foster's avatar

One other potential meaning of "regression to the mean" comes from the process of using Bayesian inference to estimate a player's ability. Using Bayesian inference, the estimate of a player's ability is a weighted combination of the population mean (over all players) and the player's mean (over their respective performance), with the weights being determined both by the sample size and by the respective variances of the distribution of population abilities and the variance of performance for a player. In that way, for small sample sizes the estimate for a player's ability is "regressed" heavily towards the population average, for larger sample sizes less "regressed" because we have more confidence that the average truly represents the players ability and is not just noise.

Expand full comment
42 more comments...

No posts