MENTAL MODEL #127

Simpson's Paradox

Simpson's Paradox
Back to all models

Core Concept

Simpson's Paradox is a phenomenon in probability and statistics where a trend observed in separate groups of data may disappear or even reverse when the groups are combined. This seemingly contradictory result is not due to statistical error, but rather arises from unaccounted confounding variables or significant differences in group sizes. The paradox typically occurs when a hidden variable influences both variables being studied and is unevenly distributed across the groups. Therefore, relying solely on overall trends in aggregated data can be misleading; accurate conclusions require examining subgroup data and identifying and controlling for potential confounding factors.

Application Examples

Example 1: University Admission Rates
Overall admission data at a university show that male applicants have a higher acceptance rate than female applicants. However, when admissions are analyzed by individual colleges (e.g., College of Engineering and College of Arts), it becomes clear that women have higher acceptance rates than men in most colleges. This apparent contradiction arises because women tend to apply to colleges with lower overall admission rates, while men tend to apply to colleges with higher admission rates. As a result, the overall data create a misleading impression of gender bias against women.

Example 2: Effectiveness of Medical Treatment
A study comparing two drugs for treating kidney stones shows that Drug A has a lower overall cure rate than Drug B. However, when patients are grouped by stone size (small vs. large), Drug A outperforms Drug B in both subgroups. The reason is that Drug A is more often used to treat patients with large stones (which have a generally lower cure rate), while Drug B is predominantly used for patients with small stones (which have a higher baseline cure rate). This distribution masks the true effectiveness of Drug A in the aggregated data.

Key Takeaways: 1. Be cautious of trends that may reverse or vanish when data are aggregated. 2. Analyze subgroup data thoroughly instead of relying only on overall trends. 3. Identify and account for potential confounding variables or omitted factors. 4. Simpson's Paradox has significant implications in medicine, social sciences, and business decision-making. 5. Understanding this paradox helps prevent misleading conclusions and supports more accurate decision-making.

Key Points

Let Knowledge Find You

Analogy helps you discover hidden connections in your knowledge

Proactive Knowledge

Let old knowledge resurface naturally while reading or creating

Discover Similar Ideas

Automatically surface related notes while browsing the web

Find Analogies

Discover hidden connections between notes while writing

Timeless Conversations

Connect with your past thoughts instantly

Want unlimited search and more features?

Install the Chrome extension and connect your Notion workspace