Free Throw: The Result Sits Deep
Simulate 20-shot sessions under
- Her real result, 11, sits deep in the lower tail
- Very few simulated sessions made 11 or fewer
Decision: genuine evidence to question the 80% claim
Decide and Justify From the Tail
Both decisions came from tail position, not gut feeling.
- Coin: 1 in 32 → not rare enough → keep the model
- Free throw: deep tail → rare → doubt the claim
Same procedure, two answers, both defensible from the distribution.
A Fair Coin Gave Nine Heads
In our 200 fair-coin trials, a few landed on 9 heads — extreme.
But these came from a coin we built to be fair.
- Does 9 heads prove the coin is unfair?
Commit to yes or no before advancing.
Surprising Is Evidence to Doubt, Not Disproof
That 9-heads trial came from a fair coin — the sim's own counterexample.
- An extreme result can't prove the model false
- It gives reason to doubt, proportional to rarity
Evidence, never proof. This is the core of the standard.
A Fitting Result Never Proves the Model True
A result in the middle is typical — but that doesn't prove the model.
- Many models produce the same unsurprising result
and both make "6 heads" ordinary
"Consistent with" means "not contradicted" — weaker than "true."
Three Cautions About Surprise and Proof
Surprising → reason to doubt, not proof of false
Fits the model → not proof the model is true
Run-to-run wobble is the variability we measure, not a defect
Each caution blocks a different overclaim. Together they keep you honest.
Decide and Justify Two Cases
Spinner model: red
- You observe 8 reds — surprising?
- You observe 4 reds — surprising? (careful!)
Decide and justify both. "Not surprising" ≠ "proven."
Key Takeaways From Lesson Two
✓ Build a reference distribution; read center and spread
✓ Decide by tail position: tiny fraction → surprising
✓ Many reps make the decision robust
Surprising = evidence to doubt, not proof of false
A fitting result never proves the model true
Coming Up Next: The Same Engine, Reused
The tail-reasoning you just learned powers the rest of the unit.
It returns to build a margin of error, and to decide whether a difference between two treatments is real or chance.
Click to begin the narrated lesson
Decide if a model is consistent with data