Unit 1
Core Concepts & Formulas
C1Null & Alternative Hypothesis
H₀ (Null hypothesis): a statement of no effect, no difference, or equality.
H₁ / Hₐ (Alternative hypothesis): the claim we are trying to find evidence for.
H₀ is assumed true until evidence proves otherwise. We never "accept" H₀ — we either reject or fail to reject it.
H₁ / Hₐ (Alternative hypothesis): the claim we are trying to find evidence for.
H₀ is assumed true until evidence proves otherwise. We never "accept" H₀ — we either reject or fail to reject it.
C2Significance Level (α)
α is the probability of rejecting H₀ when it is actually true (Type I error rate).
Common choices: α = 0.05 (5%) or α = 0.01 (1%).
We reject H₀ when p-value ≤ α.
Common choices: α = 0.05 (5%) or α = 0.01 (1%).
We reject H₀ when p-value ≤ α.
C3p-value
The probability of obtaining a test statistic at least as extreme as the observed value, assuming H₀ is true.
Small p-value → strong evidence against H₀.
p-value ≤ α → Reject H₀ | p-value > α → Fail to Reject H₀
Small p-value → strong evidence against H₀.
p-value ≤ α → Reject H₀ | p-value > α → Fail to Reject H₀
C4Test Statistics
z-test (known σ, large n): z = (x̄ − μ₀) / (σ/√n)
One-sample t-test (unknown σ): t = (x̄ − μ₀) / (s/√n), df = n−1
Two-sample t-test: t = (x̄₁ − x̄₂) / √(s₁²/n₁ + s₂²/n₂)
Proportion z-test: z = (p̂ − p₀) / √(p₀(1−p₀)/n)
Chi-square test: χ² = Σ (O − E)² / E
One-sample t-test (unknown σ): t = (x̄ − μ₀) / (s/√n), df = n−1
Two-sample t-test: t = (x̄₁ − x̄₂) / √(s₁²/n₁ + s₂²/n₂)
Proportion z-test: z = (p̂ − p₀) / √(p₀(1−p₀)/n)
Chi-square test: χ² = Σ (O − E)² / E
C5Type I & Type II Errors
Type I Error (α): Rejecting H₀ when it is TRUE — "false positive".
Type II Error (β): Failing to reject H₀ when it is FALSE — "false negative".
Power (1 − β): Probability of correctly rejecting a false H₀. Power increases with larger n or larger effect size.
Type II Error (β): Failing to reject H₀ when it is FALSE — "false negative".
Power (1 − β): Probability of correctly rejecting a false H₀. Power increases with larger n or larger effect size.
C6One-Tailed vs. Two-Tailed Tests
Two-tailed: H₁: μ ≠ μ₀ → reject if |z| or |t| exceeds critical value; p-value × 2.
Left-tailed: H₁: μ < μ₀ → reject in left tail only.
Right-tailed: H₁: μ > μ₀ → reject in right tail only.
Left-tailed: H₁: μ < μ₀ → reject in left tail only.
Right-tailed: H₁: μ > μ₀ → reject in right tail only.
C7Conditions for Each Test
z-test: σ known; population normal or n ≥ 30.
t-test: σ unknown; population approx. normal or n ≥ 30 (CLT).
Proportion z-test: np₀ ≥ 10 and n(1−p₀) ≥ 10; random sample.
Chi-square GOF: all expected counts E ≥ 5; random sample.
Chi-square independence: all E ≥ 5; random sample; two categorical variables.
t-test: σ unknown; population approx. normal or n ≥ 30 (CLT).
Proportion z-test: np₀ ≥ 10 and n(1−p₀) ≥ 10; random sample.
Chi-square GOF: all expected counts E ≥ 5; random sample.
Chi-square independence: all E ≥ 5; random sample; two categorical variables.
C8Power & Sample Size
Power = P(reject H₀ | H₀ is false) = 1 − β.
Power increases when: n ↑, α ↑, effect size ↑, or σ ↓.
Minimum sample size for a proportion CI: n ≥ (z*/ME)² · p̂(1−p̂)
Power increases when: n ↑, α ↑, effect size ↑, or σ ↓.
Minimum sample size for a proportion CI: n ≥ (z*/ME)² · p̂(1−p̂)
C9Confidence Intervals & Tests
A two-tailed test at level α corresponds to a (1 − α)×100% CI.
If μ₀ falls outside the CI → reject H₀ at level α.
CI formula: x̄ ± z* · (σ/√n) or x̄ ± t* · (s/√n)
If μ₀ falls outside the CI → reject H₀ at level α.
CI formula: x̄ ± z* · (σ/√n) or x̄ ± t* · (s/√n)
C10Paired t-Test
Used when two measurements come from the same subject (before/after).
Let d = difference for each pair: t = d̄ / (s_d / √n), df = n−1.
This eliminates individual variability and increases power.
Let d = difference for each pair: t = d̄ / (s_d / √n), df = n−1.
This eliminates individual variability and increases power.
⭐ Must-Memorize Formulas & Rules
z-stat: z = (x̄ − μ₀)/(σ/√n)
t-stat (1-sample): t = (x̄ − μ₀)/(s/√n), df = n−1
Proportion: z = (p̂ − p₀)/√(p₀q₀/n)
Chi-square: χ² = Σ(O−E)²/E, df = (r−1)(c−1)
p-value ≤ α → Reject H₀ (statistically significant)
Type I error rate = α ; Type II error rate = β ; Power = 1 − β
Two-tailed p-value = 2 × (one-tail area)
Paired t: t = d̄ / (s_d/√n), where d = x₁ − x₂ for each pair
Unit 2
Concept Examples
Example 1 — One-Sample z-Test
A factory claims its bolts have mean diameter μ = 10 mm. A sample of n = 64 bolts gives x̄ = 10.15 mm, σ = 0.8 mm. Test at α = 0.05.
Step 1: H₀: μ = 10; H₁: μ ≠ 10 (two-tailed)
Step 2: z = (10.15 − 10)/(0.8/√64) = 0.15/0.1 = 1.50
Step 3: Critical z = ±1.96. Since |1.50| < 1.96, fail to reject H₀.
Step 2: z = (10.15 − 10)/(0.8/√64) = 0.15/0.1 = 1.50
Step 3: Critical z = ±1.96. Since |1.50| < 1.96, fail to reject H₀.
✓ Conclusion: Insufficient evidence to reject the claim. p-value ≈ 0.134 > 0.05.
Example 2 — Proportion z-Test
A company claims 60% of customers are satisfied. In a random sample of 200 customers, 108 are satisfied. Test at α = 0.05.
p̂ = 108/200 = 0.54
H₀: p = 0.60; H₁: p ≠ 0.60
z = (0.54 − 0.60)/√(0.60 × 0.40/200) = −0.06/0.0346 ≈ −1.73
p-value ≈ 0.083 > 0.05
H₀: p = 0.60; H₁: p ≠ 0.60
z = (0.54 − 0.60)/√(0.60 × 0.40/200) = −0.06/0.0346 ≈ −1.73
p-value ≈ 0.083 > 0.05
✓ Fail to reject H₀. No significant evidence the satisfaction rate differs from 60%.
Example 3 — Type I & II Error Context
A medical test screens for a disease. H₀: patient is healthy.
Type I Error: Diagnosing a healthy patient as sick (false positive). Probability = α.
Type II Error: Missing a sick patient (false negative). Probability = β.
In medicine, reducing β (increasing power) is often most critical.
Type II Error: Missing a sick patient (false negative). Probability = β.
In medicine, reducing β (increasing power) is often most critical.
✓ Lowering α raises β (if n is fixed). Increasing n reduces both.