IMS1 Ch. 17
Math 215
cpr 1 dataset we explored in Ch. 14| group | died | survived | total |
|---|---|---|---|
| control | 39 | 11 | 50 |
| treatment | 26 | 14 | 40 |
| total | 65 | 25 | 90 |
Difference in proportions of “survived”: \[\hat{p}_T-\hat{p}_C=\frac{14}{40}-\frac{11}{50}=0.13\]

Sampling distribution of \(\hat{p}_1-\hat{p}_2\)
The sampling distribution of \(\hat{p}_1-\hat{p}_2\) based on samples of size \(n_1\) and \(n_2\) and population proportions \(p_1\) and \(p_2\) will be approximately normal with mean \(p_1-p_2\) and standard error \[SE(\hat{p}_1-\hat{p}_2)=\sqrt{\frac{p_1(1-p_1)}{n_1}+\frac{p_2(1-p_2)}{n_2}}\]
if the following technical conditions are met:
In the CPR example we expect
Since there are at least 10 expected successes and failures in each group a normal approximation of the null distribution is appropriate
For the CPR example the Z score is \[Z=\frac{(\hat{p}_T-\hat{p}_C)-0}{SE}=\frac{0.13}{0.095}=1.37\]
Standard normal curve with shaded area corresponding to p-value
The 2-sided p-value is the area under the density curve for \(N(0,1)\) that is more extreme than the observed difference (\(\leq-1.37\) or \(\geq1.37\))
Compare this p-value (0.171) to the one we calculated using random permutation (0.166)
Let’s compute 1,000 differences in bootstrapped proportions using the CPR data.
The 95% bootstrap percentile confidence interval for the difference in survival rates (treatment - control) is between -0.055 and 0.313.
| Type | Interval |
|---|---|
| Bootstrap Percentile | (-0.055, 0.313) |
| Bootstrap SE | (-0.055, 0.315) |
| Normal Approximation | (-0.057, 0.317) |