Inference for a Single Proportion

IMS2 Ch. 16
Math 115

Yurk

Medical Consultants

  • Some organ donors work with a medical consultant who helps them throughout the process
  • The average complication rate for liver donor surgeries in the United States is about 10%
  • One consultant claims she has low rate of complications compared to national average. Is her claim supported?
  • Let \(p\) be the consultant’s long-run complication rate

Hypotheses:

  • \(H_0: p = 0.1\)
  • \(H_A: p < 0.1\)

Data:

  • consult data set is available on our Moodle page
  • She has served as a consultant for 62 liver donor surgeries
    • 3 (4.8%) resulted in complications
    • \(\hat{p} = 0.048\)

Payday Loan Regulations

  • Borrowers use payday loans to get a cash advance before their next payday
  • Borrower writes a check for loan amount + service fee
  • Lender holds check until borrower’s payday
  • Very high APR equivalent (often over 300%)
  • Some borrowers take out second loan to pay off first, and so on
  • Michigan already has a law that limits the number of payday loans a borrower can hold (2)
  • Do most payday borrowers support additional regulation that would require payday lenders to do a credit check?
  • Let \(p\) be the proportion of payday borrowers in MI that support additional regulation.

Hypotheses:

  • \(H_0: p = 0.5\)
  • \(H_A: p > 0.5\)

Data:

  • payday data set is available on our Moodle page
  • Researchers selected a random sample of 826 payday borrowers
    • 424 (51.3%) said they would support a regulation
    • \(\hat{p}=0.513\)

Mathematical Model for a Proportion

  • We have learned that if certain conditions are met we can use a mathematical model to make inferences about a population
  • There is a version of the Central Limit Theorem for a single proportion

Sampling distribution of \(\hat{p}\)

The sampling distribution of \(\hat{p}\) based on a sample of size \(n\) from a poplation with true proportion \(p\) will be approximately normal with mean \(p\) and standard error \[SE=\sqrt{\frac{p(1-p)}{n}}\]
if the following technical conditions are met:

  1. independent observations (e.g., observations from SRS)
  2. (success-failure condition) at least 10 expected successes and at least 10 expected failures. (i.e., \(np\geq 10\) and \(n(1-p)\geq 10\))

Checking Technical Conditions

Consultant study

  • The success-failure condition is not met. Under the null hypothesis, we expect \(62\times 0.1 = 6.2\) complications (less than 10)
  • Cannot model null distribution using a normal distribution
  • Use randomization instead (parametric bootstrap simulation)

Payday study

  • The success-failure condition is met. Under the null hypothesis, we expect \(0.5\times 826 = 413\) people to support the legislation and \((1-0.5)\times 826 = 413\) to not support the legislation.
  • It is appropriate to model the null distribution using a normal distribution

Hypothesis Test Using Normal Approximation

  • Let \(p_0\) be the proportion under the null hypothesis
  • We will use the Z score as the test statistic \[Z = \frac{\hat{p}-p_0}{SE(\hat{p})}=\frac{\hat{p}-p_0}{\sqrt{p_0(1-p_0)/n}}\]
  • Recall that if \(\hat{p}\) is normally distributed, then \(Z\) has a standard normal distribution, \(N(0,1)\)
  • For the payday study \(p_0=0.5\), and \(\hat{p}=0.513\), so \[Z = \frac{0.513 - 0.5}{\sqrt{0.5\cdot(1-0.5)/826}}=0.765\]
  • The p-value is the probability that we would obtain a \(Z\) score at least as large as 0.765 if the null hypothesis is true
  • We compute the p-value by finding the area under the density curve for \(N(0,1)\) that is beyond 0.765

Normal model, \(N(0,1)\). P-value is area to the right of Z = 0.765 (shaded region).

  • We are unable to reject the null hypothesis (p-value = 0.222)
  • Note that we cannot claim that 50% of payday borrowers support the new legislation (we cannot accept the null hypothesis)
  • However, 50% is a plausible value for the parameter

Hypothesis Test Using Randomization

  • In the consultant study we cannot use a normal model for the null distribution
  • However, we can use parametric bootstrap simulation to approximate the null distribution
  • We simulate 100 random samples of 62 liver donors from a population in which the null hypothesis is true (10% complication rate)

Parametric bootstrap simulation is equivalent to the following physical simulation:

  • For each donor simulate the outcome by spinning a spinner with 10% of the area representing “complication” and 90% representing “no complication”
  • For each sample, spin the spinner 62 times and record the proportion of complications in the sample
  • Repeat to obtain proportions for 100 simulated samples
  • We can do the simulations using the Jamovi module
  • Dotplot of 100 simulated proportions assuming \(H_0\)

Approximate null distribution with observed proportion of surgeries with complications (0.048) inticated by dashed vertical line.

  • 9 out of 100 simulated proportions were less than or equal to the observed value of \(\hat{p}=0.048\)
  • The approximate p-value is 9/100 = 0.09
  • A p-value based on more simulations will be more accurate
  • Let’s do 1,000 simulations this time
  • With a large number of simulations, it is better to visualize the simulated proportions using a histogram

Approximate null distribution with observed proportion of surgeries with complications (0.048) inticated by dashed vertical line.

  • 111 out of 1,000 simulated proportions were less than or equal to the observed value of \(\hat{p}=0.048\)
  • The approximate p-value is 111/1,000 = 0.111
  • With a p-value of 0.111 we are unable to reject the null hypothesis at the \(\alpha=0.05\) level
  • It is plausible that the consultant has the same complication rate as the national average of 10%

Confidence Interval

  • We can also use a normal distribution to find a confidence interval if technical conditions are met
  • Earlier we used \(p_0\) as the mean and in the computation of SE, because we were trying to approximate the null distribution
  • A confidence interval estimates the value of the parameter
  • The best point-estimate we have is the sample proportion \(\hat{p}\), so we use that as the mean and in the computation of SE

Checking Conditions for CI

  • The success-failure condition is easier to check in this situation.
  • \(n\hat{p}\) is the number of observed success, and \(n(1-\hat{p})\) is the number of observed failures.
  • We just need to check if there were 10 successes and 10 failures in the sample.

Consultant study

  • The success-failure condition is not met. There were 3 successes and 59 failures
  • Cannot use normal approximation to find a CI
  • Use randomization instead (bootstrap as in Chapter 12)

Payday study

  • The success-failure condition is met. There were 424 successes and 402 failures
  • It is appropriate to use a normal approximation to find a CI

Confidence Interval Using a Normal Approximation

  • If a normal approximation is appropriate, a confidence interval for a proportion can be written as \[\hat{p}\pm z^{\ast}\times SE\]
  • SE is estimated using \[SE\approx\sqrt{\frac{\hat{p}(1-\hat{p})}{n}}\]
  • \(z^{\ast}\) is determined by the confidence level (e.g., 1.96 for 95%, 2.58 for 99%)
  • The standard error for the proportion of borrowers that support the new regulation is \[SE \approx \sqrt{\frac{0.513\cdot(1-0.513)}{826}}=0.0174\]
  • The 95% confidence interval is \[0.513\pm1.96\cdot0.0174 = 0.513\pm0.034\]
  • We are 95% confident that the proportion of payday borrowers that support the new regulation is between 0.479 and 0.547

Confidence Interval Using a Bootstrapping

  • We use bootstrapping to find a 95% bootstrap percentile confidence interval for the complication rate for the medical consultant
  • This time we take repeated samples (with replacement) from our original sample to approximate the variability of the sampling distribution
  • This is different from the parametric bootstrap simulation we used for hypothesis testing, because it does not assume \(H_0\) is true
  • We will start with 100 bootstraps, because it is easier to visualize

100 bootstrap proportions.

  • The 95% bootstrap percentile confidence interval is obtained by calculating the 2.5% and 97.5% percentiles for the bootstrapped statistics
  • This gives us an approximate 95% confidence interval for the long-run complication proportion of 0.00766 to 0.105
  • We can get a better estimate by increasing the number of bootstraps to 1,000

1,000 bootstrap proportions.

  • This gives us an approximate 95% confidence interval for the long-run complication proportion of 0 to 0.113
  • We are 95% confident that the consultant’s long-run complication rate is between 0 and 0.113
  • Note that the national rate (0.1) is within the confidence interval, indicating that it is plausible that the consultant has the same complication rate as the national average