10.1: Inference for Two Proportions
In Chapter 9, we covered the test for comparing a proportion to a hypothesized value. In this section we want to explore a test to compare two population proportions.
Like testing means, the usual null hypothesis will be that proportions are the same. We will usually denote each of the two proportions with a subscript, say 1 and 2. Here are some possible two‐tailed and one‐tailed Hypotheses:
𝐻𝑜:𝑝1=𝑝2𝐻𝑎:𝑝1≠𝑝2𝐻𝑜:𝑝1≥𝑝2𝐻𝑎:𝑝1<𝑝2𝐻𝑜:𝑝1≤𝑝2𝐻𝑎:𝑝1>𝑝2
Notice that the Null Hypothesis can be written as 𝐻𝑜:𝑝1−𝑝2=0, meaning we want to look at the distribution of the difference of sample proportions as a random variable.
Distribution of difference of sample proportions
Suppose we take a sample of 𝑛1
from population 1 and 𝑛2 from population 2. Let 𝑋1 be the number of success in sample 1 and 𝑋2
be the number of success in sample 2.
𝑝̂ 1=𝑋1𝑛1
represents the proportion of successes in sample 1
𝑝̂ 2=𝑋2𝑛2
represents the proportion of successes in sample 2
As long as there are at least 10 successes and 10 failures in each sample, then the difference of sample proportions 𝑝̂ 1−𝑝̂ 2
will have a Normal Distribution.
𝜇𝑝̂ 1−𝑝̂ 2=𝑝1−𝑝2
𝜎𝑝̂ 1−𝑝̂ 2=𝑝1(1−𝑝1)𝑛1+𝑝2(1−𝑝2)𝑛2‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾√
If 𝑛1𝑝1,𝑛1(1−𝑝1),𝑛2𝑝2,𝑛2(1−𝑝2) are all at least 10, then the Probability Distribution of 𝑝̂ 1−𝑝̂ 2 is approximately Normal.
Combining all of the above into a single formula:
Example
12% of North Americans claim left‐handedness. With regard to gender, men are slightly more likely than women to be left‐handed, with most studies indicating that about 13% of men and about 11% of women are left‐handed82.
𝑝𝑚
= 0.13 = proportion of men who are left‐handed
𝑝𝑤
= 0.11 = proportion of women who are left‐handed
𝑝𝑚‐𝑝𝑤
= difference in proportion of men and women who are left‐handed
Solution
Suppose we take a sample of 100 men and 150 women. Let’s investigate the random variable 𝑝̂ 𝑚−𝑝̂ 𝑤
100(0.13) = 13 100(1‐0.13) = 87
150(0.11) = 16.5 150(1‐0.11) = 133.5
Since all values are greater than 10, 𝑝̂ 𝑚−𝑝̂ 𝑤
has approximately a normal distribution.
𝜇𝑝̂ 𝑚−𝑝̂ 𝑤=0.13−0.11=0.02
𝜎𝑝̂ 𝑚−𝑝̂ 𝑤=0.13(1−0.13)100+0.11(1−0.11)150‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾√=0.0422
Hypothesis test for difference of proportions
In conducting a Hypothesis test where the Null hypothesis assumes equal proportions, it is best practice to pool or combine the sample proportions into a single estimated proportion 𝑝¯
, and use an estimated standard error, 𝑆𝑝̂ 𝑚−𝑝̂ 𝑤
:
𝑝¯=𝑋1+𝑋2𝑛1+𝑛2
𝑠𝑝̂ 1−𝑝̂ 2=𝑝¯(1−𝑝¯)𝑛1+𝑝¯(1−𝑝¯)𝑛2‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾√
The test statistic will have a Normal Distribution as long as there are at least 10 successes and 10 failures in both samples.
𝑍=(𝑝̂ 1−𝑝̂ 2)−(𝑝1−𝑝2)𝑝¯(1−𝑝¯)𝑛1+𝑝¯(1−𝑝¯)𝑛2‾‾‾‾‾‾‾‾‾‾‾‾‾‾√
Example
Under current United States law, private sales between owners are exempt from background check requirements. This is sometimes called the “Gun Show Loophole” as it may allow criminals, terrorists and the mentally ill to purchase assault weapons, such as those used in mass shootings.83
In an August 2016 Study, Pew Research analyzed American’s opinions about gun laws and rights.84 Pew took a representative sample of 990 men and 1020 women and asked them several questions. In particular, they asked the sampled Americans if background checks required at gun stores should be made universal and extended to all sales of guns between private owners or at gun shows. 772 out 990 men said yes, while 857 out of 1020 women said yes.
Is there a difference in the proportion of men and women who support universal background checks for purchasing guns? Design and conduct the test with a significance level of 1%.
Solution
Design
𝐻𝑜:𝑝𝑚=𝑝𝑤
(There is no difference in the proportion of support for background checks by gender)
𝐻𝑎:𝑝𝑚≠𝑝𝑤
(There is a difference in the proportion of support for background checks by gender)
Model: Two proportion 𝑍
test. This is a two‐tailed test with 𝛼
= 0.01.
Model Assumptions: for men there are 772 yes and 218 no. For women there are 857 yes and 163 no. Since all these numbers exceed 10, the model is appropriate.
Decision Rules:
Critical Value Method ‐ Reject 𝐻𝑜
if 𝑍 > 2.58 or 𝑍
< ‐2.58.
𝑃
‐value method ‐ Reject 𝐻𝑜 if 𝑝
‐value <0.01
Data/Results
𝑝̂ 𝑚=772990=0.780𝑝̂ 𝑤=8571020=0.840𝑝¯=772+857990+1020=0.810
𝑍=(0.780−0.840)−00.810(1−0.810)990+0.810(1−0.810)1020‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾√=−3.45p-value =0.0005<𝛼
Reject 𝐻𝑜
under both methods
Conclusion
There is a difference in the proportion of support for background checks by gender. Women are more likely to support background checks.
“Introductory Statistics Inferential Statistics and Probability – A Holistic Approach (Geraghty) Inferential Statistics and Probability – A Holistic Approach” by Maurice A. Geraghty is licensed under CC BY-SA 4.0