10.1: Inference for Two Proportions

In Chapter 9, we covered the test for comparing a proportion to a hypothesized value. In this section we want to explore a test to compare two population proportions.

Like testing means, the usual null hypothesis will be that proportions are the same. We will usually denote each of the two proportions with a subscript, say 1 and 2. Here are some possible two‐tailed and one‐tailed Hypotheses:

𝐻𝑜:𝑝1=𝑝2𝐻𝑎:𝑝1𝑝2𝐻𝑜:𝑝1𝑝2𝐻𝑎:𝑝1<𝑝2𝐻𝑜:𝑝1𝑝2𝐻𝑎:𝑝1>𝑝2

Notice that the Null Hypothesis can be written as 𝐻𝑜:𝑝1𝑝2=0, meaning we want to look at the distribution of the difference of sample proportions as a random variable.

 

Distribution of difference of sample proportions

Suppose we take a sample of 𝑛1

from population 1 and 𝑛2 from population 2. Let 𝑋1 be the number of success in sample 1 and 𝑋2

be the number of success in sample 2.

𝑝̂ 1=𝑋1𝑛1

represents the proportion of successes in sample 1

𝑝̂ 2=𝑋2𝑛2

represents the proportion of successes in sample 2

As long as there are at least 10 successes and 10 failures in each sample, then the difference of sample proportions 𝑝̂ 1𝑝̂ 2

will have a Normal Distribution.

 

Central Limit Theorem for the difference of proportions 𝑝̂ 1𝑝̂ 2

𝜇𝑝̂ 1𝑝̂ 2=𝑝1𝑝2

𝜎𝑝̂ 1𝑝̂ 2=𝑝1(1𝑝1)𝑛1+𝑝2(1𝑝2)𝑛2‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾√

If 𝑛1𝑝1,𝑛1(1𝑝1),𝑛2𝑝2,𝑛2(1𝑝2) are all at least 10, then  the Probability Distribution of 𝑝̂ 1𝑝̂ 2 is approximately Normal.

Combining all of the above into a single formula:

𝑍=(𝑝̂ 1𝑝̂ 2)(𝑝1𝑝2)𝑝1(1𝑝1)𝑛1+𝑝2(1𝑝2)𝑛2‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾√

Example

12% of North Americans claim left‐handedness. With regard to gender, men are slightly more likely than women to be left‐handed, with most studies indicating that about 13% of men and about 11% of women are left‐handed82.

𝑝𝑚

= 0.13 = proportion of men who are left‐handed

𝑝𝑤

= 0.11 = proportion of women who are left‐handed

𝑝𝑚𝑝𝑤

= difference in proportion of men and women who are left‐handed

Solution

Suppose we take a sample of 100 men and 150 women. Let’s investigate the random variable 𝑝̂ 𝑚𝑝̂ 𝑤

100(0.13) = 13            100(1‐0.13) = 87

150(0.11) = 16.5         150(1‐0.11) = 133.5

Since all values are greater than 10, 𝑝̂ 𝑚𝑝̂ 𝑤

has approximately a normal distribution.

𝜇𝑝̂ 𝑚𝑝̂ 𝑤=0.130.11=0.02

𝜎𝑝̂ 𝑚𝑝̂ 𝑤=0.13(10.13)100+0.11(10.11)150‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾√=0.0422

 

Hypothesis test for difference of proportions

In conducting a Hypothesis test where the Null hypothesis assumes equal proportions, it is best practice to pool or combine the sample proportions into a single estimated proportion 𝑝¯

, and use an estimated standard error, 𝑆𝑝̂ 𝑚𝑝̂ 𝑤

:

𝑝¯=𝑋1+𝑋2𝑛1+𝑛2

𝑠𝑝̂ 1𝑝̂ 2=𝑝¯(1𝑝¯)𝑛1+𝑝¯(1𝑝¯)𝑛2‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾√

The test statistic will have a Normal Distribution as long as there are at least 10 successes and 10 failures in both samples.

𝑍=(𝑝̂ 1𝑝̂ 2)(𝑝1𝑝2)𝑝¯(1𝑝¯)𝑛1+𝑝¯(1𝑝¯)𝑛2‾‾‾‾‾‾‾‾‾‾‾‾‾‾√

Example

clipboard_e22bc23857f2ae9154d71fd0d5a6db557.png

Under current United States law, private sales between owners are exempt from background check requirements. This is sometimes called the “Gun Show Loophole” as it may allow criminals, terrorists and the mentally ill to purchase assault weapons, such as those used in mass shootings.83

In an August 2016 Study, Pew Research analyzed American’s opinions about gun laws and rights.84 Pew took a representative sample of 990 men and 1020 women and asked them several questions. In particular, they asked the sampled Americans if background checks required at gun stores should be made universal and extended to all sales of guns between private owners or at gun shows. 772 out 990 men said yes, while 857 out of 1020 women said yes.

Is there a difference in the proportion of men and women who support universal background checks for purchasing guns? Design and conduct the test with a significance level of 1%.

Solution

Design

𝐻𝑜:𝑝𝑚=𝑝𝑤

(There is no difference in the proportion of support for background checks by gender)

𝐻𝑎:𝑝𝑚𝑝𝑤

(There is a difference in the proportion of support for background checks by gender)

Model: Two proportion 𝑍

test. This is a two‐tailed test with 𝛼

 = 0.01.

Model Assumptions: for men there are 772 yes and 218 no. For women there are 857 yes and 163 no. Since all these numbers exceed 10, the model is appropriate.

Decision Rules:

Critical Value Method ‐ Reject 𝐻𝑜

if 𝑍 > 2.58 or 𝑍

< ‐2.58.

𝑃

‐value method ‐ Reject 𝐻𝑜 if 𝑝

‐value <0.01

Data/Results

𝑝̂ 𝑚=772990=0.780𝑝̂ 𝑤=8571020=0.840𝑝¯=772+857990+1020=0.810

𝑍=(0.7800.840)00.810(10.810)990+0.810(10.810)1020‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾‾√=3.45p-value =0.0005<𝛼

Reject 𝐻𝑜

under both methods

Conclusion

There is a difference in the proportion of support for background checks by gender. Women are more likely to support background checks.

 

“Introductory Statistics Inferential Statistics and Probability – A Holistic Approach (Geraghty) Inferential Statistics and Probability – A Holistic Approach” by Maurice A. Geraghty is licensed under CC BY-SA 4.0

Share This Book