10.1: Inference for Two Proportions

Colorado Online

10.1: Inference for Two Proportions

In Chapter 9, we covered the test for comparing a proportion to a hypothesized value. In this section we want to explore a test to compare two population proportions.

Like testing means, the usual null hypothesis will be that proportions are the same. We will usually denote each of the two proportions with a subscript, say 1 and 2. Here are some possible two‐tailed and one‐tailed Hypotheses:

[latex]H_{0}:p_{1}=p_{2}[/latex] [latex]H_{0}:p_{1}\geq p_{2}[/latex] [latex]H_{0}:p_{1} \leq p_{2}[/latex]

[latex]H_{a}:p_{1}\neq p_{2}[/latex] [latex]H_{a}:p_{1}< p_{2}[/latex] [latex]H_{a}:p_{1}> p_{2}[/latex]

Notice that the Null Hypothesis can be written as [latex]H_{0}:p_{1}-p_{2}=0[/latex], meaning we want to look at the distribution of the difference of sample proportions as a random variable.

Distribution of difference of sample proportions

Suppose we take a sample of n₁from population 1 and n₂ from population 2. Let X₁be the number of successes in sample 1 and X₂ be the number of successes in sample 2.

[latex]\hat{p_{1}}=\frac{X_{1}}{n_{1}}[/latex] represents the proportion of successes in sample 1

[latex]\hat{p_{2}}=\frac{X_{2}}{n_{2}}[/latex] represents the proportion of successes in sample 2

As long as there are at least 10 successes and 10 failures in each sample, then the difference of sample proportions [latex]\hat{p_{1}}-\hat{p_{2}}[/latex] will have a Normal Distribution.

Central Limit Theorem for the difference of proportions [latex]\hat{p}_{1}-\hat{p}_{2}[/latex]

[latex]\mu _{\hat{p}_{1}-\hat{p}_{1}}=p_{1}-p_{2}[/latex]

[latex]\sigma _{\hat{p}_{1}-\hat{p}_{2}}=\sqrt{\frac{p_{1}(1-p_{1})}{n_{1}}+\frac{p_{2}(1-p_{2})}{n_{2}}}[/latex]

If [latex]n_{1}p_{1},n_{1}(1-p_{1}),n_{2}p_{2},n_{2}(1-p_{2})[/latex] are all at least 10, then the Probability Distribution of [latex]\hat{p}_{1}-\hat{p}_{2}[/latex] is approximately Normal.

Combining all of the above into a single formula:

[latex]\large Z=\frac{(\hat{p}_{1}-\hat{p}_{2})-(p_{1}-\hat{p}_{2})}{\sqrt{\frac{p_{1}(1-p_{1})}{n_{1}}+\frac{p_{2}(1-p_{2})}{n2}}}[/latex]

Example

12% of North Americans claim left‐handedness. With regard to gender, men are slightly more likely than women to be left‐handed, with most studies indicating that about 13% of men and about 11% of women are left‐handed⁸².

p_m= 0.13 = proportion of men who are left‐handed

p_w= 0.11 = proportion of women who are left‐handed

p_m– p_w = difference in proportion of men and women who are left‐handed

Solution

Suppose we take a sample of 100 men and 150 women. Let’s investigate the random variable [latex]\hat{p_{m}}-\hat{p_{w}}[/latex]

100(0.13) = 13 100(1‐0.13) = 87

150(0.11) = 16.5 150(1‐0.11) = 133.5

Since all values are greater than 10, [latex]\hat{p_{m}}-\hat{p_{w}}[/latex] has approximately a normal distribution.

[latex]\mu _{\hat{p}_{m}-\hat{p}_{w}}=0.13-0.11=0.02[/latex]
[latex]\sigma_{\hat{p}_{m}-\hat{p_{w}}}=\sqrt{\frac{0.13(1-0.13)}{100}+\frac{0.11(1-0.11)}{150}}=0.0422[/latex]

Hypothesis test for difference of proportions

In conducting a Hypothesis test where the Null hypothesis assumes equal proportions, it is best practice to pool or combine the sample proportions into a single estimated proportion [latex]\bar{p}[/latex], and use an estimated standard error, [latex]S_{\hat{p}_{m}-\hat{p}_{w}}[/latex]:

[latex]\bar{p}=\frac{X_{1}+X_{2}}{n_{1}+n_{2}}[/latex]

[latex]S_{\hat{p}_{1}-\hat{p}_{2}}=\sqrt{\frac{\bar{p}(1-\bar{p})}{n_{1}}+\frac{\bar{p}(1-\bar{p})}{n_{2}}}[/latex]

The test statistic will have a Normal Distribution as long as there are at least 10 successes and 10 failures in both samples.

[latex]\large Z=\frac{(\hat{p}_{1}-\hat{p}_{2})-(p_{1}-p_{2})}{\sqrt{\frac{\bar{p}(1-\bar{p})}{n_{1}}+\frac{\bar{p}(1-\bar{p})}{n_{2}}}}[/latex]

Example

Under current United States law, private sales between owners are exempt from background check requirements. This is sometimes called the “Gun Show Loophole” as it may allow criminals, terrorists and the mentally ill to purchase assault weapons, such as those used in mass shootings.⁸³

In an August 2016 Study, Pew Research analyzed American’s opinions about gun laws and rights.⁸⁴ Pew took a representative sample of 990 men and 1020 women and asked them several questions. In particular, they asked the sampled Americans if background checks required at gun stores should be made universal and extended to all sales of guns between private owners or at gun shows. 772 out 990 men said yes, while 857 out of 1020 women said yes.

Is there a difference in the proportion of men and women who support universal background checks for purchasing guns? Design and conduct the test with a significance level of 1%.

Solution

Design

[latex]H_{0}:p_{m}=p_{w}[/latex] (There is no difference in the proportion of support for background checks by gender)

[latex]H_{a}:p_{m}\neq p_{w}[/latex] (There is a difference in the proportion of support for background checks by gender)

Model: Two proportion [latex]Z[/latex] test. This is a two‐tailed test with [latex]\alpha =0.01[/latex]

Model Assumptions: for men there are 772 yes and 218 no. For women there are 857 yes and 163 no. Since all these numbers exceed 10, the model is appropriate.

Decision Rules:

Critical Value Method ‐ Reject [latex]H_{0}[/latex] if [latex]Z>2.58[/latex] or [latex]Z<-2.58[/latex].

>𝑃‐value method ‐ Reject [latex]H_{0}[/latex] if 𝑝‐value <0.01

Data/Results

[latex]\hat{P}_{m}=\frac{772}{990}=0.870[/latex]

[latex]\hat{P}_{w}=\frac{857}{1020}=0.840[/latex]

[latex]\bar{P}=\frac{772+857}{990+1020}=0.810[/latex]

[latex]Z=\frac{(0.780-0.840)-0}{\sqrt{\frac{0.810)(1-0.810}{990}+\frac{0.810(1-0.810)}{1020}}}=-3.45[/latex], p-value [latex]=0.0005<\alpha[/latex]

Reject [latex]H_{0}[/latex] under both methods

Conclusion

There is a difference in the proportion of support for background checks by gender. Women are more likely to support background checks.

“Introductory Statistics Inferential Statistics and Probability – A Holistic Approach (Geraghty) Inferential Statistics and Probability – A Holistic Approach” by Maurice A. Geraghty is licensed under CC BY-SA 4.0

License

Icon for the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

Distribution of difference of sample proportions

Central Limit Theorem for the difference of proportions [latex]\hat{p}_{1}-\hat{p}_{2}[/latex]

License

Share This Book