{"id":534,"date":"2024-10-18T02:30:23","date_gmt":"2024-10-18T02:30:23","guid":{"rendered":"https:\/\/pressbooks.ccconline.org\/mat1260\/?post_type=chapter&#038;p=534"},"modified":"2025-01-08T21:51:53","modified_gmt":"2025-01-08T21:51:53","slug":"7-3-sampling-distribution-of-the-sample-proportions","status":"publish","type":"chapter","link":"https:\/\/pressbooks.ccconline.org\/mat1260\/chapter\/7-3-sampling-distribution-of-the-sample-proportions\/","title":{"raw":"7.3: Sampling Distribution of the Sample Proportions","rendered":"7.3: Sampling Distribution of the Sample Proportions"},"content":{"raw":"<div id=\"lobjh\" class=\"\">\r\n<div class=\"textbox textbox--learning-objectives\"><header class=\"textbox__header\">\r\n<h2 class=\"textbox__title\">Learning Objectives<\/h2>\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n<ul>\r\n \t<li id=\"apply_sampling_distribution_of_sample_proportion\">Apply the sampling distribution of the sample proportion (when appropriate). In particular, be able to identify unusual samples from a given population.<\/li>\r\n<\/ul>\r\n<\/div>\r\n<\/div>\r\nThe first step to drawing conclusions about parameters based on the accompanying statistics is to understand how sample statistics behave relative to the parameter that summarizes the entire population. We begin with the behavior of sample proportion relative to population proportion (when the variable of interest is categorical). After that, we will explore the behavior of sample mean relative to population mean (when the variable of interest is quantitative).\r\n\r\n<\/div>\r\n<div id=\"N10B0C\" class=\"section\">\r\n<div class=\"sectionContain\">\r\n<h2><span title=\"Quick scroll up\">Behavior of Sample Proportion p\u0302<\/span><\/h2>\r\n<div class=\"examplewrap\">\r\n<div class=\"example clearfix\">\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<h3 class=\"textbox__title\">Example<\/h3>\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n<p id=\"N10B17\">Approximately 60% of all part-time college students in the United States are female. (In other words, the population proportion of females among part-time college students is p = .6.) What would you expect to see in terms of the behavior of sample proportion of females ([latex]\\hat{p}[\/latex]) if random samples of size 100 were taken from the population of all part-time college students?<\/p>\r\n<p id=\"N10B2B\">As we saw before, due to sampling variability, sample proportion in random samples of size 100 will take numerical values which vary according to the laws of chance: in other words, sample proportion is a<em>\u00a0random variable<\/em>. To summarize the behavior of any random variable, we focus on three features of its distribution: the center, the spread, and the shape.<\/p>\r\n<p id=\"N10B31\">Based only on our intuition, we would expect the following:<\/p>\r\n<p id=\"N10B35\"><em>Center<\/em>: Some sample proportions will be on the low sidesay, .55 or .58\u2014 while others will be on the high side\u2014 say, .61 or .66. It is reasonable to expect all the sample proportions in repeated random samples to average out to the underlying population proportion, .6. In other words, the mean of the distribution of\u00a0[latex]\\hat{p}[\/latex]\u00a0should be p.<\/p>\r\n<p id=\"N10B4B\"><em>Spread<\/em>: For samples of 100, we would expect sample proportions of females not to stray too far from the population proportion .6. Sample proportions lower than .5 or higher than .7 would be rather surprising. On the other hand, if we were only taking samples of size 10, we would not be at all surprised by a sample proportion of females even as low as 4\/10 = .4, or as high as 8\/10 = .8. Thus, sample size plays a role in the spread of the distribution of sample proportion: there should be less spread for larger samples, more spread for smaller samples.<\/p>\r\n<p id=\"N10B50\"><em>Shape<\/em>: Sample proportions closest to .6 would be most common, and sample proportions far from .6 in either direction would be progressively less likely. In other words, the shape of the distribution of sample proportion should bulge in the middle and taper at the ends: it should be somewhat\u00a0<em>normal.<\/em><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<h2>Comment<\/h2>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"N10B5E\" class=\"section\">\r\n<div class=\"sectionContain\">\r\n<p id=\"N10B65\">The\u00a0<em>distribution<\/em>\u00a0of the values of the sample proportions ([latex]\\hat{p}[\/latex]) in repeated\u00a0<em>samples<\/em>\u00a0is called the\u00a0<em>sampling distribution of\u00a0<\/em>[latex]\\hat{p}[\/latex].<\/p>\r\n<p id=\"N10B95\">The purpose of the next activity is to check whether our intuition about the center, spread and shape of the sampling distribution of\u00a0[latex]\\hat{p}[\/latex]\u00a0was right via simulations.<\/p>\r\n\r\n<div class=\"figurewrap\">\r\n<div class=\"figure clearfix\">\r\n<div id=\"uwrap__i_0\" class=\"youtube\">\r\n\r\n[embed]https:\/\/www.youtube.com\/embed\/2bIC4EmejkQ[\/embed]\r\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\r\n<h3 class=\"textbox__title\">Did I get this?<\/h3>\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\n[h5p id=\"145\"]\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div>\r\n<p id=\"N10BED\">At this point, we have a good sense of what happens as we take random samples from a population. Our simulation suggests that our initial intuition about the shape and center of the sampling distribution is correct. If the population has a proportion of p, then random samples of the same size drawn from the population will have sample proportions close to p. More specifically, the distribution of sample proportions will have a mean of p. We also observed that for this situation, the sample proportions are approximately normal. We will see later that this is not always the case. But if sample proportions are normally distributed, then the distribution is centered at p. Now we want to use simulation to help us think more about the variability we expect to see in the sample proportions. Our intuition tells us that larger samples will better approximate the population, so we might expect less variability in large samples. In the next walk-through we will use simulations to investigate this idea. After that walk-through, we will tie these ideas to more formal theory.<\/p>\r\n\r\n<div class=\"figurewrap\">\r\n<div class=\"figure clearfix\">\r\n<div id=\"uwrap__i_1\" class=\"youtube\">\r\n\r\n[embed]https:\/\/www.youtube.com\/embed\/tUvXeJ3A3_s?enablejsapi=1&amp;rel=0&amp;vq=large[\/embed]\r\n\r\n<\/div>\r\n<div>\r\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\">Did I get this?<\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\n[h5p id=\"146\"]\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<p id=\"N10B23\">Again, the simulations on the previous page reinforced what makes sense to our intuition. Larger random samples will better approximate the population proportion. When the sample size is large, sample proportions will be closer to p. In other words, the sampling distribution for large samples has less variability. Advanced probability theory confirms our observations and gives a more precise way to describe the standard deviation of the sample proportions. This is described next.<\/p>\r\n\r\n<div id=\"N10B26\" class=\"section\">\r\n<div class=\"sectionContain\">\r\n<h2><span title=\"Quick scroll up\">The Sampling Distribution of the Sample Proportion<\/span><\/h2>\r\n<p id=\"N10B2D\">If repeated random samples of a given size n are taken from a population of values for a categorical variable, where the proportion in the category of interest is p, then the mean of all sample proportions ([latex]\\hat{p}[\/latex]) is the population proportion (p). As for the spread of all sample proportions, theory dictates the behavior much more precisely than saying that there is less spread for larger samples. In fact, the standard deviation of all sample proportions ([latex]\\hat{p}[\/latex]) is exactly<\/p>\r\n[latex]\\sqrt{\\frac{\\mathcal{p}\\left(1-\\mathcal{p}\\right)}{\\mathcal{n}}}[\/latex]\r\n<p id=\"N10B7A\">Since sample size n appears in the denominator of the square root, the standard deviation does decrease as sample size increases. Finally, the shape of the distribution of\u00a0[latex]\\hat{p}[\/latex]\u00a0will be approximately normal as long as the sample size n is large enough. The convention is to require both np and n(1 \u2013 p) to be at least 10.<\/p>\r\n<p id=\"N10B8E\">We can summarize all of the above by the following:<\/p>\r\n<p id=\"N10B91\">[latex]\\hat{p}[\/latex]\u00a0has a normal distribution with a mean of\u00a0[latex]\\mu _{\\hat{p}}=p[\/latex]\u00a0and standard deviation\u00a0[latex]\\sigma _{\\hat{p}}=\\sqrt{\\frac{p(1-p)}{n}}[\/latex]\u00a0(and as long as np and n(1 \u2013 p) are at least 10).<\/p>\r\n<p id=\"N10C06\">Let\u2019s apply this result to our example and see how it compares with our simulation.<\/p>\r\n<p id=\"N10C09\">In our example, n = 25 (sample size) and p = 0.6. Note that np = 15 \u2265 10 and n(1 \u2013 p) = 10 \u2265 10. Therefore we can conclude that\u00a0<span id=\"MathJax-Element-8-Frame\" class=\"mjx-chtml MathJax_CHTML\"><span id=\"MJXc-Node-74\" class=\"mjx-math\"><span id=\"MJXc-Node-75\" class=\"mjx-mrow\"><span id=\"MJXc-Node-76\" class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-79\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-77\" class=\"mjx-mrow\"><span id=\"MJXc-Node-78\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0is approximately a normal distribution with mean p = 0.6 and standard deviation [latex]\\sqrt{\\frac{\\mathcal{p}\\left(1-\\mathcal{p}\\right)}{\\mathcal{n}}}=\\sqrt{\\frac{0.6\\left(1-0.6\\right)}{25}}=0.097[\/latex]\u00a0(which is very close to what we saw in our simulation).<\/p>\r\n\r\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\r\n<h3 class=\"textbox__title\">Learn by Doing<\/h3>\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\nAccording to the National Postsecondary Student Aid Study conducted by the U.S. Department of Education in 2008, 62% of graduates from public universities had student loans.\r\n\r\n[h5p id=\"147\"]\r\n\r\n<img id=\"N10070\" class=\"img-responsive popimg aligncenter\" title=\"four different sample distributions\" src=\"https:\/\/oli.cmu.edu\/repository\/webcontent\/72712ec00a0001dc418a87e73e8ebb77\/_u4_probability\/_m4_sampling_distributions\/webcontent\/tutors\/9_2_img1.gif\" alt=\"four different sample distributions\" \/>\r\n\r\n[h5p id=\"148\"]\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\r\n<h3 class=\"textbox__title\">Learn by Doing<\/h3>\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\n[h5p id=\"149\"]\r\n\r\n<\/div>\r\n<\/div>\r\n<p id=\"N10B11\">If a sampling distribution is normally shaped, then we can apply the Standard Deviation Rule and use z-scores to determine probabilities. Let\u2019s look at some examples.<\/p>\r\n\r\n<div class=\"examplewrap\">\r\n<div class=\"example clearfix\">\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<h3 class=\"textbox__title\">Example<\/h3>\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n<p id=\"N10B16\">A random sample of 100 students is taken from the population of all part-time students in the United States, for which the overall proportion of females is .6.<\/p>\r\n<p id=\"N10B19\">(a) There is a 95% chance that the sample proportion (<span class=\"mjx-chtml MathJax_CHTML\"><span class=\"mjx-math\"><span class=\"mjx-mrow\"><span class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>) falls between what two values? First note that the distribution of\u00a0<span class=\"mjx-chtml MathJax_CHTML\"><span class=\"mjx-math\"><span class=\"mjx-mrow\"><span class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0has the mean p = .6, standard deviation\u00a0[latex]\\sqrt{\\frac{\\mathcal{p}\\left(1-\\mathcal{p}\\right)}{\\mathcal{n}}}=\\sqrt{\\frac{0.6\\left(1-0.6\\right)}{100}}=0.05[\/latex], and a shape that is close to normal, since np = 100(.6) = 60 and n(1 \u2013 p) = 100(.4) = 40 are both greater than 10. The Standard Deviation Rule applies: the probability is approximately .95 that\u00a0<span class=\"mjx-chtml MathJax_CHTML\"><span class=\"mjx-math\"><span class=\"mjx-mrow\"><span class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0falls within 2 standard deviations of the mean, that is, between 0.6 \u2013 2(.05) and 0.6 + 2(.05). There is roughly a 95% chance that\u00a0<span class=\"mjx-chtml MathJax_CHTML\"><span class=\"mjx-math\"><span class=\"mjx-mrow\"><span class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0falls in the interval (.5, .7).<\/p>\r\n<p id=\"N10BD0\">(b) What is the probability that sample proportion\u00a0<span class=\"mjx-chtml MathJax_CHTML\"><span class=\"mjx-math\"><span class=\"mjx-mrow\"><span class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0is less than or equal to .56?<\/p>\r\n<p id=\"N10BE4\">To find\u00a0<span class=\"mjx-chtml MathJax_CHTML\"><span class=\"mjx-math\"><span class=\"mjx-mrow\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">P<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">(<\/span><\/span><span class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><span class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">\u2264<\/span><\/span><span class=\"mjx-mn MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">.<\/span><\/span><span class=\"mjx-mn MJXc-space1\"><span class=\"mjx-char MJXc-TeX-main-R\">5<\/span><\/span><span class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">6<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">)<\/span><\/span><\/span><\/span><\/span>, we standardize .56 to z = (.56-.60) \/ .05 = -.80:<\/p>\r\n<p id=\"N10C10\"><span class=\"mjx-chtml MathJax_CHTML\"><span class=\"mjx-math\"><span class=\"mjx-mrow\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">P<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">(<\/span><\/span><span class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><span class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">\u2264<\/span><\/span><span class=\"mjx-mn MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">.<\/span><\/span><span class=\"mjx-mn MJXc-space1\"><span class=\"mjx-char MJXc-TeX-main-R\">5<\/span><\/span><span class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">6<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">)<\/span><\/span><span class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">=<\/span><\/span><span class=\"mjx-mi MJXc-space3\"><span class=\"mjx-char MJXc-TeX-math-I\">P<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">(<\/span><\/span><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">Z<\/span><\/span><span class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">\u2264<\/span><\/span><span class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">\u2212<\/span><\/span><span class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">.<\/span><\/span><span class=\"mjx-mn MJXc-space1\"><span class=\"mjx-char MJXc-TeX-main-R\">8<\/span><\/span><span class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">)<\/span><\/span><span class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">=<\/span><\/span><span class=\"mjx-mn MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">.<\/span><\/span><span class=\"mjx-mn MJXc-space1\"><span class=\"mjx-char MJXc-TeX-main-R\">2<\/span><\/span><span class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">1<\/span><\/span><span class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">1<\/span><\/span><span class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">9<\/span><\/span><\/span><\/span><\/span><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\nTo see the impact of the sample size on these probability calculations, consider the following variation of our example.\r\n<div class=\"examplewrap\">\r\n<div class=\"example clearfix\">\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<h3 class=\"textbox__title\">Example<\/h3>\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n<p id=\"N10C78\">A random sample of\u00a0<em>2,500<\/em>\u00a0students is taken from the population of all part-time students in the United States, for which the overall proportion of females is .6.<\/p>\r\n<p id=\"N10C7E\">(a) There is a 95% chance that the sample proportion (<span class=\"mjx-chtml MathJax_CHTML\"><span class=\"mjx-math\"><span class=\"mjx-mrow\"><span class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-122\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-120\" class=\"mjx-mrow\"><span id=\"MJXc-Node-121\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>) falls between what two values? First note that the distribution of\u00a0<span id=\"MathJax-Element-10-Frame\" class=\"mjx-chtml MathJax_CHTML\"><span id=\"MJXc-Node-123\" class=\"mjx-math\"><span id=\"MJXc-Node-124\" class=\"mjx-mrow\"><span id=\"MJXc-Node-125\" class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-128\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-126\" class=\"mjx-mrow\"><span id=\"MJXc-Node-127\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0has the mean p = .6, standard deviation\u00a0[latex]\\sqrt{\\frac{\\mathcal{p}\\left(1-\\mathcal{p}\\right)}{\\mathcal{n}}}=\\sqrt{\\frac{0.6\\left(1-0.6\\right)}{2500}}=0.01[\/latex], and a shape that is close to normal, since np = 2500(.6) = 1500 and n(1 \u2013 p) = 2500(.4) = 1000 are both greater than 10. The standard deviation rule applies: the probability is approximately .95 that\u00a0<span id=\"MathJax-Element-12-Frame\" class=\"mjx-chtml MathJax_CHTML\"><span id=\"MJXc-Node-170\" class=\"mjx-math\"><span id=\"MJXc-Node-171\" class=\"mjx-mrow\"><span id=\"MJXc-Node-172\" class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-175\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-173\" class=\"mjx-mrow\"><span id=\"MJXc-Node-174\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0falls within 2 standard deviations of the mean, that is, between 0.6 \u2013 2(.01) and 0.6 + 2(.01). There is roughly a 95% chance that\u00a0<span id=\"MathJax-Element-13-Frame\" class=\"mjx-chtml MathJax_CHTML\"><span id=\"MJXc-Node-176\" class=\"mjx-math\"><span id=\"MJXc-Node-177\" class=\"mjx-mrow\"><span id=\"MJXc-Node-178\" class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-181\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-179\" class=\"mjx-mrow\"><span id=\"MJXc-Node-180\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0falls in the interval (.58, .62).<\/p>\r\n<p id=\"N10D38\">(b) What is the probability that sample proportion\u00a0<span id=\"MathJax-Element-14-Frame\" class=\"mjx-chtml MathJax_CHTML\"><span id=\"MJXc-Node-182\" class=\"mjx-math\"><span id=\"MJXc-Node-183\" class=\"mjx-mrow\"><span id=\"MJXc-Node-184\" class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-187\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-185\" class=\"mjx-mrow\"><span id=\"MJXc-Node-186\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0is less than .56?<\/p>\r\n<p id=\"N10D4C\">To find\u00a0<span id=\"MathJax-Element-15-Frame\" class=\"mjx-chtml MathJax_CHTML\"><span id=\"MJXc-Node-188\" class=\"mjx-math\"><span id=\"MJXc-Node-189\" class=\"mjx-mrow\"><span id=\"MJXc-Node-190\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">P<\/span><\/span><span id=\"MJXc-Node-191\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">(<\/span><\/span><span id=\"MJXc-Node-192\" class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-195\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-193\" class=\"mjx-mrow\"><span id=\"MJXc-Node-194\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><span id=\"MJXc-Node-196\" class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">\u2264<\/span><\/span><span id=\"MJXc-Node-197\" class=\"mjx-mn MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><span id=\"MJXc-Node-198\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">.<\/span><\/span><span id=\"MJXc-Node-199\" class=\"mjx-mn MJXc-space1\"><span class=\"mjx-char MJXc-TeX-main-R\">5<\/span><\/span><span id=\"MJXc-Node-200\" class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">6<\/span><\/span><span id=\"MJXc-Node-201\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">)<\/span><\/span><\/span><\/span><\/span>, we standardize .56 to z = (.56 \u2013 .60) \/ .01 = -4.00:<\/p>\r\n<p id=\"N10D78\"><span id=\"MathJax-Element-16-Frame\" class=\"mjx-chtml MathJax_CHTML\"><span id=\"MJXc-Node-202\" class=\"mjx-math\"><span id=\"MJXc-Node-203\" class=\"mjx-mrow\"><span id=\"MJXc-Node-204\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">P<\/span><\/span><span id=\"MJXc-Node-205\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">(<\/span><\/span><span id=\"MJXc-Node-206\" class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-209\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-207\" class=\"mjx-mrow\"><span id=\"MJXc-Node-208\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><span id=\"MJXc-Node-210\" class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">\u2264<\/span><\/span><span id=\"MJXc-Node-211\" class=\"mjx-mn MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><span id=\"MJXc-Node-212\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">.<\/span><\/span><span id=\"MJXc-Node-213\" class=\"mjx-mn MJXc-space1\"><span class=\"mjx-char MJXc-TeX-main-R\">5<\/span><\/span><span id=\"MJXc-Node-214\" class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">6<\/span><\/span><span id=\"MJXc-Node-215\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">)<\/span><\/span><span id=\"MJXc-Node-216\" class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">=<\/span><\/span><span id=\"MJXc-Node-217\" class=\"mjx-mi MJXc-space3\"><span class=\"mjx-char MJXc-TeX-math-I\">P<\/span><\/span><span id=\"MJXc-Node-218\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">(<\/span><\/span><span id=\"MJXc-Node-219\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">Z<\/span><\/span><span id=\"MJXc-Node-220\" class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">\u2264<\/span><\/span><span id=\"MJXc-Node-221\" class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">\u2212<\/span><\/span><span id=\"MJXc-Node-222\" class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">4<\/span><\/span><span id=\"MJXc-Node-223\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">.<\/span><\/span><span id=\"MJXc-Node-224\" class=\"mjx-mn MJXc-space1\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><span id=\"MJXc-Node-225\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">)<\/span><\/span><span id=\"MJXc-Node-226\" class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">=<\/span><\/span><span id=\"MJXc-Node-227\" class=\"mjx-mn MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><\/span><\/span><\/span>, approximately.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"N10DC9\" class=\"section\">\r\n<div class=\"sectionContain\">\r\n<h2><span title=\"Quick scroll up\">Comment<\/span><\/h2>\r\n<p id=\"N10DD0\">As long as the sample is truly random, the distribution of\u00a0[latex]\\hat{p}[\/latex]\u00a0is centered at p, no matter what size sample has been taken. Larger samples have less spread. Specifically, when we multiplied the sample size by 25, increasing it from 100 to 2,500, the standard deviation was reduced to 1\/5 of the original standard deviation. Sample proportion strays less from population proportion .6 when the sample is larger: it tends to fall anywhere between .5 and .7 for samples of size 100, whereas it tends to fall between .58 and .62 for samples of size 2,500. It is not so improbable to take a value as low as .56 for samples of 100 (probability is more than 20%) but it is almost impossible to take a value as low as low as .56 for samples of 2,500 (probability is virtually zero).<\/p>\r\n\r\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\r\n<h3 class=\"textbox__title\">Learn by Doing<\/h3>\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\nThe proportion of left-handed people in the general population is about 0.1. Suppose a random sample of 225 people is observed.\r\n\r\n[h5p id=\"150\"]\r\n\r\n<\/div>\r\n<\/div>\r\n<h2><span title=\"Quick scroll up\">Theoretical Comment (Optional)<\/span><\/h2>\r\n<p id=\"N10EA0\">The above results for the distribution of sample proportion\u00a0[latex]\\hat{p}[\/latex]\u00a0are directly related to the results already obtained for the distribution of sample count X in a binomial experiment. Remember that X had mean np, standard deviation\u00a0[latex]\\sqrt{\\mathcal{np}\\left(1-\\mathcal{p}\\right)}[\/latex], and a shape that allowed for normal approximations as long as both np and n(1 \u2013 p) were at least 10. Since sample proportion is\u00a0[latex]\\hat{\\mathcal{p}}=\\frac{\\mathcal{x}}{\\mathcal{n}}[\/latex], we could derive the mean and standard deviation of\u00a0[latex]\\hat{p}[\/latex]\u00a0by applying the Rules for Means and Variances:<\/p>\r\n[latex]\\mu_{\\hat{\\mathcal{p}}}=\\mu_\\frac{\\mathcal{x}}{\\mathcal{n}}=\\ \\frac{1}{\\mathcal{n}}\\mu_\\mathcal{x}=\\frac{1}{\\mathcal{n}}\\left(\\mathcal{np}\\right)[\/latex]\r\n\r\nand\r\n\r\n[latex]\\sigma_\\mathcal{p}^2=\\sigma_{\\frac{\\mathcal{x}}{\\mathcal{n}}}^2=\\frac{1}{\\mathcal{n}^2}\\sigma^2\\mathcal{x}=\\frac{1}{\\mathcal{n}^2}\\left(\\mathcal{np}\\right)\\left(1-\\mathcal{p}\\right)=\\frac{1}{\\mathcal{n}}\\mathcal{p}\\left(1-\\mathcal{p}\\right)[\/latex]\r\n\r\nso\r\n\r\n[latex]\\sigma_{\\hat{\\mathcal{p}}}=\\sqrt{\\frac{\\mathcal{p}\\left(1-\\mathcal{p}\\right)}{\\mathcal{n}}}[\/latex]\r\n\r\nThe requirements that np and n(1 \u2013 p) be at least 10 are the same, whether we are focusing on the distribution of sample count or the distribution of sample proportion. After all, the shape of\u00a0<span id=\"MathJax-Element-27-Frame\" class=\"mjx-chtml MathJax_CHTML\"><span id=\"MJXc-Node-411\" class=\"mjx-math\"><span id=\"MJXc-Node-412\" class=\"mjx-mrow\"><span id=\"MJXc-Node-413\" class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-416\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-414\" class=\"mjx-mrow\"><span id=\"MJXc-Node-415\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0is the same as the shape of X: the scale of the horizontal axis is just uniformly divided by n.\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>","rendered":"<div id=\"lobjh\" class=\"\">\n<div class=\"textbox textbox--learning-objectives\">\n<header class=\"textbox__header\">\n<h2 class=\"textbox__title\">Learning Objectives<\/h2>\n<\/header>\n<div class=\"textbox__content\">\n<ul>\n<li id=\"apply_sampling_distribution_of_sample_proportion\">Apply the sampling distribution of the sample proportion (when appropriate). In particular, be able to identify unusual samples from a given population.<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<p>The first step to drawing conclusions about parameters based on the accompanying statistics is to understand how sample statistics behave relative to the parameter that summarizes the entire population. We begin with the behavior of sample proportion relative to population proportion (when the variable of interest is categorical). After that, we will explore the behavior of sample mean relative to population mean (when the variable of interest is quantitative).<\/p>\n<\/div>\n<div id=\"N10B0C\" class=\"section\">\n<div class=\"sectionContain\">\n<h2><span title=\"Quick scroll up\">Behavior of Sample Proportion p\u0302<\/span><\/h2>\n<div class=\"examplewrap\">\n<div class=\"example clearfix\">\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<h3 class=\"textbox__title\">Example<\/h3>\n<\/header>\n<div class=\"textbox__content\">\n<p id=\"N10B17\">Approximately 60% of all part-time college students in the United States are female. (In other words, the population proportion of females among part-time college students is p = .6.) What would you expect to see in terms of the behavior of sample proportion of females ([latex]\\hat{p}[\/latex]) if random samples of size 100 were taken from the population of all part-time college students?<\/p>\n<p id=\"N10B2B\">As we saw before, due to sampling variability, sample proportion in random samples of size 100 will take numerical values which vary according to the laws of chance: in other words, sample proportion is a<em>\u00a0random variable<\/em>. To summarize the behavior of any random variable, we focus on three features of its distribution: the center, the spread, and the shape.<\/p>\n<p id=\"N10B31\">Based only on our intuition, we would expect the following:<\/p>\n<p id=\"N10B35\"><em>Center<\/em>: Some sample proportions will be on the low sidesay, .55 or .58\u2014 while others will be on the high side\u2014 say, .61 or .66. It is reasonable to expect all the sample proportions in repeated random samples to average out to the underlying population proportion, .6. In other words, the mean of the distribution of\u00a0[latex]\\hat{p}[\/latex]\u00a0should be p.<\/p>\n<p id=\"N10B4B\"><em>Spread<\/em>: For samples of 100, we would expect sample proportions of females not to stray too far from the population proportion .6. Sample proportions lower than .5 or higher than .7 would be rather surprising. On the other hand, if we were only taking samples of size 10, we would not be at all surprised by a sample proportion of females even as low as 4\/10 = .4, or as high as 8\/10 = .8. Thus, sample size plays a role in the spread of the distribution of sample proportion: there should be less spread for larger samples, more spread for smaller samples.<\/p>\n<p id=\"N10B50\"><em>Shape<\/em>: Sample proportions closest to .6 would be most common, and sample proportions far from .6 in either direction would be progressively less likely. In other words, the shape of the distribution of sample proportion should bulge in the middle and taper at the ends: it should be somewhat\u00a0<em>normal.<\/em><\/p>\n<\/div>\n<\/div>\n<h2>Comment<\/h2>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"N10B5E\" class=\"section\">\n<div class=\"sectionContain\">\n<p id=\"N10B65\">The\u00a0<em>distribution<\/em>\u00a0of the values of the sample proportions ([latex]\\hat{p}[\/latex]) in repeated\u00a0<em>samples<\/em>\u00a0is called the\u00a0<em>sampling distribution of\u00a0<\/em>[latex]\\hat{p}[\/latex].<\/p>\n<p id=\"N10B95\">The purpose of the next activity is to check whether our intuition about the center, spread and shape of the sampling distribution of\u00a0[latex]\\hat{p}[\/latex]\u00a0was right via simulations.<\/p>\n<div class=\"figurewrap\">\n<div class=\"figure clearfix\">\n<div id=\"uwrap__i_0\" class=\"youtube\">\n<p><iframe loading=\"lazy\" id=\"oembed-1\" title=\"Behavior of Sample Proportion 1\" width=\"500\" height=\"375\" src=\"https:\/\/www.youtube.com\/embed\/2bIC4EmejkQ?feature=oembed&#38;rel=0&#38;rel=0\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<h3 class=\"textbox__title\">Did I get this?<\/h3>\n<\/header>\n<div class=\"textbox__content\">\n<div id=\"h5p-145\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-145\" class=\"h5p-iframe\" data-content-id=\"145\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"7.3 Did I get this?\"><\/iframe><\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div>\n<p id=\"N10BED\">At this point, we have a good sense of what happens as we take random samples from a population. Our simulation suggests that our initial intuition about the shape and center of the sampling distribution is correct. If the population has a proportion of p, then random samples of the same size drawn from the population will have sample proportions close to p. More specifically, the distribution of sample proportions will have a mean of p. We also observed that for this situation, the sample proportions are approximately normal. We will see later that this is not always the case. But if sample proportions are normally distributed, then the distribution is centered at p. Now we want to use simulation to help us think more about the variability we expect to see in the sample proportions. Our intuition tells us that larger samples will better approximate the population, so we might expect less variability in large samples. In the next walk-through we will use simulations to investigate this idea. After that walk-through, we will tie these ideas to more formal theory.<\/p>\n<div class=\"figurewrap\">\n<div class=\"figure clearfix\">\n<div id=\"uwrap__i_1\" class=\"youtube\">\n<p>https:\/\/youtube.com\/watch?v=tUvXeJ3A3_s%3Fenablejsapi%3D1%26rel%3D0%26vq%3Dlarge<\/p>\n<\/div>\n<div>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">Did I get this?<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<div id=\"h5p-146\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-146\" class=\"h5p-iframe\" data-content-id=\"146\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"7.3 Did I get this? 1\"><\/iframe><\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p id=\"N10B23\">Again, the simulations on the previous page reinforced what makes sense to our intuition. Larger random samples will better approximate the population proportion. When the sample size is large, sample proportions will be closer to p. In other words, the sampling distribution for large samples has less variability. Advanced probability theory confirms our observations and gives a more precise way to describe the standard deviation of the sample proportions. This is described next.<\/p>\n<div id=\"N10B26\" class=\"section\">\n<div class=\"sectionContain\">\n<h2><span title=\"Quick scroll up\">The Sampling Distribution of the Sample Proportion<\/span><\/h2>\n<p id=\"N10B2D\">If repeated random samples of a given size n are taken from a population of values for a categorical variable, where the proportion in the category of interest is p, then the mean of all sample proportions ([latex]\\hat{p}[\/latex]) is the population proportion (p). As for the spread of all sample proportions, theory dictates the behavior much more precisely than saying that there is less spread for larger samples. In fact, the standard deviation of all sample proportions ([latex]\\hat{p}[\/latex]) is exactly<\/p>\n<p>[latex]\\sqrt{\\frac{\\mathcal{p}\\left(1-\\mathcal{p}\\right)}{\\mathcal{n}}}[\/latex]<\/p>\n<p id=\"N10B7A\">Since sample size n appears in the denominator of the square root, the standard deviation does decrease as sample size increases. Finally, the shape of the distribution of\u00a0[latex]\\hat{p}[\/latex]\u00a0will be approximately normal as long as the sample size n is large enough. The convention is to require both np and n(1 \u2013 p) to be at least 10.<\/p>\n<p id=\"N10B8E\">We can summarize all of the above by the following:<\/p>\n<p id=\"N10B91\">[latex]\\hat{p}[\/latex]\u00a0has a normal distribution with a mean of\u00a0[latex]\\mu _{\\hat{p}}=p[\/latex]\u00a0and standard deviation\u00a0[latex]\\sigma _{\\hat{p}}=\\sqrt{\\frac{p(1-p)}{n}}[\/latex]\u00a0(and as long as np and n(1 \u2013 p) are at least 10).<\/p>\n<p id=\"N10C06\">Let\u2019s apply this result to our example and see how it compares with our simulation.<\/p>\n<p id=\"N10C09\">In our example, n = 25 (sample size) and p = 0.6. Note that np = 15 \u2265 10 and n(1 \u2013 p) = 10 \u2265 10. Therefore we can conclude that\u00a0<span id=\"MathJax-Element-8-Frame\" class=\"mjx-chtml MathJax_CHTML\"><span id=\"MJXc-Node-74\" class=\"mjx-math\"><span id=\"MJXc-Node-75\" class=\"mjx-mrow\"><span id=\"MJXc-Node-76\" class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-79\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-77\" class=\"mjx-mrow\"><span id=\"MJXc-Node-78\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0is approximately a normal distribution with mean p = 0.6 and standard deviation [latex]\\sqrt{\\frac{\\mathcal{p}\\left(1-\\mathcal{p}\\right)}{\\mathcal{n}}}=\\sqrt{\\frac{0.6\\left(1-0.6\\right)}{25}}=0.097[\/latex]\u00a0(which is very close to what we saw in our simulation).<\/p>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<h3 class=\"textbox__title\">Learn by Doing<\/h3>\n<\/header>\n<div class=\"textbox__content\">\n<p>According to the National Postsecondary Student Aid Study conducted by the U.S. Department of Education in 2008, 62% of graduates from public universities had student loans.<\/p>\n<div id=\"h5p-147\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-147\" class=\"h5p-iframe\" data-content-id=\"147\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"7.3 Did I get this? 2\"><\/iframe><\/div>\n<\/div>\n<p><img decoding=\"async\" id=\"N10070\" class=\"img-responsive popimg aligncenter\" title=\"four different sample distributions\" src=\"https:\/\/oli.cmu.edu\/repository\/webcontent\/72712ec00a0001dc418a87e73e8ebb77\/_u4_probability\/_m4_sampling_distributions\/webcontent\/tutors\/9_2_img1.gif\" alt=\"four different sample distributions\" \/><\/p>\n<div id=\"h5p-148\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-148\" class=\"h5p-iframe\" data-content-id=\"148\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"7.3  Learn by doing 1\"><\/iframe><\/div>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<h3 class=\"textbox__title\">Learn by Doing<\/h3>\n<\/header>\n<div class=\"textbox__content\">\n<div id=\"h5p-149\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-149\" class=\"h5p-iframe\" data-content-id=\"149\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"7.3  Learn by doing 2\"><\/iframe><\/div>\n<\/div>\n<\/div>\n<\/div>\n<p id=\"N10B11\">If a sampling distribution is normally shaped, then we can apply the Standard Deviation Rule and use z-scores to determine probabilities. Let\u2019s look at some examples.<\/p>\n<div class=\"examplewrap\">\n<div class=\"example clearfix\">\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<h3 class=\"textbox__title\">Example<\/h3>\n<\/header>\n<div class=\"textbox__content\">\n<p id=\"N10B16\">A random sample of 100 students is taken from the population of all part-time students in the United States, for which the overall proportion of females is .6.<\/p>\n<p id=\"N10B19\">(a) There is a 95% chance that the sample proportion (<span class=\"mjx-chtml MathJax_CHTML\"><span class=\"mjx-math\"><span class=\"mjx-mrow\"><span class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>) falls between what two values? First note that the distribution of\u00a0<span class=\"mjx-chtml MathJax_CHTML\"><span class=\"mjx-math\"><span class=\"mjx-mrow\"><span class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0has the mean p = .6, standard deviation\u00a0[latex]\\sqrt{\\frac{\\mathcal{p}\\left(1-\\mathcal{p}\\right)}{\\mathcal{n}}}=\\sqrt{\\frac{0.6\\left(1-0.6\\right)}{100}}=0.05[\/latex], and a shape that is close to normal, since np = 100(.6) = 60 and n(1 \u2013 p) = 100(.4) = 40 are both greater than 10. The Standard Deviation Rule applies: the probability is approximately .95 that\u00a0<span class=\"mjx-chtml MathJax_CHTML\"><span class=\"mjx-math\"><span class=\"mjx-mrow\"><span class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0falls within 2 standard deviations of the mean, that is, between 0.6 \u2013 2(.05) and 0.6 + 2(.05). There is roughly a 95% chance that\u00a0<span class=\"mjx-chtml MathJax_CHTML\"><span class=\"mjx-math\"><span class=\"mjx-mrow\"><span class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0falls in the interval (.5, .7).<\/p>\n<p id=\"N10BD0\">(b) What is the probability that sample proportion\u00a0<span class=\"mjx-chtml MathJax_CHTML\"><span class=\"mjx-math\"><span class=\"mjx-mrow\"><span class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0is less than or equal to .56?<\/p>\n<p id=\"N10BE4\">To find\u00a0<span class=\"mjx-chtml MathJax_CHTML\"><span class=\"mjx-math\"><span class=\"mjx-mrow\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">P<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">(<\/span><\/span><span class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><span class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">\u2264<\/span><\/span><span class=\"mjx-mn MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">.<\/span><\/span><span class=\"mjx-mn MJXc-space1\"><span class=\"mjx-char MJXc-TeX-main-R\">5<\/span><\/span><span class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">6<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">)<\/span><\/span><\/span><\/span><\/span>, we standardize .56 to z = (.56-.60) \/ .05 = -.80:<\/p>\n<p id=\"N10C10\"><span class=\"mjx-chtml MathJax_CHTML\"><span class=\"mjx-math\"><span class=\"mjx-mrow\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">P<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">(<\/span><\/span><span class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><span class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">\u2264<\/span><\/span><span class=\"mjx-mn MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">.<\/span><\/span><span class=\"mjx-mn MJXc-space1\"><span class=\"mjx-char MJXc-TeX-main-R\">5<\/span><\/span><span class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">6<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">)<\/span><\/span><span class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">=<\/span><\/span><span class=\"mjx-mi MJXc-space3\"><span class=\"mjx-char MJXc-TeX-math-I\">P<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">(<\/span><\/span><span class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">Z<\/span><\/span><span class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">\u2264<\/span><\/span><span class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">\u2212<\/span><\/span><span class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">.<\/span><\/span><span class=\"mjx-mn MJXc-space1\"><span class=\"mjx-char MJXc-TeX-main-R\">8<\/span><\/span><span class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">)<\/span><\/span><span class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">=<\/span><\/span><span class=\"mjx-mn MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><span class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">.<\/span><\/span><span class=\"mjx-mn MJXc-space1\"><span class=\"mjx-char MJXc-TeX-main-R\">2<\/span><\/span><span class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">1<\/span><\/span><span class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">1<\/span><\/span><span class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">9<\/span><\/span><\/span><\/span><\/span><\/p>\n<\/div>\n<\/div>\n<p>To see the impact of the sample size on these probability calculations, consider the following variation of our example.<\/p>\n<div class=\"examplewrap\">\n<div class=\"example clearfix\">\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<h3 class=\"textbox__title\">Example<\/h3>\n<\/header>\n<div class=\"textbox__content\">\n<p id=\"N10C78\">A random sample of\u00a0<em>2,500<\/em>\u00a0students is taken from the population of all part-time students in the United States, for which the overall proportion of females is .6.<\/p>\n<p id=\"N10C7E\">(a) There is a 95% chance that the sample proportion (<span class=\"mjx-chtml MathJax_CHTML\"><span class=\"mjx-math\"><span class=\"mjx-mrow\"><span class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-122\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-120\" class=\"mjx-mrow\"><span id=\"MJXc-Node-121\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>) falls between what two values? First note that the distribution of\u00a0<span id=\"MathJax-Element-10-Frame\" class=\"mjx-chtml MathJax_CHTML\"><span id=\"MJXc-Node-123\" class=\"mjx-math\"><span id=\"MJXc-Node-124\" class=\"mjx-mrow\"><span id=\"MJXc-Node-125\" class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-128\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-126\" class=\"mjx-mrow\"><span id=\"MJXc-Node-127\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0has the mean p = .6, standard deviation\u00a0[latex]\\sqrt{\\frac{\\mathcal{p}\\left(1-\\mathcal{p}\\right)}{\\mathcal{n}}}=\\sqrt{\\frac{0.6\\left(1-0.6\\right)}{2500}}=0.01[\/latex], and a shape that is close to normal, since np = 2500(.6) = 1500 and n(1 \u2013 p) = 2500(.4) = 1000 are both greater than 10. The standard deviation rule applies: the probability is approximately .95 that\u00a0<span id=\"MathJax-Element-12-Frame\" class=\"mjx-chtml MathJax_CHTML\"><span id=\"MJXc-Node-170\" class=\"mjx-math\"><span id=\"MJXc-Node-171\" class=\"mjx-mrow\"><span id=\"MJXc-Node-172\" class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-175\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-173\" class=\"mjx-mrow\"><span id=\"MJXc-Node-174\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0falls within 2 standard deviations of the mean, that is, between 0.6 \u2013 2(.01) and 0.6 + 2(.01). There is roughly a 95% chance that\u00a0<span id=\"MathJax-Element-13-Frame\" class=\"mjx-chtml MathJax_CHTML\"><span id=\"MJXc-Node-176\" class=\"mjx-math\"><span id=\"MJXc-Node-177\" class=\"mjx-mrow\"><span id=\"MJXc-Node-178\" class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-181\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-179\" class=\"mjx-mrow\"><span id=\"MJXc-Node-180\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0falls in the interval (.58, .62).<\/p>\n<p id=\"N10D38\">(b) What is the probability that sample proportion\u00a0<span id=\"MathJax-Element-14-Frame\" class=\"mjx-chtml MathJax_CHTML\"><span id=\"MJXc-Node-182\" class=\"mjx-math\"><span id=\"MJXc-Node-183\" class=\"mjx-mrow\"><span id=\"MJXc-Node-184\" class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-187\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-185\" class=\"mjx-mrow\"><span id=\"MJXc-Node-186\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0is less than .56?<\/p>\n<p id=\"N10D4C\">To find\u00a0<span id=\"MathJax-Element-15-Frame\" class=\"mjx-chtml MathJax_CHTML\"><span id=\"MJXc-Node-188\" class=\"mjx-math\"><span id=\"MJXc-Node-189\" class=\"mjx-mrow\"><span id=\"MJXc-Node-190\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">P<\/span><\/span><span id=\"MJXc-Node-191\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">(<\/span><\/span><span id=\"MJXc-Node-192\" class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-195\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-193\" class=\"mjx-mrow\"><span id=\"MJXc-Node-194\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><span id=\"MJXc-Node-196\" class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">\u2264<\/span><\/span><span id=\"MJXc-Node-197\" class=\"mjx-mn MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><span id=\"MJXc-Node-198\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">.<\/span><\/span><span id=\"MJXc-Node-199\" class=\"mjx-mn MJXc-space1\"><span class=\"mjx-char MJXc-TeX-main-R\">5<\/span><\/span><span id=\"MJXc-Node-200\" class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">6<\/span><\/span><span id=\"MJXc-Node-201\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">)<\/span><\/span><\/span><\/span><\/span>, we standardize .56 to z = (.56 \u2013 .60) \/ .01 = -4.00:<\/p>\n<p id=\"N10D78\"><span id=\"MathJax-Element-16-Frame\" class=\"mjx-chtml MathJax_CHTML\"><span id=\"MJXc-Node-202\" class=\"mjx-math\"><span id=\"MJXc-Node-203\" class=\"mjx-mrow\"><span id=\"MJXc-Node-204\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">P<\/span><\/span><span id=\"MJXc-Node-205\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">(<\/span><\/span><span id=\"MJXc-Node-206\" class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-209\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-207\" class=\"mjx-mrow\"><span id=\"MJXc-Node-208\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><span id=\"MJXc-Node-210\" class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">\u2264<\/span><\/span><span id=\"MJXc-Node-211\" class=\"mjx-mn MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><span id=\"MJXc-Node-212\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">.<\/span><\/span><span id=\"MJXc-Node-213\" class=\"mjx-mn MJXc-space1\"><span class=\"mjx-char MJXc-TeX-main-R\">5<\/span><\/span><span id=\"MJXc-Node-214\" class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">6<\/span><\/span><span id=\"MJXc-Node-215\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">)<\/span><\/span><span id=\"MJXc-Node-216\" class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">=<\/span><\/span><span id=\"MJXc-Node-217\" class=\"mjx-mi MJXc-space3\"><span class=\"mjx-char MJXc-TeX-math-I\">P<\/span><\/span><span id=\"MJXc-Node-218\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">(<\/span><\/span><span id=\"MJXc-Node-219\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">Z<\/span><\/span><span id=\"MJXc-Node-220\" class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">\u2264<\/span><\/span><span id=\"MJXc-Node-221\" class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">\u2212<\/span><\/span><span id=\"MJXc-Node-222\" class=\"mjx-mn\"><span class=\"mjx-char MJXc-TeX-main-R\">4<\/span><\/span><span id=\"MJXc-Node-223\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">.<\/span><\/span><span id=\"MJXc-Node-224\" class=\"mjx-mn MJXc-space1\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><span id=\"MJXc-Node-225\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">)<\/span><\/span><span id=\"MJXc-Node-226\" class=\"mjx-mo MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">=<\/span><\/span><span id=\"MJXc-Node-227\" class=\"mjx-mn MJXc-space3\"><span class=\"mjx-char MJXc-TeX-main-R\">0<\/span><\/span><\/span><\/span><\/span>, approximately.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"N10DC9\" class=\"section\">\n<div class=\"sectionContain\">\n<h2><span title=\"Quick scroll up\">Comment<\/span><\/h2>\n<p id=\"N10DD0\">As long as the sample is truly random, the distribution of\u00a0[latex]\\hat{p}[\/latex]\u00a0is centered at p, no matter what size sample has been taken. Larger samples have less spread. Specifically, when we multiplied the sample size by 25, increasing it from 100 to 2,500, the standard deviation was reduced to 1\/5 of the original standard deviation. Sample proportion strays less from population proportion .6 when the sample is larger: it tends to fall anywhere between .5 and .7 for samples of size 100, whereas it tends to fall between .58 and .62 for samples of size 2,500. It is not so improbable to take a value as low as .56 for samples of 100 (probability is more than 20%) but it is almost impossible to take a value as low as low as .56 for samples of 2,500 (probability is virtually zero).<\/p>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<h3 class=\"textbox__title\">Learn by Doing<\/h3>\n<\/header>\n<div class=\"textbox__content\">\n<p>The proportion of left-handed people in the general population is about 0.1. Suppose a random sample of 225 people is observed.<\/p>\n<div id=\"h5p-150\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-150\" class=\"h5p-iframe\" data-content-id=\"150\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"7.3 Learn by doing 5\"><\/iframe><\/div>\n<\/div>\n<\/div>\n<\/div>\n<h2><span title=\"Quick scroll up\">Theoretical Comment (Optional)<\/span><\/h2>\n<p id=\"N10EA0\">The above results for the distribution of sample proportion\u00a0[latex]\\hat{p}[\/latex]\u00a0are directly related to the results already obtained for the distribution of sample count X in a binomial experiment. Remember that X had mean np, standard deviation\u00a0[latex]\\sqrt{\\mathcal{np}\\left(1-\\mathcal{p}\\right)}[\/latex], and a shape that allowed for normal approximations as long as both np and n(1 \u2013 p) were at least 10. Since sample proportion is\u00a0[latex]\\hat{\\mathcal{p}}=\\frac{\\mathcal{x}}{\\mathcal{n}}[\/latex], we could derive the mean and standard deviation of\u00a0[latex]\\hat{p}[\/latex]\u00a0by applying the Rules for Means and Variances:<\/p>\n<p>[latex]\\mu_{\\hat{\\mathcal{p}}}=\\mu_\\frac{\\mathcal{x}}{\\mathcal{n}}=\\ \\frac{1}{\\mathcal{n}}\\mu_\\mathcal{x}=\\frac{1}{\\mathcal{n}}\\left(\\mathcal{np}\\right)[\/latex]<\/p>\n<p>and<\/p>\n<p>[latex]\\sigma_\\mathcal{p}^2=\\sigma_{\\frac{\\mathcal{x}}{\\mathcal{n}}}^2=\\frac{1}{\\mathcal{n}^2}\\sigma^2\\mathcal{x}=\\frac{1}{\\mathcal{n}^2}\\left(\\mathcal{np}\\right)\\left(1-\\mathcal{p}\\right)=\\frac{1}{\\mathcal{n}}\\mathcal{p}\\left(1-\\mathcal{p}\\right)[\/latex]<\/p>\n<p>so<\/p>\n<p>[latex]\\sigma_{\\hat{\\mathcal{p}}}=\\sqrt{\\frac{\\mathcal{p}\\left(1-\\mathcal{p}\\right)}{\\mathcal{n}}}[\/latex]<\/p>\n<p>The requirements that np and n(1 \u2013 p) be at least 10 are the same, whether we are focusing on the distribution of sample count or the distribution of sample proportion. After all, the shape of\u00a0<span id=\"MathJax-Element-27-Frame\" class=\"mjx-chtml MathJax_CHTML\"><span id=\"MJXc-Node-411\" class=\"mjx-math\"><span id=\"MJXc-Node-412\" class=\"mjx-mrow\"><span id=\"MJXc-Node-413\" class=\"mjx-mover\"><span class=\"mjx-stack\"><span class=\"mjx-over\"><span id=\"MJXc-Node-416\" class=\"mjx-mo\"><span class=\"mjx-char MJXc-TeX-main-R\">\u02c6<\/span><\/span><\/span><span class=\"mjx-op\"><span id=\"MJXc-Node-414\" class=\"mjx-mrow\"><span id=\"MJXc-Node-415\" class=\"mjx-mi\"><span class=\"mjx-char MJXc-TeX-math-I\">p<\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span><\/span>\u00a0is the same as the shape of X: the scale of the horizontal axis is just uniformly divided by n.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n","protected":false},"author":150,"menu_order":20,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[48],"contributor":[],"license":[],"class_list":["post-534","chapter","type-chapter","status-publish","hentry","chapter-type-numberless"],"part":419,"_links":{"self":[{"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/pressbooks\/v2\/chapters\/534","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/wp\/v2\/users\/150"}],"version-history":[{"count":10,"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/pressbooks\/v2\/chapters\/534\/revisions"}],"predecessor-version":[{"id":1002,"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/pressbooks\/v2\/chapters\/534\/revisions\/1002"}],"part":[{"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/pressbooks\/v2\/parts\/419"}],"metadata":[{"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/pressbooks\/v2\/chapters\/534\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/wp\/v2\/media?parent=534"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/pressbooks\/v2\/chapter-type?post=534"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/wp\/v2\/contributor?post=534"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/pressbooks.ccconline.org\/mat1260\/wp-json\/wp\/v2\/license?post=534"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}