{"id":390,"date":"2022-05-18T16:39:10","date_gmt":"2022-05-18T16:39:10","guid":{"rendered":"https:\/\/pressbooks.ccconline.org\/accintrostats\/chapter\/two-population-means-with-unknown-standard-deviations\/"},"modified":"2022-11-09T16:30:58","modified_gmt":"2022-11-09T16:30:58","slug":"two-population-means-with-unknown-standard-deviations","status":"publish","type":"chapter","link":"https:\/\/pressbooks.ccconline.org\/accintrostats\/chapter\/two-population-means-with-unknown-standard-deviations\/","title":{"raw":"Chapter 11.4: Two Population Means with Unknown Standard Deviations","rendered":"Chapter 11.4: Two Population Means with Unknown Standard Deviations"},"content":{"raw":"&nbsp;\r\n<ol>\r\n \t<li>The two independent samples are simple random samples from two distinct populations.<\/li>\r\n \t<li>For the two distinct populations:\r\n<ul id=\"fs-idm39244848\">\r\n \t<li>if the sample sizes are small, the distributions are important (should be normal)<\/li>\r\n \t<li>if the sample sizes are large, the distributions are not important (need not be normal)<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ol>\r\n<div id=\"fs-idm107349216\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">NOTE<\/div>\r\n<p id=\"fs-idm165699296\">The test comparing two independent population means with unknown and possibly unequal population standard deviations is called the Aspin-Welch t-test. The degrees of freedom formula was developed by Aspin-Welch.<\/p>\r\n\r\n<\/div>\r\nThe comparison of two population means is very common. A difference between the two samples depends on both the means and the standard deviations. Very different means can occur by chance if there is great variation among the individual samples. In order to account for the variation, we take the difference of the sample means, \\({\\overline{X}}_{1}\\) \u2013 \\({\\overline{X}}_{2}\\), and divide by the standard error in order to standardize the difference. The result is a t-score test statistic.\r\n\r\nBecause we do not know the population standard deviations, we estimate them using the two sample standard deviations from our independent samples. For the hypothesis test, we calculate the estimated standard deviation, or <span data-type=\"term\">standard error<\/span>, of <strong>the difference in sample means<\/strong>, \\({\\overline{X}}_{1}\\) \u2013 \\({\\overline{X}}_{2}\\).\r\n<div id=\"std_err\" data-type=\"equation\"><span data-type=\"title\">The standard error is:<\/span>\\(\\sqrt{\\frac{\\left({s}_{1}{\\right)}^{2}}{{n}_{1}}+\\frac{\\left({s}_{2}{\\right)}^{2}}{{n}_{2}}}\\)<\/div>\r\nThe test statistic (<em data-effect=\"italics\">t<\/em>-score) is calculated as follows:\r\n<div id=\"t-score2\" data-type=\"equation\">\\(\\frac{\\text{(}{\\overline{x}}_{1}\u2013{\\overline{x}}_{2}\\text{)}\u2013\\text{(}{\\mu }_{1}\u2013{\\mu }_{2}\\text{)}}{\\sqrt{\\frac{{\\text{(}{s}_{1}\\text{)}}^{2}}{{n}_{1}}+\\frac{{\\text{(}{s}_{2}\\text{)}}^{2}}{{n}_{2}}}}\\)<\/div>\r\n<div data-type=\"list\">\r\n<div data-type=\"title\">where:<\/div>\r\n<ul>\r\n \t<li><em data-effect=\"italics\">s<\/em><sub>1<\/sub> and <em data-effect=\"italics\">s<\/em><sub>2<\/sub>, the sample standard deviations, are estimates of <em data-effect=\"italics\">\u03c3<\/em><sub>1<\/sub> and <em data-effect=\"italics\">\u03c3<\/em><sub>2<\/sub>, respectively.<\/li>\r\n \t<li><em data-effect=\"italics\">\u03c3<\/em><sub>1<\/sub> and <em data-effect=\"italics\">\u03c3<\/em><sub>1<\/sub> are the unknown population standard deviations.<\/li>\r\n \t<li>\\({\\overline{x}}_{1}\\) and \\({\\overline{x}}_{2}\\) are the sample means. <em data-effect=\"italics\">\u03bc<\/em><sub>1<\/sub> and <em data-effect=\"italics\">\u03bc<\/em><sub>2<\/sub> are the population means.<\/li>\r\n<\/ul>\r\n<\/div>\r\n<p id=\"element-256\" class=\"finger\">The number of <span data-type=\"term\">degrees of freedom (<em data-effect=\"italics\">df<\/em>)<\/span> requires a somewhat complicated calculation. However, a computer or calculator calculates it easily. The <em data-effect=\"italics\">df<\/em> are not always a whole number. The test statistic calculated previously is approximated by the Student's <em data-effect=\"italics\">t<\/em>-distribution with <em data-effect=\"italics\">df<\/em> as follows:<\/p>\r\n\r\n<div id=\"eq_df\" data-type=\"equation\"><span data-type=\"title\">Degrees of freedom<\/span>\\(df=\\frac{{\\left(\\frac{{\\left({s}_{1}\\right)}^{2}}{{n}_{1}}+\\frac{{\\left({s}_{2}\\right)}^{2}}{{n}_{2}}\\right)}^{2}}{\\left(\\frac{1}{{n}_{1}\u20131}\\right){\\left(\\frac{{\\left({s}_{1}\\right)}^{2}}{{n}_{1}}\\right)}^{2}+\\left(\\frac{1}{{n}_{2}\u20131}\\right){\\left(\\frac{{\\left({s}_{2}\\right)}^{2}}{{n}_{2}}\\right)}^{2}}\\)<\/div>\r\n<p id=\"element-316\">When both sample sizes <em data-effect=\"italics\">n<\/em><sub>1<\/sub> and <em data-effect=\"italics\">n<\/em><sub>2<\/sub> are five or larger, the Student's <em data-effect=\"italics\">t<\/em> approximation is very good. Notice that the sample variances (<em data-effect=\"italics\">s<\/em><sub>1<\/sub>)<sup>2<\/sup> and (<em data-effect=\"italics\">s<\/em><sub>2<\/sub>)<sup>2<\/sup> are not pooled. (If the question comes up, do not pool the variances.)<\/p>\r\n\r\n<div id=\"fs-idm74088160\" class=\"finger\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">NOTE<\/div>\r\n<p id=\"fs-idm378573120\">It is not necessary to compute this by hand. A calculator or computer easily computes it.<\/p>\r\n\r\n<\/div>\r\n<div class=\"textbox textbox--examples\" data-type=\"example\">\r\n<div data-type=\"title\">Independent groups<\/div>\r\nThe average amount of time boys and girls aged seven to 11 spend playing sports each day is believed to be the same. A study is done and data are collected, resulting in the data in <a class=\"autogenerated-content\" href=\"#uid888\">(Figure)<\/a>. Each populations has a normal distribution.\r\n<table id=\"uid888\" summary=\"This table presents the sample size in the second column, average hours a day in the third column, and the sample standard deviation in the fourth column. The first row is for girls and the second row is for boys.\">\r\n<thead valign=\"top\">\r\n<tr>\r\n<th data-align=\"center\"><\/th>\r\n<th data-align=\"center\">Sample Size<\/th>\r\n<th data-align=\"center\">Average Number of Hours Playing Sports Per Day<\/th>\r\n<th data-align=\"center\">Sample Standard Deviation<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody valign=\"top\">\r\n<tr>\r\n<td>Girls<\/td>\r\n<td>9<\/td>\r\n<td>2<\/td>\r\n<td>\\(0.866\\)<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Boys<\/td>\r\n<td>16<\/td>\r\n<td>3.2<\/td>\r\n<td>1.00<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<div data-type=\"exercise\">\r\n<div id=\"id6322791\" data-type=\"problem\">\r\n\r\nIs there a difference in the mean amount of time boys and girls aged seven to 11 play sports each day? Test at the 5% level of significance.\r\n\r\n<\/div>\r\n<div id=\"id21329984\" data-type=\"solution\">\r\n\r\n<strong>The population standard deviations are not known.<\/strong> Let <em data-effect=\"italics\">g<\/em> be the subscript for girls and <em data-effect=\"italics\">b<\/em> be the subscript for boys. Then, <em data-effect=\"italics\">\u03bc<sub>g<\/sub><\/em> is the population mean for girls and <em data-effect=\"italics\">\u03bc<sub>b<\/sub><\/em> is the population mean for boys. This is a test of two <strong>independent groups<\/strong>, two population <strong>means<\/strong>.\r\n\r\n<span data-type=\"term\">Random variable<\/span>: \\({\\overline{X}}_{g}-{\\overline{X}}_{b}\\) = difference in the sample mean amount of time girls and boys play sports each day. <span data-type=\"newline\">\r\n<\/span><em data-effect=\"italics\">H<\/em><sub>0<\/sub>: <em data-effect=\"italics\">\u03bc<sub>g<\/sub><\/em> = <em data-effect=\"italics\">\u03bc<sub>b<\/sub><\/em>\u2003\u2003<em data-effect=\"italics\">H<sub>0<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<sub>g<\/sub><\/em> \u2013 <em data-effect=\"italics\">\u03bc<sub>b<\/sub><\/em> = 0 <span data-type=\"newline\">\r\n<\/span><em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<sub>g<\/sub><\/em> \u2260 <em data-effect=\"italics\">\u03bc<sub>b<\/sub><\/em>\u2003\u2003<em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<sub>g<\/sub><\/em> \u2013 <em data-effect=\"italics\">\u03bc<sub>b<\/sub><\/em> \u2260 0 <span data-type=\"newline\">\r\n<\/span>The words <strong>\"the same\"<\/strong> tell you <em data-effect=\"italics\">H<sub>0<\/sub><\/em> has an \"=\". Since there are no other words to indicate <em data-effect=\"italics\">H<sub>a<\/sub><\/em>, assume it says <strong>\"is different.\"<\/strong> This is a two-tailed test.\r\n\r\n<strong>Distribution for the test:<\/strong> Use <em data-effect=\"italics\">t<sub>df<\/sub><\/em> where <em data-effect=\"italics\">df<\/em> is calculated using the <em data-effect=\"italics\">df<\/em> formula for independent groups, two population means. Using a calculator, <em data-effect=\"italics\">df<\/em> is approximately 18.8462. <strong>Do not pool the variances.<\/strong>\r\n\r\n<strong>Calculate the <em data-effect=\"italics\">p<\/em>-value using a Student's <em data-effect=\"italics\">t<\/em>-distribution:<\/strong><em data-effect=\"italics\">p<\/em>-value = 0.0054\r\n<p id=\"element-90\"><strong>Graph:<\/strong><\/p>\r\n\r\n<div id=\"hyptest22_cmp1\" class=\"bc-figure figure\"><span id=\"id18963169\" data-type=\"media\" data-alt=\"This is a normal distribution curve representing the difference in the average amount of time girls and boys play sports all day. The mean is equal to zero, and the values -1.2, 0, and 1.2 are labeled on the horizontal axis. Two vertical lines extend from -1.2 and 1.2 to the curve. The region to the left of x = -1.2 and the region to the right of x = 1.2 are shaded to represent the p-value. The area of each region is 0.0028.\"><img src=\"https:\/\/pressbooks.ccconline.org\/acccomposition1\/wp-content\/uploads\/sites\/83\/2022\/05\/fig-ch10_02_01-1.jpg\" alt=\"This is a normal distribution curve representing the difference in the average amount of time girls and boys play sports all day. The mean is equal to zero, and the values -1.2, 0, and 1.2 are labeled on the horizontal axis. Two vertical lines extend from -1.2 and 1.2 to the curve. The region to the left of x = -1.2 and the region to the right of x = 1.2 are shaded to represent the p-value. The area of each region is 0.0028.\" width=\"380\" data-media-type=\"image\/jpg\" \/><\/span><\/div>\r\n<span data-type=\"newline\">\r\n<\/span>\\({s}_{g}=0.866\\)<span data-type=\"newline\">\r\n<\/span>\\({s}_{b}=1\\)<span data-type=\"newline\">\r\n<\/span>So, \\({\\overline{x}}_{g}\u2013{\\overline{x}}_{b}\\) = 2 \u2013 3.2 = \u20131.2 <span data-type=\"newline\">\r\n<\/span>Half the <em data-effect=\"italics\">p<\/em>-value is below \u20131.2 and half is above 1.2.\r\n\r\n<strong>Make a decision:<\/strong> Since <em data-effect=\"italics\">\u03b1<\/em> &gt; <em data-effect=\"italics\">p<\/em>-value, reject <em data-effect=\"italics\">H<sub>0<\/sub><\/em>. This means you reject <em data-effect=\"italics\">\u03bc<sub>g<\/sub><\/em> = <em data-effect=\"italics\">\u03bc<sub>b<\/sub><\/em>. The means are different.\r\n<div id=\"fs-idp61535456\" class=\"statistics calculator\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<p id=\"fs-idp144406384\">Press <code>STAT<\/code>. Arrow over to <code>TESTS<\/code> and press <code>4:2-SampTTest<\/code>. Arrow over to Stats and press <code>ENTER<\/code>. Arrow down and enter <code>2<\/code> for the first sample mean, <code>\\(0.866\\)<\/code> for Sx1, <code>9<\/code> for n1, <code>3.2<\/code> for the second sample mean, <code>1<\/code> for Sx2, and <code>16<\/code> for n2. Arrow down to \u03bc1: and arrow to <code>does not equal<\/code> \u03bc2. Press <code>ENTER<\/code>. Arrow down to Pooled: and <code>No<\/code>. Press <code>ENTER<\/code>. Arrow down to <code>Calculate<\/code> and press <code>ENTER<\/code>. The <em data-effect=\"italics\">p<\/em>-value is <em data-effect=\"italics\">p<\/em> = 0.0054, the dfs are approximately 18.8462, and the test statistic is -3.14. Do the procedure again but instead of Calculate do Draw.<\/p>\r\n\r\n<\/div>\r\n<strong>Conclusion:<\/strong> At the 5% level of significance, the sample data show there is sufficient evidence to conclude that the mean number of hours that girls and boys aged seven to 11 play sports per day is different (mean number of hours boys aged seven to 11 play sports per day is greater than the mean number of hours played by girls OR the mean number of hours girls aged seven to 11 play sports per day is greater than the mean number of hours played by boys).\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp67783984\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">Try It<\/div>\r\n<div id=\"fs-idm235264144\" data-type=\"exercise\">\r\n<div id=\"fs-idm162906288\" data-type=\"problem\">\r\n<p id=\"fs-idm193306272\">Two samples are shown in <a class=\"autogenerated-content\" href=\"#fs-idm99631856\">(Figure)<\/a>. Both have normal distributions. The means for the two populations are thought to be the same. Is there a difference in the means? Test at the 5% level of significance.<\/p>\r\n\r\n<table id=\"fs-idm99631856\" summary=\"\">\r\n<thead>\r\n<tr>\r\n<th><\/th>\r\n<th>Sample Size<\/th>\r\n<th>Sample Mean<\/th>\r\n<th>Sample Standard Deviation<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td>Population A<\/td>\r\n<td>25<\/td>\r\n<td>5<\/td>\r\n<td>1<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Population B<\/td>\r\n<td>16<\/td>\r\n<td>4.7<\/td>\r\n<td>1.2<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm50414208\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">NOTE<\/div>\r\n<p id=\"fs-idm204425680\">When the sum of the sample sizes is larger than 30 (<em data-effect=\"italics\">n<\/em><sub>1<\/sub> + <em data-effect=\"italics\">n<\/em><sub>2<\/sub> &gt; 30) you can use the normal distribution to approximate the Student's <em data-effect=\"italics\">t<\/em>.<\/p>\r\n\r\n<\/div>\r\n<div class=\"textbox textbox--examples\" data-type=\"example\">\r\n<p id=\"element-980\">A study is done by a community group in two neighboring colleges to determine which one graduates students with more math classes. College A samples 11 graduates. Their average is four math classes with a standard deviation of 1.5 math classes. College B samples nine graduates. Their average is 3.5 math classes with a standard deviation of one math class. The community group believes that a student who graduates from college A <strong>has taken more math classes,<\/strong> on the average. Both populations have a normal distribution. Test at a 1% significance level. Answer the following questions.<\/p>\r\n&nbsp;\r\n<div id=\"fs-idm178911728\" data-type=\"exercise\">\r\n<div id=\"fs-idm177203296\" data-type=\"problem\">\r\n<p id=\"fs-idm171395200\">a. Is this a test of two means or two proportions?<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm36568576\" data-type=\"solution\">\r\n<p id=\"fs-idm78962816\">a. two means<\/p>\r\n&nbsp;\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm194746624\" data-type=\"exercise\">\r\n<div id=\"fs-idm227630576\" data-type=\"problem\">\r\n<p id=\"fs-idm227630448\">b. Are the populations standard deviations known or unknown?<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm217237904\" data-type=\"solution\">\r\n<p id=\"fs-idm217237776\">b. unknown<\/p>\r\n&nbsp;\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm164304912\" data-type=\"exercise\">\r\n<div id=\"fs-idm35318288\" data-type=\"problem\">\r\n<p id=\"fs-idm220657088\">c. Which distribution do you use to perform the test?<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm153443152\" data-type=\"solution\">\r\n<p id=\"fs-idm145309056\">c. Student's <em data-effect=\"italics\">t<\/em><\/p>\r\n&nbsp;\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm130995312\" data-type=\"exercise\">\r\n<div id=\"fs-idm143520064\" data-type=\"problem\">\r\n<p id=\"fs-idm143519936\">d. What is the random variable?<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm222952320\" data-type=\"solution\">\r\n<p id=\"fs-idm222952192\">d. \\({\\overline{X}}_{A}-{\\overline{X}}_{B}\\)<\/p>\r\n&nbsp;\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm301029872\" data-type=\"exercise\">\r\n<div id=\"fs-idm176701568\" data-type=\"problem\">\r\n<p id=\"fs-idm151272736\">e. What are the null and alternate hypotheses? Write the null and alternate hypotheses in words and in symbols.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm147502464\" data-type=\"solution\">\r\n<p id=\"fs-idm206451408\">e.<\/p>\r\n\r\n<ul id=\"fs-idp21552016\">\r\n \t<li>\\({H}_{o}:{\\mu }_{A}\\le {\\mu }_{B}\\)<\/li>\r\n \t<li>\\({H}_{a}:{\\mu }_{A}&gt;{\\mu }_{B}\\)<\/li>\r\n<\/ul>\r\n&nbsp;\r\n\r\n&nbsp;\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm39370224\" data-type=\"exercise\">\r\n<div id=\"fs-idm154517184\" data-type=\"problem\">\r\n<p id=\"fs-idm154517056\">f. Is this test right-, left-, or two-tailed?<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm168558752\" data-type=\"solution\">\r\n<p id=\"fs-idm168558624\">f.<\/p>\r\n\r\n<div id=\"eip-idm29208032\" class=\"bc-figure figure\"><span id=\"eip-idm4786768\" data-type=\"media\" data-alt=\"This is a normal distribution curve with mean equal to 0. A vertical line near the tail of the curve to the right of zero extends from the axis to the curve. The region under the curve to the right of the line is shaded.\"><img src=\"https:\/\/pressbooks.ccconline.org\/acccomposition1\/wp-content\/uploads\/sites\/83\/2022\/08\/CNX_Stats_C10_M02_001anno-1.jpg\" alt=\"This is a normal distribution curve with mean equal to 0. A vertical line near the tail of the curve to the right of zero extends from the axis to the curve. The region under the curve to the right of the line is shaded.\" width=\"380\" data-media-type=\"image\/jpg\" \/><\/span><\/div>\r\n<span data-type=\"newline\">\r\n<\/span>right\r\n\r\n&nbsp;\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm146752960\" data-type=\"exercise\">\r\n<div id=\"fs-idm148661808\" data-type=\"problem\">\r\n<p id=\"fs-idm148661680\">g. What is the <em data-effect=\"italics\">p<\/em>-value?<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm79675952\" data-type=\"solution\">\r\n<p id=\"fs-idm106536048\">g. 0.1928<\/p>\r\n&nbsp;\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm176691440\" data-type=\"exercise\">\r\n<div id=\"fs-idp29101808\" data-type=\"problem\">\r\n<p id=\"fs-idp29101936\">h. Do you reject or not reject the null hypothesis?<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm36842944\" data-type=\"solution\">\r\n<p id=\"fs-idm36842816\">h. Do not reject.<\/p>\r\n&nbsp;\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm65144592\" data-type=\"exercise\">\r\n<div id=\"fs-idm153855872\" data-type=\"problem\">\r\n<p id=\"fs-idm41056320\">i. <strong>Conclusion:<\/strong><\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm31976880\" data-type=\"solution\">\r\n<p id=\"fs-idm31976752\">i. At the 1% level of significance, from the sample data, there is not sufficient evidence to conclude that a student who graduates from college A has taken more math classes, on the average, than a student who graduates from college B.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp145514592\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">Try It<\/div>\r\n<div id=\"eip-idp166826064\" data-type=\"exercise\">\r\n<div id=\"eip-idp166826320\" data-type=\"problem\">\r\n<p id=\"fs-idm170467472\">A study is done to determine if Company A retains its workers longer than Company B. Company A samples 15 workers, and their average time with the company is five years with a standard deviation of 1.2. Company B samples 20 workers, and their average time with the company is 4.5 years with a standard deviation of 0.8. The populations are normally distributed.<\/p>\r\n\r\n<ol id=\"fs-idp55515664\" type=\"a\">\r\n \t<li>Are the population standard deviations known?<\/li>\r\n \t<li>Conduct an appropriate hypothesis test. At the 5% significance level, what is your conclusion?<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm22935408\" class=\"textbox textbox--examples\" data-type=\"example\">\r\n<p id=\"fs-idm10086800\">A professor at a large community college wanted to determine whether there is a difference in the means of final exam scores between students who took his statistics course online and the students who took his face-to-face statistics class. He believed that the mean of the final exam scores for the online class would be lower than that of the face-to-face class. Was the professor correct? The randomly selected 30 final exam scores from each group are listed in <a class=\"autogenerated-content\" href=\"#fs-idm34056944\">(Figure)<\/a> and <a class=\"autogenerated-content\" href=\"#fs-idm124378288\">(Figure)<\/a>.<\/p>\r\n\r\n<table id=\"fs-idm34056944\" summary=\"\"><caption><span data-type=\"title\">Online Class<\/span><\/caption>\r\n<tbody>\r\n<tr>\r\n<td>67.6<\/td>\r\n<td>41.2<\/td>\r\n<td>85.3<\/td>\r\n<td>55.9<\/td>\r\n<td>82.4<\/td>\r\n<td>91.2<\/td>\r\n<td>73.5<\/td>\r\n<td>94.1<\/td>\r\n<td>64.7<\/td>\r\n<td>64.7<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>70.6<\/td>\r\n<td>38.2<\/td>\r\n<td>61.8<\/td>\r\n<td>88.2<\/td>\r\n<td>70.6<\/td>\r\n<td>58.8<\/td>\r\n<td>91.2<\/td>\r\n<td>73.5<\/td>\r\n<td>82.4<\/td>\r\n<td>35.5<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>94.1<\/td>\r\n<td>88.2<\/td>\r\n<td>64.7<\/td>\r\n<td>55.9<\/td>\r\n<td>88.2<\/td>\r\n<td>97.1<\/td>\r\n<td>85.3<\/td>\r\n<td>61.8<\/td>\r\n<td>79.4<\/td>\r\n<td>79.4<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<table id=\"fs-idm124378288\" summary=\"\"><caption><span data-type=\"title\">Face-to-face Class<\/span><\/caption>\r\n<tbody>\r\n<tr>\r\n<td>77.9<\/td>\r\n<td>95.3<\/td>\r\n<td>81.2<\/td>\r\n<td>74.1<\/td>\r\n<td>98.8<\/td>\r\n<td>88.2<\/td>\r\n<td>85.9<\/td>\r\n<td>92.9<\/td>\r\n<td>87.1<\/td>\r\n<td>88.2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>69.4<\/td>\r\n<td>57.6<\/td>\r\n<td>69.4<\/td>\r\n<td>67.1<\/td>\r\n<td>97.6<\/td>\r\n<td>85.9<\/td>\r\n<td>88.2<\/td>\r\n<td>91.8<\/td>\r\n<td>78.8<\/td>\r\n<td>71.8<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>98.8<\/td>\r\n<td>61.2<\/td>\r\n<td>92.9<\/td>\r\n<td>90.6<\/td>\r\n<td>97.6<\/td>\r\n<td>100<\/td>\r\n<td>95.3<\/td>\r\n<td>83.5<\/td>\r\n<td>92.9<\/td>\r\n<td>89.4<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<div id=\"fs-idm298748224\" data-type=\"exercise\">\r\n<div id=\"fs-idm181409216\" data-type=\"problem\">\r\n<p id=\"fs-idm28696464\">Is the mean of the Final Exam scores of the online class lower than the mean of the Final Exam scores of the face-to-face class? Test at a 5% significance level. Answer the following questions:<\/p>\r\n\r\n<ol id=\"fs-idp180387984\" type=\"a\">\r\n \t<li>Is this a test of two means or two proportions?<\/li>\r\n \t<li>Are the population standard deviations known or unknown?<\/li>\r\n \t<li>Which distribution do you use to perform the test?<\/li>\r\n \t<li>What is the random variable?<\/li>\r\n \t<li>What are the null and alternative hypotheses? Write the null and alternative hypotheses in words and in symbols.<\/li>\r\n \t<li>Is this test right, left, or two tailed?<\/li>\r\n \t<li>What is the <em data-effect=\"italics\">p<\/em>-value?<\/li>\r\n \t<li>Do you reject or not reject the null hypothesis?<\/li>\r\n \t<li>At the ___ level of significance, from the sample data, there ______ (is\/is not) sufficient evidence to conclude that ______.<\/li>\r\n<\/ol>\r\n<p id=\"fs-idp169691824\">(See the conclusion in <a class=\"autogenerated-content\" href=\"#element-968\">(Figure)<\/a>, and write yours in a similar fashion)<\/p>\r\n\r\n<div id=\"fs-idm11616112\" class=\"statistics calculator\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<p id=\"fs-idm206958928\">First put the data for each group into two lists (such as L1 and L2). Press STAT. Arrow over to TESTS and press 4:2SampTTest. Make sure Data is highlighted and press ENTER. Arrow down and enter L1 for the first list and L2 for the second list. Arrow down to <em data-effect=\"italics\">\u03bc<\/em><sub>1<\/sub>: and arrow to \u2260 <em data-effect=\"italics\">\u03bc<\/em><sub>2<\/sub> (does not equal). Press ENTER. Arrow down to Pooled: No. Press ENTER. Arrow down to Calculate and press ENTER.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm107478368\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">Note<\/div>\r\n<p id=\"fs-idm218153776\">Be careful not to mix up the information for Group 1 and Group 2!<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm168184384\" data-type=\"solution\">\r\n<ol id=\"fs-idm11713040\" type=\"a\">\r\n \t<li>two means<\/li>\r\n \t<li>unknown<\/li>\r\n \t<li>Student's <em data-effect=\"italics\">t<\/em><\/li>\r\n \t<li>\\({\\overline{X}}_{1}\u2013{\\overline{X}}_{2}\\)<\/li>\r\n \t<li>\r\n<ol id=\"fs-idm109578096\">\r\n \t<li><em data-effect=\"italics\">H<sub>0<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<sub>1<\/sub><\/em> = <em data-effect=\"italics\">\u03bc<sub>2<\/sub><\/em> Null hypothesis: the means of the final exam scores are equal for the online and face-to-face statistics classes.<\/li>\r\n \t<li><em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<sub>1<\/sub><\/em> &lt; <em data-effect=\"italics\">\u03bc<sub>2<\/sub><\/em> Alternative hypothesis: the mean of the final exam scores of the online class is less than the mean of the final exam scores of the face-to-face class.<\/li>\r\n<\/ol>\r\n<\/li>\r\n \t<li>left-tailed<\/li>\r\n \t<li><em data-effect=\"italics\">p<\/em>-value = 0.0011\r\n<div id=\"fs-idm98197120\" class=\"bc-figure figure\"><span id=\"fs-idp598000\" data-type=\"media\" data-alt=\"This is a normal distribution curve with mean equal to zero. A vertical line near the tail of the curve to the left of zero extends from the axis to the curve. The region under the curve to the left of the line is shaded representing p-value = 0.0011.\"><img src=\"https:\/\/pressbooks.ccconline.org\/acccomposition1\/wp-content\/uploads\/sites\/83\/2022\/08\/CNX_Stats_C10_M02_002annoN-1.jpg\" alt=\"This is a normal distribution curve with mean equal to zero. A vertical line near the tail of the curve to the left of zero extends from the axis to the curve. The region under the curve to the left of the line is shaded representing p-value = 0.0011.\" width=\"380\" data-media-type=\"image\/jpg\" \/><\/span><\/div><\/li>\r\n \t<li>Reject the null hypothesis<\/li>\r\n \t<li>The professor was correct. The evidence shows that the mean of the final exam scores for the online class is lower than that of the face-to-face class. <span data-type=\"newline\">\r\n<\/span>At the <u data-effect=\"underline\">5%<\/u> level of significance, from the sample data, there is (is\/is not) sufficient evidence to conclude that the mean of the final exam scores for the online class is less than <u data-effect=\"underline\">the mean of final exam scores of the face-to-face class.<\/u><\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<span data-type=\"title\">Cohen's Standards for Small, Medium, and Large Effect Sizes<\/span><span data-type=\"term\">Cohen's <em data-effect=\"italics\">d<\/em><\/span> is a measure of effect size based on the differences between two means. Cohen\u2019s <em data-effect=\"italics\">d<\/em>, named for United States statistician Jacob Cohen, measures the relative strength of the differences between the means of two populations based on sample data. The calculated value of effect size is then compared to Cohen\u2019s standards of small, medium, and large effect sizes.\r\n<table id=\"fs-idp93235584\" summary=\"\"><caption><span data-type=\"title\">Cohen's Standard Effect Sizes<\/span><\/caption>\r\n<thead>\r\n<tr>\r\n<th>Size of effect<\/th>\r\n<th><em data-effect=\"italics\">d<\/em><\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td>Small<\/td>\r\n<td>0.2<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>medium<\/td>\r\n<td>0.5<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Large<\/td>\r\n<td>0.8<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<p id=\"fs-idm52480352\">Cohen's <em data-effect=\"italics\">d<\/em> is the measure of the difference between two means divided by the pooled standard deviation: \\(d=\\frac{{\\overline{x}}_{1}\u2013{\\overline{x}}_{2}}{{s}_{pooled}}\\) where \\({s}_{pooled}=\\sqrt{\\frac{\\left({n}_{1}\u20131\\right){s}_{1}^{2}+\\left({n}_{2}\u20131\\right){s}_{2}^{2}}{{n}_{1}+{n}_{2}\u20132}}\\)<\/p>\r\n\r\n<div id=\"fs-idm100194752\" class=\"textbox textbox--examples\" data-type=\"example\">\r\n<div id=\"fs-idm122689856\" data-type=\"exercise\">\r\n<div id=\"fs-idm122689600\" data-type=\"problem\">\r\n<p id=\"fs-idm1441280\">Calculate Cohen\u2019s <em data-effect=\"italics\">d<\/em> for <a class=\"autogenerated-content\" href=\"#element-968\">(Figure)<\/a>. Is the size of the effect small, medium, or large? Explain what the size of the effect means for this problem.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm42685344\" data-type=\"solution\">\r\n<p id=\"fs-idp179952160\"><em data-effect=\"italics\">\u03bc<\/em><sub>1<\/sub> = 4 <em data-effect=\"italics\">s<\/em><sub>1<\/sub> = 1.5 <em data-effect=\"italics\">n<\/em><sub>1<\/sub> = 11 <span data-type=\"newline\">\r\n<\/span><em data-effect=\"italics\">\u03bc<\/em><sub>2<\/sub> = 3.5 <em data-effect=\"italics\">s<\/em><sub>2<\/sub> = 1 <em data-effect=\"italics\">n<\/em><sub>2<\/sub> = 9 <span data-type=\"newline\">\r\n<\/span><em data-effect=\"italics\">d<\/em> = 0.384 <span data-type=\"newline\">\r\n<\/span>The effect is small because 0.384 is between Cohen\u2019s value of 0.2 for small effect size and 0.5 for medium effect size. The size of the differences of the means for the two colleges is small indicating that there is not a significant difference between them.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm112602048\" class=\"textbox textbox--examples\" data-type=\"example\">\r\n<div id=\"fs-idm163588752\" data-type=\"exercise\">\r\n<div id=\"fs-idm118829104\" data-type=\"problem\">\r\n<p id=\"fs-idm174261728\">Calculate Cohen\u2019s <em data-effect=\"italics\">d<\/em> for <a class=\"autogenerated-content\" href=\"#fs-idm22935408\">(Figure)<\/a>. Is the size of the effect small, medium or large? Explain what the size of the effect means for this problem.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm177693104\" data-type=\"solution\">\r\n<p id=\"fs-idm86592608\"><em data-effect=\"italics\">d<\/em> = 0.834; Large, because 0.834 is greater than Cohen\u2019s 0.8 for a large effect size. The size of the differences between the means of the Final Exam scores of online students and students in a face-to-face class is large indicating a significant difference.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp145071264\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">Try It<\/div>\r\n<p id=\"fs-idm62120944\">Weighted alpha is a measure of risk-adjusted performance of stocks over a period of a year. A high positive weighted alpha signifies a stock whose price has risen while a small positive weighted alpha indicates an unchanged stock price during the time period. Weighted alpha is used to identify companies with strong upward or downward trends. The weighted alpha for the top 30 stocks of banks in the northeast and in the west as identified by Nasdaq on May 24, 2013 are listed in <a class=\"autogenerated-content\" href=\"#fs-idm242901296\">(Figure)<\/a> and <a class=\"autogenerated-content\" href=\"#fs-idm167801152\">(Figure)<\/a>, respectively.<\/p>\r\n\r\n<table id=\"fs-idm242901296\" summary=\"\"><caption><span data-type=\"title\">Northeast<\/span><\/caption>\r\n<tbody>\r\n<tr>\r\n<td>94.2<\/td>\r\n<td>75.2<\/td>\r\n<td>69.6<\/td>\r\n<td>52.0<\/td>\r\n<td>48.0<\/td>\r\n<td>41.9<\/td>\r\n<td>36.4<\/td>\r\n<td>33.4<\/td>\r\n<td>31.5<\/td>\r\n<td>27.6<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>77.3<\/td>\r\n<td>71.9<\/td>\r\n<td>67.5<\/td>\r\n<td>50.6<\/td>\r\n<td>46.2<\/td>\r\n<td>38.4<\/td>\r\n<td>35.2<\/td>\r\n<td>33.0<\/td>\r\n<td>28.7<\/td>\r\n<td>26.5<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>76.3<\/td>\r\n<td>71.7<\/td>\r\n<td>56.3<\/td>\r\n<td>48.7<\/td>\r\n<td>43.2<\/td>\r\n<td>37.6<\/td>\r\n<td>33.7<\/td>\r\n<td>31.8<\/td>\r\n<td>28.5<\/td>\r\n<td>26.0<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<table id=\"fs-idm167801152\" summary=\"\"><caption><span data-type=\"title\">West<\/span><\/caption>\r\n<tbody>\r\n<tr>\r\n<td>126.0<\/td>\r\n<td>70.6<\/td>\r\n<td>65.2<\/td>\r\n<td>51.4<\/td>\r\n<td>45.5<\/td>\r\n<td>37.0<\/td>\r\n<td>33.0<\/td>\r\n<td>29.6<\/td>\r\n<td>23.7<\/td>\r\n<td>22.6<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>116.1<\/td>\r\n<td>70.6<\/td>\r\n<td>58.2<\/td>\r\n<td>51.2<\/td>\r\n<td>43.2<\/td>\r\n<td>36.0<\/td>\r\n<td>31.4<\/td>\r\n<td>28.7<\/td>\r\n<td>23.5<\/td>\r\n<td>21.6<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>78.2<\/td>\r\n<td>68.2<\/td>\r\n<td>55.6<\/td>\r\n<td>50.3<\/td>\r\n<td>39.0<\/td>\r\n<td>34.1<\/td>\r\n<td>31.0<\/td>\r\n<td>25.3<\/td>\r\n<td>23.4<\/td>\r\n<td>21.5<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<div id=\"fs-idm120507648\" data-type=\"exercise\">\r\n<div id=\"eip-idp51089200\" data-type=\"problem\">\r\n<p id=\"fs-idm100910544\">Is there a difference in the weighted alpha of the top 30 stocks of banks in the northeast and in the west? Test at a 5% significance level. Answer the following questions:<\/p>\r\n\r\n<ol id=\"fs-idp33249296\" type=\"a\">\r\n \t<li>Is this a test of two means or two proportions?<\/li>\r\n \t<li>Are the population standard deviations known or unknown?<\/li>\r\n \t<li>Which distribution do you use to perform the test?<\/li>\r\n \t<li>What is the random variable?<\/li>\r\n \t<li>What are the null and alternative hypotheses? Write the null and alternative hypotheses in words and in symbols.<\/li>\r\n \t<li>Is this test right, left, or two tailed?<\/li>\r\n \t<li>What is the <em data-effect=\"italics\">p<\/em>-value?<\/li>\r\n \t<li>Do you reject or not reject the null hypothesis?<\/li>\r\n \t<li>At the ___ level of significance, from the sample data, there ______ (is\/is not) sufficient evidence to conclude that ______.<\/li>\r\n \t<li>Calculate Cohen\u2019s <em data-effect=\"italics\">d<\/em> and interpret it.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div class=\"footnotes\" data-depth=\"1\">\r\n<h3 data-type=\"title\">References<\/h3>\r\nData from Graduating Engineer + Computer Careers. Available online at http:\/\/www.graduatingengineer.com\r\n<p id=\"fs-idp6735712\">Data from <em data-effect=\"italics\">Microsoft Bookshelf<\/em>.<\/p>\r\n<p id=\"fs-idm54110288\">Data from the United States Senate website, available online at www.Senate.gov (accessed June 17, 2013).<\/p>\r\n<p id=\"fs-idp26924704\">\u201cList of current United States Senators by Age.\u201d Wikipedia. Available online at http:\/\/en.wikipedia.org\/wiki\/List_of_current_United_States_Senators_by_age (accessed June 17, 2013).<\/p>\r\n<p id=\"fs-idp67778720\">\u201cSectoring by Industry Groups.\u201d Nasdaq. Available online at http:\/\/www.nasdaq.com\/markets\/barchart-sectors.aspx?page=sectors&amp;base=industry (accessed June 17, 2013).<\/p>\r\n<p id=\"fs-idp92967520\">\u201cStrip Clubs: Where Prostitution and Trafficking Happen.\u201d Prostitution Research and Education, 2013. Available online at www.prostitutionresearch.com\/ProsViolPosttrauStress.html (accessed June 17, 2013).<\/p>\r\n<p id=\"fs-idp130720928\">\u201cWorld Series History.\u201d Baseball-Almanac, 2013. Available online at http:\/\/www.baseball-almanac.com\/ws\/wsmenu.shtml (accessed June 17, 2013).<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm178063424\" class=\"summary\" data-depth=\"1\">\r\n<h3 data-type=\"title\">Chapter Review<\/h3>\r\n<p id=\"fs-idm116979232\">Two population means from independent samples where the population standard deviations are not known<\/p>\r\n\r\n<ul id=\"fs-idm111534432\">\r\n \t<li>Random Variable: \\({\\overline{X}}_{1}-{\\overline{X}}_{2}\\) = the difference of the sampling means<\/li>\r\n \t<li>Distribution: Student's <em data-effect=\"italics\">t<\/em>-distribution with degrees of freedom (variances not pooled)<\/li>\r\n<\/ul>\r\n<\/div>\r\n<div id=\"fs-idm147993376\" class=\"formula-review\" data-depth=\"1\">\r\n<h3 data-type=\"title\">Formula Review<\/h3>\r\n<p id=\"fs-idm5111056\">Standard error: <em data-effect=\"italics\">SE<\/em> = \\(\\sqrt{\\frac{{\\left({s}_{1}\\right)}^{2}}{{n}_{1}}+\\frac{{\\left({s}_{2}\\right)}^{2}}{{n}_{2}}}\\)<\/p>\r\n<p id=\"fs-idm105811424\">Test statistic (<em data-effect=\"italics\">t<\/em>-score): <em data-effect=\"italics\">t<\/em> = \\(\\frac{\\left({\\overline{x}}_{1}-{\\overline{x}}_{2}\\right)-\\left({\\mu }_{1}-{\\mu }_{2}\\right)}{\\sqrt{\\frac{{\\left({s}_{1}\\right)}^{2}}{{n}_{1}}+\\frac{{\\left({s}_{2}\\right)}^{2}}{{n}_{2}}}}\\)<\/p>\r\n<p id=\"fs-idm96358640\">Degrees of freedom:<span data-type=\"newline\">\r\n<\/span> \\(df=\u00a0\\frac{{\\left(\\frac{{\\left({s}_{1}\\right)}^{2}}{{n}_{1}}+\u00a0\\frac{{\\left({s}_{2}\\right)}^{2}}{{n}_{2}}\\right)}^{2}}{\\left(\\frac{1}{{n}_{1}-1}\\right){\\left(\\frac{{\\left({s}_{1}\\right)}^{2}}{{n}_{1}}\\right)}^{2}+\\left(\\frac{1}{{n}_{2}-1}\\right){\\left(\\frac{{\\left({s}_{2}\\right)}^{2}}{{n}_{2}}\\right)}^{2}}\\)<\/p>\r\n<p id=\"fs-idm98284352\">where:<\/p>\r\n<p id=\"fs-idm92032864\"><em data-effect=\"italics\">s<\/em><sub>1<\/sub> and <em data-effect=\"italics\">s<\/em><sub>2<\/sub> are the sample standard deviations, and <em data-effect=\"italics\">n<\/em><sub>1<\/sub> and <em data-effect=\"italics\">n<\/em><sub>2<\/sub> are the sample sizes.<\/p>\r\n<p id=\"fs-idm32210368\">\\({\\overline{x}}_{1}\\) and \\({\\overline{x}}_{2}\\) are the sample means.<\/p>\r\n<p id=\"fs-idm117522208\">Cohen\u2019s <em data-effect=\"italics\">d<\/em> is the measure of effect size:<\/p>\r\n<p id=\"fs-idm117521824\">\\(d=\\frac{{\\overline{x}}_{1}-{\\overline{x}}_{2}}{{s}_{pooled}}\\)<span data-type=\"newline\">\r\n<\/span>where \\({s}_{pooled}=\\sqrt{\\frac{\\left({n}_{1}-1\\right){s}_{1}^{2}+\\left({n}_{2}-1\\right){s}_{2}^{2}}{{n}_{1}+{n}_{2}-2}}\\)<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm9464368\" class=\"practice\" data-depth=\"1\">\r\n<p id=\"fs-idm124875168\"><em data-effect=\"italics\">Use the following information to answer the next 15 exercises:<\/em> Indicate if the hypothesis test is for<\/p>\r\n\r\n<ol id=\"fs-idm45665008\" type=\"a\">\r\n \t<li>independent group means, population standard deviations, and\/or variances known<\/li>\r\n \t<li>independent group means, population standard deviations, and\/or variances unknown<\/li>\r\n \t<li>matched or paired samples<\/li>\r\n \t<li>single mean<\/li>\r\n \t<li>two proportions<\/li>\r\n \t<li>single proportion<\/li>\r\n<\/ol>\r\n<div id=\"fs-idm22935904\" data-type=\"exercise\">\r\n<div id=\"fs-idm124875424\" data-type=\"problem\">\r\n<p id=\"fs-idm84013008\">It is believed that 70% of males pass their drivers test in the first attempt, while 65% of females pass the test in the first attempt. Of interest is whether the proportions are in fact equal.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm235455488\" data-type=\"solution\">\r\n<p id=\"fs-idm173588752\">two proportions<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm146907648\" data-type=\"exercise\">\r\n<div id=\"fs-idm146907392\" data-type=\"problem\">\r\n<p id=\"fs-idm146907136\">A new laundry detergent is tested on consumers. Of interest is the proportion of consumers who prefer the new brand over the leading competitor. A study is done to test this.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm161821520\" data-type=\"exercise\">\r\n<div id=\"fs-idm161821264\" data-type=\"problem\">\r\n<p id=\"fs-idm164006592\">A new windshield treatment claims to repel water more effectively. Ten windshields are tested by simulating rain without the new treatment. The same windshields are then treated, and the experiment is run again. A hypothesis test is conducted.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm127047328\" data-type=\"solution\">\r\n<p id=\"fs-idm127047072\">matched or paired samples<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm120536064\" data-type=\"exercise\">\r\n<div id=\"fs-idm68027168\" data-type=\"problem\">\r\n<p id=\"fs-idm68026912\">The known standard deviation in salary for all mid-level professionals in the financial industry is \\$11,000. Company A and Company B are in the financial industry. Suppose samples are taken of mid-level professionals from Company A and from Company B. The sample mean salary for mid-level professionals in Company A is \\$80,000. The sample mean salary for mid-level professionals in Company B is \\$96,000. Company A and Company B management want to know if their mid-level professionals are paid differently, on average.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp21825264\" data-type=\"exercise\">\r\n<div id=\"fs-idp21825520\" data-type=\"problem\">\r\n<p id=\"fs-idm175592768\">The average worker in Germany gets eight weeks of paid vacation.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm81349024\" data-type=\"solution\">\r\n<p id=\"fs-idm81348768\">single mean<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm170345552\" data-type=\"exercise\">\r\n<div id=\"fs-idm170345296\" data-type=\"problem\">\r\n<p id=\"fs-idm39186912\">According to a television commercial, 80% of dentists agree that Ultrafresh toothpaste is the best on the market.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp3576048\" data-type=\"exercise\">\r\n<div id=\"fs-idm27505984\" data-type=\"problem\">\r\n<p id=\"fs-idm27505728\">It is believed that the average grade on an English essay in a particular school system for females is higher than for males. A random sample of 31 females had a mean score of 82 with a standard deviation of three, and a random sample of 25 males had a mean score of 76 with a standard deviation of four.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm27710704\" data-type=\"solution\">\r\n<p id=\"fs-idm54785888\">independent group means, population standard deviations and\/or variances unknown<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm32687376\" data-type=\"exercise\">\r\n<div id=\"fs-idm32687120\" data-type=\"problem\">\r\n<p id=\"fs-idm32686864\">The league mean batting average is 0.280 with a known standard deviation of 0.06. The Rattlers and the Vikings belong to the league. The mean batting average for a sample of eight Rattlers is 0.210, and the mean batting average for a sample of eight Vikings is 0.260. There are 24 players on the Rattlers and 19 players on the Vikings. Are the batting averages of the Rattlers and Vikings statistically different?<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm36046944\" data-type=\"exercise\">\r\n<div id=\"fs-idm36046688\" data-type=\"problem\">\r\n<p id=\"fs-idm3190160\">In a random sample of 100 forests in the United States, 56 were coniferous or contained conifers. In a random sample of 80 forests in Mexico, 40 were coniferous or contained conifers. Is the proportion of conifers in the United States statistically more than the proportion of conifers in Mexico?<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm7896320\" data-type=\"solution\">\r\n<p id=\"fs-idm7896064\">two proportions<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm84811344\" data-type=\"exercise\">\r\n<div id=\"fs-idm84811088\" data-type=\"problem\">\r\n<p id=\"fs-idm16418448\">A new medicine is said to help improve sleep. Eight subjects are picked at random and given the medicine. The means hours slept for each person were recorded before starting the medication and after.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm111997184\" data-type=\"exercise\">\r\n<div id=\"fs-idm111996928\" data-type=\"problem\">\r\n<p id=\"fs-idm35434864\">It is thought that teenagers sleep more than adults on average. A study is done to verify this. A sample of 16 teenagers has a mean of 8.9 hours slept and a standard deviation of 1.2. A sample of 12 adults has a mean of 6.9 hours slept and a standard deviation of 0.6.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm5318624\" data-type=\"solution\">\r\n<p id=\"fs-idm5318368\">independent group means, population standard deviations and\/or variances unknown<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm83613168\" data-type=\"exercise\">\r\n<div id=\"fs-idm83612912\" data-type=\"problem\">\r\n<p id=\"fs-idm122472032\">Varsity athletes practice five times a week, on average.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm155074576\" data-type=\"exercise\">\r\n<div id=\"fs-idm187988384\" data-type=\"problem\">\r\n<p id=\"fs-idm187988128\">A sample of 12 in-state graduate school programs at school A has a mean tuition of \\$64,000 with a standard deviation of \\$8,000. At school B, a sample of 16 in-state graduate programs has a mean of \\$80,000 with a standard deviation of \\$6,000. On average, are the mean tuitions different?<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm63313632\" data-type=\"solution\">\r\n<p id=\"fs-idm86332592\">independent group means, population standard deviations and\/or variances unknown<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm86331952\" data-type=\"exercise\">\r\n<div id=\"fs-idm109695328\" data-type=\"problem\">\r\n<p id=\"fs-idm109695072\">A new WiFi range booster is being offered to consumers. A researcher tests the native range of 12 different routers under the same conditions. The ranges are recorded. Then the researcher uses the new WiFi range booster and records the new ranges. Does the new WiFi range booster do a better job?<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm152397632\" data-type=\"exercise\">\r\n<div id=\"fs-idm187246864\" data-type=\"problem\">\r\n<p id=\"fs-idm187246608\">A high school principal claims that 30% of student athletes drive themselves to school, while 4% of non-athletes drive themselves to school. In a sample of 20 student athletes, 45% drive themselves to school. In a sample of 35 non-athlete students, 6% drive themselves to school. Is the percent of student athletes who drive themselves to school more than the percent of nonathletes?<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm178089840\" data-type=\"solution\">\r\n<p id=\"fs-idm174648448\">two proportions<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<span data-type=\"newline\">\r\n<\/span><em data-effect=\"italics\">Use the following information to answer the next three exercises:<\/em> A study is done to determine which of two soft drinks has more sugar. There are 13 cans of Beverage A in a sample and six cans of Beverage B. The mean amount of sugar in Beverage A is 36 grams with a standard deviation of 0.6 grams. The mean amount of sugar in Beverage B is 38 grams with a standard deviation of 0.8 grams. The researchers believe that Beverage B has more sugar than Beverage A, on average. Both populations have normal distributions.\r\n<div id=\"fs-idm174647808\" data-type=\"exercise\">\r\n<div id=\"fs-idm80739952\" data-type=\"problem\">\r\n<p id=\"fs-idm174134640\">Are standard deviations known or unknown?<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm213711120\" data-type=\"exercise\">\r\n<div id=\"fs-idm213710864\" data-type=\"problem\">\r\n<p id=\"fs-idm213710608\">What is the random variable?<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm10993824\" data-type=\"solution\">\r\n<p id=\"fs-idm10993568\">The random variable is the difference between the mean amounts of sugar in the two soft drinks.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm106857024\" data-type=\"exercise\">\r\n<div id=\"fs-idm38595520\" data-type=\"problem\">\r\n<p id=\"fs-idm38595264\">Is this a one-tailed or two-tailed test?<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<em data-effect=\"italics\">Use the following information to answer the next 12 exercises:<\/em> The U.S. Center for Disease Control reports that the mean life expectancy was 47.6 years for whites born in 1900 and 33.0 years for nonwhites. Suppose that you randomly survey death records for people born in 1900 in a certain county. Of the 124 whites, the mean life span was 45.3 years with a standard deviation of 12.7 years. Of the 82 nonwhites, the mean life span was 34.1 years with a standard deviation of 15.6 years. Conduct a hypothesis test to see if the mean life spans in the county were the same for whites and nonwhites.\r\n<div id=\"fs-idm77134224\" data-type=\"exercise\">\r\n<div id=\"fs-idm77133968\" data-type=\"problem\">\r\n<p id=\"fs-idm82811296\">Is this a test of means or proportions?<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm82810784\" data-type=\"solution\">\r\n<p id=\"fs-idm5951248\">means<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm87532368\" data-type=\"exercise\">\r\n<div id=\"fs-idm87532112\" data-type=\"problem\">\r\n<p id=\"fs-idm87531856\">State the null and alternative hypotheses.<\/p>\r\n\r\n<ol id=\"fs-idm126031792\" type=\"a\">\r\n \t<li><em data-effect=\"italics\">H<sub>0<\/sub><\/em>: __________<\/li>\r\n \t<li><em data-effect=\"italics\">H<sub>a<\/sub><\/em>: __________<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp13932064\" data-type=\"exercise\">\r\n<div id=\"fs-idm162071840\" data-type=\"problem\">\r\n<p id=\"fs-idm162071584\">Is this a right-tailed, left-tailed, or two-tailed test?<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm104897488\" data-type=\"solution\">\r\n<p id=\"fs-idm104897232\">two-tailed<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm104977072\" data-type=\"exercise\">\r\n<div id=\"fs-idm104976816\" data-type=\"problem\">\r\n<p id=\"fs-idm161761248\">In symbols, what is the random variable of interest for this test?<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm147822240\" data-type=\"exercise\">\r\n<div id=\"fs-idm147821984\" data-type=\"problem\">\r\n<p id=\"fs-idm147821728\">In words, define the random variable of interest for this test.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm2327840\" data-type=\"solution\">\r\n<p id=\"fs-idm2327584\">the difference between the mean life spans of whites and nonwhites<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm17957104\" data-type=\"exercise\">\r\n<div id=\"fs-idm17956848\" data-type=\"problem\">\r\n<p id=\"fs-idm100280816\">Which distribution (normal or Student's <em data-effect=\"italics\">t<\/em>) would you use for this hypothesis test?<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm121049024\" data-type=\"exercise\">\r\n<div id=\"fs-idm121048768\" data-type=\"problem\">\r\n<p id=\"fs-idm121048512\">Explain why you chose the distribution you did for <a class=\"autogenerated-content\" href=\"#fs-idm17957104\">(Figure)<\/a>.<\/p>\r\n\r\n<\/div>\r\n<div id=\"eip-idp105137344\" data-type=\"solution\">\r\n<p id=\"eip-idp105137600\">This is a comparison of two population means with unknown population standard deviations.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm96874480\" data-type=\"exercise\">\r\n<div id=\"fs-idm96874224\" data-type=\"problem\">\r\n<p id=\"fs-idm263989856\">Calculate the test statistic and <em data-effect=\"italics\">p<\/em>-value.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm120924688\" data-type=\"exercise\">\r\n<div id=\"fs-idm120924432\" data-type=\"problem\">\r\n<p id=\"fs-idm120924176\">Sketch a graph of the situation. Label the horizontal axis. Mark the hypothesized difference and the sample difference. Shade the area corresponding to the <em data-effect=\"italics\">p<\/em>-value.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idp104610560\" data-type=\"solution\">\r\n<p id=\"fs-idp104610816\">Check student\u2019s solution.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm47382320\" data-type=\"exercise\">\r\n<div id=\"fs-idm47382064\" data-type=\"problem\">\r\n<p id=\"fs-idm174192304\">Find the <em data-effect=\"italics\">p<\/em>-value.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm129306608\" data-type=\"exercise\">\r\n<div id=\"fs-idm129306352\" data-type=\"problem\">\r\n<p id=\"fs-idm148755040\">At a pre-conceived <em data-effect=\"italics\">\u03b1<\/em> = 0.05, what is your:<\/p>\r\n\r\n<ol id=\"fs-idp598432\" type=\"a\">\r\n \t<li>Decision:<\/li>\r\n \t<li>Reason for the decision:<\/li>\r\n \t<li>Conclusion (write out in a complete sentence):<\/li>\r\n<\/ol>\r\n<\/div>\r\n<div id=\"fs-idm110404384\" data-type=\"solution\">\r\n<ol id=\"fs-idp229240368\" type=\"a\">\r\n \t<li>Reject the null hypothesis<\/li>\r\n \t<li><em data-effect=\"italics\">p<\/em>-value &lt; 0.05<\/li>\r\n \t<li>There is not enough evidence at the 5% level of significance to support the claim that life expectancy in the 1900s is different between whites and nonwhites.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm127412752\" data-type=\"exercise\">\r\n<div id=\"fs-idm126248464\" data-type=\"problem\">\r\n<p id=\"fs-idm126248208\">Does it appear that the means are the same? Why or why not?<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm173509824\" class=\"free-response\" data-depth=\"1\">\r\n<h3 data-type=\"title\">Homework<\/h3>\r\n<em data-effect=\"italics\">DIRECTIONS: For each of the word problems, use a solution sheet to do the hypothesis test. The solution sheet is found in <a href=\"\/contents\/c0449c55-aa47-4f1c-bd5f-0521652f0e82\">Appendix E<\/a>. Please feel free to make copies of the solution sheets. For the online version of the book, it is suggested that you copy the .doc or the .pdf files.<\/em>\r\n<div id=\"eip-792\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\r\n<div data-type=\"title\">NOTE<\/div>\r\n<p id=\"eip-idp140444868962512\">If you are using a Student's <em data-effect=\"italics\">t<\/em>-distribution for a homework problem in what follows, including for paired data, you may assume that the underlying population is normally distributed. (When using these tests in a real situation, you must first prove that assumption, however.)<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm164128\" data-type=\"exercise\">\r\n<div id=\"fs-idm163872\" data-type=\"problem\">\r\n<p id=\"fs-idm163616\"><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm8894688\" data-type=\"exercise\">\r\n<div id=\"fs-idm8894432\" data-type=\"problem\">\r\n<p id=\"fs-idm185269312\">1)A student at a four-year college claims that mean enrollment at four\u2013year colleges is higher than at two\u2013year colleges in the United States. Two surveys are conducted. Of the 35 two\u2013year colleges surveyed, the mean enrollment was 5,068 with a standard deviation of 4,777. Of the 35 four-year colleges surveyed, the mean enrollment was 5,466 with a standard deviation of 8,191.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idp138385488\" data-type=\"solution\">\r\n\r\n&nbsp;\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm39083984\" data-type=\"exercise\">\r\n<div id=\"fs-idm39083728\" data-type=\"problem\">\r\n<p id=\"fs-idm53880736\">2) At Rachel\u2019s 11<sup>th<\/sup> birthday party, eight girls were timed to see how long (in seconds) they could hold their breath in a relaxed position. After a two-minute rest, they timed themselves while jumping. The girls thought that the mean difference between their jumping and relaxed times would be zero. Test their hypothesis.<\/p>\r\n\r\n<table id=\"fs-idm53880480\" summary=\"The table presents the first column as the relaxed time in seconds and the second column as the jumping time in seconds.\">\r\n<thead>\r\n<tr>\r\n<th>Relaxed time (seconds)<\/th>\r\n<th>Jumping time (seconds)<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td>26<\/td>\r\n<td>21<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>47<\/td>\r\n<td>40<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>30<\/td>\r\n<td>28<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>22<\/td>\r\n<td>21<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>23<\/td>\r\n<td>25<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>45<\/td>\r\n<td>43<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>37<\/td>\r\n<td>35<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>29<\/td>\r\n<td>32<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n&nbsp;\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm122985232\" data-type=\"exercise\">\r\n<div id=\"fs-idm122984976\" data-type=\"problem\">\r\n<p id=\"fs-idm122984720\">3) Mean entry-level salaries for college graduates with mechanical engineering degrees and electrical engineering degrees are believed to be approximately the same. A recruiting office thinks that the mean mechanical engineering salary is actually lower than the mean electrical engineering salary. The recruiting office randomly surveys 50 entry level mechanical engineers and 60 entry level electrical engineers. Their mean salaries were \\$46,100 and \\$46,700, respectively. Their standard deviations were \\$3,450 and \\$4,210, respectively. Conduct a hypothesis test to determine if you agree that the mean entry-level mechanical engineering salary is lower than the mean entry-level electrical engineering salary.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm188189904\" data-type=\"solution\">\r\n\r\n&nbsp;\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm62206688\" data-type=\"exercise\">\r\n<div id=\"fs-idm62206432\" data-type=\"problem\">\r\n<p id=\"fs-idm105723088\">4) Marketing companies have collected data implying that teenage girls use more ring tones on their cellular phones than teenage boys do. In one particular study of 40 randomly chosen teenage girls and boys (20 of each) with cellular phones, the mean number of ring tones for the girls was 3.2 with a standard deviation of 1.5. The mean for the boys was 1.7 with a standard deviation of 0.8. Conduct a hypothesis test to determine if the means are approximately the same or if the girls\u2019 mean is higher than the boys\u2019 mean.<\/p>\r\n&nbsp;\r\n\r\n<\/div>\r\n<\/div>\r\n<em data-effect=\"italics\">Use the information from <a class=\"autogenerated-content\" href=\"\/contents\/3ef830bc-5247-460a-9007-e3fd762e5e93\">(Figure)<\/a> to answer the next four exercises.<\/em>\r\n<div id=\"fs-idm50796704\" data-type=\"exercise\">\r\n<div id=\"fs-idm50796448\" data-type=\"problem\">\r\n<p id=\"fs-idm98236496\">5) Using the data from Lap 1 only, conduct a hypothesis test to determine if the mean time for completing a lap in races is the same as it is in practices.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm73210192\" data-type=\"solution\">\r\n\r\n&nbsp;\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm2195248\" data-type=\"exercise\">\r\n<div id=\"fs-idm2194992\" data-type=\"problem\">\r\n<p id=\"fs-idm241101904\">6) Repeat the test in <a href=\"#fs-idm50796704\">Exercise 10.83<\/a>, but use Lap 5 data this time.<\/p>\r\n&nbsp;\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm170930784\" data-type=\"exercise\">\r\n<div id=\"fs-idm54834800\" data-type=\"problem\">\r\n<p id=\"fs-idm54834544\">7) Repeat the test in <a href=\"#fs-idm50796704\">Exercise 10.83<\/a>, but this time combine the data from Laps 1 and 5.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm39528240\" data-type=\"solution\"><\/div>\r\n<\/div>\r\n<div id=\"fs-idm80120272\" data-type=\"exercise\">\r\n<div id=\"fs-idm80120016\" data-type=\"problem\">\r\n<p id=\"fs-idm80119760\"><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<em data-effect=\"italics\">Use the following information to answer the next two exercises.<\/em> The Eastern and Western Major League Soccer conferences have a new Reserve Division that allows new players to develop their skills. Data for a randomly picked date showed the following annual goals.\r\n<table summary=\"Eastern and Western Major League Soccer\">\r\n<thead>\r\n<tr>\r\n<th>Western<\/th>\r\n<th>Eastern<\/th>\r\n<\/tr>\r\n<\/thead>\r\n<tbody>\r\n<tr>\r\n<td>Los Angeles 9<\/td>\r\n<td>D.C. United 9<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>FC Dallas 3<\/td>\r\n<td>Chicago 8<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Chivas USA 4<\/td>\r\n<td>Columbus 7<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Real Salt Lake 3<\/td>\r\n<td>New England 6<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>Colorado 4<\/td>\r\n<td>MetroStars 5<\/td>\r\n<\/tr>\r\n<tr>\r\n<td>San Jose 4<\/td>\r\n<td>Kansas City 3<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<p id=\"eip-549\"><em data-effect=\"italics\">Conduct a hypothesis test to answer the next two exercises.<\/em><\/p>\r\n&nbsp;\r\n<div id=\"fs-idm67199056\" data-type=\"exercise\">\r\n<div id=\"fs-idm67198800\" data-type=\"problem\">\r\n<p id=\"fs-idm67198544\">8) The <strong>exact<\/strong> distribution for the hypothesis test is:<\/p>\r\n\r\n<ol id=\"fs-idm91569200\" type=\"a\">\r\n \t<li>the normal distribution<\/li>\r\n \t<li>the Student's <em data-effect=\"italics\">t<\/em>-distribution<\/li>\r\n \t<li>the uniform distribution<\/li>\r\n \t<li>the exponential distribution<\/li>\r\n<\/ol>\r\n&nbsp;\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm3207760\" data-type=\"exercise\">\r\n<div id=\"fs-idm104292672\" data-type=\"problem\">\r\n<p id=\"fs-idm104292416\">9) If the level of significance is 0.05, the conclusion is:<\/p>\r\n\r\n<ol id=\"fs-idm104292032\" type=\"a\">\r\n \t<li>There is sufficient evidence to conclude that the <strong>W<\/strong> Division teams score fewer goals, on average, than the <strong>E<\/strong> teams<\/li>\r\n \t<li>There is insufficient evidence to conclude that the <strong>W<\/strong> Division teams score more goals, on average, than the <strong>E<\/strong> teams.<\/li>\r\n \t<li>There is insufficient evidence to conclude that the <strong>W<\/strong> teams score fewer goals, on average, than the <strong>E<\/strong> teams score.<\/li>\r\n \t<li>Unable to determine<\/li>\r\n<\/ol>\r\n<\/div>\r\n<div id=\"fs-idm178991312\" data-type=\"solution\">\r\n<p id=\"fs-idm178991056\"><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm108118736\" data-type=\"exercise\">\r\n<div id=\"fs-idm122622576\" data-type=\"problem\">\r\n<p id=\"eip-idp273849808\">10) Suppose a statistics instructor believes that there is no significant difference between the mean class scores of statistics day students on Exam 2 and statistics night students on Exam 2. She takes random samples from each of the populations. The mean and standard deviation for 35 statistics day students were 75.86 and 16.91. The mean and standard deviation for 37 statistics night students were 75.41 and 19.73. The \u201cday\u201d subscript refers to the statistics day students. The \u201cnight\u201d subscript refers to the statistics night students. A concluding statement is:<\/p>\r\n\r\n<ol id=\"fs-idm122621936\" type=\"a\">\r\n \t<li>There is sufficient evidence to conclude that statistics night students' mean on Exam 2 is better than the statistics day students' mean on Exam 2.<\/li>\r\n \t<li>There is insufficient evidence to conclude that the statistics day students' mean on Exam 2 is better than the statistics night students' mean on Exam 2.<\/li>\r\n \t<li>There is insufficient evidence to conclude that there is a significant difference between the means of the statistics day students and night students on Exam 2.<\/li>\r\n \t<li>There is sufficient evidence to conclude that there is a significant difference between the means of the statistics day students and night students on Exam 2.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm81478720\" data-type=\"exercise\">\r\n<div id=\"fs-idp12884432\" data-type=\"problem\">\r\n<p id=\"fs-idp12884688\">11) Researchers interviewed street prostitutes in Canada and the United States. The mean age of the 100 Canadian prostitutes upon entering prostitution was 18 with a standard deviation of six. The mean age of the 130 United States prostitutes upon entering prostitution was 20 with a standard deviation of eight. Is the mean age of entering prostitution in Canada lower than the mean age in the United States? Test at a 1% significance level.<\/p>\r\n\r\n<\/div>\r\n<div id=\"fs-idm189266736\" data-type=\"solution\">\r\n<p id=\"fs-idm125542800\"><\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm65734896\" data-type=\"exercise\">\r\n<div id=\"fs-idm65734640\" data-type=\"problem\">\r\n<p id=\"fs-idp44866512\">12) A powder diet is tested on 49 people, and a liquid diet is tested on 36 different people. Of interest is whether the liquid diet yields a higher mean weight loss than the powder diet. The powder diet group had a mean weight loss of 42 pounds with a standard deviation of 12 pounds. The liquid diet group had a mean weight loss of 45 pounds with a standard deviation of 14 pounds.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<div data-type=\"exercise\">\r\n<div data-type=\"problem\">\r\n\r\n13) Suppose a statistics instructor believes that there is no significant difference between the mean class scores of statistics day students on Exam 2 and statistics night students on Exam 2. She takes random samples from each of the populations. The mean and standard deviation for 35 statistics day students were 75.86 and 16.91, respectively. The mean and standard deviation for 37 statistics night students were 75.41 and 19.73. The \u201cday\u201d subscript refers to the statistics day students. The \u201cnight\u201d subscript refers to the statistics night students. An appropriate alternative hypothesis for the hypothesis test is:\r\n<ol id=\"eip-idp47881856\" type=\"a\">\r\n \t<li><em data-effect=\"italics\">\u03bc<\/em><sub>day<\/sub> &gt; <em data-effect=\"italics\">\u03bc<\/em><sub>night<\/sub><\/li>\r\n \t<li><em data-effect=\"italics\">\u03bc<\/em><sub>day<\/sub> &lt; <em data-effect=\"italics\">\u03bc<\/em><sub>night<\/sub><\/li>\r\n \t<li><em data-effect=\"italics\">\u03bc<\/em><sub>day<\/sub> = <em data-effect=\"italics\">\u03bc<\/em><sub>night<\/sub><\/li>\r\n \t<li><em data-effect=\"italics\">\u03bc<\/em><sub>day<\/sub> \u2260 <em data-effect=\"italics\">\u03bc<\/em><sub>night<\/sub><\/li>\r\n<\/ol>\r\n<\/div>\r\n<div id=\"eip-605\" data-type=\"solution\">\r\n\r\n&nbsp;\r\n\r\n1)\r\n<p id=\"fs-idm89316432\">Subscripts: 1: two-year colleges; 2: four-year colleges<\/p>\r\n\r\n<ol id=\"fs-idm89316048\" type=\"a\">\r\n \t<li><em data-effect=\"italics\">H<sub>0<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<sub>1<\/sub><\/em> \u2265 <em data-effect=\"italics\">\u03bc<sub>2<\/sub><\/em><\/li>\r\n \t<li><em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<sub>1<\/sub><\/em> &lt; <em data-effect=\"italics\">\u03bc<sub>2<\/sub><\/em><\/li>\r\n \t<li>\\({\\overline{X}}_{1}\u2013{\\overline{X}}_{2}\\) is the difference between the mean enrollments of the two-year colleges and the four-year colleges.<\/li>\r\n \t<li>Student\u2019s-<em data-effect=\"italics\">t<\/em><\/li>\r\n \t<li>test statistic: -0.2480<\/li>\r\n \t<li><em data-effect=\"italics\">p<\/em>-value: 0.4019<\/li>\r\n \t<li>Check student\u2019s solution.<\/li>\r\n \t<li>\r\n<ol id=\"fs-idp113692848\" type=\"i\">\r\n \t<li>Alpha: 0.05<\/li>\r\n \t<li>Decision: Do not reject<\/li>\r\n \t<li>Reason for Decision: <em data-effect=\"italics\">p<\/em>-value &gt; alpha<\/li>\r\n \t<li>Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the mean enrollment at four-year colleges is higher than at two-year colleges.<\/li>\r\n<\/ol>\r\n<\/li>\r\n<\/ol>\r\n3)\r\n<p id=\"fs-idp57225296\">Subscripts: 1: mechanical engineering; 2: electrical engineering<\/p>\r\n\r\n<ol id=\"fs-idp195702880\" type=\"a\">\r\n \t<li><em data-effect=\"italics\">H<sub>0<\/sub><\/em>: <em data-effect=\"italics\">\u00b5<sub>1<\/sub><\/em> \u2265 <em data-effect=\"italics\">\u00b5<sub>2<\/sub><\/em><\/li>\r\n \t<li><em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u00b5<sub>1<\/sub><\/em> &lt; <em data-effect=\"italics\">\u00b5<sub>2<\/sub><\/em><\/li>\r\n \t<li>\\({\\overline{X}}_{1}-{\\overline{X}}_{2}\\) is the difference between the mean entry level salaries of mechanical engineers and electrical engineers.<\/li>\r\n \t<li><em data-effect=\"italics\">t<\/em><sub>108<\/sub><\/li>\r\n \t<li>test statistic: <em data-effect=\"italics\">t<\/em> = \u20130.82<\/li>\r\n \t<li><em data-effect=\"italics\">p<\/em>-value: 0.2061<\/li>\r\n \t<li>Check student\u2019s solution.<\/li>\r\n \t<li>\r\n<ol id=\"fs-idp3469344\" type=\"i\">\r\n \t<li>Alpha: 0.05<\/li>\r\n \t<li>Decision: Do not reject the null hypothesis.<\/li>\r\n \t<li>Reason for Decision: <em data-effect=\"italics\">p<\/em>-value &gt; alpha<\/li>\r\n \t<li>Conclusion: At the 5% significance level, there is insufficient evidence to conclude that the mean entry-level salaries of mechanical engineers is lower than that of electrical engineers.'<\/li>\r\n<\/ol>\r\n<\/li>\r\n<\/ol>\r\n5)\r\n<ol id=\"fs-idp184583584\" type=\"a\">\r\n \t<li><em data-effect=\"italics\">H<sub>0<\/sub><\/em>: <em data-effect=\"italics\">\u00b5<sub>1<\/sub><\/em> = <em data-effect=\"italics\">\u00b5<sub>2<\/sub><\/em><\/li>\r\n \t<li><em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u00b5<sub>1<\/sub><\/em> \u2260 <em data-effect=\"italics\">\u00b5<sub>2<\/sub><\/em><\/li>\r\n \t<li>\\({\\overline{X}}_{1}-{\\overline{X}}_{2}\\) is the difference between the mean times for completing a lap in races and in practices.<\/li>\r\n \t<li><em data-effect=\"italics\">t<\/em><sub>20.32<\/sub><\/li>\r\n \t<li>test statistic: \u20134.70<\/li>\r\n \t<li><em data-effect=\"italics\">p<\/em>-value: 0.0001<\/li>\r\n \t<li>Check student\u2019s solution.<\/li>\r\n \t<li>\r\n<ol id=\"fs-idp173433584\" type=\"i\">\r\n \t<li>Alpha: 0.05<\/li>\r\n \t<li>Decision: Reject the null hypothesis.<\/li>\r\n \t<li>Reason for Decision: <em data-effect=\"italics\">p<\/em>-value &lt; alpha<\/li>\r\n \t<li>Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the mean time for completing a lap in races is different from that in practices.<\/li>\r\n<\/ol>\r\n<\/li>\r\n<\/ol>\r\n7)\r\n<div id=\"fs-idm170930784\" data-type=\"exercise\">\r\n<div id=\"fs-idm39528240\" data-type=\"solution\">\r\n<ol id=\"fs-idp172770560\" type=\"a\">\r\n \t<li><em data-effect=\"italics\">H<sub>0<\/sub><\/em>: <em data-effect=\"italics\">\u00b5<sub>1<\/sub><\/em> = <em data-effect=\"italics\">\u00b5<sub>2<\/sub><\/em><\/li>\r\n \t<li><em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u00b5<sub>1<\/sub><\/em> \u2260 <em data-effect=\"italics\">\u00b5<sub>2<\/sub><\/em><\/li>\r\n \t<li>is the difference between the mean times for completing a lap in races and in practices.<\/li>\r\n \t<li><em data-effect=\"italics\">t<\/em><sub>40.94<\/sub><\/li>\r\n \t<li>test statistic: \u20135.08<\/li>\r\n \t<li><em data-effect=\"italics\">p<\/em>-value: zero<\/li>\r\n \t<li>Check student\u2019s solution.<\/li>\r\n \t<li>\r\n<ol id=\"fs-idp77676896\" type=\"i\">\r\n \t<li>Alpha: 0.05<\/li>\r\n \t<li>Decision: Reject the null hypothesis.<\/li>\r\n \t<li>Reason for Decision: <em data-effect=\"italics\">p<\/em>-value &lt; alpha<\/li>\r\n \t<li>Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the mean time for completing a lap in races is different from that in practices.<\/li>\r\n<\/ol>\r\n<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm80120272\" data-type=\"exercise\">\r\n<div id=\"fs-idm80120016\" data-type=\"problem\">\r\n<p id=\"fs-idm80119760\">\u00a0In two to three complete sentences, explain in detail how you might use Terri Vogel\u2019s data to answer the following question. \u201cDoes Terri Vogel drive faster in races than she does in practices?\u201d<\/p>\r\n9) c\r\n\r\n11)\r\n<p id=\"fs-idm189266480\">Test: two independent sample means, population standard deviations unknown.<\/p>\r\n<p id=\"fs-idm7792944\">Random variable: \\({\\overline{X}}_{1}-{\\overline{X}}_{2}\\)<\/p>\r\n<p id=\"fs-idm96995296\">Distribution: <em data-effect=\"italics\">H<sub>0<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<\/em><sub>1<\/sub> = <em data-effect=\"italics\">\u03bc<\/em><sub>2<\/sub><em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<\/em><sub>1<\/sub> &lt; <em data-effect=\"italics\">\u03bc<\/em><sub>2<\/sub> The mean age of entering prostitution in Canada is lower than the mean age in the United States.<\/p>\r\n\r\n<div id=\"fs-idm122708400\" class=\"bc-figure figure\"><span id=\"fs-idm5558192\" data-type=\"media\" data-alt=\"This is a normal distribution curve with mean equal to zero. A vertical line near the tail of the curve to the left of zero extends from the axis to the curve. The region under the curve to the left of the line is shaded representing p-value = 0.0157.\"><img src=\"https:\/\/pressbooks.ccconline.org\/acccomposition1\/wp-content\/uploads\/sites\/83\/2022\/08\/CNX_Stats_C10_M02_005anno-1.jpg\" alt=\"This is a normal distribution curve with mean equal to zero. A vertical line near the tail of the curve to the left of zero extends from the axis to the curve. The region under the curve to the left of the line is shaded representing p-value = 0.0157.\" width=\"380\" data-media-type=\"image\/jpg\" \/><\/span><\/div>\r\n<p id=\"fs-idm95276832\">Graph: left-tailed<\/p>\r\n<p id=\"fs-idm5467280\"><em data-effect=\"italics\">p<\/em>-value : 0.0151<\/p>\r\n<p id=\"fs-idm171100928\">Decision: Do not reject <em data-effect=\"italics\">H<\/em><sub>0<\/sub>.<\/p>\r\n<p id=\"fs-idm125542800\">Conclusion: At the 1% level of significance, from the sample data, there is not sufficient evidence to conclude that the mean age of entering prostitution in Canada is lower than the mean age in the United States.<\/p>\r\n13) d\r\n\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div class=\"textbox shaded\" data-type=\"glossary\">\r\n<h3 data-type=\"glossary-title\">Glossary<\/h3>\r\n<dl>\r\n \t<dt>Degrees of Freedom (<em data-effect=\"italics\">df<\/em>)<\/dt>\r\n \t<dd id=\"id21146024\">the number of objects in a sample that are free to vary.<\/dd>\r\n<\/dl>\r\n<dl>\r\n \t<dt>Standard Deviation<\/dt>\r\n \t<dd id=\"id10539417\">A number that is equal to the square root of the variance and measures how far data values are from their mean; notation: <em data-effect=\"italics\">s<\/em> for sample standard deviation and <em data-effect=\"italics\">\u03c3<\/em> for population standard deviation.<\/dd>\r\n<\/dl>\r\n<dl id=\"variable\">\r\n \t<dt>Variable (Random Variable)<\/dt>\r\n \t<dd id=\"id1167450448490\">a characteristic of interest in a population being studied. Common notation for variables are upper-case Latin letters <em data-effect=\"italics\">X<\/em>, <em data-effect=\"italics\">Y<\/em>, <em data-effect=\"italics\">Z<\/em>,... Common notation for a specific value from the domain (set of all possible values of a variable) are lower-case Latin letters <em data-effect=\"italics\">x<\/em>, <em data-effect=\"italics\">y<\/em>, <em data-effect=\"italics\">z<\/em>,.... For example, if <em data-effect=\"italics\">X<\/em> is the number of children in a family, then <em data-effect=\"italics\">x<\/em> represents a specific integer 0, 1, 2, 3, .... Variables in statistics differ from variables in intermediate algebra in two following ways.\r\n<ul id=\"fs-idp40452080\">\r\n \t<li>The domain of the random variable (RV) is not necessarily a numerical set; the domain may be expressed in words; for example, if <em data-effect=\"italics\">X<\/em> = hair color, then the domain is {black, blond, gray, green, orange}.<\/li>\r\n \t<li>We can tell what specific value <em data-effect=\"italics\">x<\/em> of the random variable <em data-effect=\"italics\">X<\/em> takes only after performing the experiment.<\/li>\r\n<\/ul>\r\n<\/dd>\r\n<\/dl>\r\n<\/div>","rendered":"<p>&nbsp;<\/p>\n<ol>\n<li>The two independent samples are simple random samples from two distinct populations.<\/li>\n<li>For the two distinct populations:\n<ul id=\"fs-idm39244848\">\n<li>if the sample sizes are small, the distributions are important (should be normal)<\/li>\n<li>if the sample sizes are large, the distributions are not important (need not be normal)<\/li>\n<\/ul>\n<\/li>\n<\/ol>\n<div id=\"fs-idm107349216\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">NOTE<\/div>\n<p id=\"fs-idm165699296\">The test comparing two independent population means with unknown and possibly unequal population standard deviations is called the Aspin-Welch t-test. The degrees of freedom formula was developed by Aspin-Welch.<\/p>\n<\/div>\n<p>The comparison of two population means is very common. A difference between the two samples depends on both the means and the standard deviations. Very different means can occur by chance if there is great variation among the individual samples. In order to account for the variation, we take the difference of the sample means, \\({\\overline{X}}_{1}\\) \u2013 \\({\\overline{X}}_{2}\\), and divide by the standard error in order to standardize the difference. The result is a t-score test statistic.<\/p>\n<p>Because we do not know the population standard deviations, we estimate them using the two sample standard deviations from our independent samples. For the hypothesis test, we calculate the estimated standard deviation, or <span data-type=\"term\">standard error<\/span>, of <strong>the difference in sample means<\/strong>, \\({\\overline{X}}_{1}\\) \u2013 \\({\\overline{X}}_{2}\\).<\/p>\n<div id=\"std_err\" data-type=\"equation\"><span data-type=\"title\">The standard error is:<\/span>\\(\\sqrt{\\frac{\\left({s}_{1}{\\right)}^{2}}{{n}_{1}}+\\frac{\\left({s}_{2}{\\right)}^{2}}{{n}_{2}}}\\)<\/div>\n<p>The test statistic (<em data-effect=\"italics\">t<\/em>-score) is calculated as follows:<\/p>\n<div id=\"t-score2\" data-type=\"equation\">\\(\\frac{\\text{(}{\\overline{x}}_{1}\u2013{\\overline{x}}_{2}\\text{)}\u2013\\text{(}{\\mu }_{1}\u2013{\\mu }_{2}\\text{)}}{\\sqrt{\\frac{{\\text{(}{s}_{1}\\text{)}}^{2}}{{n}_{1}}+\\frac{{\\text{(}{s}_{2}\\text{)}}^{2}}{{n}_{2}}}}\\)<\/div>\n<div data-type=\"list\">\n<div data-type=\"title\">where:<\/div>\n<ul>\n<li><em data-effect=\"italics\">s<\/em><sub>1<\/sub> and <em data-effect=\"italics\">s<\/em><sub>2<\/sub>, the sample standard deviations, are estimates of <em data-effect=\"italics\">\u03c3<\/em><sub>1<\/sub> and <em data-effect=\"italics\">\u03c3<\/em><sub>2<\/sub>, respectively.<\/li>\n<li><em data-effect=\"italics\">\u03c3<\/em><sub>1<\/sub> and <em data-effect=\"italics\">\u03c3<\/em><sub>1<\/sub> are the unknown population standard deviations.<\/li>\n<li>\\({\\overline{x}}_{1}\\) and \\({\\overline{x}}_{2}\\) are the sample means. <em data-effect=\"italics\">\u03bc<\/em><sub>1<\/sub> and <em data-effect=\"italics\">\u03bc<\/em><sub>2<\/sub> are the population means.<\/li>\n<\/ul>\n<\/div>\n<p id=\"element-256\" class=\"finger\">The number of <span data-type=\"term\">degrees of freedom (<em data-effect=\"italics\">df<\/em>)<\/span> requires a somewhat complicated calculation. However, a computer or calculator calculates it easily. The <em data-effect=\"italics\">df<\/em> are not always a whole number. The test statistic calculated previously is approximated by the Student&#8217;s <em data-effect=\"italics\">t<\/em>-distribution with <em data-effect=\"italics\">df<\/em> as follows:<\/p>\n<div id=\"eq_df\" data-type=\"equation\"><span data-type=\"title\">Degrees of freedom<\/span>\\(df=\\frac{{\\left(\\frac{{\\left({s}_{1}\\right)}^{2}}{{n}_{1}}+\\frac{{\\left({s}_{2}\\right)}^{2}}{{n}_{2}}\\right)}^{2}}{\\left(\\frac{1}{{n}_{1}\u20131}\\right){\\left(\\frac{{\\left({s}_{1}\\right)}^{2}}{{n}_{1}}\\right)}^{2}+\\left(\\frac{1}{{n}_{2}\u20131}\\right){\\left(\\frac{{\\left({s}_{2}\\right)}^{2}}{{n}_{2}}\\right)}^{2}}\\)<\/div>\n<p id=\"element-316\">When both sample sizes <em data-effect=\"italics\">n<\/em><sub>1<\/sub> and <em data-effect=\"italics\">n<\/em><sub>2<\/sub> are five or larger, the Student&#8217;s <em data-effect=\"italics\">t<\/em> approximation is very good. Notice that the sample variances (<em data-effect=\"italics\">s<\/em><sub>1<\/sub>)<sup>2<\/sup> and (<em data-effect=\"italics\">s<\/em><sub>2<\/sub>)<sup>2<\/sup> are not pooled. (If the question comes up, do not pool the variances.)<\/p>\n<div id=\"fs-idm74088160\" class=\"finger\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">NOTE<\/div>\n<p id=\"fs-idm378573120\">It is not necessary to compute this by hand. A calculator or computer easily computes it.<\/p>\n<\/div>\n<div class=\"textbox textbox--examples\" data-type=\"example\">\n<div data-type=\"title\">Independent groups<\/div>\n<p>The average amount of time boys and girls aged seven to 11 spend playing sports each day is believed to be the same. A study is done and data are collected, resulting in the data in <a class=\"autogenerated-content\" href=\"#uid888\">(Figure)<\/a>. Each populations has a normal distribution.<\/p>\n<table id=\"uid888\" summary=\"This table presents the sample size in the second column, average hours a day in the third column, and the sample standard deviation in the fourth column. The first row is for girls and the second row is for boys.\">\n<thead valign=\"top\">\n<tr>\n<th data-align=\"center\"><\/th>\n<th data-align=\"center\">Sample Size<\/th>\n<th data-align=\"center\">Average Number of Hours Playing Sports Per Day<\/th>\n<th data-align=\"center\">Sample Standard Deviation<\/th>\n<\/tr>\n<\/thead>\n<tbody valign=\"top\">\n<tr>\n<td>Girls<\/td>\n<td>9<\/td>\n<td>2<\/td>\n<td>\\(0.866\\)<\/td>\n<\/tr>\n<tr>\n<td>Boys<\/td>\n<td>16<\/td>\n<td>3.2<\/td>\n<td>1.00<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<div data-type=\"exercise\">\n<div id=\"id6322791\" data-type=\"problem\">\n<p>Is there a difference in the mean amount of time boys and girls aged seven to 11 play sports each day? Test at the 5% level of significance.<\/p>\n<\/div>\n<div id=\"id21329984\" data-type=\"solution\">\n<p><strong>The population standard deviations are not known.<\/strong> Let <em data-effect=\"italics\">g<\/em> be the subscript for girls and <em data-effect=\"italics\">b<\/em> be the subscript for boys. Then, <em data-effect=\"italics\">\u03bc<sub>g<\/sub><\/em> is the population mean for girls and <em data-effect=\"italics\">\u03bc<sub>b<\/sub><\/em> is the population mean for boys. This is a test of two <strong>independent groups<\/strong>, two population <strong>means<\/strong>.<\/p>\n<p><span data-type=\"term\">Random variable<\/span>: \\({\\overline{X}}_{g}-{\\overline{X}}_{b}\\) = difference in the sample mean amount of time girls and boys play sports each day. <span data-type=\"newline\"><br \/>\n<\/span><em data-effect=\"italics\">H<\/em><sub>0<\/sub>: <em data-effect=\"italics\">\u03bc<sub>g<\/sub><\/em> = <em data-effect=\"italics\">\u03bc<sub>b<\/sub><\/em>\u2003\u2003<em data-effect=\"italics\">H<sub>0<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<sub>g<\/sub><\/em> \u2013 <em data-effect=\"italics\">\u03bc<sub>b<\/sub><\/em> = 0 <span data-type=\"newline\"><br \/>\n<\/span><em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<sub>g<\/sub><\/em> \u2260 <em data-effect=\"italics\">\u03bc<sub>b<\/sub><\/em>\u2003\u2003<em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<sub>g<\/sub><\/em> \u2013 <em data-effect=\"italics\">\u03bc<sub>b<\/sub><\/em> \u2260 0 <span data-type=\"newline\"><br \/>\n<\/span>The words <strong>&#8220;the same&#8221;<\/strong> tell you <em data-effect=\"italics\">H<sub>0<\/sub><\/em> has an &#8220;=&#8221;. Since there are no other words to indicate <em data-effect=\"italics\">H<sub>a<\/sub><\/em>, assume it says <strong>&#8220;is different.&#8221;<\/strong> This is a two-tailed test.<\/p>\n<p><strong>Distribution for the test:<\/strong> Use <em data-effect=\"italics\">t<sub>df<\/sub><\/em> where <em data-effect=\"italics\">df<\/em> is calculated using the <em data-effect=\"italics\">df<\/em> formula for independent groups, two population means. Using a calculator, <em data-effect=\"italics\">df<\/em> is approximately 18.8462. <strong>Do not pool the variances.<\/strong><\/p>\n<p><strong>Calculate the <em data-effect=\"italics\">p<\/em>-value using a Student&#8217;s <em data-effect=\"italics\">t<\/em>-distribution:<\/strong><em data-effect=\"italics\">p<\/em>-value = 0.0054<\/p>\n<p id=\"element-90\"><strong>Graph:<\/strong><\/p>\n<div id=\"hyptest22_cmp1\" class=\"bc-figure figure\"><span id=\"id18963169\" data-type=\"media\" data-alt=\"This is a normal distribution curve representing the difference in the average amount of time girls and boys play sports all day. The mean is equal to zero, and the values -1.2, 0, and 1.2 are labeled on the horizontal axis. Two vertical lines extend from -1.2 and 1.2 to the curve. The region to the left of x = -1.2 and the region to the right of x = 1.2 are shaded to represent the p-value. The area of each region is 0.0028.\"><img decoding=\"async\" src=\"https:\/\/pressbooks.ccconline.org\/acccomposition1\/wp-content\/uploads\/sites\/83\/2022\/05\/fig-ch10_02_01-1.jpg\" alt=\"This is a normal distribution curve representing the difference in the average amount of time girls and boys play sports all day. The mean is equal to zero, and the values -1.2, 0, and 1.2 are labeled on the horizontal axis. Two vertical lines extend from -1.2 and 1.2 to the curve. The region to the left of x = -1.2 and the region to the right of x = 1.2 are shaded to represent the p-value. The area of each region is 0.0028.\" width=\"380\" data-media-type=\"image\/jpg\" \/><\/span><\/div>\n<p><span data-type=\"newline\"><br \/>\n<\/span>\\({s}_{g}=0.866\\)<span data-type=\"newline\"><br \/>\n<\/span>\\({s}_{b}=1\\)<span data-type=\"newline\"><br \/>\n<\/span>So, \\({\\overline{x}}_{g}\u2013{\\overline{x}}_{b}\\) = 2 \u2013 3.2 = \u20131.2 <span data-type=\"newline\"><br \/>\n<\/span>Half the <em data-effect=\"italics\">p<\/em>-value is below \u20131.2 and half is above 1.2.<\/p>\n<p><strong>Make a decision:<\/strong> Since <em data-effect=\"italics\">\u03b1<\/em> &gt; <em data-effect=\"italics\">p<\/em>-value, reject <em data-effect=\"italics\">H<sub>0<\/sub><\/em>. This means you reject <em data-effect=\"italics\">\u03bc<sub>g<\/sub><\/em> = <em data-effect=\"italics\">\u03bc<sub>b<\/sub><\/em>. The means are different.<\/p>\n<div id=\"fs-idp61535456\" class=\"statistics calculator\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<p id=\"fs-idp144406384\">Press <code>STAT<\/code>. Arrow over to <code>TESTS<\/code> and press <code>4:2-SampTTest<\/code>. Arrow over to Stats and press <code>ENTER<\/code>. Arrow down and enter <code>2<\/code> for the first sample mean, <code>\\(0.866\\)<\/code> for Sx1, <code>9<\/code> for n1, <code>3.2<\/code> for the second sample mean, <code>1<\/code> for Sx2, and <code>16<\/code> for n2. Arrow down to \u03bc1: and arrow to <code>does not equal<\/code> \u03bc2. Press <code>ENTER<\/code>. Arrow down to Pooled: and <code>No<\/code>. Press <code>ENTER<\/code>. Arrow down to <code>Calculate<\/code> and press <code>ENTER<\/code>. The <em data-effect=\"italics\">p<\/em>-value is <em data-effect=\"italics\">p<\/em> = 0.0054, the dfs are approximately 18.8462, and the test statistic is -3.14. Do the procedure again but instead of Calculate do Draw.<\/p>\n<\/div>\n<p><strong>Conclusion:<\/strong> At the 5% level of significance, the sample data show there is sufficient evidence to conclude that the mean number of hours that girls and boys aged seven to 11 play sports per day is different (mean number of hours boys aged seven to 11 play sports per day is greater than the mean number of hours played by girls OR the mean number of hours girls aged seven to 11 play sports per day is greater than the mean number of hours played by boys).<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idp67783984\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">Try It<\/div>\n<div id=\"fs-idm235264144\" data-type=\"exercise\">\n<div id=\"fs-idm162906288\" data-type=\"problem\">\n<p id=\"fs-idm193306272\">Two samples are shown in <a class=\"autogenerated-content\" href=\"#fs-idm99631856\">(Figure)<\/a>. Both have normal distributions. The means for the two populations are thought to be the same. Is there a difference in the means? Test at the 5% level of significance.<\/p>\n<table id=\"fs-idm99631856\" summary=\"\">\n<thead>\n<tr>\n<th><\/th>\n<th>Sample Size<\/th>\n<th>Sample Mean<\/th>\n<th>Sample Standard Deviation<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Population A<\/td>\n<td>25<\/td>\n<td>5<\/td>\n<td>1<\/td>\n<\/tr>\n<tr>\n<td>Population B<\/td>\n<td>16<\/td>\n<td>4.7<\/td>\n<td>1.2<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idm50414208\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">NOTE<\/div>\n<p id=\"fs-idm204425680\">When the sum of the sample sizes is larger than 30 (<em data-effect=\"italics\">n<\/em><sub>1<\/sub> + <em data-effect=\"italics\">n<\/em><sub>2<\/sub> &gt; 30) you can use the normal distribution to approximate the Student&#8217;s <em data-effect=\"italics\">t<\/em>.<\/p>\n<\/div>\n<div class=\"textbox textbox--examples\" data-type=\"example\">\n<p id=\"element-980\">A study is done by a community group in two neighboring colleges to determine which one graduates students with more math classes. College A samples 11 graduates. Their average is four math classes with a standard deviation of 1.5 math classes. College B samples nine graduates. Their average is 3.5 math classes with a standard deviation of one math class. The community group believes that a student who graduates from college A <strong>has taken more math classes,<\/strong> on the average. Both populations have a normal distribution. Test at a 1% significance level. Answer the following questions.<\/p>\n<p>&nbsp;<\/p>\n<div id=\"fs-idm178911728\" data-type=\"exercise\">\n<div id=\"fs-idm177203296\" data-type=\"problem\">\n<p id=\"fs-idm171395200\">a. Is this a test of two means or two proportions?<\/p>\n<\/div>\n<div id=\"fs-idm36568576\" data-type=\"solution\">\n<p id=\"fs-idm78962816\">a. two means<\/p>\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm194746624\" data-type=\"exercise\">\n<div id=\"fs-idm227630576\" data-type=\"problem\">\n<p id=\"fs-idm227630448\">b. Are the populations standard deviations known or unknown?<\/p>\n<\/div>\n<div id=\"fs-idm217237904\" data-type=\"solution\">\n<p id=\"fs-idm217237776\">b. unknown<\/p>\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm164304912\" data-type=\"exercise\">\n<div id=\"fs-idm35318288\" data-type=\"problem\">\n<p id=\"fs-idm220657088\">c. Which distribution do you use to perform the test?<\/p>\n<\/div>\n<div id=\"fs-idm153443152\" data-type=\"solution\">\n<p id=\"fs-idm145309056\">c. Student&#8217;s <em data-effect=\"italics\">t<\/em><\/p>\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm130995312\" data-type=\"exercise\">\n<div id=\"fs-idm143520064\" data-type=\"problem\">\n<p id=\"fs-idm143519936\">d. What is the random variable?<\/p>\n<\/div>\n<div id=\"fs-idm222952320\" data-type=\"solution\">\n<p id=\"fs-idm222952192\">d. \\({\\overline{X}}_{A}-{\\overline{X}}_{B}\\)<\/p>\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm301029872\" data-type=\"exercise\">\n<div id=\"fs-idm176701568\" data-type=\"problem\">\n<p id=\"fs-idm151272736\">e. What are the null and alternate hypotheses? Write the null and alternate hypotheses in words and in symbols.<\/p>\n<\/div>\n<div id=\"fs-idm147502464\" data-type=\"solution\">\n<p id=\"fs-idm206451408\">e.<\/p>\n<ul id=\"fs-idp21552016\">\n<li>\\({H}_{o}:{\\mu }_{A}\\le {\\mu }_{B}\\)<\/li>\n<li>\\({H}_{a}:{\\mu }_{A}&gt;{\\mu }_{B}\\)<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm39370224\" data-type=\"exercise\">\n<div id=\"fs-idm154517184\" data-type=\"problem\">\n<p id=\"fs-idm154517056\">f. Is this test right-, left-, or two-tailed?<\/p>\n<\/div>\n<div id=\"fs-idm168558752\" data-type=\"solution\">\n<p id=\"fs-idm168558624\">f.<\/p>\n<div id=\"eip-idm29208032\" class=\"bc-figure figure\"><span id=\"eip-idm4786768\" data-type=\"media\" data-alt=\"This is a normal distribution curve with mean equal to 0. A vertical line near the tail of the curve to the right of zero extends from the axis to the curve. The region under the curve to the right of the line is shaded.\"><img decoding=\"async\" src=\"https:\/\/pressbooks.ccconline.org\/acccomposition1\/wp-content\/uploads\/sites\/83\/2022\/08\/CNX_Stats_C10_M02_001anno-1.jpg\" alt=\"This is a normal distribution curve with mean equal to 0. A vertical line near the tail of the curve to the right of zero extends from the axis to the curve. The region under the curve to the right of the line is shaded.\" width=\"380\" data-media-type=\"image\/jpg\" \/><\/span><\/div>\n<p><span data-type=\"newline\"><br \/>\n<\/span>right<\/p>\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm146752960\" data-type=\"exercise\">\n<div id=\"fs-idm148661808\" data-type=\"problem\">\n<p id=\"fs-idm148661680\">g. What is the <em data-effect=\"italics\">p<\/em>-value?<\/p>\n<\/div>\n<div id=\"fs-idm79675952\" data-type=\"solution\">\n<p id=\"fs-idm106536048\">g. 0.1928<\/p>\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm176691440\" data-type=\"exercise\">\n<div id=\"fs-idp29101808\" data-type=\"problem\">\n<p id=\"fs-idp29101936\">h. Do you reject or not reject the null hypothesis?<\/p>\n<\/div>\n<div id=\"fs-idm36842944\" data-type=\"solution\">\n<p id=\"fs-idm36842816\">h. Do not reject.<\/p>\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm65144592\" data-type=\"exercise\">\n<div id=\"fs-idm153855872\" data-type=\"problem\">\n<p id=\"fs-idm41056320\">i. <strong>Conclusion:<\/strong><\/p>\n<\/div>\n<div id=\"fs-idm31976880\" data-type=\"solution\">\n<p id=\"fs-idm31976752\">i. At the 1% level of significance, from the sample data, there is not sufficient evidence to conclude that a student who graduates from college A has taken more math classes, on the average, than a student who graduates from college B.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idp145514592\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">Try It<\/div>\n<div id=\"eip-idp166826064\" data-type=\"exercise\">\n<div id=\"eip-idp166826320\" data-type=\"problem\">\n<p id=\"fs-idm170467472\">A study is done to determine if Company A retains its workers longer than Company B. Company A samples 15 workers, and their average time with the company is five years with a standard deviation of 1.2. Company B samples 20 workers, and their average time with the company is 4.5 years with a standard deviation of 0.8. The populations are normally distributed.<\/p>\n<ol id=\"fs-idp55515664\" type=\"a\">\n<li>Are the population standard deviations known?<\/li>\n<li>Conduct an appropriate hypothesis test. At the 5% significance level, what is your conclusion?<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idm22935408\" class=\"textbox textbox--examples\" data-type=\"example\">\n<p id=\"fs-idm10086800\">A professor at a large community college wanted to determine whether there is a difference in the means of final exam scores between students who took his statistics course online and the students who took his face-to-face statistics class. He believed that the mean of the final exam scores for the online class would be lower than that of the face-to-face class. Was the professor correct? The randomly selected 30 final exam scores from each group are listed in <a class=\"autogenerated-content\" href=\"#fs-idm34056944\">(Figure)<\/a> and <a class=\"autogenerated-content\" href=\"#fs-idm124378288\">(Figure)<\/a>.<\/p>\n<table id=\"fs-idm34056944\" summary=\"\">\n<caption><span data-type=\"title\">Online Class<\/span><\/caption>\n<tbody>\n<tr>\n<td>67.6<\/td>\n<td>41.2<\/td>\n<td>85.3<\/td>\n<td>55.9<\/td>\n<td>82.4<\/td>\n<td>91.2<\/td>\n<td>73.5<\/td>\n<td>94.1<\/td>\n<td>64.7<\/td>\n<td>64.7<\/td>\n<\/tr>\n<tr>\n<td>70.6<\/td>\n<td>38.2<\/td>\n<td>61.8<\/td>\n<td>88.2<\/td>\n<td>70.6<\/td>\n<td>58.8<\/td>\n<td>91.2<\/td>\n<td>73.5<\/td>\n<td>82.4<\/td>\n<td>35.5<\/td>\n<\/tr>\n<tr>\n<td>94.1<\/td>\n<td>88.2<\/td>\n<td>64.7<\/td>\n<td>55.9<\/td>\n<td>88.2<\/td>\n<td>97.1<\/td>\n<td>85.3<\/td>\n<td>61.8<\/td>\n<td>79.4<\/td>\n<td>79.4<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<table id=\"fs-idm124378288\" summary=\"\">\n<caption><span data-type=\"title\">Face-to-face Class<\/span><\/caption>\n<tbody>\n<tr>\n<td>77.9<\/td>\n<td>95.3<\/td>\n<td>81.2<\/td>\n<td>74.1<\/td>\n<td>98.8<\/td>\n<td>88.2<\/td>\n<td>85.9<\/td>\n<td>92.9<\/td>\n<td>87.1<\/td>\n<td>88.2<\/td>\n<\/tr>\n<tr>\n<td>69.4<\/td>\n<td>57.6<\/td>\n<td>69.4<\/td>\n<td>67.1<\/td>\n<td>97.6<\/td>\n<td>85.9<\/td>\n<td>88.2<\/td>\n<td>91.8<\/td>\n<td>78.8<\/td>\n<td>71.8<\/td>\n<\/tr>\n<tr>\n<td>98.8<\/td>\n<td>61.2<\/td>\n<td>92.9<\/td>\n<td>90.6<\/td>\n<td>97.6<\/td>\n<td>100<\/td>\n<td>95.3<\/td>\n<td>83.5<\/td>\n<td>92.9<\/td>\n<td>89.4<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<div id=\"fs-idm298748224\" data-type=\"exercise\">\n<div id=\"fs-idm181409216\" data-type=\"problem\">\n<p id=\"fs-idm28696464\">Is the mean of the Final Exam scores of the online class lower than the mean of the Final Exam scores of the face-to-face class? Test at a 5% significance level. Answer the following questions:<\/p>\n<ol id=\"fs-idp180387984\" type=\"a\">\n<li>Is this a test of two means or two proportions?<\/li>\n<li>Are the population standard deviations known or unknown?<\/li>\n<li>Which distribution do you use to perform the test?<\/li>\n<li>What is the random variable?<\/li>\n<li>What are the null and alternative hypotheses? Write the null and alternative hypotheses in words and in symbols.<\/li>\n<li>Is this test right, left, or two tailed?<\/li>\n<li>What is the <em data-effect=\"italics\">p<\/em>-value?<\/li>\n<li>Do you reject or not reject the null hypothesis?<\/li>\n<li>At the ___ level of significance, from the sample data, there ______ (is\/is not) sufficient evidence to conclude that ______.<\/li>\n<\/ol>\n<p id=\"fs-idp169691824\">(See the conclusion in <a class=\"autogenerated-content\" href=\"#element-968\">(Figure)<\/a>, and write yours in a similar fashion)<\/p>\n<div id=\"fs-idm11616112\" class=\"statistics calculator\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<p id=\"fs-idm206958928\">First put the data for each group into two lists (such as L1 and L2). Press STAT. Arrow over to TESTS and press 4:2SampTTest. Make sure Data is highlighted and press ENTER. Arrow down and enter L1 for the first list and L2 for the second list. Arrow down to <em data-effect=\"italics\">\u03bc<\/em><sub>1<\/sub>: and arrow to \u2260 <em data-effect=\"italics\">\u03bc<\/em><sub>2<\/sub> (does not equal). Press ENTER. Arrow down to Pooled: No. Press ENTER. Arrow down to Calculate and press ENTER.<\/p>\n<\/div>\n<div id=\"fs-idm107478368\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">Note<\/div>\n<p id=\"fs-idm218153776\">Be careful not to mix up the information for Group 1 and Group 2!<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm168184384\" data-type=\"solution\">\n<ol id=\"fs-idm11713040\" type=\"a\">\n<li>two means<\/li>\n<li>unknown<\/li>\n<li>Student&#8217;s <em data-effect=\"italics\">t<\/em><\/li>\n<li>\\({\\overline{X}}_{1}\u2013{\\overline{X}}_{2}\\)<\/li>\n<li>\n<ol id=\"fs-idm109578096\">\n<li><em data-effect=\"italics\">H<sub>0<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<sub>1<\/sub><\/em> = <em data-effect=\"italics\">\u03bc<sub>2<\/sub><\/em> Null hypothesis: the means of the final exam scores are equal for the online and face-to-face statistics classes.<\/li>\n<li><em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<sub>1<\/sub><\/em> &lt; <em data-effect=\"italics\">\u03bc<sub>2<\/sub><\/em> Alternative hypothesis: the mean of the final exam scores of the online class is less than the mean of the final exam scores of the face-to-face class.<\/li>\n<\/ol>\n<\/li>\n<li>left-tailed<\/li>\n<li><em data-effect=\"italics\">p<\/em>-value = 0.0011\n<div id=\"fs-idm98197120\" class=\"bc-figure figure\"><span id=\"fs-idp598000\" data-type=\"media\" data-alt=\"This is a normal distribution curve with mean equal to zero. A vertical line near the tail of the curve to the left of zero extends from the axis to the curve. The region under the curve to the left of the line is shaded representing p-value = 0.0011.\"><img decoding=\"async\" src=\"https:\/\/pressbooks.ccconline.org\/acccomposition1\/wp-content\/uploads\/sites\/83\/2022\/08\/CNX_Stats_C10_M02_002annoN-1.jpg\" alt=\"This is a normal distribution curve with mean equal to zero. A vertical line near the tail of the curve to the left of zero extends from the axis to the curve. The region under the curve to the left of the line is shaded representing p-value = 0.0011.\" width=\"380\" data-media-type=\"image\/jpg\" \/><\/span><\/div>\n<\/li>\n<li>Reject the null hypothesis<\/li>\n<li>The professor was correct. The evidence shows that the mean of the final exam scores for the online class is lower than that of the face-to-face class. <span data-type=\"newline\"><br \/>\n<\/span>At the <u data-effect=\"underline\">5%<\/u> level of significance, from the sample data, there is (is\/is not) sufficient evidence to conclude that the mean of the final exam scores for the online class is less than <u data-effect=\"underline\">the mean of final exam scores of the face-to-face class.<\/u><\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/div>\n<p><span data-type=\"title\">Cohen&#8217;s Standards for Small, Medium, and Large Effect Sizes<\/span><span data-type=\"term\">Cohen&#8217;s <em data-effect=\"italics\">d<\/em><\/span> is a measure of effect size based on the differences between two means. Cohen\u2019s <em data-effect=\"italics\">d<\/em>, named for United States statistician Jacob Cohen, measures the relative strength of the differences between the means of two populations based on sample data. The calculated value of effect size is then compared to Cohen\u2019s standards of small, medium, and large effect sizes.<\/p>\n<table id=\"fs-idp93235584\" summary=\"\">\n<caption><span data-type=\"title\">Cohen&#8217;s Standard Effect Sizes<\/span><\/caption>\n<thead>\n<tr>\n<th>Size of effect<\/th>\n<th><em data-effect=\"italics\">d<\/em><\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Small<\/td>\n<td>0.2<\/td>\n<\/tr>\n<tr>\n<td>medium<\/td>\n<td>0.5<\/td>\n<\/tr>\n<tr>\n<td>Large<\/td>\n<td>0.8<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p id=\"fs-idm52480352\">Cohen&#8217;s <em data-effect=\"italics\">d<\/em> is the measure of the difference between two means divided by the pooled standard deviation: \\(d=\\frac{{\\overline{x}}_{1}\u2013{\\overline{x}}_{2}}{{s}_{pooled}}\\) where \\({s}_{pooled}=\\sqrt{\\frac{\\left({n}_{1}\u20131\\right){s}_{1}^{2}+\\left({n}_{2}\u20131\\right){s}_{2}^{2}}{{n}_{1}+{n}_{2}\u20132}}\\)<\/p>\n<div id=\"fs-idm100194752\" class=\"textbox textbox--examples\" data-type=\"example\">\n<div id=\"fs-idm122689856\" data-type=\"exercise\">\n<div id=\"fs-idm122689600\" data-type=\"problem\">\n<p id=\"fs-idm1441280\">Calculate Cohen\u2019s <em data-effect=\"italics\">d<\/em> for <a class=\"autogenerated-content\" href=\"#element-968\">(Figure)<\/a>. Is the size of the effect small, medium, or large? Explain what the size of the effect means for this problem.<\/p>\n<\/div>\n<div id=\"fs-idm42685344\" data-type=\"solution\">\n<p id=\"fs-idp179952160\"><em data-effect=\"italics\">\u03bc<\/em><sub>1<\/sub> = 4 <em data-effect=\"italics\">s<\/em><sub>1<\/sub> = 1.5 <em data-effect=\"italics\">n<\/em><sub>1<\/sub> = 11 <span data-type=\"newline\"><br \/>\n<\/span><em data-effect=\"italics\">\u03bc<\/em><sub>2<\/sub> = 3.5 <em data-effect=\"italics\">s<\/em><sub>2<\/sub> = 1 <em data-effect=\"italics\">n<\/em><sub>2<\/sub> = 9 <span data-type=\"newline\"><br \/>\n<\/span><em data-effect=\"italics\">d<\/em> = 0.384 <span data-type=\"newline\"><br \/>\n<\/span>The effect is small because 0.384 is between Cohen\u2019s value of 0.2 for small effect size and 0.5 for medium effect size. The size of the differences of the means for the two colleges is small indicating that there is not a significant difference between them.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idm112602048\" class=\"textbox textbox--examples\" data-type=\"example\">\n<div id=\"fs-idm163588752\" data-type=\"exercise\">\n<div id=\"fs-idm118829104\" data-type=\"problem\">\n<p id=\"fs-idm174261728\">Calculate Cohen\u2019s <em data-effect=\"italics\">d<\/em> for <a class=\"autogenerated-content\" href=\"#fs-idm22935408\">(Figure)<\/a>. Is the size of the effect small, medium or large? Explain what the size of the effect means for this problem.<\/p>\n<\/div>\n<div id=\"fs-idm177693104\" data-type=\"solution\">\n<p id=\"fs-idm86592608\"><em data-effect=\"italics\">d<\/em> = 0.834; Large, because 0.834 is greater than Cohen\u2019s 0.8 for a large effect size. The size of the differences between the means of the Final Exam scores of online students and students in a face-to-face class is large indicating a significant difference.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idp145071264\" class=\"statistics try\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">Try It<\/div>\n<p id=\"fs-idm62120944\">Weighted alpha is a measure of risk-adjusted performance of stocks over a period of a year. A high positive weighted alpha signifies a stock whose price has risen while a small positive weighted alpha indicates an unchanged stock price during the time period. Weighted alpha is used to identify companies with strong upward or downward trends. The weighted alpha for the top 30 stocks of banks in the northeast and in the west as identified by Nasdaq on May 24, 2013 are listed in <a class=\"autogenerated-content\" href=\"#fs-idm242901296\">(Figure)<\/a> and <a class=\"autogenerated-content\" href=\"#fs-idm167801152\">(Figure)<\/a>, respectively.<\/p>\n<table id=\"fs-idm242901296\" summary=\"\">\n<caption><span data-type=\"title\">Northeast<\/span><\/caption>\n<tbody>\n<tr>\n<td>94.2<\/td>\n<td>75.2<\/td>\n<td>69.6<\/td>\n<td>52.0<\/td>\n<td>48.0<\/td>\n<td>41.9<\/td>\n<td>36.4<\/td>\n<td>33.4<\/td>\n<td>31.5<\/td>\n<td>27.6<\/td>\n<\/tr>\n<tr>\n<td>77.3<\/td>\n<td>71.9<\/td>\n<td>67.5<\/td>\n<td>50.6<\/td>\n<td>46.2<\/td>\n<td>38.4<\/td>\n<td>35.2<\/td>\n<td>33.0<\/td>\n<td>28.7<\/td>\n<td>26.5<\/td>\n<\/tr>\n<tr>\n<td>76.3<\/td>\n<td>71.7<\/td>\n<td>56.3<\/td>\n<td>48.7<\/td>\n<td>43.2<\/td>\n<td>37.6<\/td>\n<td>33.7<\/td>\n<td>31.8<\/td>\n<td>28.5<\/td>\n<td>26.0<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<table id=\"fs-idm167801152\" summary=\"\">\n<caption><span data-type=\"title\">West<\/span><\/caption>\n<tbody>\n<tr>\n<td>126.0<\/td>\n<td>70.6<\/td>\n<td>65.2<\/td>\n<td>51.4<\/td>\n<td>45.5<\/td>\n<td>37.0<\/td>\n<td>33.0<\/td>\n<td>29.6<\/td>\n<td>23.7<\/td>\n<td>22.6<\/td>\n<\/tr>\n<tr>\n<td>116.1<\/td>\n<td>70.6<\/td>\n<td>58.2<\/td>\n<td>51.2<\/td>\n<td>43.2<\/td>\n<td>36.0<\/td>\n<td>31.4<\/td>\n<td>28.7<\/td>\n<td>23.5<\/td>\n<td>21.6<\/td>\n<\/tr>\n<tr>\n<td>78.2<\/td>\n<td>68.2<\/td>\n<td>55.6<\/td>\n<td>50.3<\/td>\n<td>39.0<\/td>\n<td>34.1<\/td>\n<td>31.0<\/td>\n<td>25.3<\/td>\n<td>23.4<\/td>\n<td>21.5<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<div id=\"fs-idm120507648\" data-type=\"exercise\">\n<div id=\"eip-idp51089200\" data-type=\"problem\">\n<p id=\"fs-idm100910544\">Is there a difference in the weighted alpha of the top 30 stocks of banks in the northeast and in the west? Test at a 5% significance level. Answer the following questions:<\/p>\n<ol id=\"fs-idp33249296\" type=\"a\">\n<li>Is this a test of two means or two proportions?<\/li>\n<li>Are the population standard deviations known or unknown?<\/li>\n<li>Which distribution do you use to perform the test?<\/li>\n<li>What is the random variable?<\/li>\n<li>What are the null and alternative hypotheses? Write the null and alternative hypotheses in words and in symbols.<\/li>\n<li>Is this test right, left, or two tailed?<\/li>\n<li>What is the <em data-effect=\"italics\">p<\/em>-value?<\/li>\n<li>Do you reject or not reject the null hypothesis?<\/li>\n<li>At the ___ level of significance, from the sample data, there ______ (is\/is not) sufficient evidence to conclude that ______.<\/li>\n<li>Calculate Cohen\u2019s <em data-effect=\"italics\">d<\/em> and interpret it.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"footnotes\" data-depth=\"1\">\n<h3 data-type=\"title\">References<\/h3>\n<p>Data from Graduating Engineer + Computer Careers. Available online at http:\/\/www.graduatingengineer.com<\/p>\n<p id=\"fs-idp6735712\">Data from <em data-effect=\"italics\">Microsoft Bookshelf<\/em>.<\/p>\n<p id=\"fs-idm54110288\">Data from the United States Senate website, available online at www.Senate.gov (accessed June 17, 2013).<\/p>\n<p id=\"fs-idp26924704\">\u201cList of current United States Senators by Age.\u201d Wikipedia. Available online at http:\/\/en.wikipedia.org\/wiki\/List_of_current_United_States_Senators_by_age (accessed June 17, 2013).<\/p>\n<p id=\"fs-idp67778720\">\u201cSectoring by Industry Groups.\u201d Nasdaq. Available online at http:\/\/www.nasdaq.com\/markets\/barchart-sectors.aspx?page=sectors&amp;base=industry (accessed June 17, 2013).<\/p>\n<p id=\"fs-idp92967520\">\u201cStrip Clubs: Where Prostitution and Trafficking Happen.\u201d Prostitution Research and Education, 2013. Available online at www.prostitutionresearch.com\/ProsViolPosttrauStress.html (accessed June 17, 2013).<\/p>\n<p id=\"fs-idp130720928\">\u201cWorld Series History.\u201d Baseball-Almanac, 2013. Available online at http:\/\/www.baseball-almanac.com\/ws\/wsmenu.shtml (accessed June 17, 2013).<\/p>\n<\/div>\n<div id=\"fs-idm178063424\" class=\"summary\" data-depth=\"1\">\n<h3 data-type=\"title\">Chapter Review<\/h3>\n<p id=\"fs-idm116979232\">Two population means from independent samples where the population standard deviations are not known<\/p>\n<ul id=\"fs-idm111534432\">\n<li>Random Variable: \\({\\overline{X}}_{1}-{\\overline{X}}_{2}\\) = the difference of the sampling means<\/li>\n<li>Distribution: Student&#8217;s <em data-effect=\"italics\">t<\/em>-distribution with degrees of freedom (variances not pooled)<\/li>\n<\/ul>\n<\/div>\n<div id=\"fs-idm147993376\" class=\"formula-review\" data-depth=\"1\">\n<h3 data-type=\"title\">Formula Review<\/h3>\n<p id=\"fs-idm5111056\">Standard error: <em data-effect=\"italics\">SE<\/em> = \\(\\sqrt{\\frac{{\\left({s}_{1}\\right)}^{2}}{{n}_{1}}+\\frac{{\\left({s}_{2}\\right)}^{2}}{{n}_{2}}}\\)<\/p>\n<p id=\"fs-idm105811424\">Test statistic (<em data-effect=\"italics\">t<\/em>-score): <em data-effect=\"italics\">t<\/em> = \\(\\frac{\\left({\\overline{x}}_{1}-{\\overline{x}}_{2}\\right)-\\left({\\mu }_{1}-{\\mu }_{2}\\right)}{\\sqrt{\\frac{{\\left({s}_{1}\\right)}^{2}}{{n}_{1}}+\\frac{{\\left({s}_{2}\\right)}^{2}}{{n}_{2}}}}\\)<\/p>\n<p id=\"fs-idm96358640\">Degrees of freedom:<span data-type=\"newline\"><br \/>\n<\/span> \\(df=\u00a0\\frac{{\\left(\\frac{{\\left({s}_{1}\\right)}^{2}}{{n}_{1}}+\u00a0\\frac{{\\left({s}_{2}\\right)}^{2}}{{n}_{2}}\\right)}^{2}}{\\left(\\frac{1}{{n}_{1}-1}\\right){\\left(\\frac{{\\left({s}_{1}\\right)}^{2}}{{n}_{1}}\\right)}^{2}+\\left(\\frac{1}{{n}_{2}-1}\\right){\\left(\\frac{{\\left({s}_{2}\\right)}^{2}}{{n}_{2}}\\right)}^{2}}\\)<\/p>\n<p id=\"fs-idm98284352\">where:<\/p>\n<p id=\"fs-idm92032864\"><em data-effect=\"italics\">s<\/em><sub>1<\/sub> and <em data-effect=\"italics\">s<\/em><sub>2<\/sub> are the sample standard deviations, and <em data-effect=\"italics\">n<\/em><sub>1<\/sub> and <em data-effect=\"italics\">n<\/em><sub>2<\/sub> are the sample sizes.<\/p>\n<p id=\"fs-idm32210368\">\\({\\overline{x}}_{1}\\) and \\({\\overline{x}}_{2}\\) are the sample means.<\/p>\n<p id=\"fs-idm117522208\">Cohen\u2019s <em data-effect=\"italics\">d<\/em> is the measure of effect size:<\/p>\n<p id=\"fs-idm117521824\">\\(d=\\frac{{\\overline{x}}_{1}-{\\overline{x}}_{2}}{{s}_{pooled}}\\)<span data-type=\"newline\"><br \/>\n<\/span>where \\({s}_{pooled}=\\sqrt{\\frac{\\left({n}_{1}-1\\right){s}_{1}^{2}+\\left({n}_{2}-1\\right){s}_{2}^{2}}{{n}_{1}+{n}_{2}-2}}\\)<\/p>\n<\/div>\n<div id=\"fs-idm9464368\" class=\"practice\" data-depth=\"1\">\n<p id=\"fs-idm124875168\"><em data-effect=\"italics\">Use the following information to answer the next 15 exercises:<\/em> Indicate if the hypothesis test is for<\/p>\n<ol id=\"fs-idm45665008\" type=\"a\">\n<li>independent group means, population standard deviations, and\/or variances known<\/li>\n<li>independent group means, population standard deviations, and\/or variances unknown<\/li>\n<li>matched or paired samples<\/li>\n<li>single mean<\/li>\n<li>two proportions<\/li>\n<li>single proportion<\/li>\n<\/ol>\n<div id=\"fs-idm22935904\" data-type=\"exercise\">\n<div id=\"fs-idm124875424\" data-type=\"problem\">\n<p id=\"fs-idm84013008\">It is believed that 70% of males pass their drivers test in the first attempt, while 65% of females pass the test in the first attempt. Of interest is whether the proportions are in fact equal.<\/p>\n<\/div>\n<div id=\"fs-idm235455488\" data-type=\"solution\">\n<p id=\"fs-idm173588752\">two proportions<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm146907648\" data-type=\"exercise\">\n<div id=\"fs-idm146907392\" data-type=\"problem\">\n<p id=\"fs-idm146907136\">A new laundry detergent is tested on consumers. Of interest is the proportion of consumers who prefer the new brand over the leading competitor. A study is done to test this.<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm161821520\" data-type=\"exercise\">\n<div id=\"fs-idm161821264\" data-type=\"problem\">\n<p id=\"fs-idm164006592\">A new windshield treatment claims to repel water more effectively. Ten windshields are tested by simulating rain without the new treatment. The same windshields are then treated, and the experiment is run again. A hypothesis test is conducted.<\/p>\n<\/div>\n<div id=\"fs-idm127047328\" data-type=\"solution\">\n<p id=\"fs-idm127047072\">matched or paired samples<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm120536064\" data-type=\"exercise\">\n<div id=\"fs-idm68027168\" data-type=\"problem\">\n<p id=\"fs-idm68026912\">The known standard deviation in salary for all mid-level professionals in the financial industry is \\$11,000. Company A and Company B are in the financial industry. Suppose samples are taken of mid-level professionals from Company A and from Company B. The sample mean salary for mid-level professionals in Company A is \\$80,000. The sample mean salary for mid-level professionals in Company B is \\$96,000. Company A and Company B management want to know if their mid-level professionals are paid differently, on average.<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idp21825264\" data-type=\"exercise\">\n<div id=\"fs-idp21825520\" data-type=\"problem\">\n<p id=\"fs-idm175592768\">The average worker in Germany gets eight weeks of paid vacation.<\/p>\n<\/div>\n<div id=\"fs-idm81349024\" data-type=\"solution\">\n<p id=\"fs-idm81348768\">single mean<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm170345552\" data-type=\"exercise\">\n<div id=\"fs-idm170345296\" data-type=\"problem\">\n<p id=\"fs-idm39186912\">According to a television commercial, 80% of dentists agree that Ultrafresh toothpaste is the best on the market.<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idp3576048\" data-type=\"exercise\">\n<div id=\"fs-idm27505984\" data-type=\"problem\">\n<p id=\"fs-idm27505728\">It is believed that the average grade on an English essay in a particular school system for females is higher than for males. A random sample of 31 females had a mean score of 82 with a standard deviation of three, and a random sample of 25 males had a mean score of 76 with a standard deviation of four.<\/p>\n<\/div>\n<div id=\"fs-idm27710704\" data-type=\"solution\">\n<p id=\"fs-idm54785888\">independent group means, population standard deviations and\/or variances unknown<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm32687376\" data-type=\"exercise\">\n<div id=\"fs-idm32687120\" data-type=\"problem\">\n<p id=\"fs-idm32686864\">The league mean batting average is 0.280 with a known standard deviation of 0.06. The Rattlers and the Vikings belong to the league. The mean batting average for a sample of eight Rattlers is 0.210, and the mean batting average for a sample of eight Vikings is 0.260. There are 24 players on the Rattlers and 19 players on the Vikings. Are the batting averages of the Rattlers and Vikings statistically different?<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm36046944\" data-type=\"exercise\">\n<div id=\"fs-idm36046688\" data-type=\"problem\">\n<p id=\"fs-idm3190160\">In a random sample of 100 forests in the United States, 56 were coniferous or contained conifers. In a random sample of 80 forests in Mexico, 40 were coniferous or contained conifers. Is the proportion of conifers in the United States statistically more than the proportion of conifers in Mexico?<\/p>\n<\/div>\n<div id=\"fs-idm7896320\" data-type=\"solution\">\n<p id=\"fs-idm7896064\">two proportions<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm84811344\" data-type=\"exercise\">\n<div id=\"fs-idm84811088\" data-type=\"problem\">\n<p id=\"fs-idm16418448\">A new medicine is said to help improve sleep. Eight subjects are picked at random and given the medicine. The means hours slept for each person were recorded before starting the medication and after.<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm111997184\" data-type=\"exercise\">\n<div id=\"fs-idm111996928\" data-type=\"problem\">\n<p id=\"fs-idm35434864\">It is thought that teenagers sleep more than adults on average. A study is done to verify this. A sample of 16 teenagers has a mean of 8.9 hours slept and a standard deviation of 1.2. A sample of 12 adults has a mean of 6.9 hours slept and a standard deviation of 0.6.<\/p>\n<\/div>\n<div id=\"fs-idm5318624\" data-type=\"solution\">\n<p id=\"fs-idm5318368\">independent group means, population standard deviations and\/or variances unknown<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm83613168\" data-type=\"exercise\">\n<div id=\"fs-idm83612912\" data-type=\"problem\">\n<p id=\"fs-idm122472032\">Varsity athletes practice five times a week, on average.<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm155074576\" data-type=\"exercise\">\n<div id=\"fs-idm187988384\" data-type=\"problem\">\n<p id=\"fs-idm187988128\">A sample of 12 in-state graduate school programs at school A has a mean tuition of \\$64,000 with a standard deviation of \\$8,000. At school B, a sample of 16 in-state graduate programs has a mean of \\$80,000 with a standard deviation of \\$6,000. On average, are the mean tuitions different?<\/p>\n<\/div>\n<div id=\"fs-idm63313632\" data-type=\"solution\">\n<p id=\"fs-idm86332592\">independent group means, population standard deviations and\/or variances unknown<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm86331952\" data-type=\"exercise\">\n<div id=\"fs-idm109695328\" data-type=\"problem\">\n<p id=\"fs-idm109695072\">A new WiFi range booster is being offered to consumers. A researcher tests the native range of 12 different routers under the same conditions. The ranges are recorded. Then the researcher uses the new WiFi range booster and records the new ranges. Does the new WiFi range booster do a better job?<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm152397632\" data-type=\"exercise\">\n<div id=\"fs-idm187246864\" data-type=\"problem\">\n<p id=\"fs-idm187246608\">A high school principal claims that 30% of student athletes drive themselves to school, while 4% of non-athletes drive themselves to school. In a sample of 20 student athletes, 45% drive themselves to school. In a sample of 35 non-athlete students, 6% drive themselves to school. Is the percent of student athletes who drive themselves to school more than the percent of nonathletes?<\/p>\n<\/div>\n<div id=\"fs-idm178089840\" data-type=\"solution\">\n<p id=\"fs-idm174648448\">two proportions<\/p>\n<\/div>\n<\/div>\n<p><span data-type=\"newline\"><br \/>\n<\/span><em data-effect=\"italics\">Use the following information to answer the next three exercises:<\/em> A study is done to determine which of two soft drinks has more sugar. There are 13 cans of Beverage A in a sample and six cans of Beverage B. The mean amount of sugar in Beverage A is 36 grams with a standard deviation of 0.6 grams. The mean amount of sugar in Beverage B is 38 grams with a standard deviation of 0.8 grams. The researchers believe that Beverage B has more sugar than Beverage A, on average. Both populations have normal distributions.<\/p>\n<div id=\"fs-idm174647808\" data-type=\"exercise\">\n<div id=\"fs-idm80739952\" data-type=\"problem\">\n<p id=\"fs-idm174134640\">Are standard deviations known or unknown?<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm213711120\" data-type=\"exercise\">\n<div id=\"fs-idm213710864\" data-type=\"problem\">\n<p id=\"fs-idm213710608\">What is the random variable?<\/p>\n<\/div>\n<div id=\"fs-idm10993824\" data-type=\"solution\">\n<p id=\"fs-idm10993568\">The random variable is the difference between the mean amounts of sugar in the two soft drinks.<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm106857024\" data-type=\"exercise\">\n<div id=\"fs-idm38595520\" data-type=\"problem\">\n<p id=\"fs-idm38595264\">Is this a one-tailed or two-tailed test?<\/p>\n<\/div>\n<\/div>\n<p><em data-effect=\"italics\">Use the following information to answer the next 12 exercises:<\/em> The U.S. Center for Disease Control reports that the mean life expectancy was 47.6 years for whites born in 1900 and 33.0 years for nonwhites. Suppose that you randomly survey death records for people born in 1900 in a certain county. Of the 124 whites, the mean life span was 45.3 years with a standard deviation of 12.7 years. Of the 82 nonwhites, the mean life span was 34.1 years with a standard deviation of 15.6 years. Conduct a hypothesis test to see if the mean life spans in the county were the same for whites and nonwhites.<\/p>\n<div id=\"fs-idm77134224\" data-type=\"exercise\">\n<div id=\"fs-idm77133968\" data-type=\"problem\">\n<p id=\"fs-idm82811296\">Is this a test of means or proportions?<\/p>\n<\/div>\n<div id=\"fs-idm82810784\" data-type=\"solution\">\n<p id=\"fs-idm5951248\">means<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm87532368\" data-type=\"exercise\">\n<div id=\"fs-idm87532112\" data-type=\"problem\">\n<p id=\"fs-idm87531856\">State the null and alternative hypotheses.<\/p>\n<ol id=\"fs-idm126031792\" type=\"a\">\n<li><em data-effect=\"italics\">H<sub>0<\/sub><\/em>: __________<\/li>\n<li><em data-effect=\"italics\">H<sub>a<\/sub><\/em>: __________<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div id=\"fs-idp13932064\" data-type=\"exercise\">\n<div id=\"fs-idm162071840\" data-type=\"problem\">\n<p id=\"fs-idm162071584\">Is this a right-tailed, left-tailed, or two-tailed test?<\/p>\n<\/div>\n<div id=\"fs-idm104897488\" data-type=\"solution\">\n<p id=\"fs-idm104897232\">two-tailed<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm104977072\" data-type=\"exercise\">\n<div id=\"fs-idm104976816\" data-type=\"problem\">\n<p id=\"fs-idm161761248\">In symbols, what is the random variable of interest for this test?<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm147822240\" data-type=\"exercise\">\n<div id=\"fs-idm147821984\" data-type=\"problem\">\n<p id=\"fs-idm147821728\">In words, define the random variable of interest for this test.<\/p>\n<\/div>\n<div id=\"fs-idm2327840\" data-type=\"solution\">\n<p id=\"fs-idm2327584\">the difference between the mean life spans of whites and nonwhites<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm17957104\" data-type=\"exercise\">\n<div id=\"fs-idm17956848\" data-type=\"problem\">\n<p id=\"fs-idm100280816\">Which distribution (normal or Student&#8217;s <em data-effect=\"italics\">t<\/em>) would you use for this hypothesis test?<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm121049024\" data-type=\"exercise\">\n<div id=\"fs-idm121048768\" data-type=\"problem\">\n<p id=\"fs-idm121048512\">Explain why you chose the distribution you did for <a class=\"autogenerated-content\" href=\"#fs-idm17957104\">(Figure)<\/a>.<\/p>\n<\/div>\n<div id=\"eip-idp105137344\" data-type=\"solution\">\n<p id=\"eip-idp105137600\">This is a comparison of two population means with unknown population standard deviations.<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm96874480\" data-type=\"exercise\">\n<div id=\"fs-idm96874224\" data-type=\"problem\">\n<p id=\"fs-idm263989856\">Calculate the test statistic and <em data-effect=\"italics\">p<\/em>-value.<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm120924688\" data-type=\"exercise\">\n<div id=\"fs-idm120924432\" data-type=\"problem\">\n<p id=\"fs-idm120924176\">Sketch a graph of the situation. Label the horizontal axis. Mark the hypothesized difference and the sample difference. Shade the area corresponding to the <em data-effect=\"italics\">p<\/em>-value.<\/p>\n<\/div>\n<div id=\"fs-idp104610560\" data-type=\"solution\">\n<p id=\"fs-idp104610816\">Check student\u2019s solution.<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm47382320\" data-type=\"exercise\">\n<div id=\"fs-idm47382064\" data-type=\"problem\">\n<p id=\"fs-idm174192304\">Find the <em data-effect=\"italics\">p<\/em>-value.<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm129306608\" data-type=\"exercise\">\n<div id=\"fs-idm129306352\" data-type=\"problem\">\n<p id=\"fs-idm148755040\">At a pre-conceived <em data-effect=\"italics\">\u03b1<\/em> = 0.05, what is your:<\/p>\n<ol id=\"fs-idp598432\" type=\"a\">\n<li>Decision:<\/li>\n<li>Reason for the decision:<\/li>\n<li>Conclusion (write out in a complete sentence):<\/li>\n<\/ol>\n<\/div>\n<div id=\"fs-idm110404384\" data-type=\"solution\">\n<ol id=\"fs-idp229240368\" type=\"a\">\n<li>Reject the null hypothesis<\/li>\n<li><em data-effect=\"italics\">p<\/em>-value &lt; 0.05<\/li>\n<li>There is not enough evidence at the 5% level of significance to support the claim that life expectancy in the 1900s is different between whites and nonwhites.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div id=\"fs-idm127412752\" data-type=\"exercise\">\n<div id=\"fs-idm126248464\" data-type=\"problem\">\n<p id=\"fs-idm126248208\">Does it appear that the means are the same? Why or why not?<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"fs-idm173509824\" class=\"free-response\" data-depth=\"1\">\n<h3 data-type=\"title\">Homework<\/h3>\n<p><em data-effect=\"italics\">DIRECTIONS: For each of the word problems, use a solution sheet to do the hypothesis test. The solution sheet is found in <a href=\"\/contents\/c0449c55-aa47-4f1c-bd5f-0521652f0e82\">Appendix E<\/a>. Please feel free to make copies of the solution sheets. For the online version of the book, it is suggested that you copy the .doc or the .pdf files.<\/em><\/p>\n<div id=\"eip-792\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<div data-type=\"title\">NOTE<\/div>\n<p id=\"eip-idp140444868962512\">If you are using a Student&#8217;s <em data-effect=\"italics\">t<\/em>-distribution for a homework problem in what follows, including for paired data, you may assume that the underlying population is normally distributed. (When using these tests in a real situation, you must first prove that assumption, however.)<\/p>\n<\/div>\n<div id=\"fs-idm164128\" data-type=\"exercise\">\n<div id=\"fs-idm163872\" data-type=\"problem\">\n<p id=\"fs-idm163616\">\n<\/div>\n<\/div>\n<div id=\"fs-idm8894688\" data-type=\"exercise\">\n<div id=\"fs-idm8894432\" data-type=\"problem\">\n<p id=\"fs-idm185269312\">1)A student at a four-year college claims that mean enrollment at four\u2013year colleges is higher than at two\u2013year colleges in the United States. Two surveys are conducted. Of the 35 two\u2013year colleges surveyed, the mean enrollment was 5,068 with a standard deviation of 4,777. Of the 35 four-year colleges surveyed, the mean enrollment was 5,466 with a standard deviation of 8,191.<\/p>\n<\/div>\n<div id=\"fs-idp138385488\" data-type=\"solution\">\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm39083984\" data-type=\"exercise\">\n<div id=\"fs-idm39083728\" data-type=\"problem\">\n<p id=\"fs-idm53880736\">2) At Rachel\u2019s 11<sup>th<\/sup> birthday party, eight girls were timed to see how long (in seconds) they could hold their breath in a relaxed position. After a two-minute rest, they timed themselves while jumping. The girls thought that the mean difference between their jumping and relaxed times would be zero. Test their hypothesis.<\/p>\n<table id=\"fs-idm53880480\" summary=\"The table presents the first column as the relaxed time in seconds and the second column as the jumping time in seconds.\">\n<thead>\n<tr>\n<th>Relaxed time (seconds)<\/th>\n<th>Jumping time (seconds)<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>26<\/td>\n<td>21<\/td>\n<\/tr>\n<tr>\n<td>47<\/td>\n<td>40<\/td>\n<\/tr>\n<tr>\n<td>30<\/td>\n<td>28<\/td>\n<\/tr>\n<tr>\n<td>22<\/td>\n<td>21<\/td>\n<\/tr>\n<tr>\n<td>23<\/td>\n<td>25<\/td>\n<\/tr>\n<tr>\n<td>45<\/td>\n<td>43<\/td>\n<\/tr>\n<tr>\n<td>37<\/td>\n<td>35<\/td>\n<\/tr>\n<tr>\n<td>29<\/td>\n<td>32<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm122985232\" data-type=\"exercise\">\n<div id=\"fs-idm122984976\" data-type=\"problem\">\n<p id=\"fs-idm122984720\">3) Mean entry-level salaries for college graduates with mechanical engineering degrees and electrical engineering degrees are believed to be approximately the same. A recruiting office thinks that the mean mechanical engineering salary is actually lower than the mean electrical engineering salary. The recruiting office randomly surveys 50 entry level mechanical engineers and 60 entry level electrical engineers. Their mean salaries were \\$46,100 and \\$46,700, respectively. Their standard deviations were \\$3,450 and \\$4,210, respectively. Conduct a hypothesis test to determine if you agree that the mean entry-level mechanical engineering salary is lower than the mean entry-level electrical engineering salary.<\/p>\n<\/div>\n<div id=\"fs-idm188189904\" data-type=\"solution\">\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm62206688\" data-type=\"exercise\">\n<div id=\"fs-idm62206432\" data-type=\"problem\">\n<p id=\"fs-idm105723088\">4) Marketing companies have collected data implying that teenage girls use more ring tones on their cellular phones than teenage boys do. In one particular study of 40 randomly chosen teenage girls and boys (20 of each) with cellular phones, the mean number of ring tones for the girls was 3.2 with a standard deviation of 1.5. The mean for the boys was 1.7 with a standard deviation of 0.8. Conduct a hypothesis test to determine if the means are approximately the same or if the girls\u2019 mean is higher than the boys\u2019 mean.<\/p>\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<p><em data-effect=\"italics\">Use the information from <a class=\"autogenerated-content\" href=\"\/contents\/3ef830bc-5247-460a-9007-e3fd762e5e93\">(Figure)<\/a> to answer the next four exercises.<\/em><\/p>\n<div id=\"fs-idm50796704\" data-type=\"exercise\">\n<div id=\"fs-idm50796448\" data-type=\"problem\">\n<p id=\"fs-idm98236496\">5) Using the data from Lap 1 only, conduct a hypothesis test to determine if the mean time for completing a lap in races is the same as it is in practices.<\/p>\n<\/div>\n<div id=\"fs-idm73210192\" data-type=\"solution\">\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm2195248\" data-type=\"exercise\">\n<div id=\"fs-idm2194992\" data-type=\"problem\">\n<p id=\"fs-idm241101904\">6) Repeat the test in <a href=\"#fs-idm50796704\">Exercise 10.83<\/a>, but use Lap 5 data this time.<\/p>\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm170930784\" data-type=\"exercise\">\n<div id=\"fs-idm54834800\" data-type=\"problem\">\n<p id=\"fs-idm54834544\">7) Repeat the test in <a href=\"#fs-idm50796704\">Exercise 10.83<\/a>, but this time combine the data from Laps 1 and 5.<\/p>\n<\/div>\n<div id=\"fs-idm39528240\" data-type=\"solution\"><\/div>\n<\/div>\n<div id=\"fs-idm80120272\" data-type=\"exercise\">\n<div id=\"fs-idm80120016\" data-type=\"problem\">\n<p id=\"fs-idm80119760\">\n<\/div>\n<\/div>\n<p><em data-effect=\"italics\">Use the following information to answer the next two exercises.<\/em> The Eastern and Western Major League Soccer conferences have a new Reserve Division that allows new players to develop their skills. Data for a randomly picked date showed the following annual goals.<\/p>\n<table summary=\"Eastern and Western Major League Soccer\">\n<thead>\n<tr>\n<th>Western<\/th>\n<th>Eastern<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Los Angeles 9<\/td>\n<td>D.C. United 9<\/td>\n<\/tr>\n<tr>\n<td>FC Dallas 3<\/td>\n<td>Chicago 8<\/td>\n<\/tr>\n<tr>\n<td>Chivas USA 4<\/td>\n<td>Columbus 7<\/td>\n<\/tr>\n<tr>\n<td>Real Salt Lake 3<\/td>\n<td>New England 6<\/td>\n<\/tr>\n<tr>\n<td>Colorado 4<\/td>\n<td>MetroStars 5<\/td>\n<\/tr>\n<tr>\n<td>San Jose 4<\/td>\n<td>Kansas City 3<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p id=\"eip-549\"><em data-effect=\"italics\">Conduct a hypothesis test to answer the next two exercises.<\/em><\/p>\n<p>&nbsp;<\/p>\n<div id=\"fs-idm67199056\" data-type=\"exercise\">\n<div id=\"fs-idm67198800\" data-type=\"problem\">\n<p id=\"fs-idm67198544\">8) The <strong>exact<\/strong> distribution for the hypothesis test is:<\/p>\n<ol id=\"fs-idm91569200\" type=\"a\">\n<li>the normal distribution<\/li>\n<li>the Student&#8217;s <em data-effect=\"italics\">t<\/em>-distribution<\/li>\n<li>the uniform distribution<\/li>\n<li>the exponential distribution<\/li>\n<\/ol>\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<div id=\"fs-idm3207760\" data-type=\"exercise\">\n<div id=\"fs-idm104292672\" data-type=\"problem\">\n<p id=\"fs-idm104292416\">9) If the level of significance is 0.05, the conclusion is:<\/p>\n<ol id=\"fs-idm104292032\" type=\"a\">\n<li>There is sufficient evidence to conclude that the <strong>W<\/strong> Division teams score fewer goals, on average, than the <strong>E<\/strong> teams<\/li>\n<li>There is insufficient evidence to conclude that the <strong>W<\/strong> Division teams score more goals, on average, than the <strong>E<\/strong> teams.<\/li>\n<li>There is insufficient evidence to conclude that the <strong>W<\/strong> teams score fewer goals, on average, than the <strong>E<\/strong> teams score.<\/li>\n<li>Unable to determine<\/li>\n<\/ol>\n<\/div>\n<div id=\"fs-idm178991312\" data-type=\"solution\">\n<p id=\"fs-idm178991056\">\n<\/div>\n<\/div>\n<div id=\"fs-idm108118736\" data-type=\"exercise\">\n<div id=\"fs-idm122622576\" data-type=\"problem\">\n<p id=\"eip-idp273849808\">10) Suppose a statistics instructor believes that there is no significant difference between the mean class scores of statistics day students on Exam 2 and statistics night students on Exam 2. She takes random samples from each of the populations. The mean and standard deviation for 35 statistics day students were 75.86 and 16.91. The mean and standard deviation for 37 statistics night students were 75.41 and 19.73. The \u201cday\u201d subscript refers to the statistics day students. The \u201cnight\u201d subscript refers to the statistics night students. A concluding statement is:<\/p>\n<ol id=\"fs-idm122621936\" type=\"a\">\n<li>There is sufficient evidence to conclude that statistics night students&#8217; mean on Exam 2 is better than the statistics day students&#8217; mean on Exam 2.<\/li>\n<li>There is insufficient evidence to conclude that the statistics day students&#8217; mean on Exam 2 is better than the statistics night students&#8217; mean on Exam 2.<\/li>\n<li>There is insufficient evidence to conclude that there is a significant difference between the means of the statistics day students and night students on Exam 2.<\/li>\n<li>There is sufficient evidence to conclude that there is a significant difference between the means of the statistics day students and night students on Exam 2.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div id=\"fs-idm81478720\" data-type=\"exercise\">\n<div id=\"fs-idp12884432\" data-type=\"problem\">\n<p id=\"fs-idp12884688\">11) Researchers interviewed street prostitutes in Canada and the United States. The mean age of the 100 Canadian prostitutes upon entering prostitution was 18 with a standard deviation of six. The mean age of the 130 United States prostitutes upon entering prostitution was 20 with a standard deviation of eight. Is the mean age of entering prostitution in Canada lower than the mean age in the United States? Test at a 1% significance level.<\/p>\n<\/div>\n<div id=\"fs-idm189266736\" data-type=\"solution\">\n<p id=\"fs-idm125542800\">\n<\/div>\n<\/div>\n<div id=\"fs-idm65734896\" data-type=\"exercise\">\n<div id=\"fs-idm65734640\" data-type=\"problem\">\n<p id=\"fs-idp44866512\">12) A powder diet is tested on 49 people, and a liquid diet is tested on 36 different people. Of interest is whether the liquid diet yields a higher mean weight loss than the powder diet. The powder diet group had a mean weight loss of 42 pounds with a standard deviation of 12 pounds. The liquid diet group had a mean weight loss of 45 pounds with a standard deviation of 14 pounds.<\/p>\n<\/div>\n<\/div>\n<div data-type=\"exercise\">\n<div data-type=\"problem\">\n<p>13) Suppose a statistics instructor believes that there is no significant difference between the mean class scores of statistics day students on Exam 2 and statistics night students on Exam 2. She takes random samples from each of the populations. The mean and standard deviation for 35 statistics day students were 75.86 and 16.91, respectively. The mean and standard deviation for 37 statistics night students were 75.41 and 19.73. The \u201cday\u201d subscript refers to the statistics day students. The \u201cnight\u201d subscript refers to the statistics night students. An appropriate alternative hypothesis for the hypothesis test is:<\/p>\n<ol id=\"eip-idp47881856\" type=\"a\">\n<li><em data-effect=\"italics\">\u03bc<\/em><sub>day<\/sub> &gt; <em data-effect=\"italics\">\u03bc<\/em><sub>night<\/sub><\/li>\n<li><em data-effect=\"italics\">\u03bc<\/em><sub>day<\/sub> &lt; <em data-effect=\"italics\">\u03bc<\/em><sub>night<\/sub><\/li>\n<li><em data-effect=\"italics\">\u03bc<\/em><sub>day<\/sub> = <em data-effect=\"italics\">\u03bc<\/em><sub>night<\/sub><\/li>\n<li><em data-effect=\"italics\">\u03bc<\/em><sub>day<\/sub> \u2260 <em data-effect=\"italics\">\u03bc<\/em><sub>night<\/sub><\/li>\n<\/ol>\n<\/div>\n<div id=\"eip-605\" data-type=\"solution\">\n<p>&nbsp;<\/p>\n<p>1)<\/p>\n<p id=\"fs-idm89316432\">Subscripts: 1: two-year colleges; 2: four-year colleges<\/p>\n<ol id=\"fs-idm89316048\" type=\"a\">\n<li><em data-effect=\"italics\">H<sub>0<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<sub>1<\/sub><\/em> \u2265 <em data-effect=\"italics\">\u03bc<sub>2<\/sub><\/em><\/li>\n<li><em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<sub>1<\/sub><\/em> &lt; <em data-effect=\"italics\">\u03bc<sub>2<\/sub><\/em><\/li>\n<li>\\({\\overline{X}}_{1}\u2013{\\overline{X}}_{2}\\) is the difference between the mean enrollments of the two-year colleges and the four-year colleges.<\/li>\n<li>Student\u2019s-<em data-effect=\"italics\">t<\/em><\/li>\n<li>test statistic: -0.2480<\/li>\n<li><em data-effect=\"italics\">p<\/em>-value: 0.4019<\/li>\n<li>Check student\u2019s solution.<\/li>\n<li>\n<ol id=\"fs-idp113692848\" type=\"i\">\n<li>Alpha: 0.05<\/li>\n<li>Decision: Do not reject<\/li>\n<li>Reason for Decision: <em data-effect=\"italics\">p<\/em>-value &gt; alpha<\/li>\n<li>Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the mean enrollment at four-year colleges is higher than at two-year colleges.<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<p>3)<\/p>\n<p id=\"fs-idp57225296\">Subscripts: 1: mechanical engineering; 2: electrical engineering<\/p>\n<ol id=\"fs-idp195702880\" type=\"a\">\n<li><em data-effect=\"italics\">H<sub>0<\/sub><\/em>: <em data-effect=\"italics\">\u00b5<sub>1<\/sub><\/em> \u2265 <em data-effect=\"italics\">\u00b5<sub>2<\/sub><\/em><\/li>\n<li><em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u00b5<sub>1<\/sub><\/em> &lt; <em data-effect=\"italics\">\u00b5<sub>2<\/sub><\/em><\/li>\n<li>\\({\\overline{X}}_{1}-{\\overline{X}}_{2}\\) is the difference between the mean entry level salaries of mechanical engineers and electrical engineers.<\/li>\n<li><em data-effect=\"italics\">t<\/em><sub>108<\/sub><\/li>\n<li>test statistic: <em data-effect=\"italics\">t<\/em> = \u20130.82<\/li>\n<li><em data-effect=\"italics\">p<\/em>-value: 0.2061<\/li>\n<li>Check student\u2019s solution.<\/li>\n<li>\n<ol id=\"fs-idp3469344\" type=\"i\">\n<li>Alpha: 0.05<\/li>\n<li>Decision: Do not reject the null hypothesis.<\/li>\n<li>Reason for Decision: <em data-effect=\"italics\">p<\/em>-value &gt; alpha<\/li>\n<li>Conclusion: At the 5% significance level, there is insufficient evidence to conclude that the mean entry-level salaries of mechanical engineers is lower than that of electrical engineers.&#8217;<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<p>5)<\/p>\n<ol id=\"fs-idp184583584\" type=\"a\">\n<li><em data-effect=\"italics\">H<sub>0<\/sub><\/em>: <em data-effect=\"italics\">\u00b5<sub>1<\/sub><\/em> = <em data-effect=\"italics\">\u00b5<sub>2<\/sub><\/em><\/li>\n<li><em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u00b5<sub>1<\/sub><\/em> \u2260 <em data-effect=\"italics\">\u00b5<sub>2<\/sub><\/em><\/li>\n<li>\\({\\overline{X}}_{1}-{\\overline{X}}_{2}\\) is the difference between the mean times for completing a lap in races and in practices.<\/li>\n<li><em data-effect=\"italics\">t<\/em><sub>20.32<\/sub><\/li>\n<li>test statistic: \u20134.70<\/li>\n<li><em data-effect=\"italics\">p<\/em>-value: 0.0001<\/li>\n<li>Check student\u2019s solution.<\/li>\n<li>\n<ol id=\"fs-idp173433584\" type=\"i\">\n<li>Alpha: 0.05<\/li>\n<li>Decision: Reject the null hypothesis.<\/li>\n<li>Reason for Decision: <em data-effect=\"italics\">p<\/em>-value &lt; alpha<\/li>\n<li>Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the mean time for completing a lap in races is different from that in practices.<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<p>7)<\/p>\n<div data-type=\"exercise\">\n<div data-type=\"solution\">\n<ol id=\"fs-idp172770560\" type=\"a\">\n<li><em data-effect=\"italics\">H<sub>0<\/sub><\/em>: <em data-effect=\"italics\">\u00b5<sub>1<\/sub><\/em> = <em data-effect=\"italics\">\u00b5<sub>2<\/sub><\/em><\/li>\n<li><em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u00b5<sub>1<\/sub><\/em> \u2260 <em data-effect=\"italics\">\u00b5<sub>2<\/sub><\/em><\/li>\n<li>is the difference between the mean times for completing a lap in races and in practices.<\/li>\n<li><em data-effect=\"italics\">t<\/em><sub>40.94<\/sub><\/li>\n<li>test statistic: \u20135.08<\/li>\n<li><em data-effect=\"italics\">p<\/em>-value: zero<\/li>\n<li>Check student\u2019s solution.<\/li>\n<li>\n<ol id=\"fs-idp77676896\" type=\"i\">\n<li>Alpha: 0.05<\/li>\n<li>Decision: Reject the null hypothesis.<\/li>\n<li>Reason for Decision: <em data-effect=\"italics\">p<\/em>-value &lt; alpha<\/li>\n<li>Conclusion: At the 5% significance level, there is sufficient evidence to conclude that the mean time for completing a lap in races is different from that in practices.<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div data-type=\"exercise\">\n<div data-type=\"problem\">\n<p>\u00a0In two to three complete sentences, explain in detail how you might use Terri Vogel\u2019s data to answer the following question. \u201cDoes Terri Vogel drive faster in races than she does in practices?\u201d<\/p>\n<p>9) c<\/p>\n<p>11)<\/p>\n<p id=\"fs-idm189266480\">Test: two independent sample means, population standard deviations unknown.<\/p>\n<p id=\"fs-idm7792944\">Random variable: \\({\\overline{X}}_{1}-{\\overline{X}}_{2}\\)<\/p>\n<p id=\"fs-idm96995296\">Distribution: <em data-effect=\"italics\">H<sub>0<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<\/em><sub>1<\/sub> = <em data-effect=\"italics\">\u03bc<\/em><sub>2<\/sub><em data-effect=\"italics\">H<sub>a<\/sub><\/em>: <em data-effect=\"italics\">\u03bc<\/em><sub>1<\/sub> &lt; <em data-effect=\"italics\">\u03bc<\/em><sub>2<\/sub> The mean age of entering prostitution in Canada is lower than the mean age in the United States.<\/p>\n<div id=\"fs-idm122708400\" class=\"bc-figure figure\"><span id=\"fs-idm5558192\" data-type=\"media\" data-alt=\"This is a normal distribution curve with mean equal to zero. A vertical line near the tail of the curve to the left of zero extends from the axis to the curve. The region under the curve to the left of the line is shaded representing p-value = 0.0157.\"><img decoding=\"async\" src=\"https:\/\/pressbooks.ccconline.org\/acccomposition1\/wp-content\/uploads\/sites\/83\/2022\/08\/CNX_Stats_C10_M02_005anno-1.jpg\" alt=\"This is a normal distribution curve with mean equal to zero. A vertical line near the tail of the curve to the left of zero extends from the axis to the curve. The region under the curve to the left of the line is shaded representing p-value = 0.0157.\" width=\"380\" data-media-type=\"image\/jpg\" \/><\/span><\/div>\n<p id=\"fs-idm95276832\">Graph: left-tailed<\/p>\n<p id=\"fs-idm5467280\"><em data-effect=\"italics\">p<\/em>-value : 0.0151<\/p>\n<p id=\"fs-idm171100928\">Decision: Do not reject <em data-effect=\"italics\">H<\/em><sub>0<\/sub>.<\/p>\n<p>Conclusion: At the 1% level of significance, from the sample data, there is not sufficient evidence to conclude that the mean age of entering prostitution in Canada is lower than the mean age in the United States.<\/p>\n<p>13) d<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"textbox shaded\" data-type=\"glossary\">\n<h3 data-type=\"glossary-title\">Glossary<\/h3>\n<dl>\n<dt>Degrees of Freedom (<em data-effect=\"italics\">df<\/em>)<\/dt>\n<dd id=\"id21146024\">the number of objects in a sample that are free to vary.<\/dd>\n<\/dl>\n<dl>\n<dt>Standard Deviation<\/dt>\n<dd id=\"id10539417\">A number that is equal to the square root of the variance and measures how far data values are from their mean; notation: <em data-effect=\"italics\">s<\/em> for sample standard deviation and <em data-effect=\"italics\">\u03c3<\/em> for population standard deviation.<\/dd>\n<\/dl>\n<dl id=\"variable\">\n<dt>Variable (Random Variable)<\/dt>\n<dd id=\"id1167450448490\">a characteristic of interest in a population being studied. Common notation for variables are upper-case Latin letters <em data-effect=\"italics\">X<\/em>, <em data-effect=\"italics\">Y<\/em>, <em data-effect=\"italics\">Z<\/em>,&#8230; Common notation for a specific value from the domain (set of all possible values of a variable) are lower-case Latin letters <em data-effect=\"italics\">x<\/em>, <em data-effect=\"italics\">y<\/em>, <em data-effect=\"italics\">z<\/em>,&#8230;. For example, if <em data-effect=\"italics\">X<\/em> is the number of children in a family, then <em data-effect=\"italics\">x<\/em> represents a specific integer 0, 1, 2, 3, &#8230;. Variables in statistics differ from variables in intermediate algebra in two following ways.<\/p>\n<ul id=\"fs-idp40452080\">\n<li>The domain of the random variable (RV) is not necessarily a numerical set; the domain may be expressed in words; for example, if <em data-effect=\"italics\">X<\/em> = hair color, then the domain is {black, blond, gray, green, orange}.<\/li>\n<li>We can tell what specific value <em data-effect=\"italics\">x<\/em> of the random variable <em data-effect=\"italics\">X<\/em> takes only after performing the experiment.<\/li>\n<\/ul>\n<\/dd>\n<\/dl>\n<\/div>\n","protected":false},"author":32,"menu_order":4,"template":"","meta":{"pb_show_title":"","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-390","chapter","type-chapter","status-publish","hentry"],"part":382,"_links":{"self":[{"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/pressbooks\/v2\/chapters\/390","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/wp\/v2\/users\/32"}],"version-history":[{"count":2,"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/pressbooks\/v2\/chapters\/390\/revisions"}],"predecessor-version":[{"id":695,"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/pressbooks\/v2\/chapters\/390\/revisions\/695"}],"part":[{"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/pressbooks\/v2\/parts\/382"}],"metadata":[{"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/pressbooks\/v2\/chapters\/390\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/wp\/v2\/media?parent=390"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/pressbooks\/v2\/chapter-type?post=390"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/wp\/v2\/contributor?post=390"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/pressbooks.ccconline.org\/accintrostats\/wp-json\/wp\/v2\/license?post=390"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}