420 likes | 660 Vues
Welcome to Chapter 12 MBA 541. B ENEDICTINE U NIVERSITY Two-Sample Tests & Analysis of Variance Analysis of Variance Chapter 12. Chapter 12. Please, Read Chapter 12 in Lind before viewing this presentation. Statistical Techniques in Business & Economics Lind. Goals.
E N D
Welcome to Chapter 12MBA 541 BENEDICTINEUNIVERSITY • Two-Sample Tests & Analysis of Variance • Analysis of Variance • Chapter 12
Chapter 12 Please, Read Chapter 12 in Lind before viewing this presentation. Statistical Techniques in Business & Economics Lind
Goals When you have completed this chapter, you will be able to: • ONE • List the characteristics of the F distribution. • TWO • Conduct a test of hypothesis to determine whether the variances of two populations are equal. • THREE • Discuss the general idea of analysis of variance. • FOUR • Organize data into a one-way and a two-way ANOVA table.
Goals When you have completed this chapter, you will be able to: • FIVE • Conduct a test of hypothesis among three or more treatment means. • SIX • Develop confidence intervals for the difference in treatment means. • SEVEN • Define and understand the terms treatments and blocks. • EIGHT • Conduct a test of hypothesis to determine whether there is a difference among block means.
The F Distribution • There is a “family” of F distributions. • Each member of the family is determined by two parameters: the numerator degrees of freedom and the denominator degrees of freedom. • The F distribution is continuous. • Its values range from 0 to ∞. • The F distribution cannot be negative. • The F distribution is positively skewed. • The F distribution is asymptotic. • As the values of x increase, the F curve approaches the X-axis but never touches it.
Comparing Two Population Variances • The test statistic for comparing two variances is given by: where, s1² and s2² are the sample variances for the two samples. The larger s² is placed in the numerator. • The degrees of freedom are: n1-1 for the numerator and n2-1 for the denominator. • The null hypothesis is rejected if the computed value of the test statistic is greater than the critical value.
Example 1 • Colin, a stockbroker at Critical Securities, reported that the mean rate of return on a sample of 10 internet stocks was 12.6 percent with a standard deviation of 3.9 percent. • The mean rate of return on a sample of 8 utility stocks was 10.9 percent with a standard deviation of 3.5 percent. • At the 0.05 significance level, can Colin conclude that there is more variation in the internet stocks?
Example 1 (Continued) • Step 1: State the null and alternate hypotheses. H0: σ1²≤ σ2² H1: σ1²> σ2² • Step 2: State the level of significance. The 0.05 significance level is stated in the problem. • Step 3: Find the appropriate test statistic. The test statistic is the F distribution. • Step 4: State the decision rule. The null hypothesis is rejected if F is greater than 3.68 or if p is less than 0.05. There are n1-1 or 9 degrees of freedom in the numerator and n2-1 or 7 degrees of freedom in the denominator.
Example 1 (Continued) • Step 5: Compute the value of F and make a decision. • The p(F>1.2416) is 0.3965. • Because the computed F of 1.2416 is less than the critical F value of 3.68 and the p-value of 0.3965 is greater than the required level of significance of 0.05, the decision is to not reject the null hypothesis. • There is insufficient evidence to show more variation in the internet stocks.
The ANOVA Test • The F distribution is also used for testing whether three or more sample means came from the same or equal populations. • This technique is called analysis of variance or ANOVA. • The null and alternate hypotheses for four sample means is given by: Ho: μ1= μ2= μ3= μ4 H1: The means are not all equal.
The ANOVA Assumptions • ANOVA requires the following conditions: • The sampled populations follow the normal distribution, • The samples are independent, and • The populations have equal standard deviations.
The ANOVA Test • The F distribution is used as the test statistic for the ANOVA Test via the following equation. • The degrees of freedom for the F statistic in ANOVA: • If there are k populations being sampled, the numerator degrees of freedom is k-1, and • If there are a total of n observations, then the denominator degrees of freedom is n-k.
The ANOVA Test • ANOVA divides the Total Variation into the variation due to the treatment, Treatment Variation, and to the error component, Random Variation. • In the following table, i stands for the ith observation, is the overall or grand mean, and k is the number of treatment groups.
The ANOVA Table • SST is Treatment Variation • SSE is Random Variation • TSS is Total Variation
Example 2 • Rosenbaum Restaurants specialize in meals for families. Katy Polsby, President, developed a new meat loaf dinner. • Before making it a part of the regular menu, she decides to test it in several of her restaurants. • She would like to know if there is a difference in the mean number of dinners sold per day at the Anyor, Loris, and Lander restaurants. • Use the 0.05 significance level.
Example 2 (Continued) Step 1: State the null and alternate hypotheses. H0: μAynor = μLoris = μLander H1: The means are not all equal. Step 2: State the level of significance. The 0.05 significance level is stated in the problem. Step 3: Find the appropriate test statistic. The test statistic is the F distribution.
Example 2 (Continued) • Step 4: State the decision rule. The numerator degrees of freedom, n-k, is equal to 3-1 or 2. The denominator degrees of freedom, n-k, is equal to 13-3 or 10. The critical value of F at 2 and 10 degrees of freedom (with α =0.05) is 4.10. The null hypothesis is rejected if F is greater than 4.10 or if p is less than 0.05. • Step 5: Select the sample, perform the calculations, and make a decision The ANOVA calculations follow.
Example 2 (Continued) Shortcut: SST = TSS – SSE = 86 – 9.75 = 76.25
Example 2 (Continued) • As shown, the computed value for F is 39.103. • The p(F>39.103) is 0.000018 • Because an F of 39.103 is greater than the critical F value of 4.10, and the p-value of 0.000018 is less than the required level of significance of 0.05, the decision is to reject the null hypothesis and conclude that at least two of the treatment means are not the same. • The mean number of meals sold at the three locations is not the same. • The ANOVA tables on the next two slides are from the Minitab and Excel applications.
Example 2 (Continued) Analysis of Variance Source DF SS MS F P Factor 2 76.250 38.125 39.10 0.000 Error 10 9.750 0.975 Total 12 86.000 Individual 95% CIs For Mean Based on Pooled StDev Level N Mean StDev ---------+---------+---------+------- Aynor 4 12.750 0.957 (---*---) Loris 4 11.500 1.291 (---*---) Lander 5 17.000 0.707 (---*---) ---------+---------+---------+------- Pooled StDev = 0.987 12.5 15.0 17.5
Inferences About Treatment Means • When I reject the null hypothesis that the means are equal, I want to know which treatment means differ. • One of the simplest procedures is through the use of confidence intervals around the difference in treatment means.
Confidence Interval for theDifference Between Two Means • The confidence interval is given by the following equation • Where, t is obtained from the t table with degrees of freedom equal to (n-k), and • If the confidence interval around the difference in treatment means includes zero, there is not a difference between the treatment means.
Example 3 • Determine the 95% confidence interval for the difference in the mean number of meat loaf dinners sold in Lander and Aynor. • Can Katy conclude that there is a difference between the two restaurants?
Example 3 (Continued) • Because zero is not in the interval, we conclude that this pair of means differs. • The mean number of meals sold in Aynor is different from Lander.
Two-Factor ANOVA • Sometimes, there are other causes of variation. • For the two-factor ANOVA, we test whether there is a significant difference between the treatment effect and whether there is a difference in the blocking effect (a second treatment variable). where r is the number of blocks, Xb is the sample mean of block b, and XG is the overall or grand mean. • In the following ANOVA table, all sums of squares are computed as before, with the addition of the SSB.
Example 4 • The Bieber Manufacturing Co. operates 24 hours a day, five days a week. • The workers rotate shifts each week. • Todd Bieber, the owner, is interested in whether there is a difference in the number of units produced when the employees work on various shifts. • A sample of five workers is selected and their output recorded on each shift. • At the 0.05 significance level, can we conclude that there is a difference in the mean production by shift and in the mean production by employee?
Example 4 (Continued) For the Treatment Effect: • Step 1: State the null and alternate hypotheses. H0: μ1 = μ2 = μ3 H1: The means are not all equal. • Step 2: State the level of significance. The 0.05 significance level is stated in the problem. • Step 3: Find the appropriate test statistic. The test statistic is the F distribution. • Step 4: State the decision rule. The null hypothesis is rejected if F is greater than 4.46 or if p is less than 0.05. The degrees of freedom are (2,8). • Step 5: Perform the calculations and make a decision.
Example 4 (Continued) For the Block Effect: • Step 1: State the null and alternate hypotheses. H0: μ1 = μ2 = μ3= μ4 = μ5 H1: The means are not all equal. • Step 2: State the level of significance. The 0.05 significance level is stated in the problem. • Step 3: Find the appropriate test statistic. The test statistic is the F distribution. • Step 4: State the decision rule. The null hypothesis is rejected if F is greater than 3.84 or if p is less than 0.05. The degrees of freedom are (4,8). • Step 5: Perform the calculations and make a decision.
Example 4 (Continued) Note: XG = 28.87
Example 4 (Continued) Compute the remaining sums of squares as before: TSS = 139.73 SST = 62.53 SSE = 43.47 = (139.73 – 62.53 – 33.73) df (block) = 4 = (b – 1) df (treatment) = 2 = (k – 1) df (error) = 8 = (k – 1 )(b – 1)
Example 4 (Continued) • For the Treatment Effect (time of day): • Because the computed F of 5.75 is greater than the critical F of 4.10, and the p-value of 0.03 is less than the required level of significance of 0.05, the null hypothesis is rejected. • There is a significant difference in the mean number of units produced for the different time periods. • For the Block Effect (different employees): • Because the computed F of 1.55 is less than the critical F of 3.84, and the p-value of 0.28 is greater than the required level of significance of 0.05, the null hypothesis is not rejected. • There is no significant difference in the mean number of units produced for the different employees.
Example 4 (Continued) • Output Using Minitab. • Two-Way ANOVA: Units versus Worker, Shift Analysis of Variance for Units Source DF SS MS F P Worker 4 33.73 8.43 1.55 0.276 Shift 2 62.53 31.27 5.75 0.028 Error 8 43.47 5.43 Total 14 139.73
Example 4 (Continued) • Output Using Excel.