1 / 25

Lecture Slides

Lecture Slides. Elementary Statistics Eleventh Edition and the Triola Statistics Series by Mario F. Triola. 13-1 Review and Preview 13-2 Sign Test 13-3 Wilcoxon Signed-Ranks Test for Matched Pairs 13-4 Wilcoxon Rank-Sum Test for Two Independent Samples 13-5 Kruskal-Wallis Test

sherryp
Télécharger la présentation

Lecture Slides

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Lecture Slides Elementary StatisticsEleventh Edition and the Triola Statistics Series by Mario F. Triola

  2. 13-1 Review and Preview 13-2 Sign Test 13-3 Wilcoxon Signed-Ranks Test for Matched Pairs 13-4 Wilcoxon Rank-Sum Test for Two Independent Samples 13-5 Kruskal-Wallis Test 13-6 Rank Correction 13-7 Runs Test for Randomness Chapter 13Nonparametric Statistics

  3. Section 13-6 Rank Correlation

  4. Key Concept This section describes the nonparametric method of the rank correlation test, which uses paired data to test for an association between two variables. In Chapter 10 we used paired sample data to compute values for the linear correlation coefficient r, but in this section we use ranks as a the basis for computing the rank correlation coefficient .

  5. Definition The rank correlation test (or Spearman’s rank correlation test) is a non-parametric test that uses ranks of sample data consisting of matched pairs. It is used to test for an association between two variables.

  6. Advantages Rank correlation has these advantages over the parametric methods discussed in Chapter 10: • The nonparametric method of rank correlation can be used in a wider variety of circumstances than the parametric method of linear correlation.  With rank correlation, we can analyze paired data that are ranks or can be converted to ranks. • Rank correlation can be used to detect some (not all) relationships that are not linear.

  7. Objective Compute the rank correlation coefficient and use it to test for an association between two variables. (There is no correlation between the two variables.) (There is a correlation between the two variables.)

  8. Notation = rank correlation coefficient for sample paired data ( is a sample statistic) = rank correlation coefficient for all the population data( is a population parameter) n = number of pairs of sample data d = difference between ranks for the two values within a pair

  9. Requirements The sample paired data have been randomly selected. Note: Unlike the parametric methods of Section 10-2, there is no requirement that the sample pairs of data have a bivariate normal distribution. There is no requirement of a normal distribution for any population.

  10. Rank Correlation Test Statistic No ties: After converting the data in each sample to ranks, if there are no ties among ranks for either variable, the exact value of the test statistic can be calculated using this formula:

  11. Rank Correlation Test Statistic Ties: After converting the data in each sample to ranks, if either variable has ties among its ranks, the exact value of the test statistic can be found by using Formula 10-1 with the ranks:

  12. Critical values: If , critical values are found in Table A-9. If , use Formula 13-1. Rank Correlation where the value of z corresponds to the significance level. (For example, if ,z = 1.96.)

  13. Disadvantages A disadvantage of rank correlation is its efficiency rating of 0.91, as described in Section 13-1. This efficiency rating shows that with all other circumstances being equal, the nonparametric approach of rank correlation requires 100 pairs of sample data to achieve the same results as only 91 pairs of sample observations analyzed through parametric methods, assuming that the stricter requirements of the parametric approach are met.

  14. Figure 13-4Rank Correlation for Testing

  15. Figure 13-4Rank Correlation for Testing

  16. Example: Table 13-1 lists overall quality scores and selectivity rankings of a sample of national universities (based on data from U.S. News and World Report). Find the value of the rank correlation coefficient and use it to determine whether there is a correlation between the overall quality scores and the selectivity rankings. Use a 0.05 significance level. Based on the result, does it appear that national universities with higher overall quality scores are more difficult to get into?

  17. Example: Requirement is satisfied: paired data are a simple random sample The selectivity data consist of ranks that are not normally distributed. So, we use the rank correlation coefficient to test for a relationship between overall quality score and selectivity rank. The null and alternative hypotheses are as follows:

  18. Example: Since neither variable has ties in the ranks:

  19. Example: From Table A-9, using and , the critical values are . Because the test statistic of is not between the critical values of –0.738 and 0.738, we reject the null hypothesis. There is sufficient evidence to support a claim of a correlation between overall quality score and selectivity ranking. It appears that Universities with higher quality scores are more selective and are more difficult to get into.

  20. Example: Detecting a Nonlinear Pattern An experiment involves a growing population of bacteria. Table 13-8 lists randomly selected times (in hr) after the experiment is begun, and the number of bacteria present. Use a 0.05 significance level to test the claim that there is a correlation between time and population size.

  21. Example: Detecting a Nonlinear Pattern Requirement is satisfied: date are from a simple random sample The hypotheses are: We follow the rank correlation procedure summarized in Figure 13-5. The original values are not ranks, so we convert them to ranks and enter the results in Table 13-9.

  22. Example: Detecting a Nonlinear Pattern There are no ties among ranks of either list.

  23. Example: Detecting a Nonlinear Pattern Since is less than 30, use Table A-9 Critical values are The test statistic rs = 1 is not between and , so we reject the null hypothesis of (no correlation). There is sufficient evidence to conclude there is a correlation between time and population size.

  24. Example: Detecting a Nonlinear Pattern If this example is done using the methods of Section 10-2, the linear correlation coefficient is r = 0.0621 and critical values of –0.632 and 0.632. This leads to the conclusion that there is not enough evidence to support the claim of a significant linear correlation, whereas the nonlinear test found that there was enough evidence. The Minitab scatter diagram shows that there is a non-linear relationship that the parametric method would not have detected.

  25. Recap In this section we have discussed: Rank correlation which is the non-parametric equivalent of testing for correlation described in Chapter 10. It uses ranks of matched pairs to test for association. Sometimes rank correlation can detect non-linear correlation that the parametric test will not recognize.

More Related