200 likes | 296 Vues
Learn about methods and calculations for evaluating validity in statistics, including t-tests, correlations, Bland-Altman analysis, Spearman's rho, and criterion-referenced tests. Explore sample and population measurements to ensure accurate interpretation. Reflect on the role of statistics in human judgment.
E N D
Chapter 4 Calculating and Evaluating Validity
When We Have Validity . . . We have an acceptably accurate measurement. We are measuring what we intend to measure. We are interpreting and applying the measurement appropriately.
Population and Sample Population: The designated group being measured Sample: A representative subgroup of the population Inferential statistics: Generalization about a population based on what is learned about the sample
Using t-tests to Compare Sample Means Purpose of t-tests: Compare two sample means Determine if a sample represents a population Types: Paired vs. independent One-tailed vs. two-tailed
Interpreting t-tests p value: The probability that the difference between two means is a coincidence of random sampling Sometimes a t-test comparison of means doesn’t give us everything we need to know; we need to compare individual differences in scores.
Correlations Defined: Measurement of the strength of the relationship between two variables. Pearson Product-Moment Correlation: Compares individual differences between two methods of measurement
Results of a PearsonProduct-Moment Correlationfor the VO2max-12-minute run example
Methods for Evaluating a Correlation r value (or correlation coefficient) Trendline R2 (or coefficient of determination)
Slope of a Trendline Defined: The change in y value per unit change in x value. y value = the slope * the x value + the y intercept (point where y = 0) Or y = mx + b
Bland-Altman Analysis A simple approach for evaluating criterion validity—through error analysis: Data taken from an unusually diverse group of subjects will show an artificially high correlation. Analysis focuses attention on error scores.
Evaluating Validity of Ranked Data Ordinal numbers: Ranked numbers that give a place in line but no information about distances between numbers; place holders. Interval numbers: Numbers that are separated by equal intervals; scalar numbers.
Spearman’s rho A special correlation used in cases where one variable is an ordinal number Calculating Spearman’s rho: Excel doesn’t offer the calculation Some websites can do it; for example, www.wessa.net/rankcorr.wasp
Evaluating the Validity of Criterion-Referenced Tests Validity ratio: The ratio of number of scores classified correctly to the total number of scores. Validity ratio = (CR+NCR)/(CR+CW+NCR) where CR = competent and classified correctly NCW = not competent and wrongly classified CW = competent but wrongly classified and NCR = not competent and classified correctly
Example Diagram for Evaluating the Validity of a Criterion-Referenced Measurement
Your Viewpoint After reading the chapter, go back and read the chapter-opening quote from Henry Clay: “Statistics are no substitute for human judgment.” Think also about the famous quote, “There are three kinds of lies: lies, damned lies, and statistics.” Do you agree with the sentiments expressed by these authors? Why or why not?