360 likes | 864 Vues
Validity Lecture Overview. Overview of the concept Different types of validity Threats to validity and strategies for handling them Examples of validity issues from the literature Discussion of validity issues with respect to student projects. “Hypothesis” validity “Construct” validity
E N D
ValidityLecture Overview • Overview of the concept • Different types of validity • Threats to validity and strategies for handling them • Examples of validity issues from the literature • Discussion of validity issues with respect to student projects
“Hypothesis” validity “Construct” validity “Content” validity “Convergent” validity “Ecological” validity “Internal” validity “Statistical conclusion”validity “Concurrent” validity “External” validity “Predictive” validity “Criterion-related” validity “Discriminant” validity Validity Descriptors
Taxonomy of Validity • Validity as it pertains to assessment • Validity as it pertains to causal inference • Validity as it pertains to generalization of findings to real-world phenomena
Validity of an Assessment Tool • Validity represents an overall judgment of the degree to which both empirical evidence and theoretical considerations support the interpretation of the score and the implications for action that this interpretation entails (Cronbach, 1971).
Validity of an Assessment Tool • Score validation is an empirical evaluation of the meaning and consequences of measurement (Messick, 1989).
Validity Features • Validity applies to all assessments, including performance/behavioral assessments • Validity is not a property of the “test” per se, but rather of the meaning of the test score • Validation is an ongoing process
Features • Validity is not just a measurement principle, it is a social value that has powerful implications whenever evaluative judgments and decisions are made
Types of “Assessment” Validity • Content validity • Degree to which “Test” items adequately sample the universe of relevant items for a given domain
Types of “Assessment” Validity • Criterion-related validity • Degree to which a “Test” score relates to some relevant external criterion • Concurrent validity • Predictive validity
Types of “Assessment” Validity • Construct validity* • Ongoing, integrated summary of the evidence supporting the interpretation and utility of “Test” scores • Combines information from content validity, criterion-related validity, and discriminant/convergent validity
Convergent/Discriminant Construct Validation • Convergent validity • Empirical evidence demonstrating communality between the test score and other indicators of the same construct • Discriminant validity • Empirical evidence demonstrating a lack of communality with the test score and indicators of a different construct
Threats to Assessment Validity • Construct underrepresentation • Exists when the assessment fails to include important facets of the construct (i.e., assessment is too narrow) • Examples?
Threats to Assessment Validity • Construct-irrelevant variance • Exists when the assessment contains reliable variance associated with other distinct constructs (i.e., assessment is too broad) • Examples?
Evidence for Assessment Validity • Evidence of content relevance and representativeness • The extent to which test scores are consistent with theoretical predictions • Evidence examining the extent to which score properties and interpretations generalize to and across groups, settings, and tasks
Evidence for Assessment Validity • Evidence on the fidelity of the scoring structure to the structure of the construct being tapped • Evidence from criterion-related studies including convergent and discriminant studies • Evidence pertaining to the consequential aspect of test use and score interpretation, especially as it relates to issues of bias, and fairness
Strategies for Enhancing Assessment Validity • Avoid sole reliance on measures that lack validation data (e.g., new author-constructed measures) • Employ multiple indicators of the focal construct whenever possible • Employ indicators from more than one assessment modality domain • Discussion?
Types of Validity Pertinent to Drawing Causal Inferences • Internal validity • Degree to which causal inferences can be made between a measured or manipulated variable (i.e. independent variable) and another measured variable (dependent variable) A B C
Types of Validity Pertinent to Drawing Causal Inferences • Statistical conclusion validity • Concerned with sources of random error and with the appropriate use of statistics and statistical tests (as opposed to systematic bias as in the case of internal validity)
Types of Validity Pertinent to Drawing Causal Inferences • External validity • Refers to the degree to which the observed causal relationship is generalizable across persons, settings, and occasions • Important distinction between generalizing to a specified population (or setting or occasion) vs. generalizing across populations
Types of Validity Pertinent to Drawing Causal Inferences • Construct validity* • The degree to which causal inferences concerning one variable’s effect on another can be generalized to examplars of the constructs in question • In every day practice, this form of validity deals with the issue of “confounds”
Threats to Internal Validity • History • Maturation • Testing • Instrumentation • Statistical regression
Example of a“Testing” Threat Data from Jaimez & Telch (in preparation)
Threats to Internal Validity • Selection • Mortality • Ambiguity about the direction of causal influence
Threats to Internal Validity • Diffusion of treatments • Compensatory equalization of treatments • Compensatory rivalry by respondents • Resentful demoralization of respondents
Threats to Internal Validity • Compensatory equalization of treatments • Compensatory rivalry among participants • Resentful demoralization • Mortality
Threats to Construct Validity About Cause and Effect • Construct underrepresentation • Mono-operation bias • Mono-method bias • Confounding constructs and levels of constructs
Threats to Construct Validity About Cause and Effect • Construct irrelevancies (i.e., confounds) • Interaction of different treatments • Hypothesis-guessing within experimental conditions • Evaluation apprehension (demand characteristics) • Experimenter expectancies • Interaction of testing and treatment
Threats to External Validity • Interaction of selection and treatment • Interaction of setting and treatment • Interaction of history and treatment
Strategies for Enhancing External Validity • Employ random sampling to obtain a representative sample if time, resources, and feasibility permit • Employ heterogeneous samples whenever possible • Conduct analyses to determine whether the causal relationship holds across characteristics of subjects, settings, etc