270 likes | 421 Vues
This text explores bivariate methods to analyze the relationship between two variables, such as education and income. Key concepts include scatterplots, correlation, and covariance, providing insights into how these statistical measures relate to one another. The interpretation of correlation coefficients, such as Pearson's r, is also discussed, emphasizing that correlation does not imply causation. Examples illustrate how covariance and correlation can vary in strength and direction, aiding in the comprehension of the complexities of variable relationships.
E N D
Bivariate Methods • Relationship between two variables • e.g, as education , what does income do? • Scatterplot
Linear Correlation Source: Earickson, RJ, and Harlin, JM. 1994. Geographic Measurement and Quantitative Analysis. USA: Macmillan College Publishing Co., p. 209.
Wet – May 29/30 Avg. – June 26/28 Dry – August 22 Pond Branch - PG 11.25m DEM R2=0.79 R2=0.79 R2=0.71 Glyndon – LIDAR 0.5m DEM 11x11 R2=0.24 R2=0.10 R2=0.29
Covariance: Interpreting Scatterplots • General sense (direction and strength) • Subjective judgment • More objective approach • Extent to which variables Y and X vary together • Covariance
i=n S 1 (xi - x)(yi - y) Cov [X, Y] = n - 1 i=1 Covariance Formulae
2 3 4 5 1 Covariance Example
How Does Covariance Work? • X and Y are positively related • xi > x yi > y • xi < x yi < y • X and Y are negatively related • xi > x yi < y • xi < x yi > y __ __ __ __ __ __ __ __
Interpreting Covariances • Direction & magnitude • Cov[X,Y] > 0 positive • Cov[X, Y] < 0 negative • abs(Cov[X, Y]) ↑ strength ↑ • Magnitude ~ units
Covariance Correlation • Magnitude ~ units • Multiple pairs of variables not comparable • Standardized covariance • Compare one such measure to another
i=n S (xi - x)(yi - y) i=1 (n - 1) sXsY r = r = i=n ZxZy S Cov [X, Y] r = sXsY i=1 (n - 1) Pearson’s product-moment correlation coefficient
Pearson’s Correlation Coefficient • r [–1, +1] • abs(r) ↑ strength ↑ • r cannot be interpreted proportionally • ranges for interpreting r values • 0 - 0.2 Negligible • 0.2 - 0.4 Weak • 0.4 - 0.6 Moderate • 0.6 - 0.8 Strong • 0.8 - 1.0 Very strong
Example • X = TVDI, Y = Soil Moisture • Cov[X, Y] = -0.017063 • SX = 0.170, SY = 0.115 • r ?
Pearson’s r - Assumptions • interval or ratio • Selected randomly • Linear • Joint bivariate normal distribution
Interpreting Correlation Coefficients • Correlation is not the same as causation! • Correlation suggests an association between variables • Both X and Y are influenced by Z
Interpreting Correlation Coefficients • Causative chain (i.e. X A B Y) • e.g. rainfall soil moisture ground water runoff • Mutual relationship • e.g., income & social status • 4. Spurious relationship • e.g., Temperature (different units) • 5. A true causal relationship (X Y)
Interpreting Correlation Coefficients • A result of chance • e.g., your annual income vs. annual population of the world
Interpreting Correlation Coefficients 7. Outliers (Source: Fang et al., 2001, Science, p1723a)
Interpreting Correlation Coefficients • Lack of independence • Social data • Geographic data • Spatial autocorrelation
A Significance Test for r • An estimator • r r • r = 0 ? • t-test
r r r n - 2 ttest = = = SEr 1 - r2 1 - r2 n - 2 A Significance Test for r df = n - 2
r n - 2 ttest = 1 - r2 A Significance Test for r • H0: r = 0 • HA: r¹ 0