Simple Linear Regression

Simple Linear Regression Lecture XXVIII

Overview • Most of the material for this lecture is from George Casella and Roger L. Berger Statistical Inference (Belmont, California: Duxbury Press, 1990) Chapter 12, pp. 554-577.

The purpose of regression analysis is to explore the relationship between two variables. • In this course, the relationship that we will be interested in can be expressed as: where yi is a random variable and xi is a variable hypothesized to affect or drive yi. The coefficients a and b are the intercept and slope parameters, respectively.

These parameters are assumed to be fixed, but unknown. • The residual ei is assumed to be an unobserved, random error. Under typical assumptions E[ei]=0. • Thus, the expected value of yi given xi then becomes:

The goal of regression analysis is to estimate a and b and to say something about the significance of the relationship. • From a terminology standpoint, y is typically referred to as the dependent variable and x is referred to as the independent variable. Cassella and Berger prefer the terminology of y as the response variable and x as the predictor variable.

This relationship is a linear regression in that the relationship is linear in the parameters a and b. Abstracting for a moment, the traditional Cobb-Douglas production function can be written as: taking the natural log of both sides yields:

Simple Linear Regression • The setup for simple linear regression is that we have a sample of n pairs of variables (xi,yi),…(xn,yn). Further, we want to summarize this relationship using by fitting a line through the data. • Based on the sample data, we first describe the data as follows: • The sample means

The sums of squares:

The most common estimators given this formulation are then given by

Least Squares: A Mathematical Solution • Following on our theme in the discussion of linear projections “Our first derivation of estimates of aand b makes no statistical assumptions about the observations (xi,yi)…. Think of drawing through this cloud of points a straight line that comes ‘as close as possible’ to all the points.”

This definition involves minimizing the sum of square error in the choice of a and b:

Focusing on a first

Taking the first-order conditions with respect to b yields:

Going from this result to the traditional estimator requires the statement that

The least squares estimator of b then becomes:

Computing the simple least squares representation:

First, we derive the projection matrix which is a 12 x 12 matrix. The projection of y onto the space can then be calculated as:

Comparing these results with the estimated values of y from the model yields:

Best Linear Unbiased Estimators: A Statistical Solution • The linear relationship between the xs and ys and we assume that

The implications of this variance assumption are significant. Note that we assume that each observation has the same variance irregardless of the value of the independent variable. In traditional regression terms, this implies that the errors are homoscedastic.

One way to state these assumptions is This specification is consistent with our assumptions, since the model is homoscedastic and linear in the parameters.

Based on this formulation, we can define the linear estimators of a and b as An unbiased estimator of b can further be defined as those linear estimators whose expected value is the true value of the parameter:

The linear estimator that satisfies these unbiasedness conditions and yields the smallest variance of the estimate is referred to as the best linear unbiased estimator (or BLUE). In this example, we need to show that

Given that the yis are uncorrelated, the variance of linear model can be written as:

The problem of minimizing the variance then becomes choosing the dis to minimize this sum subject to the unbiasedness constraints

Using the results from the first n first-order conditions and the second constraint first, we have

Substituting this result into the first n first-order conditions yields:

Substituting these conditions into the first constraint, we get

This proves that simple least squares is BLUE on a fairly global scale. Note that we did not assume normality in this proof. The only assumptions were that the expected error term is equal to zero and that the variances were independently and identically distributed.

Simple Linear Regression