Topic4 Ordinary Least Squares

Topic4 Ordinary Least Squares

Suppose that X is a non-random variable • Y is a random variable that is affected by X in a linear fashion and by the random variable e with E(e) = 0 That is, E(Y) = b1 + b2X Or, Y = b1 + b2X + e

Y . . . Observed points . . O X

Y Actual Line . . . . . Y= b1 + b2x O X

Y . Actual Line . . . . Y= b1 + b2x O X

Y . Actual Line . . . Y= b1 + b2x . O X

Y . . Actual Line . . Y= b1 + b2x . O X

Y= b1 + b2x Fitted Line Y . . . Actual Line C . B . A . Y= b1 + b2x . BC is an error of Estimation AC is an effect of the random factor O X

The Ordinary Least Squares (OLS) estimates are obtained by minimising the sum of the squares of each of these errors. • The OLS estimates are obtained from the values of X and the actual Y values (YA) as follows:

Error of estimation (e)  |YA–YE | where YE is the estimated value of Y. Se2S [YA–YE ]2 Se2S [YA–(b1 + b2 X)]2 dSe2/db12S[YA–(b1 + b2X)] (-1) =0 dSe2 /db22S [YA–(b1 + b2X)] (-X) = 0

S [Y–(b1 + b2X)] (-1) = 0 -NYMEAN + N b1 + b2NXMEAN = 0 b1= YMEAN – b2XMEAN ….. (1)

dSe2/db2 2S [Y–(b1+ b2X)] (-X) = 0 S [Y–(b1 + b2X)] (-X) = 0 b1SX –b2SX2 = SXY ………..(2) b1= YMEAN - b2XMEAN ….. (1)

These estimates are given below (with the superscripts for Y dropped). b^1 = (∑Y)(ΣX2) – (∑X)(∑XY) N∑ X2 - (∑X)2 b^2 = N∑YX – (∑X)(∑Y) N∑ X2 - (∑X)2

Alternatively, b^1= YMEAN - b^2XMEAN b^2 = Covariance(X,Y) Variance(X)

Two Important Results (a) ei(Yi– YiE) = 0 and (b) X2ieiX2i(Yi– YiE) = 0 where YiEis the estimated value of Yi. X2i is the same as Xi from before Proof: (Yi– YiE) =S(Yi– b^1 - b^2 X2i) =SYi–Sb^1 - Sb^2 X2i = nYMEAN – nb^1 - nb^2 XMEAN = n(YMEAN – b^1 - b^2 XMEAN) = 0 [ since b^1 = YMEAN - b^2XMEAN ]

See the lecture notes for a proof of part (b) Total sum of squares (TSS) (Yi– YMEAN )2 Residual sum of squares (RSS)  (Yi– YiE)2 Explained sum of squares (ESS)  (YiE– YMEAN )2

To prove that TSS = RSS + ESS TSS ≡ S(Yi– YMEAN)2 = S{(Yi– YiE + YiE– YMEAN)}2 = S(Yi– YiE)2 + S(YiE– YMEAN)}2 +2S(Yi– YiE)(YiE– YMEAN) = RSS + ESS +2S(Yi– YiE)(YiE– YMEAN)

S(Yi– YiE)(YiE– YMEAN) • S(Yi– YiE)(YiE ) -YMEANS(Yi– YiE) • S(Yi– YiE)(YiE ) [by (a) above] S(Yi– YiE)(YiE ) =S(Yi– YiE)( b^1 + b^2 Xi) =b^1 S(Yi– YiE) + b^2 Xi(Yi– YiE) = 0 [by (a) and (b) above]

R2 ≡ ESS/TSS Since TSS = RSS + ESS, it follows that 0 R2  1

Topic 5 Properties of Estimators

In the discussion that follows, q^ is an estimator of the parameter of interest, q Bias of q^ ≡ E(q^) - q q^ is unbiased if Bias of q^ = 0. q^ is negatively biased if Bias of q^ < 0. q^ is positively biased if Bias of q^ > 0.

Mean Squared Errors (MSE) of estimation for q^ is given as MSEq^ ≡ E[(q^-q)]2 MSEq^ ≡ E[(q^-q)2] ≡ E[{q^-E(q^) +E(q^)- q}2] ≡ E[{q^-E(q^)}2] + E[{E(q^)- q}2] + 2E[{q^-E(q^)}*{E(q^)- q}] ≡ Var(q^) + {E(q^)- q}2+ 2E[{q^-E(q^)}*{E(q^)- q}]

Now, E[{q^-E(q^)}*{E(q^)- q}] ≡ {E(q^)-E(q^)}*{E(q^)- q}] ≡ 0*{E(q^)- q}] = 0 MSEq^≡ Var(q^) + {E(q^)- q}2 MSEq^ ≡ Var(q^) + (bias)2 .

If q^ is unbiased, that is, if E(q ^)- q = 0. then we have, MSEq^ ≡ Var(q^) An unbiased estimator q^ of a parameter qis efficientif and only if it has the smallest variance of all unbiased estimators. That is, for any other unbiased estimator p of q, Var(q^)≤ Var(p)

An estimator q^is said to be consistentif it converges inprobability toq. That is, Limn  Prob(|q^-q | > e) = 0 for every e> 0.

When the above condition holds, q^ is said to be the probability limit of q, that is, plim q^ = q Sufficient conditions for consistency: If the mean of q^converges to q and var(q^) converges to zero (as n approaches ) then q^is consistent.

That is, q^n is consistent if it can be shown that Lim n  E(q^n) = q And Lim n  Var(q^n) = 0

The Regression Model with TWO Variables The Model :: Y = b1 + b2X + e Y is the DEPENDENT variable X is the INDEPENDENT variable Yi= b1X1i+ b2X2i+ ei

Yi= b1X1i+ b2X2i+ ei Here X1i ≡ 1 for all i and X2 is nothing but X . The OLS estimates b^1 and b^2 are sample statistics used to estimate b1andb2respectively

(1a)X2 is non-random (chosen by the investigator) Assumptions about X2: (1b) Random sampling is performed from a population of fixed values of X2 . (1c) : Lim (1/n) S(x22i) = Q > 0 n [ where x2i X2i – X2MEAN.] (1c) : Lim (1/n)S(X2i) = P > 0 n 

2a. E() = 0 Assumptions about the disturbance term e 2b. Var(ei) = 2 for all i. Homoskedasticity 2c. Cov(ei, ej ) = 0 for i  j. (The  values are uncorrelated across observations). 2d. The ei all have a normal distribution

Proof: b^2 = Covariance(X,Y) Variance(X) Result :b^2 is linear in the dependent variable Yi b^2 = S(Yi–YMEAN )(Xi–XMEAN ) S(Xi–XMEAN )2

b^2 = SYi(Xi–XMEAN ) + K S(Xi–XMEAN )2 =S CiYi + K where the Ci andK are constants

Therefore, b^2 is a linear function of Yi Since, Yi= b1X1i+ b2X2i+ ei b^2 is a linear function of ei and hence is normally distributed

Similarly, b^1 is a linear function of Yi (and hence ei ) and is normally distributed Both b^1 and b^2 are unbiased estimates of b1 and b2 respectively. That is, E( b^1 ) = b1 and E( b^2 ) = b2

Each of b^1 and b^2 is an efficientestimators of b1 and b2 respectively. Thus, each of b^1 and b^2 is a Best (efficient) Linear (in the dependent variable Yi ) Unbiased Estimatorof b1 and b2 respectively. Also, Each of b^1 and b^2 is a consistent estimator of b1 and b2 respectively.

Var(b^1 ) = s2 (1/n +X 2mean2/Sx2i2) Var(b^2 ) = s2 /Sx2i2) . Cov(b^1, b^2 ) = -s2 X 2mean/Sx2i2

LimVar(b^2 ) n  = Lim s2/Sx2i2 n  = Lim s2/n/Sx2i2/n n  = 0/Q [using assumption (1c)] = 0

Because b^2 is an unbiased estimator of b2 and LimVar(b^2 ) = 0 n  b^2 is a consistent estimator of b2

The variance of the random term, s2, is not known To perform statistical analysis, we estimate s2by s^2  RSS/(n-2) This is because s^2 is an unbiased estimator of s2

Topic4 Ordinary Least Squares

Topic4 Ordinary Least Squares

Presentation Transcript

Least Squares example

Ordinary Least-Squares

Least Squares Regression

Ordinary least Squares

Simultaneity Issues in Ordinary Least Squares

Ordinary Least Squares

Least-Squares Regression

Least Squares

Linear Least Squares

Least-Squares Regression

Least Squares Regression

Least squares

Class 4 Ordinary Least Squares

Class 4 Ordinary Least Squares

Ordinary least squares regression (OLS)

Least Squares

Method of Least Squares (Least Squares Regression):

Linearity and Ordinary Least Squares Fitting

Least Squares Migration

Linear Least Squares

Linear Least Squares

Least Squares Regression