430 likes | 648 Vues
Topic4 Ordinary Least Squares. Suppose that X is a non-random variable Y is a random variable that is affected by X in a linear fashion and by the random variable e with E( e ) = 0 That is, E(Y) = b 1 + b 2 X Or, Y = b 1 + b 2 X + e. Y. Observed points.
 
                
                E N D
Topic4 Ordinary Least Squares
Suppose that X is a non-random variable • Y is a random variable that is affected by X in a linear fashion and by the random variable e with E(e) = 0 That is, E(Y) = b1 + b2X Or, Y = b1 + b2X + e
Y . . . Observed points . . O X
Y Actual Line . . . . . Y= b1 + b2x O X
Y . Actual Line . . . . Y= b1 + b2x O X
Y . Actual Line . . . . Y= b1 + b2x O X
Y . Actual Line . . . . Y= b1 + b2x O X
Y . Actual Line . . . Y= b1 + b2x . O X
Y . . Actual Line . . Y= b1 + b2x . O X
Y= b1 + b2x Fitted Line Y . . . Actual Line C . B . A . Y= b1 + b2x . BC is an error of Estimation AC is an effect of the random factor O X
The Ordinary Least Squares (OLS) estimates are obtained by minimising the sum of the squares of each of these errors. • The OLS estimates are obtained from the values of X and the actual Y values (YA) as follows:
Error of estimation (e)  |YA–YE | where YE is the estimated value of Y. Se2S [YA–YE ]2 Se2S [YA–(b1 + b2 X)]2 dSe2/db12S[YA–(b1 + b2X)] (-1) =0 dSe2 /db22S [YA–(b1 + b2X)] (-X) = 0
S [Y–(b1 + b2X)] (-1) = 0 -NYMEAN + N b1 + b2NXMEAN = 0 b1= YMEAN – b2XMEAN ….. (1)
dSe2/db2 2S [Y–(b1+ b2X)] (-X) = 0 S [Y–(b1 + b2X)] (-X) = 0 b1SX –b2SX2 = SXY ………..(2) b1= YMEAN - b2XMEAN ….. (1)
These estimates are given below (with the superscripts for Y dropped). b^1 = (∑Y)(ΣX2) – (∑X)(∑XY) N∑ X2 - (∑X)2 b^2 = N∑YX – (∑X)(∑Y) N∑ X2 - (∑X)2
Alternatively, b^1= YMEAN - b^2XMEAN b^2 = Covariance(X,Y) Variance(X)
Two Important Results (a) ei(Yi– YiE) = 0 and (b) X2ieiX2i(Yi– YiE) = 0 where YiEis the estimated value of Yi. X2i is the same as Xi from before Proof: (Yi– YiE) =S(Yi– b^1 - b^2 X2i) =SYi–Sb^1 - Sb^2 X2i = nYMEAN – nb^1 - nb^2 XMEAN = n(YMEAN – b^1 - b^2 XMEAN) = 0 [ since b^1 = YMEAN - b^2XMEAN ]
See the lecture notes for a proof of part (b) Total sum of squares (TSS) (Yi– YMEAN )2 Residual sum of squares (RSS)  (Yi– YiE)2 Explained sum of squares (ESS)  (YiE– YMEAN )2
To prove that TSS = RSS + ESS TSS ≡ S(Yi– YMEAN)2 = S{(Yi– YiE + YiE– YMEAN)}2 = S(Yi– YiE)2 + S(YiE– YMEAN)}2 +2S(Yi– YiE)(YiE– YMEAN) = RSS + ESS +2S(Yi– YiE)(YiE– YMEAN)
S(Yi– YiE)(YiE– YMEAN) • S(Yi– YiE)(YiE ) -YMEANS(Yi– YiE) • S(Yi– YiE)(YiE ) [by (a) above] S(Yi– YiE)(YiE ) =S(Yi– YiE)( b^1 + b^2 Xi) =b^1 S(Yi– YiE) + b^2 Xi(Yi– YiE) = 0 [by (a) and (b) above]
R2 ≡ ESS/TSS Since TSS = RSS + ESS, it follows that 0 R2  1
Topic 5 Properties of Estimators
In the discussion that follows, q^ is an estimator of the parameter of interest, q Bias of q^ ≡ E(q^) - q q^ is unbiased if Bias of q^ = 0. q^ is negatively biased if Bias of q^ < 0. q^ is positively biased if Bias of q^ > 0.
Mean Squared Errors (MSE) of estimation for q^ is given as MSEq^ ≡ E[(q^-q)]2 MSEq^ ≡ E[(q^-q)2] ≡ E[{q^-E(q^) +E(q^)- q}2] ≡ E[{q^-E(q^)}2] + E[{E(q^)- q}2] + 2E[{q^-E(q^)}*{E(q^)- q}] ≡ Var(q^) + {E(q^)- q}2+ 2E[{q^-E(q^)}*{E(q^)- q}]
Now, E[{q^-E(q^)}*{E(q^)- q}] ≡ {E(q^)-E(q^)}*{E(q^)- q}] ≡ 0*{E(q^)- q}] = 0 MSEq^≡ Var(q^) + {E(q^)- q}2 MSEq^ ≡ Var(q^) + (bias)2 .
If q^ is unbiased, that is, if E(q ^)- q = 0. then we have, MSEq^ ≡ Var(q^) An unbiased estimator q^ of a parameter qis efficientif and only if it has the smallest variance of all unbiased estimators. That is, for any other unbiased estimator p of q, Var(q^)≤ Var(p)
An estimator q^is said to be consistentif it converges inprobability toq. That is, Limn  Prob(|q^-q | > e) = 0 for every e> 0.
When the above condition holds, q^ is said to be the probability limit of q, that is, plim q^ = q Sufficient conditions for consistency: If the mean of q^converges to q and var(q^) converges to zero (as n approaches ) then q^is consistent.
That is, q^n is consistent if it can be shown that Lim n  E(q^n) = q And Lim n  Var(q^n) = 0
The Regression Model with TWO Variables The Model :: Y = b1 + b2X + e Y is the DEPENDENT variable X is the INDEPENDENT variable Yi= b1X1i+ b2X2i+ ei
Yi= b1X1i+ b2X2i+ ei Here X1i ≡ 1 for all i and X2 is nothing but X . The OLS estimates b^1 and b^2 are sample statistics used to estimate b1andb2respectively
(1a)X2 is non-random (chosen by the investigator) Assumptions about X2: (1b) Random sampling is performed from a population of fixed values of X2 . (1c) : Lim (1/n) S(x22i) = Q > 0 n [ where x2i X2i – X2MEAN.] (1c) : Lim (1/n)S(X2i) = P > 0 n 
2a. E() = 0 Assumptions about the disturbance term e 2b. Var(ei) = 2 for all i. Homoskedasticity 2c. Cov(ei, ej ) = 0 for i  j. (The  values are uncorrelated across observations). 2d. The ei all have a normal distribution
Proof: b^2 = Covariance(X,Y) Variance(X) Result :b^2 is linear in the dependent variable Yi b^2 = S(Yi–YMEAN )(Xi–XMEAN ) S(Xi–XMEAN )2
b^2 = SYi(Xi–XMEAN ) + K S(Xi–XMEAN )2 =S CiYi + K where the Ci andK are constants
Therefore, b^2 is a linear function of Yi Since, Yi= b1X1i+ b2X2i+ ei b^2 is a linear function of ei and hence is normally distributed
Similarly, b^1 is a linear function of Yi (and hence ei ) and is normally distributed Both b^1 and b^2 are unbiased estimates of b1 and b2 respectively. That is, E( b^1 ) = b1 and E( b^2 ) = b2
Each of b^1 and b^2 is an efficientestimators of b1 and b2 respectively. Thus, each of b^1 and b^2 is a Best (efficient) Linear (in the dependent variable Yi ) Unbiased Estimatorof b1 and b2 respectively. Also, Each of b^1 and b^2 is a consistent estimator of b1 and b2 respectively.
Var(b^1 ) = s2 (1/n +X 2mean2/Sx2i2) Var(b^2 ) = s2 /Sx2i2) . Cov(b^1, b^2 ) = -s2 X 2mean/Sx2i2
LimVar(b^2 ) n  = Lim s2/Sx2i2 n  = Lim s2/n/Sx2i2/n n  = 0/Q [using assumption (1c)] = 0
Because b^2 is an unbiased estimator of b2 and LimVar(b^2 ) = 0 n  b^2 is a consistent estimator of b2
The variance of the random term, s2, is not known To perform statistical analysis, we estimate s2by s^2  RSS/(n-2) This is because s^2 is an unbiased estimator of s2