1 / 18

LT5: Review

LT5: Review. Sam Marden s.h.marden@lse.ac.uk. 1. Working with summary data. Parts (a) and (b) describe the data. For parts (c) and (d) you will need to recall that the t-statistic is calculated as

urian
Télécharger la présentation

LT5: Review

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. LT5: Review Sam Marden s.h.marden@lse.ac.uk

  2. 1. Working with summary data • Parts (a) and (b) describe the data. • For parts (c) and (d) you will need to recall that the t-statistic is calculated as • You can use the fact that in a large sample (as we have here) the t-statistic converges to the normal distribution and so can use normal critical values

  3. 2. More Stats Refresher (a) Now the standard deviation is known so z-scores are given by X~N(2,1) what is the z-score associated with • x1=-1 • x2=2.5 (b)

  4. 3. Panel Data (a) We are trying to learn whether the Aid to Families With Dependant Children program (which provided block grants to states to support programs targeted at low income women with children) effected birth weights. You run the OLS regression: LowBirthWeight = a + b*AFDCPct + u Where AFDCPct is the share of the states population on AFDC supported welfare programs and LowBirthWeight is the percentage of children born with low birth weight • What do you expect b_hat to be? Why? • Do you think this is likely to be the causal effect of the welfare program?

  5. 3. Panel Data (a) We are trying to learn whether the Aid to Families With Dependant Children program (which provided block grants to states to support programs targeted at low income women with children) effected birth weights. You run the OLS regression: LowBirthWeight = a + b*AFDCPct + u Where AFDCPct is the share of the states population on AFDC supported welfare programs and LowBirthWeight is the percentage of children born with low birth weight • What do you expect b_hat to be? Why? • Causal effect – maybe weakly negative. But OVB, in particular correlates with poverty and cov(poverty, lowbirthweight)>0 and cov(poverty, AFDCPct)>0 so will be biased upwards. Bias probably stronger than causal effect. • Do you think this is likely to be the causal effect of the welfare program?

  6. 3. Panel Data (b) Like a boss, you add some controls for doctors per capita, hospital beds per capita and income. • What is the (likely) causal effect of each of these variables? • How is your estimate of b_hat likely to change when you control for each of these factors • What would need to be true for the new estimates of b_hat to be a consistent estimator of the programs effect? • Suppose you add state fixed effects. What problem do they help solve? What would you expect to happen to b_hat when you include state FE?

  7. 3. Panel Data (b) Like a boss, you add some controls for doctors per capita, hospital beds per capita and income. • What is the (likely) causal effect of each of these variables? • How is your estimate of b_hat likely to change when you control for each of these factors • What would need to be true for the new estimates of b_hat to be a consistent estimator of the programs effect? • Suppose you add state fixed effects. What problem do they help solve? What would you expect to happen to b_hat when you include state FE? Parts I common sense. Part ii think about OVB. Part iii cov(x,e)=0 (what does this mean. Part 4, takes care of all time invariant differences between states  identify only off ‘within’ variation. Not clear what the direction of the change should be.

  8. Question 4: The Wald Estimator (a) What is the meaning of: E[yic|T] E : yic: T :

  9. Question 4: The Wald Estimator (a) What is the meaning of: E[yic|T] E : the expectation of – the ‘population’ mean yic: test scores for school iif it were treated T :conditional on being part of the treated group So, E[yic|T] is the expected average test score of an school in the treated group, had it not got the treatment.

  10. Question 4: The Wald Estimator (b) What does Ḕ[yic|C] mean? What is the value of Ḕ[yic|C] ?

  11. Question 4: The Wald Estimator (b) What does Ḕ[yic|C] mean? It’s the sample analogue of, “the expected test score of an individual in the control group, had they not got the treatment.” What is the value of Ḕ[yic|C] ? 60

  12. Question 4: The Wald Estimator (c) Wald Estimator: So, the Wald Estimator is…

  13. Question 4: The Wald Estimator (d) With random assignment of schoolbooks within the treatment and the control group we obtain the ATE (think about why this is true). How would our estimates of ATE • Be biased if only the control schools with books were a non-random sample (within the control group)? • Be biased if only the ‘treated’ schools without books were a non-random sample (within the treatment group)? • What is the overall bias. .

  14. Question 4: The Wald Estimator (d) With random assignment of schoolbooks within the treatment and the control group we obtain the ATE (think about why this is true). How would our estimates of ATE • What would be the bias if only the control schools with books were a non-random sample? • These are probably the schools in the control group that benefit most. Thus the effect size captured by the reduced form () will be smaller than under random assignment. • What would the bias be if only the ‘treated’ schools without books were a non-random sample? • These are probably the schools in the treatment group that would’ve benefited least. Thus the effect size captured by the reduced form () will be larger than under random assignment. • What is the overall bias. • It’s impossible to say. Note, while it’s possible to tell a story that results in the bias being in the opposite direction in case 1 & 2, it is harder to come up with a story where they are in the same direction.

  15. Question 5 We run a regression and use it to predict house prices. It turns out that our predictions are too low for the most expensive houses and too high for the cheapest. What gives?

  16. Question 6 (a) We obtain the following regression results: DaysIlli = 2.2 + 0.2*FluShoti • What is the interpretation of the coefficients? • What is the biggest problem with interpreting things causally?

  17. Question 6 (b) and (c) 4b. Is HMO membership a good instrument for getting a flu shot? 4c. Is being visited by a health worker who talks about flu and flu shots a good intrument for getting a flu shot?i

  18. Question 6 (b) and (c) Take 3. 4b. Is HMO membership a good instrument for getting a flu shot? 4c. Is being visited by a health worker who talks about flu and flu shots a good intrument for getting a flu shot? Conditions of a good instrument (1) relevance, (2) exogeneity, Both probably satisfy relevance. We can check this anyway. Neither probably satisfy exogeneity e.g. • There is selection into HMO’s – people may be poor sicker whatever. Also, HMO’s focus on preventative care which may affect days sick other than through the flu shot. • The health worker talks about the risk of flu. People may be more careful e.g. washing their hands. This will also effect the number of days sick

More Related