1 / 13

Data Extraction and Analysis in IPUMS: A Workshop Overview

This workshop, held from June 23-27, 2008, focuses on extracting and analyzing data using the IPUMS platform. Participants will learn about model building, regression analysis, and data recoding while utilizing familiar statistical packages such as SAS, SPSS, and Stata. The workshop covers selection strategies for variables, including those for poverty and income data, and how to generate final outputs like codebooks and setup statements. By the end, participants will understand how to navigate and manipulate the data from various years (1850-2006) for their research purposes.

enrico
Télécharger la présentation

Data Extraction and Analysis in IPUMS: A Workshop Overview

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. IPUMS Extract Exercise Lisa Neidert Poverty/American Community Survey Workshop June 23-27, 2008

  2. Purpose of extracting data • Modeling • Regressions, etc. • Recodes • In familiar statistical package

  3. IPUMS • Same data we have used with PDQ-Explore • Concatenated file • 1850-2006 • Can select one year or many • Final product is: • Data (compressed) • Codebook • Set-up statements

  4. Strategy • Do simple extract • Download • Uncompress data • Modify set-up statement • Produce means and regression in statistical package of choice • SAS, SPSS, stata • Go back and produce ‘real’ extract • Take time choosing variables

  5. Variables • http://usa.ipums.org/usa/VariableGroups.do • Variables arranged by Subject • General version • Detailed version

  6. Variables included in my extract • hhwt (default) • perwt (default) • age [25-54] • sex [males] • race [white, black] • incearn • poverty

  7. Sample selections • Exclude incearn < 1

  8. Recodes • L_Earn = Log(Incearn) • White (dummy variable) • A2 = Age*Age • Interaction terms (with white) • W_Age • W_Age2

  9. Download/Unzip data • If zipped file does not have an icon associated with it, your PC is missing software • Solution • Use my PC • Copy to gift flash drive

  10. Modify program • SAS wants you to identify: • Work area • Data location • Desktop location as a path is: • 'c:\Documents and Settings\lisan\Desktop’ • 'c:\Documents and Settings\lisan\ Desktop\lisan_umich_edu_078.dat'

More Related