Agenda

Agenda • Survey approaches to hypothesis testing • The logic and methods of sampling

Alternatives to randomization • Matched-group quasi-experiments • Selecting variables for group matching • Before-and-after quasi-experiments • Using each group as its own control • Non-experimental survey approaches • Using statistical analysis to address threats

Matched Group Quasi-Experiment X O X O Matching O O

Before-and-After Quasi-Experiment O1 O2 O3 X O4 O5 O6

Cross-Sectional Survey correlation X O Z 1 Z2 X Y Z3 Z4

If Z explains the X-Y relationship, then controlling for Z should eliminate the correlation X Y Newspaper reading Public affairs knowledge Education Whenhigh Z Xhigh Yhigh When Zmedium Xmedium Ymedium When Zlow Xlow Ylow

Surveys • Ubiquitous • Many (not all) cross-sectional • Causal inferences open to multiple threats • Selection bias on independent variable • Temporal order not established • Major strength: Representation of large populations

Why survey? • To infer the characteristics of populations • Populations have parameters • Samples give us parameter estimates • Central tendencies (e.g., average age, typical rates of TV viewing) • Variations (e.g., range of incomes) • Relationships (e.g., correlation of income with TV viewing)

Some key terms • Population vs. frame vs. sample Sample Population (Census) Sampling Frame

Generalization • Are sample parameter estimates biased? • Bias depends upon: • Sampling method • Response rate • Measures used

Sample designs • Availability (convenience) • Quota • Snowball • Purposive • Probability

Probability sampling • Units selected by chance • Permits estimation of sampling error • Design options • Simple random • Systematic random • Stratified random • Cluster or multi-stage random

Logic of probability sampling Example: What percentage of households have broadband connection? Community of 10,000 households • Randomly select 25 • Randomly select another 25 • Randomly select another 25 …

Results of 100 Samples of N = 25 25 20 15 10 5 Std. Dev = 9.07 Mean = 47 0 N = 100.00 25 35 45 55 65 75 SAMPL25 95 our of 100 of surveys within ~18% of 47 One survey estimated 74% Two-thirds of surveys within ~9% of 47 Three surveys estimated 54% Mean

Results of 100 Samples of N = 100 25 20 15 10 5 Std. Dev = 4.29 Mean = 45 0 N = 100.00 25 35 45 55 65 75 SAMPL100

Results of 100 Samples of N = 500 25 20 15 10 5 Std. Dev = 2.39 Mean = 45 0 N = 100.00 25 35 45 55 65 75 SAMPL500

Results of 100 Samples of N = 1000 25 20 15 10 5 Std. Dev = 1.81 Mean = 45 0 N = 100.00 25 35 45 55 65 75 SAMP1000

Confidence intervals • Express our statistical confidence in a parameter “45 percent plus or minus 3 percent” “The 95% confidence interval is from 42 to 48 percent” Meaning: In 100 purely random samples of this size, approximately 95 of the intervals from 42 to 48 percent include the population value.

Confidence intervals • Only possible with probability samples • Mainly a function of sample size • Big boosts in accuracy as samples grow to 1,000 or 1,500 respondents (where intervals are usually +/- 3%) • Do NOT depend on population size* • Will be larger when subgroups are examined

Confidence intervals • Estimated based solely on sampling theory • Assume high response rates • Do not take into account other sources of noise or bias • Questionnaire design • Question wording

For Thursday • Review assignment three • Implementing survey research designs • Schutt, Ch. 8 • Lavrakas on telephone surveys • Focus on basic design and sampling -- not questionnaire design

Agenda

Agenda

Presentation Transcript

Agenda

Agenda

Agenda

Agenda

Agenda

Agenda

Agenda

Agenda

Agenda

Agenda

Agenda:

Agenda

Agenda

AGENDA