1 / 22

Education 793 Class Notes

Education 793 Class Notes. Descriptive Statistics, Central Tendency and Variability 10 September 2003. Today’s agenda. Lab details finalized Your announcements Chapter 3: Frequency Distributions Chapter 4: Central Tendency and Variability. Data Matrix.

Télécharger la présentation

Education 793 Class Notes

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Education 793 Class Notes Descriptive Statistics, Central Tendency and Variability 10 September 2003

  2. Today’s agenda • Lab details finalized • Your announcements • Chapter 3: Frequency Distributions • Chapter 4: Central Tendency and Variability

  3. Data Matrix Data Matrix is defined as a matrix in which subjects are listed in the rows and variables corresponding to each subject are listed in the columns Example

  4. Basic Terms and Concepts • Frequency Distribution: Orders values from lowest to highest and gives the number or percent of subjects with each value. • Can be presented with: • tabular form • histogram • frequency polygon • stem and leaf plot

  5. Tables and Polygons

  6. Histograms and Bar Charts

  7. Stem and Leaf

  8. Shape of Frequency Distributions • Symmetric: when two halves of the distribution mirror each other

  9. Rectangular Bimodal Unimodal Multimodal Modality • Modality: the number of relative peaks the distribution exhibits

  10. Skew A. B. Left (Negative) Skew Right (Positive) Skew

  11. Kurtosis (Peakedness) Flat Peak, No Tails High Peak, Long Tails

  12. Central Tendency and Variability 1) Central tendency Mode Median Mean 2) Variation Range Semi-interquartile Range Variance Standard deviation

  13. Central Tendency Mode   Most frequently occurring score in a distribution Median   Point on the distribution below which one-half (50%) of the scores fall Mean Arithmetic average of scores within a distribution Special properties: Sum of deviations of scores mean is zero Sum of squares of deviations is at a minimum

  14. Mean, Median, Mode A distribution of GRE scores: 340 450 510 580 580 600 620 660 670 710 Mode: Median: Mean: 572  A second distribution of GRE scores: 340 450 510 580 580 600 620 660 670 1710* Mode: Median: Mean: 672 *Heidi’s score

  15. Box Plots

  16. Shapes and Statistics A. B.

  17. Variability Identifying the middle of a distribution as revealed by measures of central tendency is of limited value unless one also knows how much the scores in the distribution differ from each other. As such, measures of how much scores differ from each other should always accompany measures of central tendency. These measure of how much the scores differ from each other are called measures of "spread“ or variability.

  18. Basic Measures • Range The range is the simplest measure of spread: It is equal to the difference between the largest and the smallest values. The range can be a useful measure of spread primarily because it is so easily understood, but it is otherwise seldom used in real statistical practice. It is very sensitive to extreme scores, however, because it is based on only two values. • The semi-interquartile range Computed as one half the difference between the 75th percentile (often called Q3) and the 25th percentile (Q1), or (Q3 - Q1) / 2. Because half the scores in a distribution lie between Q3 and Q1, semi-interquartile range is 1/2 the distance needed to cover 1/2 the scores. In a symmetric distribution, an interval stretching from one semi-interquartile range below the median to one semi- interquartile above the median will contain 1/2 of the scores. This will not be true for a skewed distribution, however.

  19. Variance and Standard Deviation The variance is a widely used measure of spread. It is computed as the average squared deviation of each number from its mean. The formula (in summation notation) for the variance in a population in which M is the mean and N is the number of scores is: The standard deviation is the square root of the variance. It is the most commonly used measure of spread. An important attribute of the standard deviation as a measure of spread is that it is possible to compute the percentile rank associated with any given score if the mean and standard deviation of a normal distribution are known.

  20. Calculating Standard Deviation Or to calculate by hand there is a shortcut formula

  21. Numbers =13.5

  22. Next Week • Coursepack • Say it with Figures Chapter7: • The Cross-Tabulation Refines • and Available through JSTOR at www.lib.umich.edu

More Related