Frequency Distributions in Statistics
Learn about frequency, percentage, and cumulative distributions in statistics, with practical examples and graphing techniques. Explore the importance of normal curves and data interpretation.
Frequency Distributions in Statistics
E N D
Presentation Transcript
Statistics • Ordering & Grouping of Information • N = 50, a test score of 83% • where does it fit in the class?? • N = 25, a GRF value of 27 N/kg • where does it fit in the sample??
Nomenclature • Frequency: number of cases or subjects or occurrences • represented with f • ie f = 12 for a score of 25 • 12 occurrences of 25 in the sample
Nomenclature • Percentage: number of cases or subjects or occurrences expressed per 100 • represented with P or % • ie f = 12 for a score of 25 when n = 25 • P = 12/25*100 = 48%
Caveat (Warning) • Should report the f when presenting percentages • ie 80% of the elementary students came from a family with an income < $25,000 • different interpretation if n = 5 compared to n = 100 • report in literature as • f = 4 (80%) OR • 80% (f = 4) OR 80% (n = 4)
Numerator Monster Headline: State Farm distributes more than $1 billion in repayments to policyholders More than $34 million in Illinois Pantagraph, 6/13/00
Numerator Monster Pantagraph, 6/13/00
Numerator Monster Pantagraph, 6/13/00
Frequency Distribution of Test Scores • 40 items on exam • Most students scored >34 • skewed • more scores at one end of the scale
Cumulative frequency • Cumulative Frequency: how many subjects in and below a given score (SPSS???)
Cumulative percentage • Cumulative Percentage: what % of subjects in and above a given score
Cumulative percentage • Cumulative Percentage: what % of subjects in and below a given score
Eyeball data check: Graphing with SPSS • Stem and Leaf plot (quick viewing of data distribution) • Bar Chart: each score separate bar • Histogram: malleable bar graph • Polygon: line graph (not available on SPSS)
Fast look at shape of distribution shows f numerically & graphically stem is value, leaf is f Stem and Leaf(SPSS: Explore command) Frequency Stem & Leaf 2.00 Extremes (=<25.0) 2.00 28 . 00 2.00 29 . 00 1.00 30 . 0 1.00 31 . 0 3.00 32 . 000 1.00 33 . 0 6.00 34 . 000000 3.00 35 . 000 4.00 36 . 0000 8.00 37 . 00000000 Stem width: 1 Each leaf: 1 case(s)
Format of Bar Chart Y axis (ordinate) f X axis (abcissa) scores/categories
Rules for Frequency Distributions • 10 to 20 groups of score intervals
Rules for Frequency Distributions • 10 to 20 groups of score intervals • intervals of the same size
Rules for Frequency Distributions • 10 to 20 groups of score intervals • intervals of the same size • score interval size calculated as (Max - Min) / # score intervals
Rules for Frequency Distributions • 10 to 20 groups of score intervals • intervals of the same size • score interval size calculated as (Max - Min) / # score intervals • Need more information about calculating by hand?
Format of Histogram Y axis (ordinate) f Can be manipulated X axis (abcissa) Scores / categories
Shapes of Distributions • Scores arranged from low to high creates a distribution • with large n, distribution becomes smoother (a curve) • characteristic curves are named
Normal Curve (bell-shaped) f high low scores
Bell Curve Importance f high low scores Height or weight or VO2 or GRF or Vision or Skill
Bell-curve importance • Natural data typically follows the normal curve • height, weight, strength, aerobic capacity, vitamin intake • The Happiness Meter • Basis of inferential statistics
Skewed Distribution f Positive Skewness high low scores
Skewed Distribution f Negative Skewness high low scores
Interesting Distribution Assessment f Expected distribution of agent-paid claims (State Farm Ins.) high low $$ amount
Interesting Distribution Assessment f Observed distribution of an agent-paid claims (suspicious.) high low $$ amount
Table 4.1 data as Histogram Normal Curve
Always plot out data as a first step in the analysis
Percentiles(Percentile Ranks) • Percentile: Values above and below which certain percentages of cases fall. • ie Doctors • 25% earn < $100,000 • 25% earn > $225,000 • How to divide scores into these ranks?
Percentiles (Quartiles) Data from Table 4.1