1 / 15

MBA 7025 Statistical Business Analysis Displaying Data – Charts & Graphs Jan 20, 2015

MBA 7025 Statistical Business Analysis Displaying Data – Charts & Graphs Jan 20, 2015. Basic Concepts. Agenda. Displaying Data – Charts & Graphs. Basic Concepts in Data Analysis. Data, Information, and Knowledge Populations and Samples Variables and Observations

viviana
Télécharger la présentation

MBA 7025 Statistical Business Analysis Displaying Data – Charts & Graphs Jan 20, 2015

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. MBA 7025Statistical Business AnalysisDisplaying Data – Charts & GraphsJan 20, 2015

  2. Basic Concepts Agenda Displaying Data – Charts & Graphs

  3. Basic Concepts in Data Analysis • Data, Information, and Knowledge • Populations and Samples • Variables and Observations • Types of Data: Categorical and Numerical • Types of Data: Cross Sectional and Time Ordered

  4. Data Data Data Data, Information, and Knowledge Knowledge Information • Processing • Analysis • Reports • Application • Meaning • Relevance

  5. Populations and Samples Statistical Inference Sample:: Subset of collection of all possible entities (observation units) Data on sample is what is available. KNOWN Statistics are used to describe samples. These can vary across samples. Population: : Collection of all possible entities (observation units) Data on the whole population is usually not available. UNKNOWN Parameters are used to describe populations. These are constants for a population. Statistical Inference is the art and science of drawing inferences/ conclusions about a population of interest.

  6. Variables and Observations VARIABLES OBSERVATIONS Measurement

  7. Types of Data: Categorical and Numerical Categorical Numerical

  8. Questions What was the absenteeism at Plant 1 in Jan. 2008? Was the annual absenteeism the same for all plants? Was absenteeism stable at plant 1 during 2008? Types of Data: Cross-sectional and Time Ordered

  9. Agenda Basic Concepts Displaying Data – Charts & Graphs

  10. Relative Frequency Percentage Class Frequency 10 but under 20 3 .15 15 20 but under 30 6 .30 30 30 but under 40 5 .25 25 40 but under 50 4 .20 20 50 but under 60 2 .10 10 Total 20 1 100 Frequency Tables A Frequency Table showing a classification of the AGE of attendees at an event.

  11. Frequency Histograms A graphical display of distribution of frequencies How about cumulative frequency? When do we apply cumulative frequency?

  12. Developing Frequency Tables and Histograms • Sort Raw Data in Ascending Order: • 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 • Find Range: 58 - 12 = 46 • Select Number of Classes: 5 (usually between 5 and 15) • Compute Class Interval (width): 10 (range/classes = 46/5 then round up • Determine Class Boundaries (limits): 10, 20, 30, 40, 50 • Compute Class Midpoints: 15, 25, 35, 45, 55 • Count Observations & Assign to Classes

  13. Bar and Pie Charts Displaying Categorical Data CD 14% Investment AmountPercentage Category(in thousands $) Stocks 46.5 42.27 Bonds 32 29.09 CD 15.5 14.09 Savings 16 14.55 Total 110100 Savings 15% Stocks 42% Bonds 29%

  14. Side by Side Chart Categorical Bivariate Data: Side-by-Side Charts

  15. Scatter Plot for bivariate numerical data Shows relationship between two variables. Can one be used to predict the other? Time-Series and Regression Analysis are used to predict one variable’s value based on the other. Correlation analyses is used to measure the strength of linear relationship among two variables.

More Related