1 / 17

Submit Predictions

Goal. Predict whom survived the Titanic Disaster. Hypotheses. Woman and Children First. Get Data. Read dataset into Excel, R, etc. Data Management. Some Age Missing Data, Analyze Gender Only. Statistics & Analysis. 74% Women, 19% Men . Submit Predictions. 320 / 418 = 76.5%.

dieter
Télécharger la présentation

Submit Predictions

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Goal Predict whom survived the Titanic Disaster Hypotheses Woman and Children First Get Data Read dataset into Excel, R, etc Data Management Some Age Missing Data, Analyze Gender Only Statistics & Analysis 74% Women, 19% Men Submit Predictions 320 / 418 = 76.5%

  2. Predictor Variables

  3. Age All N = 891 Data N = 714 Missing N = 177

  4. Decision Trees • Dependent variable, (Y) • Continuous • Categorical • Independent variables, (X’s) • Continuous • Categorical The Decision Tree looks for split on sample at the node that can lead to the most differentiation on Y

  5. Age

  6. Decision Trees • maximize data likelihood (minimize deviance).

  7. Prediction and Missing Values Correlation, Association of Age with other Variables?

  8. Goal Predict whom survived the Titanic Disaster Hypotheses Woman and Children First Get Data Read dataset into Excel, R, etc Data Management Some Age Missing Data, Analyze Gender Only Statistics & Analysis 74% Women, 19% Men Submit Predictions 320 / 418 = 76.5%

  9. Gender

  10. Gender and Age • Tree grows based on optimizing only the split from the current node rather then optimizing the entire tree • Tree stops when further split becomes ineffective

  11. Prediction: Gender + Age

  12. Goal Predict whom survived the Titanic Disaster Hypotheses Woman and Children First Get Data Read dataset into Excel, R, etc Data Management Some Age Missing Data, Analyze Gender Only Statistics & Analysis Submit Predictions

  13. Goal Predict whom survived the Titanic Disaster Hypotheses Woman and Children First Get Data Read dataset into Excel, R, etc Data Management Age + Gender Statistics & Analysis Submit Predictions

  14. Kitchen Sink

  15. Kitchen Sink

  16. Decision Trees • Popular Implementations • CART Classification And Regression Tree • CHAID CHi-squared Automatic Interaction Detector • CHAID allows multiple branch split - a wider tree • CART uses binary split

More Related