1 / 41

Barnan Das

Addressing Machine Learning Challenges to Perform Automated Prompting. Barnan Das. PhD Preliminary Exam. November 8, 2012.

Télécharger la présentation

Barnan Das

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Addressing Machine Learning Challenges to Perform Automated Prompting Barnan Das PhD Preliminary Exam November 8, 2012 ***Self-portraits by William Utermohlen, an American artist living in London, after he was diagnosed with Alzheimer’s disease in 1995. Utermohlen died from the consequences of Alzheimer’s disease in March 2007.

  2. 36 million Worldwide Dementia population Actual and expected number of Americans >=65 year with Alzheimer’s $200 billion Payment for care in 2012 15 million Unpaid caregivers Source: World Health Organization and Alzheimer’s Association.

  3. Automated Prompting Help with Activities of Daily Living (ADLs)

  4. Existing Work • Rule-based (temporal or contextual) • Activity initiation • RFID and video-input based prompts for activity steps Our Contribution • Learning-based • Sub-activity level prompts • No audio/video input

  5. System Architecture Published at ICOST 2011 and Journal of Personal and Ubiquitous Computing 2012.

  6. Outline of Work

  7. Outline of Work

  8. prompt Off-line Classification of Activity Steps no-prompt

  9. Data Collection

  10. Class Distribution Total number of data points 3980

  11. Imbalanced Class Distribution

  12. Existing Work • Preprocessing • Sampling • Over-sampling minority class • Under-sampling majority class • Oversampling minority class • Spatial location of samples in Euclidean feature space

  13. Proposed Approach • Preprocessing technique • Oversampling minority class • Based on Gibbs sampling Attribute Value Markov Chain Node Submitted at Journal of Machine Learning Research, 2012.

  14. Proposed Approach Markov Chains Minority Class Samples Majority Class Samples

  15. (wrapper-based)RApidlyCOnvergingGibbs sampler: RACOG & wRACOG • Differ in sample selection from Markov chains • RACOG: • Based on burn-in and lag • Stopping criteria: predefined number of iterations • Effectiveness of new samples is not judged • wRACOG: • Iterative training on dataset, addition of misclassified data points • Stopping criteria: No further improvement of performance measure (TP rate)

  16. Experimental Setup Implemented Gibbs sampling, SMOTEBoost, RUSBoost

  17. Results (RACOG & wRACOG) Geometric Mean (TP Rate, TN Rate) TP Rate

  18. Results (RACOG and wRACOG) ROC Curve

  19. Outline of Work

  20. Overlapping Classes

  21. Overlapping Classes in Prompting Data 3D PCA Plot of prompting data

  22. Existing Work • Discard data of the overlapping region • Treat overlapping region as a separate class

  23. Tomek Links

  24. Cluster-Based Under-Sampling(ClusBUS) Form clusters Under-sampling interesting clusters Published in IOS Press Book on Agent-Based Approaches to Ambient Intelligence, 2012.

  25. Experimental Setup

  26. Results (ClusBus)

  27. Outline of Work

  28. Outline of Work

  29. s1 s2 Unsupervised Learning of Prompt Situations on Streaming Sensor Data s4 s1 s3 s2

  30. Motivation • Several hundred man-hours to label activity steps • High probability of inaccuracy • Needs activity-step recognition model

  31. Knowledge Flow

  32. Data Collection

  33. Modeling Activity Errors Abnormal Occurrence Delayed Occurrence

  34. Modeling Delayed Occurrence Elapsed Time Sensor Frequency

  35. Predicting Errors At every sensor event evaluate: Likelihood of sensor si occurrence for participant pj Probability of elapsed time for current nth occurrence of sensor si Probability of all sensor frequency for current nth occurrence of sensor si

  36. Preliminary Experiments Elapsed Time No observable trend Sensor Frequency No observable trend

  37. Current Obstacles • Noisy data • Unwanted sensor events, specifically, object sensors • Erroneous activity sequences not suitable for model evaluation

  38. Proposed Plan • Identifying suitable distributions for modeling sensor frequency and elapsed time • Finding out additional statistical measures that can model the errors better • Building generalized prompt model for all six ADLs (if at all possible(?)) • Need data to evaluate proposed model • Synthetically generate erroneous sequences from normal sequences(?) • Collect more data if necessary

  39. Publications

More Related