1 / 9

Voice Recognition for Wheelchair Control

Voice Recognition for Wheelchair Control. Theo Theodoridis , Xin Liu, and Huosheng Hu. Contents. Introduction Speech Recognition Structure Microsoft Speech SDK Testing. Introduction. Task: To use voice recognition for controlling a wheelchair

acton
Télécharger la présentation

Voice Recognition for Wheelchair Control

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Voice Recognition for Wheelchair Control Theo Theodoridis, Xin Liu, and Huosheng Hu

  2. Contents • Introduction • Speech Recognition Structure • Microsoft Speech SDK • Testing

  3. Introduction • Task: To use voice recognition for controlling a wheelchair • Purpose: To aid people with limited physical capability • Software: The Microsoft Speech SDK • Hardware: The Essex robotic wheelchair • Experimentation: The Essex robotic arena

  4. Start Speech Recognition Structure Sampling real-time signals • Driving components: - Start: Capture voice command - Sampling: Sample voice signal in real-time - Calculate energy: Validate signal’s presence - Calculate zero-crossing rate: Validate signal’s changes - Calculate entropy: Validate signal’s utterance - Speech recognition by parser: Microsoft Speech SDK - Driving: Execute the commands (Forw, Back, Left, Right, Stop) Calculate energy Calculate zero-crossing rate Calculate entropy Speech recognition by parser Driving Speech Recognition flow chart

  5. Microsoft Speech SDK • Features: ∙ Developed by Microsoft’s Speech Technologies Group ∙ Aims to recognize audio speech and perform text-to-speech synthesizing ∙ This API can be used on common programming languages including C++ • FFTW Core: ∙ FFTW is a ready-made library for computing discrete Fourier transform (DFT) ∙ Developed using the C++ language by MIT ∙ Can be used for increasing the running speed • Recognition Accuracy: ∙ Four commands are employed for control ∙ Exceptional recognition accuracy ∙ Adequate real-time control

  6. Microsoft Speech SDK • Recognition Tests: ∙ Three noisy background cases applied: (a) Silent – no noise (66db) (b) Music – noisy background (76db) (c) Live Singing – very noisy background (76db) ∙ Accuracies achieved: (a) Silent – 91% at 66db (b) Music – 84% at 66-76db (c) Live Singing – 58% at 66-76db Overall accuracy: 77.7%

  7. Testing • Experimental Environment 1 Environment: A simple corridor with no obstacles Task: Reach destination at the same horizontal coordinate as the origin • Experimental Environment 2 Environment: An open area with two obstacles Task: Avoid obstacles in a zigzag fashion and return back to the origin

  8. Testing

  9. Thank You

More Related