1 / 12

Phone Reader Project

Phone Reader Project. Presenter: Marilyn Bihina Supervisor: James Connan. Presentation. Intro Related work Design Implementation Problems encountered Functionalities of the system Current results Questions. Introduction - Reminder . What is a phone reader?.

gayle
Télécharger la présentation

Phone Reader Project

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Phone Reader Project • Presenter: Marilyn Bihina • Supervisor: James Connan

  2. Presentation • Intro • Related work • Design • Implementation • Problems encountered • Functionalities of the system • Current results • Questions

  3. Introduction - Reminder • What is a phone reader? • The user takes a picture of a text with his phone and selects a language • The user listens to the text extracted from the picture

  4. The importance • Blind people • Visually impaired • Non-native speakers

  5. Related work • CapturaTalk: Android application designed in UK in 2008 • knfbREADER(1000 USD): First Cell Phone that Reads to the Blind and Dyslexic (USA) in 2008 • Google goggles : visual search system does not automatically read

  6. Design of the system User takes a photo with his phone The photo is uploaded to the server The image is processed OCR performed + post-processing Text translated Text to Speech performed The user hears the text being read

  7. Implementation • Tools - Ubuntu - smart phone - Android SDK - web server (Lamp server) - java (Eclipse) - Imagemagick - Tesseract OCR - translation program - Text-To-Speech (espeak / Android TTS)

  8. Problems encountered • Design a system for blind people (testing) • Provide accurate results (OCR)

  9. Progression until today • Image pre-processing • Web server • Optical Character Recognition • Text to Speech • Translation • Client interface (just starting)

  10. Functionalities not developed yet • Post processing: correct spelling mistakes • User interface for the server • Testing and improving the system • Final documentation

  11. Current results • Original picture -> processed picture

  12. ???????????? ???????????? ????????????

More Related