1 / 18

Phone Reader Project

This presentation outlines the Phone Reader project, designed to assist individuals in reading and understanding text from images. Recognizing challenges faced by non-native speakers, dyslexics, blind people, and illiterates, the project employs advanced image processing techniques to convert text from photos into speech. The implementation includes methods such as cropping, despeckling, and deskewing images, followed by using Tesseract for OCR. Future work aims to improve character recognition, add spell-check features, and expand language support. Join us for a demo and insightful discussion.

jemma
Télécharger la présentation

Phone Reader Project

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Phone Reader Project Presenter: Marilyn Bihina Supervisor: James Connan

  2. Overview of the presentation • Intro - Statement of the problem • Description of the Phone Reader project • Design of the system • Implementation • Demo • Tests • Future work • Questions

  3. Statement of the problem • Reading and understanding a text can be difficult for: • Non-native speakers • Dyslectics and blind people • Illiterates • The Phone Reader project aims to help users read and understand a text from a picture.

  4. Description of the Phone Reader project • What is a phone reader? Photo of text converted to speech through a phone.

  5. Design of the system

  6. Implementation

  7. Implementation • Pre-processing methods used with Imagemagick: • Crop • Despeckle • Grayscale • Threshold • Deskew • Unsharp

  8. Demo • Video 1 • Video 2

  9. Tests 1 2 4 3

  10. Tests with good picture quality Original Image Processed Image

  11. Tesseract results from a good picture

  12. Tests with bad picture

  13. Tesseract results from not processed bad image

  14. Bad quality processed image

  15. Tesseract results from bad processed image

  16. Future work • Display special characters(é, ï, ç) in translated text • Include a spell checker to improve the final text • Allow more languages to be recognized and translated

  17. Conclusion • Main challenge: take a good picture!!!!

  18. Questions ???????????? ???????????? ????????????

More Related