LYU0203 Smart Traveller with Visual Translator for OCR and Face Recognition

Department of Computer Science & Engineering The Chinese University of Hong Kong LYU0203Smart Traveller with Visual Translatorfor OCR and Face Recognition Supervised by Prof. LYU, Rung Tsong Michael Prepared by: Wong Chi Hang Tsang Siu Fung

Outline • Introduction • System Architecture • Korean OCR • Friend Reminder • Conclusion • Acknowledgement

Introduction – What is VTT? • Smart Traveller with Visual Translator (VTT) • Mobile Device which is convenient for a traveller to carry • Mobile Phone, Pocket PC, Palm, etc. • Recognize and translate the foreign text into native language • Detect and recognize the face into name

Introduction – Objective • Two main features: • Korean to English Visual Translation • Remind Somebody’s Information with Face Image

Introduction – Objective (Cont.) • Real Life Examples • Sometimes we lose the way, we need to know where we are. • Sometimes we forget somebody we met before.

User Request Response GUI Output Request Request Request Response Request Response Data Korean OCR Face Recognizer Camera API Result Query Result Query Update Data Request Stroke Database & Dictionary Face Database Camera System Architecture

Korean OCR (KOCR) • Usage • Visual Translator from Korean to English • Procedure for using KOCR • Text Area Detection • Character Identification • Translation

Initialization Capture Image Text Segmentation Recognition Translation KOCR – Program Flow

Horizontal Projection Threshold Vertical Projection KOCR – Text Area Detection • Edge Detection using Sobel Filter • Horizontal Projection and Vertical Projection • Find Potential Text Area by threshold

KOCR – Text Area Detection (Cont.)

KOCR – Character Identification • Features on Stroke • Extracted by Labeling Connected Component algorithm • Proposed Feature Extraction • Five rays each side • Difference of adjacent rays (-1 or 0 or 1) • Has holes (0 or 1) • Dimension ratio of Stroke (width/height) (-1 or 0 or 1)

KOCR – Character Identification (Cont.)

KOCR – Translation • Dictionary • Korean to English • About 1000 Korean Words • Matching • Longest Match from left to right

KOCR – Translation (Cont.)

KOCR – Evaluations • OCR Correctness • Training Set (3327 – 30% of all Character) • Testing Set (7845 – Others) • Result (64%) • Suggestion • Train all Korean characters

KOCR – Evaluations (Cont.) • Text Segmentation Correctness • 45 Captured Images • 99 Characters • Result • Segment 83% characters correctly • Segment 71% image correctly • Acceptable Result

KOCR – Evaluations (Cont.) • OCR Correctness • 45 Captured Images • 99 Characters • Result • 79% Characters correctly Recognized • 69% Images correctly Recognized

Initialization Capture Image Face Segmentation Recognition Show Profile Friend Reminder – Program Flow

Friend Reminder (FR) • Usage • Show the Profile of Friend by capturing a photo • Procedure for using FR • Face Segmentation • Face Identification • Friend’s Profile

FR – Face Segmentation • Eye Detection • Algorithm • Gabor Wavelet Feature • Log-Polar Sampling • Manual Selected (Suggest) • Selected Eyes and Mouth Positions

FR – Face Segmentation

FR – Face Identification • EigenFace • By using Principal Component Analysis (PCA) • Project the input face into the eigenvectors that pre-learned • Find the difference between the projection and the faces in database • Face determined to be ‘NEW’ if the difference is larger than a threshold

FR – Friend’s Profile

FR – Evaluations • Eye Detection Correctness • 40 Images • Result • 22.5% Image Successfully Detected • Non-acceptable • Suggestion • Manually Select Eyes and Mouth Positions

FR – Evaluations • Face Identification • Evaluation Information • 26 Test Persons’ Faces • 16 faces is in database • 10 faces is not in database • 3 faces Trained per person • 8 persons in face database • Result • 77% Successfully Identified • 63% Successfully Identified as person in database • 100% Successfully Identified as person not in database

Conclusion • Combined Modern Equipments • Digital camera • Personal Data Assistant (PDA) • Techniques Learned • Image Processing • Optical Character Recognition • Face Recognition Techniques • VTT Integrated • VTT for Korean to English OCR • VTT for Friend Reminder

Acknowledgement • Thanks Professor Michael Lyu,Project Supervisor • Give us valuable advice • Provide us necessary equipments • Thanks Edward Yau,Technical Manager of VIEW project • Give us many ideas

~The End~

LYU0203 Smart Traveller with Visual Translator for OCR and Face Recognition

LYU0203 Smart Traveller with Visual Translator for OCR and Face Recognition

Presentation Transcript

Face Recognition

Face Recognition

Smart Traveller with Visual Translator for OCR and Face Recognition

Face Recognition

Face Recognition

Face Recognition

The Smart Traveller

Face Recognition

Face Recognition

FACE RECOGNITION

Face Recognition

Face Recognition

Face Recognition

FACE RECOGNITION

Face Recognition

Face Recognition

LYU0203 Smart Traveller with Visual Translator for OCR and Face Recognition

Smart Traveller with Visual Translator

Face recognition

Face Recognition

Face Recognition

Face Recognition