1 / 29

A Gradient Vector Flow-Based Method for Video Character Segmentation

A Gradient Vector Flow-Based Method for Video Character Segmentation. Trung Quy Phan , Palaiahnakote Shivakumara , Bolan Su and Chew Lim Tan. Agenda. Introduction Proposed method Experimental results Summary. Agenda. Introduction Proposed method Experimental results Summary.

juana
Télécharger la présentation

A Gradient Vector Flow-Based Method for Video Character Segmentation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Gradient Vector Flow-Based Method for Video Character Segmentation TrungQuyPhan, PalaiahnakoteShivakumara, Bolan Su and Chew Lim Tan

  2. Agenda • Introduction • Proposed method • Experimental results • Summary

  3. Agenda • Introduction • Proposed method • Experimental results • Summary

  4. Introduction • Text extraction from video frames video search and retrieval • Graphics text:artificially added • Scene text:part of the scene

  5. Introduction • Character segmentation: split text line into individual characters • Important preprocessing step to improve recognition rate

  6. Character Segmentation • Challenges • Low resolution • Complex background • Unconstrained fonts & colors • Touching characters

  7. Character Segmentation • Previous methods • Document analysis methods binary • Projection analysis [Huang09] sensitive Image from [Lienhart02]

  8. Character Segmentation • Previous methods • Document analysis methods binary • Projection analysis [Huang09] sensitive • Neural networks [Saidane08] vertical • Proposed method: gray, curved, less sensitive

  9. Agenda • Introduction • Proposed method • Experimental results • Summary

  10. Steps • Cut candidate identification • “Seed” pixels • Path finding • Least cost path based on seed pixels • Path verification • Eliminate false positives

  11. 1. Cut Candidate Identification • Good cut: far from character edges • Gradient vector flow • Propagate gradient information into background • Mainly used for registration and tracking Edge Gap

  12. 1. Cut Candidate Identification • Candidate cut pixel GVF field g(x, y) = (u(x, y), v(x, y))

  13. 1. Cut Candidate Identification • Gap pixels • “Medial” pixels: middle of character strokes

  14. 1. Cut Candidate Identification • Gap pixels • “Medial” pixels: middle of character strokes • Medial pixels lead to transitions between background and characters large intensity variations handled by cost function in the next step

  15. 2. Path Finding col x-1 x x+1 • Input image = graph • Nodes: pixels • Edges • Cost function • Penalize paths that go through medial pixels (higher cost) • Small penalty for diagonal moves row y row y+1

  16. 2. Path Finding • Cut = least cost path from top row to bottom row • Dynamic programming • Multiple cuts: path finding multiple times • Starting point every h / 4 pixels h

  17. 3. Path Verification • Starting points in same gap converge to the same end point end points are more reliable • Backward path finding from each end point • False cuts switch to the sides of the characters

  18. Agenda • Introduction • Proposed method • Experimental results • Summary

  19. Experiments • Text lines from TRECVID videos • Compare with [Kopf05] • Path finding based on intensities • Proposed method: gradient, verification

  20. Results – Horizontal Text Kopf’s method Proposed method

  21. More Results – Proposed Method • Non-horizontal scene text • Logo text

  22. Performance – English Text • Similar recall • Proposed method has much better precision • Criterion for “good” cuts

  23. Performance – Chinese Text • Proposed method • Precision  for Chinese: sub-components

  24. Recognition Accuracy • Segmentation helps to improve binarization? • Binarization method: [Su10] • Reported to outperform classical methods, e.g. Otsu’s and Niblack’s Line level ‘TO’ Character level ‘TONIGH §’

  25. Recognition Accuracy • Tesseract OCR engine on English text • With segmentation: 7.5% increase

  26. Agenda • Introduction • Proposed method • Experimental results • Summary

  27. Summary • Segmentation: important preprocessing for recognition • GVF, path finding, verification gray, curved • Future: end-to-end video text recognition

  28. References Huang et al. (2009). A new video text extraction approach. In Proc. ICME. Kopf et al. (2005). Robust Character Recognition in Low-Resolution Images and Videos. Technical report. University of Mannheim. Lienhart et al. (2002). Localizing and Segmenting Text in Images and Videos. CSVT. Saidane et al. (2008). An Automatic Method for Video Character Segmentation. Image Analysis and Recognition. Su et al. (2010). Binarization of historical document images using the local maximum and minimum. In Proc. DAS.

  29. Thank You

More Related