1 / 28

LYU 0102 : XML for Interoperable Digital Video Library

LYU 0102 : XML for Interoperable Digital Video Library. Recent years, rapid increase in the usage of multimedia information, New approach: DIGITAL VIDEO LIBRARY Automated video and audio indexing Navigation, visualization Search and retrieval Video segmentation and summarization.

amy-oliver
Télécharger la présentation

LYU 0102 : XML for Interoperable Digital Video Library

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. LYU 0102 : XML for Interoperable Digital Video Library • Recent years, rapid increase in the usage of multimedia information, • New approach: DIGITAL VIDEO LIBRARY • Automated video and audio indexing • Navigation, visualization • Search and retrieval • Video segmentation and summarization

  2. Video Information • Integration of speech , language, and image processing • Text processing • Audio processing • Image processing • Video processing

  3. Digital Video Library System Overview

  4. Techniques to segment data

  5. Techniques we may apply • VOCD • Scene changes • Text processing • Face detection • Storage as XML

  6. Techniques to be discussed • VOCR • Scene changes • Storage and editing with XML

  7. Video OCR for Digital News

  8. Detection of Text Region • Video news program comprises huge numbers of frames • Roughly detect text region • Increase processing speed • Reduce processing cost

  9. Detection of Text Region • Typical text region can be characterized as a horizontal rectangular structure • With clustered sharp edges • Regions of high contrast against the background

  10. Image Enhancement • Sub-pixel Interpolation: • To magnify the text area • To increase the resolution of caption • Multi-frame Integration: • Video motion of non-caption areas, caption relatively stable • To reduce the variability on background

  11. Character Segmentation • Vertical project profile • Character segmentation

  12. Character Recognition • Binarize the character image with threshold • Filter the binary image with morphological filter • Filter the character image with connected component filter

  13. Post-Processing • Further improve the recognition rate • Using the words of dictionary to refine the character • Integrate the recognition result of multiple frames

  14. Scene change • detection technique • effective method for segmenting a video sequence into significant components

  15. Existing Method • Image difference method • Histogram Difference Method • Histogram Difference Method using DC Coefficient Image • Our Method • histogram difference method with a dynamic threshold

  16. Scene change • grasp scene from the video for every 0.05 second • grasped scenes are 24-bit image, 8 bits for each color (red R, green G, blue B) • check each pixel with the most 2 significant bits • classify them into 64 different classes • build a color histogram

  17. Scene change • Compared the histogram with the pervious scene • For each column of the histogram, calculate the difference • Sum all the difference • If (total difference) > threshold => scene change • Use the first frame as key frame

  18. XML • Extensible Markup Language • Create its own mark-up language for describing the contents • Look like a big database

  19. Advantages of using XML • Platform and system independent • Create your own tag • Adopt Unicode • Universal format • Easy to search

  20. Design schema • Starts with choosing a vocabulary • Words and phrases that are able to describe extracted video information content and therefore can be used as tag name • Show relationship between vocabulary entries

  21. XML Parser • A parser is a interface between an XML document and the application program • Document Object Model (DOM)

  22. How to present XML • Tree model becomes very similar to an XML schema • Represented as nodes that show element/attribute names or the text content and their relative places within the XML

  23. OUR TOOL

  24. OUR TOOL

  25. OUR TOOL

  26. OUR TOOL

  27. COMING • EXTRACT SECONDARY INFORMATION

  28. THE END

More Related