1 / 24

Integrating Web Engineering with Natural Language Processing for Voice Interface Design

This paper discusses the intersection of web engineering and natural language processing, focusing on the development of vocal interfaces. It highlights key technologies such as VoiceXML, text-to-speech (TTS), and automatic speech recognition (ASR). The presentation of voice-driven interfaces includes components such as the PBTG mediator, which links web applications with voice portals. Additionally, the paper reviews related work and concludes with insights on the future of voice-driven interface technology in web applications.

sharis
Télécharger la présentation

Integrating Web Engineering with Natural Language Processing for Voice Interface Design

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Web engineering meets natural language processing: a vocal interface generation practice 2006/12/11 江文成

  2. Reference • Hendrik Macedo, Jacques Robin, Roberto Barros • Proceedings of the 11th Brazilian Symposium on Multimedia and the web WebMedia '05 • ACM , December, 2005

  3. Introduction • Overview • VoiceXML • Voice Portals ● TTS ● ASR • Voice-Driven interface ●PBTG mediator ● MPC ●COLEC • Related work • Conclusions

  4. Overview • VoiceXML • Voice Portals ● TTS ● ASR • Voice-Driven interface ● PBTG mediator ● MPC ●COLEC • Related work • Conclusions

  5. Overview PSTN ︰公共交換電話網

  6. Overview • VoiceXML • Voice Portals ● TTS ● ASR • Voice-Driven interface ● PBTG mediator ● MPC ●COLEC • Related work • Conclusions

  7. VoiceXML • 1999年3月,由Motorola、Lucent、AT&T和IBM四家公司聯合發起成立了VoiceXML論(http://www.voicexml.org) • 可擴展標記語言(XML)的一種擴展 • 為電話和移動設備提供一種便捷的訪問Internet網路,獲取服務和資訊的手段。

  8. VoiceXML(example)

  9. Overview • VoiceXML • Voice Portals ● TTS ● ASR • Voice-Driven interface ● PBTG mediator ● MPC ●COLEC • Related work • Conclusions

  10. Voice Portals • TTS ■Text-To-Speech: 文字轉語音 • ASR ■Automatic Speech Recognition:自動語音識別 • Voice Portal application architecture

  11. Voice Portals

  12. Overview • VoiceXML • Voice Portals ● TTS ● ASR • Voice-Driven interface ● PBTG mediator ● MPC ●COLEC • Related work • Conclusions

  13. Voice-Driven interface • PBTG mediator ■a artificial intelligence system ■To connect a Web application and a Voice Portal ■consist of a MPC and a COLEC • MPC ■Message Production Component • COLEC ■Content Organizationand Linquistic Expression Component ■ COLEC pipeline (10 stages)

  14. Overview • VoiceXML • Voice Portals ● TTS ● ASR • Voice-Driven interface ● PBTG mediator ● MPC ●COLEC • Related work • Conclusions

  15. PBTG mediator

  16. PBTG mediator

  17. MPC • Recommendation Message ■ “The system recommends Matrix Reloaded to you?” • Feature Message ■ Matrix Reloaded stars Keanu Reeves? ■ Matrix Reloaded is a science-fiction moive? • Feature Asking Message ■ Would you like more information about this movie?

  18. COLEC ( pipeline )

  19. COLEC ( pipeline ) - CLTT • CoganateLexicalizedThematicTree(CLTT) • Example: Matrix Reloaded is a science-fiction movie?

  20. COLEC ( pipeline ) - FLST • FullyLexicalizedSyntacticTree(FLST) • Input source : form CLTT ( output of the last step) Matrix Reloaded is a science-fiction movie?

  21. Overview • VoiceXML • Voice Portals ● TTS ● ASR • Voice-Driven interface ● PBTG mediator ● MPC ●COLEC • Related work • Conclusions

  22. Related Work • DCIE ■ proxy-based interactive service ■ browse dynamically generated audio renditions of both e-mail and WWW documents ■ In April,1997 • WIRE voice browser ■ for car radio ■ to access e-mail and WWW ■ in Octobeer ,1998

  23. Overview • VoiceXML • Voice Portals ● TTS ● ASR • Voice-Driven interface ● PBTG mediator ● MPC ●COLEC • Related work • Conclusions

  24. Conclusions Thanks!!

More Related