html5-img
1 / 24

Synthetic Agents that Speak and Listen

Synthetic Agents that Speak and Listen. Talking with Highbrow Avatars on Your Cell Phone Prof. Matthew Nickerson, Southern Utah University. Automated Audio Tours . Audio cassette player. Analog. Audio CD Player. Digital audio player. Multimedia player. Digital.

Télécharger la présentation

Synthetic Agents that Speak and Listen

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Synthetic Agents that Speak and Listen Talking with Highbrow Avatars on Your Cell Phone Prof. Matthew Nickerson, Southern Utah University

  2. Automated Audio Tours Audio cassette player Analog Audio CD Player Digital audio player Multimedia player Digital

  3. Research issues • Frustration and complications • Player damage, loss, or theft. • Patron anxiety • Updates and changes • Outdoor venues can be problematic. • Patrons with limited mobility.

  4. Automated Audio Tours Audio cassette player Analog Audio CD Player Digital audio player B Y O P Multimedia player Digital

  5. Voice Extensible Markup Language VXML is an XML-based markup language designed specifically to implement interactive voice dialogs. Web Server VXML Digital Sound Content User Cell Phone Voice / Telephony Gateway

  6. Historical photograph exhibit A gallery exhibit featuring historic photographs covering 100 years of theater history in Cedar City, Utah. 1900-2000

  7. Benefits to developer • Low upfront costs, start slow • No check out/in, maintenance, personnel • Easily updated • Real-time usage statistics • Powerful evaluation tool

  8. Benefits to users • Familiar device • No check-in or collateral required • Avoid hygiene concerns • BYOD

  9. Platform? Work with a Vendor Do it yourself

  10. Partner with a vendor Web Server VXML Digital Sound Content User Cell Phone Voice / Telephony Gateway

  11. User Cell Phone Bridging Worldwide Networks TELEPHONY INTERNET Voice Server Web Server - VXML Digital Sound Content

  12. Built in VXML tools • Voice or DTMF input • Prerecorded or computer generated output • Audio system event handlers • Interrupt • Capture audio input

  13. Multiple Languages

  14. Virtual conversation A gallery exhibit featuring historic photographs covering 100 years of local theater history, 1900-2000.

  15. Building Synthetic Agents • Voice or DTMF input • Prerecorded or computer generated output • Audio system event handlers • Interrupt • Capture audio input

  16. Do you want to know more about General Lee? What artistic period are you interested in? What area are you currently exploring? Limiting Response Options • Ask questions

  17. Limiting Response Options • Ask questions • Create grammars <rule id = “destination” scope = “public” > <one-of> <item> <tag> “new york” </tag> new york </item> <item> <tag> “new york” </tag> new york city </item> <item> <tag> “new york” </tag> big apple </item> </one-of> </rule>

  18. PROXIMITY GEOGRAPHY SUBJECT Limiting Response Options • Ask questions • Create grammars • Point of contact

  19. Location, location, location WiFi, GPS, Bluetooth

  20. Challenges to Cultural Heritage Applications… and others • Current policies • Photography • Limiting phone calls/conversations • No speaker phones, please! • Reception

  21. Choosing a voice Battle of the sexes among synthetic agents and avatars BMW, Unisys, GMVoices

  22. Modulated human voice… ? Some swear that synthetic agents are better… others just swear. Clifford Nass, Stanford University; Sprint PCS

  23. Nuance in “virtual” conversation Affective interpretation of metaphorical utterances Catherine Smith, et al. School of Computer Science, University of Birmingham

  24. … and still more to come.

More Related