80 likes | 192 Vues
The Microsoft Research Signal Processing Group has been pioneering advancements in multimedia technology since 1998. This document explores the challenges faced in multimedia teleconferencing (MTC), including audio-visual fidelity, technology anxiety, and operational hurdles. Key improvements like enhanced codecs for Windows Media, ClearType for better display readability, and advanced noise reduction algorithms have been introduced. Despite these advancements, many still find MTC unsatisfactory compared to in-person interactions. The document outlines potential technological enhancements needed to bridge the gap between current capabilities and user expectations.
E N D
Standards First!™
Breaking the Mass Market Barrier Technology Aspects H. MalvarMgr., Signal Processing Group Microsoft Research Standards First!™
Quick intro • Signal Processing Research at MSR • Group formed in 1998 (MSR founded in 1991) • 16 people • Helped bring several technologies to MS • New codecs for Windows Media – e.g. WMA • ClearType for more readable displays • Enhanced echo cancellation and noise reduction for Windows Messenger in XP • More to come…
Why don’t we like multimedia teleconferences (MTC)? • Because we like to travel • That has changed recently… • Because we like to interact • A dinner meeting via teleconference? • Because technology is not yet matching our expectations • we’re spoiled by “Star Trek” & “The Jetsons” • media fidelity is not yet enough? • MTC equipment still intimidates
Annoyances • Engineers usually forget little annoyances • “make sure our address shows through the window” • “no domain controller available” • “this program performed an illegal operation” • Most of us struggle with technology • Even connecting laptops to displays can be confusing… • Who will operate the VC machine? • Where are the special pens for the e-board? • Even when we have good technology, bad deployment can get in the way…
Technology hurdles • Delay – 100 ms annoys, 500 ms hurts • Incorrect gaze • Support for side conversations? • Video fidelity – can’t catch your mood • Audio fidelity – can’t hear you well • Quick sharing of annotated material • Tele-touch?
Improving technology(a biased view) • Better audio • Wideband – less tiring • Microphones arrays – less noise • Localized sound – matches video • Lower delay • Better video • Camera arrays – fix gaze • Lower delay • Better annotations • Efficient capture of regular whiteboards