70 likes | 188 Vues
This report details the progress on Workpackage 3 of the TESSA project, focusing on face-to-face transaction systems. Key deliverables include a constrained speech-to-sign system, a free speech input system, and a new avatar integrating approximately 150 new signs. Challenges faced involved issues with sign blending and posture, which are being addressed with Televirtual's assistance. Evaluation of speech recognition software led to selecting IBM Via Voice for its accuracy and training capabilities. Future efforts include recording additional signs and developing a demo system for a real-world application.
E N D
Workpackage 3Face-to-face transactionsSTATUS REPORT Stephen Cox
WP3 Deliverables • Deliverable 1, July 2000 • “Constrained” speech to sign system(delivered) • Deliverable 2, June 2001 • Free speech input to sign system • Deliverable 3, December 2002 • Free speech input with limited recognition of signs
New Avatar • Several versions tested over last six months • Integrated into TESSA • Model rendering and features improved • Problems with static position, posture and blending of signs • Static position largely corrected but other problems need Televirtual help
New Signs • ~150 signs recorded between September and November • All processed and integrated into TESSA with new avatar • No hand-crafting done • Need to record at least another 200 signs, possibly many more
Speech recognition software • Entropic software now no longer supported • Evaluation of some commercial packages has been done: • IBM Via Voice • Dragon NaturallySpeaking • Microsoft system • Via Voice selected for development because: • Accurate • Training is best • ActiveX controls are best
Open system • PO are sending us transcripts of how clerks would phrase various transactions • Unkowns: • vocabulary • language model • variable amounts • SJC now has experience of free speech phrase techniques after period at Nuance
Update on TESSA project • Judy Tryggvason has joined the project • Demo system in a real Post Office in April (120 transactions, ?? phrases) • PO interpreters are examining list of phrases to see how many new signs are required • Use PO interpreters to assist with recordings? • A “talking head” avatar (French and Welsh) by July