Universal Attribute Characterization for Enhanced Spoken Language Recognition

Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition

Outline • Introduction • UAR-FrondEnd • VSM-BackEnd • Experiment

Introduction • Here we focus on the token-based. • Propse an alternative universal acoustic characterization of spoken languages based on acoustic phonetic feature. • The advantage of using attribute-based unit is they can be define universally across all language.

System Overview

UAR-FrondEnd • The frond-end processing module tokenize all spoken utterances into sequences of speech unit using a universal attribute recognizer. • Two phoneme-to-attribute table are created that are phoneme-to-manner and phoneme-to-place.

VSM-BackEnd • Each transcription is converted into a vector-based representation by applying LSA.

Experiment • The OGI-TS corpus is used to train the articulatory recognizer. This corpus has phonetic transcriptions for six language.

Experiment • CallFriend corpus is used for training the back-end language models. • Test are carried out on the NIST 2003 spoken language evaluation material.

Experiment

Universal Attribute Characterization for Enhanced Spoken Language Recognition

Universal Attribute Characterization for Enhanced Spoken Language Recognition

Presentation Transcript

Spoken Language

Evaluating Spoken Language Skills the leading test of spoken language

COMMON FEATURES OF SPOKEN LANGUAGES

Spoken Language Structure

Spoken or written language

Spoken Language Processing

Spoken Language

spoken language

The language of Spoken Discourse:

Spoken Language difficulties:

Phonetics and Spoken Language

SPOKEN LANGUAGE CORPUS PROJECT

SPOKEN LANGUAGE COMPREHENSION

Spoken Language Understanding

Spoken Language

Studying spoken language

Wold's Most Spoken language

Spoken Language Understanding

Spoken Language Processing

Spoken Language Processing:Summing Up

Spoken Vs Written Language

Spoken Language Translation