1 / 34

Determining query types

Determining query types. by analysing intonation. Overview. Using prosodic features of utterances Generating set of prosodic labels with which test utterances are annotated Trying to determine which class the utterances belong to action, problem, connect, who, info, other. Contents.

tameka
Télécharger la présentation

Determining query types

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Determining query types by analysing intonation

  2. Overview • Using prosodic features of utterances • Generating set of prosodic labels with which test utterances are annotated • Trying to determine which class the utterances belong to • action, problem, connect, who, info, other

  3. Contents • Motivation • Corpus • Prosody • System architecture • pitch extraction • segmentation • prosodic labelling • label sequences (n-grams) • Results • Conclusions

  4. Motivation • Linguists (Crystal, Searle) found relationship between • prosody and utterance type (question, command…) • prosody and attitude • Edinburgh maptask group (Taylor, Wright) found prosody help distinguish utterance types

  5. British Telecom corpus • Callers dial 100, requesting • alarm calls, collect calls • codes, numbers • connection problems • … • 8000 calls: first utterance only • Annotation • call types: by BT • prosody: by me

  6. Call types in BT corpus

  7. Prosody • má mà (lexical tone) • yés yès (word-level intonation) • Now is the time for | all good men to | come to the | aid of the | party

  8. pitch extractor / octave error correction segmenter clustering utterance classifier [1,9,4,3] [2,6,5,4,4] [4,5,1,7,5,6] ... centroid LM draw layers thingy! Simplified architecture

  9. Pitch extraction “Yes, Manchester please”

  10. Octave error correction

  11. pitch extractor / octave error correction segmenter clustering utterance classifier [1,9,4,3] [2,6,5,4,4] [4,5,1,7,5,6] ... centroid LM draw layers thingy! Simplified architecture

  12. Data points for one segment showing line of best fit duration penalty prevents very short segments also minimum and maximum segment lengths

  13. Varying the duration penalty

  14. Minimum segment length Yeah, could I book a wake-up call please

  15. pitch extractor / octave error correction segmenter clustering utterance classifier [1,9,4,3] [2,6,5,4,4] [4,5,1,7,5,6] ... centroid LM draw layers thingy! Simplified architecture

  16. Assigning labels • Each segment in training corpus has features • duration • gradient • mid-point frequency • Clustering algorithm (K-means) places segments in feature space • Prosodic labels assigned to segments, based on cluster membership

  17. 2-D data points arranged in 15 clusters

  18. Label trajectories on to clustering now: discretization

  19. More trajectory schemes no maximum with normalization 40 clusters 10 clusters

  20. 70 clusters

  21. pitch extractor / octave error correction segmenter clustering utterance classifier [1,9,4,3] [2,6,5,4,4] [4,5,1,7,5,6] ... centroid LM draw layers thingy! Simplified architecture

  22. Label sequences • N-gram collocation model used • 台中 vs 台 and 中 • label sequence e.g. [4;11;13;1] statistically more useful than individual labels • Association of label sequences with each class in training data computed • Then estimate test data classes using maximum likelihood model

  23. Results • Correct classification around 1/3 • correct classification by chance around 1/4 • But changing parameters does affect results • Some optimum parameters • 20 clusters (prosodic labels) • only label sequences seen 4 times used • sequences of 4 labels best, performance degrades with 5-grams

  24. Conclusions • Psycholinguistic experiment showed humans find same task difficult • Prosody cannot be used by itself to classify utterances • But, in combination with a lexical model, could be of use

  25. Introducing Linguistics • What do linguists do? • Grammar, and other aspects of language • Relationships between languages • How is linguistics used in the real world?

  26. What do linguists do? • They don’t necessarily “learn languages” • Linguist and 語言學 are confusing terms • They are often interested in the structure of languages. They might • specialize in one language, or a group of languages • compare different languages • study features shared by all languages

  27. Many linguists study grammar • Syntax • the way words are arranged to make sentences • John had lunch / *John lunch had • Morphology • the way words are modified to fit the circumstances • John had lunch / *John have lunch • Linguists study • what people actually say • not what they “should” say!

  28. The sort of things linguists look at in syntax • Syntax (the way words are arranged to make sentences) • John saw the girl with the telescope • 爸爸給小明買鹹蛋超人 • Me and Dad went to the toyshop • Dad bought an Ultraman for John and I

  29. And in morphology… • Affixation: hardly used in Chinese • My son has 73 Ultramen • 我(?的)兒子有73只鹹蛋超人(*們) • Compounding • rare in English: greenhouse, blackbird • productive in Chinese • Verb-object compounds: 開車, 幫忙 • Resultative compounds: 來得及, 跑不掉 • Stump compounds: 交大

  30. Phonology: the sounds of a language • How good is ㄅㄆㄇㄈ at representing the sounds of Chinese? • 雄 is xiong in 韓愈拼音, vsㄒㄩㄥ. • 嗯 and 恩 are the same inㄅㄆㄇㄈ, n vs en in Pinyin • Has 台灣國語 lost the sounds ㄓㄔㄕ? • Why do we sometimes hear 禮拜ㄕ?

  31. Historical linguistics • How languages are related • Language families • Indo-European, Sino-Tibetan… • Areal linguistics • Greek, Bulgarian • Mostly borrowed words; also shared grammatical features • Chinese, Korean, Japanese • How language changes over time • sounds: poor vs paw, suit. • vocab: 咖啡, 颱風. Calque: 摩天大樓, skyscraper, gratte-ciel • grammar: Did you eat yet? Adversative passive 被

  32. Sociolinguistics • Diglossia: “high” and “low” prestige languages • The role of Mandarin and Taiwanese in a bilingual society • The changing role of English in Taiwan society: borrowing, or showing off? • case and size: code-switching, or lexicalized Chinese words? Ta-hsüeh-shih-ching Ta-hsüeh-shih-ching Ta-hsüeh-shih-ching

  33. Applications for linguistics • Speech disorders • Forensic linguistics • Accent detection • Style verification (eg police style) • Language teaching • Computational applications • Machine translation • Speech recognition and synthesis • Language identification

More Related