1 / 104

Emanuela Cresti LABLITA- University of Florence Lisbon, May 5 2003

Emanuela Cresti LABLITA- University of Florence Lisbon, May 5 2003 For a corpus based linguistics of spoken language Intonation and morpho-syntax in an italian corpus. http://lablita.dit.unifi.it. “Teoria della lingua in atto” Theoretical frame: Speech acts theory (Austin 1962)

dunn
Télécharger la présentation

Emanuela Cresti LABLITA- University of Florence Lisbon, May 5 2003

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Emanuela Cresti LABLITA- University of Florence Lisbon, May 5 2003 For a corpus based linguistics of spoken language Intonation and morpho-syntaxin an italian corpus http://lablita.dit.unifi.it

  2. “Teoria della lingua in atto” • Theoretical frame: Speech acts theory (Austin 1962) • locutionary act • Speech act illocutionary act simoultaneously performed • perlocutionary act • If and expression accomplishes an illocutionary act it is an utterance • Every expression which can be pragmatically interpreted is an utterance

  3. The primary function of intonation • The relationship between the pragmatic entity “linguistic act” and the linguistic entity “utterance” is marked by the intonation. • Illocutionary act • Intonationinterface • Locutionary act

  4. An intonation pattern marks the accomplishment of an illocutionary act in the speech flow • Intonation provides perceptually relevant feature : • llocutionary criterion. • A criterion for the identification of utterances by means of the recognition of tonal patterns • The “linguistic stuff” which corresponds to this sound stream is an utterance. • Every expression intonated by a well formed intonation pattern (terminal contour) is an utterance (illocutionary criterion). No syntax involved.

  5. Is it possible to identify an illocutionary act ? • We verified on a large corpus of spontaneous spoken Italian the correspondence between streams of speech identified by an intonation pattern and the accomplishment of an illocutionary act • Pragmatic features of illocutionary force conventionally codified

  6. Conventional pragmatic features of illocutionary force • a) Semiological • l’apertura o meno del canale di comunicazione tra i parlanti; • condivisione o meno dell’orizzonte attenzionale e/o del focus attenzionale; • b) Pragmatic • modalità dell’azione comunicativa ; • i caratteri di operabilità della situazione extralinguistica; • c)     Perceptual • i caratteri percettivi dell’oggetto contestuale di riferimento (vicino, lontano, fermo, in movimento, noto); • la distanza e la prossemica dei parlanti; • d)    Social • la disponibilità, abilità d’intervento, le conoscenze, dei parlanti; • le credenze e gli stati emozionali del parlante; • età, sesso, ruolo e distanza sociale dei parlanti. • Textual • breaf/long;simple/ complex; contextually bound/creative; •  .

  7. CLASSES OF ILLOCUTIONARY ACTS RIFUSAL ASSERTION DIRECTIVE EXPRESSIVE RITUALS

  8. Illocutionary classes in the LABLITA Sampling of the Spontaneous Speech Corpus (Dati V.Firenzuoli, Tesi PhD 2003) 10h 9300 utterances Ratio 15%

  9. Second main function of intonation • Intonation marks the inner organization of the illocutionary act • Illocutionary act • Pragmatic pattern: field, parenthetic, force, integration, fatic … • Intonation • Intonation pattern: prefix, parenthetic, root, suffix, fatic … • Utterance • Information pattern: topic, parenthetic, comment, appendix, fatic … .

  10. An intonation pattern marks the accomplishment of different pragmatic functions • The intonation pattern identifies, in a quasi isomorphic way with the pragmatic pattern, the information pattern of an utterance, that can be compound by many information units • The “linguistic stuff” which corresponds to this sound stream is an information unit. • The information structure of the utterance doesn’t depend on the relation given vs new • Information structure: the relation among information units carrying illocutionary force vs other pragmatic functions • What accomplishes an illocutionary act is necessarily new

  11. How can we identify an information unit? • a)     information function • b)    distribution • c)     intonation • Inventory of information units (9): • Basic information units: comment, topic, appendix • Meta-linguistic information unit:parenthetic, locutive introducer • Dialogic information unit:*incipit, allocutive, fathic, conative • LABLIA-Sampling • Roughly 8.000 utterances more than 50% patterned utterances

  12. Comment: COM (100%) • Function: it accomplishes the illocutionary force • Distribution: it is necessary for the accomplishment of the act • Intonation: a) its intonation corresponds to a root unit; • b)there are many root intonation units, codified for specific illocutionary forces (32 detected for Italian in current researches) • *EST: palestra ?(pursuing) • [gym] • *SIC: quindi gli devi mettere nero// (conclusion) • [therefore you must put some black in it]

  13. Illocutionary root forms Inventory

  14. Morpho-syntactic features • Description and distributions of verb-less comments • Informal Family-Private corpus C-ORAL-ROM, (Scarano, 2003) • 82.409 words, 13.403 utterances • 5108 verb-less utterances, 38% • 1917 Adverbial (37,5 %) • 1274 Noun phrase (25%) • 1011 Interjections (19,5%) • 422 Prepositional phrases (8,2%) • 382 Adjectival phrases (7,47%) • 102 Nominal sentential utterance (1,9 %)

  15. *ELA: ma te / discreta // (evaluation) • [and you, polite] • (Topic / Coomment //) • *GAB: Lecce / bellissima // (exclamation) • [Lecce, wonderfull] • (Topic / Comment) • *RIN: ah // (approval) • *MAX: ferma // (order) • [stop you]

  16. Description of verbal comment • Informal private corpus LABLITA • 3000 utterances (V.Firenzuoli, 1999) • two verbs (less then 10%) • sentences (4,7%) • (most part of verbal comment have no subject, a few post verbal subjects) • 4,5%right dislocated subjects, in appendix unit • Information about verbal mood and time • (80% present-indicative, V. Firenzuoli, 2000) • person ( Cresti, 2003)

  17. *PAT: le fragole / dove sono ? (partial question) • [strawberries, where they are?] • (topic/ comment) • *ELA: lavora tutti i giorni ? (total question) • [do you work every day ?] • *SMN: con chi esci ? (partial question) • [with whom are you going ?] • *MAE: cerca di ricordartelo // (advice) • [try to figure it out] • *GNO: [<] <no / più che fatto / l' ho visti fare / ecco> // [specification] • [no, I do not did it, I saw other people doing it/ that’s it] • (Fatico / Topic / Comment / Fatico) • *SIC: quindi gli devi mettere nero// (conclusion) • [therefore you must put some black in it]

  18. Modality • Bally’s definition: speaker’s attitude toward the referential and cognitive content of his own locution (Modus on Dictum). • two strong implications: •   it doesn’t exist any utterance without a modality; •   only one modality at time can occur. • Logical definition of modality • logic operators, It can be there or not and can be repeated • (many modalities can be embedded one in an other in the semantic configuration of a same sentence).

  19. Three major modal attitudes (Bally) • Alethic • Epistemic • Deontic • Well differentiated and opposed from a logical point of view. • A locution which is judged deontic, cannot be judged at the same time also alethic or epistemic, and viceversa.

  20. a) Alethicmodality: a state of affairs is evaluated true (or false) on the base of its logic or factual evidence. • Swans are white • Notes • the more common alethic attitude is that wich presents a state of affairs without a special engagement , as if there were a factual ascertainement (Yesterday I have been to the moovie, Carlo is twentee years old, the pen is on the table). • In this case there are no specific modal indexes to express modality. • Other kinds of truth, given as logically necessary (A is different from non A), or definition (Whales are mammalian), or nomic truth (Brothers knives), or scientific truths (Water boils at 100 °) show syntactic indexes as non-reportive present form and definite or abstract subjects, or they are nomic noun sentences.

  21. b) epistemic modality a state of affairs can be considered known or believed and differentiated on the degree and the nature of the engagement on its truth Necessary – probable – more or less probable or impossible Carlo should have left (probable) Carlo may have left (possible) Notes The epistemic attitude, which is the best identified by lexical and morpho-syntactic indexes, (epistemic verbs, adjectives, adverbs, verbal times and moods), shows a locutive content under the perspective of probability, that is of the subjective judgement ( not of the perceptual evidence nor logical), neither in the perspective of libitum (pleasure, desire, will, belief)

  22. c) deontic modality state of affair is considered as requested, allowed or forbidden, following a particular logic of obligation. obligatory vs non-obligatory Carlo must leave (need) permission vs prohibition Carlo can leave (permission)  Notes the deontic attitude must be broaden to the world of what is desired or porsued or commented in a way to express interior states, sentiments and believes, which have nothing in common with factual or logic evidence (alethic modality) or with degree of knowledge and probability.(epistemic modality).

  23. Comment modality is determined by the illocutionary class directive and expressive illocutions select epistemic and deontic modality *ELA: lavora tutti i giorni ?total question – epistemic modality [do you work every day ?] *PAT: le fragole / dove sono ? partial question (alethic topic)- epistemic modality [strawberries, where they are?] *MAE: cerca di ricordartelo // [try to figure it out] order – deontic modality *PIS: no la su moglie // [not his wife] mildening - deontic modality *SMN: con chi esci ? [with whom are you going ?] courtesy question - epistemic modality *MAX: ferma // [alt] order - deontic modality *EST: palestra ?pursuing - deontic modality

  24. Assertive illocution mostly selects alethic modality, *PIL: sono stata a prendere Christian a scuola // [I went to pick up Christian at school] declarative – alethic modality *GNO: [<] <no / più che fatto / l' ho visti fare / ecco> // [no, I did not relly did it, I saw other people doing it, that’s it] specification - alethic modality *RIN: ah // approval - alethic modality

  25. Assertive illocution allows epistemic and deontic modality, through lexical and morpho-syntactic indexes. *SIC: quindi gli devi mettere nero // [hence you must put black in there] conclusion – deontic modality *POZ: anche perché / se mi consenti / la cosa spetterebbe alla critica d’arte // [because, you must agree, that thing is up to fine art critics] hypothesis - (epistemic parenthetic) epistemic modality MED: ma lei / invece/ avrebbe bisogno di mangiare // [but she, on the contrary, should need to eat] ascertainment -(alethic parenthetic) deontic modality

  26. Topic: TOP (19%) Function: it expresses the field of the illocutionary force, the relation field-force allows the utterance to be distantiated from the context Distribution: a) it must precede the comment, but it has not to be necessarily contiguous; b) it can be repeated (max three times); Intonation: a) its intonation corresponds to a prefix unit; b) there are three types of prefix intonation units

  27. Complex utterances 40,2% Utterances 17,8% Complex utterances: 3595 Simple 1055 Complex 272 CORPUSLABLITA sampling (S.Signorini, 2001) TOPIC 1327 • utterances: 8093 Topic: 1327

  28. *LUC: perché / la minestra di pane / è questa // [because, the soup of bread, is this one] *CLA: come lei va via la sera / nell’ascensore ‘un c’è più luce // [when she goes away in the night, in the lift there is no more light] *SAB: però / se ti viene uno davanti alto / non vedi nulla // [But, is somebody tall comes in front of you, you can’t see anything

  29. TIPO 1 *SAB: però se ti viene uno davanti alto/top non vedi nulla //com

  30. ne pa la mi ne stra di TIPO 2 *LUC: perché /inpla minestra di pane /top è questa //com

  31. come ra lei se va via la TIPO 3 *CLA: come lei va via la sera /top nell’ ascensore ‘un c’è più luce //com

  32. Frequency of forms on a Corpus sampling 105 topic on 1055 (1 / 10 ratio) 55,2% 23,8% 21%

  33. Morpho-syntactic characters of Simple topic (1055) NOMINAL: 798 (76%) *LUC:la minestra di pane /topè questa //com [The soup of bread, is this one] VERBAL : 257 (24%) *SAB:quando siamo entrati /topc’ era un sacco di gente//com [when we came in, a lot of people was there]

  34. NOMINAL TOPIC (798) • NP :574(71,9%) *DAN: la fodera /topgliela forniscono //com [The lining , do they sell it to you ] - PP: 168 (21%) *DAN:per la fine dell’ anno /topsarà dura//comby the end of the year, is going to be hard] -Avv:42 (5,3%) *ANG:stasera /topl’ ho regolato com’ era //com [this night, I put it as it was before] -Adg:14 (1,8%) *KAT: magari /inpcrude /topriempian di più //com [may be, row, they fill you up much more]

  35. *DAN: la fodera /gliela forniscono ? [The lining , do they sell it to you ] *DAN: per la fine dell’ anno / sarà dura // [by the end of the year, is going to be hard] *ANG: stasera / l’ ho regolato com’era // [this night, I put it as it was before] *KAT: magari / crude / riempian di più // [may be, row, they fill you up much more] *SAN: non so / questa sede / sarà oggetto di tributi commerciali / immagino // [I do not know, this society, may be object of commercial fees / I guess]

  36. VERBAL TOPIC (257) - Finite Forms : 193 (75,1%): *ALB: se c’ è le donne /topsi viene //com [If we can find womens, we come] *CAA:attraversò la piazza /top t’ avre’ fatto vedere //com [he crossed the square, you should have seen him] Non-Finite Forms : 64(24,9%): *LID:essendo bianca /topci tengo la sopracoperta //com [given that it is white, I keep the cover on it] *LUC:a vedella /topinsomma /inx ti fa effetto //com [when you see it, so, you get impressed]

  37. *SAB: però / se ti viene uno davanti alto / non vedi nulla // [But, is somebody tall comes in front of you, you can’t see anything] *LID: essendo bianca / ci tengo la sopracoperta // [given that it is white, I keep the cover on it] *LUC: a vedella / insomma / ti fa effetto // [when you see it, so, you get impressed] *CAA: attraversò la piazza / t’ avre’ fatto vedere // [he crossed the square, you should have seen him]

  38. In the Topic Unit modality is only alethic or epistemic • (never deontic) • * DAN: la fodera /gliela forniscono ?alethic- epistemic • [The lining , do they sell it to you?] • *GUG: la Marretti / chi l’è ? • [Miss. Marietti, who is she?] alethic-epistemic • *GUG: conoscendo la Sandra / avrà senz’altro cambiato il modo di parlare // • [you know Sandra, she will for sure change her accent ] alethic- epistemic • *PAT: le fragole / dove sono ? • [The strawberries, where they are] alethic-epistemic • *CAA: attraversò la piazza / t’ avre’ fatto vedere // alethic- deontic • [he crossed the square, you should have seen him] • *SAN: e guardi / questo / io glielo stampo // alethic- deontic • [look, this, I’m going to print it out to you]

  39. *PZ1: se ‘un ci si vede prima di Natale / buon Natale // [just in case we are not going to meet each other again before Christmas, happy Christmas] epistemic – deontic *KAT: magari / crude / riempian di più // epistemic- alethic [may be, row, they fill you up much more] *LUC: a vedella / insomma / ti fa effetto // epistemic-alethic [when you see her, so, you get impressed] *PAZ: mangiare / mangio // epistemic –alethic [as far as food is concerned, I eat]

  40. *ANG: stasera / l’ ho regolato com’era // alethic-alethic [this night, I put it as it was before] *SAB: quando siamo entrati / c’ era un sacco di gente // alethic- alethic [when we came in, a lot of people was there] *LID: essendo bianca / ci tengo la sopracoperta // alethic – alethic [given that it is white, I keep the cover on it]

  41. Topic frequently preceedes a negative focalized comment *CLA: come lei va via la sera / nell’ascensore ‘un c’è più luce // [when she goes away in the night, in the lift there is no more light] epistemic- alethic (negative) *SAB: però / se ti viene uno davanti alto / non vedi nulla // [But, is somebody tall comes in front of you, you can’t see anything] epistemic- alethic (negative) *PAR: e l’uomo / non inventa nulla // alethic- alethic (negative) [the man, do not invent anything] *DAR: le case del comune / non possono essere vendute // [the houses of the Town, cannot be on sale] alethic- deontic (negative)

  42. Appendix: APP (10%) • Function: it integrates only the locution, that is the linguistic content, of the comment (frequently) or topic (rarely) • Distribution: a) it follows a topicor the comment • b) it can be repeated • Intonation: its intonation corresponds to a suffix unit • *VAL: Sotto i tre anni / contano tanto // ma anche / insomma / da tre a diciotto / contano // • [under three years, they are relevant. But also, I mean, from three to eighteen, are relevant.] • *GCM: qual’ era / che doveva uscire ? • [what was, which should come?]

  43. Description of the suffix intonation unit; F0: D - movement (flat falling) Alignment : no alignment with the stress structure Length: short Speed: in the average Start: low with no temporal break with the previous unit

  44. Morpho-syntactic features; • Informal and formal corpus LABLITA (C. Ferri) • 8609 utterances, 552 appendix of comment (6,4%) • 73,4% verb-less (405) • 50,1 % SP; • 28,1 SN; • 7,4% ADV (missing adv.judgments); • 5,6% Pro; • 3,7% Adj (qualificative, missing evaluative); • 4,9% SN, SP, Adj coordinate with “and” or “or.” • 26,6% verbal (147) • 74,8% clauses without conjunction, • 25,2% clauses preceded by conjunction (25% introduced by “e”; 20% introduced by “che”)

  45. *LIV: ha riusato il costume / dell’anno scorso // [he reused the mask / of the last year] *TIZ: sì sì sì / dell’anno scorso // [yes yes, of the last year] *TAM: ce l' ho io / la ricetta // [it is me who own, the recept] *VAL: ‘un è mica sposata / l’Ornella // [it is not married, Ornella] *ALM: ma ero dispettosa / dispettosa // [but I was nasty, nasty]

  46. *CAR: invece manca Schwarz / manca // [on the contrary it lacks Schwarz / it lacks] *INS: con mio fratello piccino / era un fenomeno / era // [my jung brother, was a phenomenon, he was ] *UTE: no / la liquidazione / te lo dico / quando te l’hanno data // [no, the money, I’m going to tell you, when they gave that to you] *LUC: io non lo so / come sono venuti in contatto // [I do not know, how they coulg get in touch] *SAN: e qui indicate il vostro codice contribuente / che è lo stesso // [here you put your code, which is the same] *LAN : noi / ci s’andette in macchina da noi / quando s’andette a Barcellona // [we, we went there all alone by car, when we went to Barcelona]

  47. Semantic tiypologies: 1) revival, repetition, echo *VAL: Sotto i tre anni / contano tanto // ma anche / insomma / da tre a diciotto / contano // [under three years, they are relevant. But also, I mean, from three to eighteen, are relevant.] *CAR: invece manca Schwarz / manca // [on the contrary it lacks Schwarz / it lacks] *ALM: ma ero dispettosa / dispettosa // [but I was nasty, nasty] *INS: con mio fratello piccino / era un fenomeno / era // [my young brother, was a phenomenon, he was ] *LAN : noi / ci s’andette in macchina da noi / quando s’andette a Barcellona // [we, we went there all alone by car, when we went to Barcelona] *PZ1: e lì nel pagliaio / c’erano quattro persone / facevano il pagliaio // [there in the pagliaio, there where four people, they were doing the pagliaio] *LIV: ha riusato il costume / dell’anno scorso // *TIZ: sì sì sì / dell’anno scorso // [he reused the mask / of the last year] *FLP: mamma / te l’ho detto / e non te lo ridico // [mama, I told you, and I’m not going to tell it to you again.

  48. 2 - anaphora explicitation: *TAM: ce l' ho io / la ricetta // [it is me who own, the receipt] *HED: sì / le ho fatte / le analisi // [yes,I did it, the analysis] *VAL: ‘un è mica sposata / l’Ornella // [it is not married, Ornella] *UTE: no / la liquidazione / te lo dico / quando te l’hanno data // [no, the money, I’m going to tell you, when they gave that to you] *LUC: io non lo so / come sono venuti in contatto // [I do not know, how they coulg get in touch] *INS: ma lo sai / che non me lo ricordo // [you know, I do not remember]

  49. 3 – redundancy integration *GCM: qual’ era / che doveva uscire ? [what was, which should come?] *ELT: io sono un’elettrice / di Sesto Fiorentino // [I’m a citizen, of Sesto Fiorentino] *MIL: e ci siamo ripassati quest’anno / da quelle strade lì // [we went there again, around that roads] *SAN: e qui indicate il vostro codice contribuente / che è lo stesso // [here you put your code, which is the same] *OTT: puoi riassumere i parametri / che abbiamo esplicitato ? [can you sum up the parameters, we referred to? *IMP: almeno un certo periodo / per restituire un po’ di soldi // [at least some time, to give some money back] *FLP: te l’ha detto / e non te lo ricordi // [he told it to you, and you forgot it] *BAA: ti giri / e poi dorme lei // [you turn on the other side, and then she sleeps]

  50. Appendix hasn’t a proper modality, but it inherits that of the information unit it integrates • lack of modal expression in the appendix units

More Related