1 / 106

Natural Language Processing >> Morphology <<

Natural Language Processing >> Morphology <<. winter / fall 2012/2013 41.4268. Prof. Dr. Bettina Harriehausen-Mühlbauer Univ. of Applied Science, Darmstadt, Germany https://www.fbi.h-da.de/organisation/personen/harriehausen-muehlbauer-bettina.html Bettina.Harriehausen@h-da.de. content.

gavivi
Télécharger la présentation

Natural Language Processing >> Morphology <<

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Natural Language Processing>> Morphology << winter / fall 2012/201341.4268 Prof. Dr. Bettina Harriehausen-Mühlbauer Univ. of Applied Science, Darmstadt, Germany https://www.fbi.h-da.de/organisation/personen/harriehausen-muehlbauer-bettina.html Bettina.Harriehausen@h-da.de

  2. content • 1 morphemes • 2 compounds / concatenation • 3 idiomatic phrases • multiple word entries (MWE) • spell aid • regular expressions • Finite State Automata (FSA) - Natural Language Systems - Harriehausen

  3. content • 1 morphemes • 2 compounds / concatenation • 3 idiomatic phrases • multiple word entries (MWE) • spell aid • regular expressions • Finite State Automata (FSA) - Natural Language Systems - Harriehausen

  4. definition • Morphemes • morpheme = smallest possible item in a language that carries meaning • lexeme (man, house, dog,...) • inflectional affixes (dog-s, want-ed,...) • other affixes (pre-/in-/suff-): unwanted, atypical, antipathetic,... • esp. in technical language (-itis = „infection“, gastro = stomach...gastroenteritis) - Natural Language Systems - Harriehausen

  5. morphemes - Natural Language Systems - Harriehausen

  6. morphemes free morphemes : stand-alone, carry lexical and morphological meaning (e.g. house= sing, neuter, nominative ; case/number/gender) bound morphemes : legal wordformonly in combination with another morpheme, stand-alone, carry lexical and morphological meaning.Various combinations exist: bound + free: e.g. un-happy, all bound: e.g. gastro-enter-itis - Natural Language Systems - Harriehausen

  7. morphemes inflectional morphemes : create words and carry morphological meaning (e.g. dogs, laughed, going derivational morphemes : create wordforms and carry morphological meaning ( happily, intellectually, instruction, instructor, insulator, the pounding, limpness, blindness...) • Question: which string (~morpheme) do we include in our dictionary ? • full form dictionary vs. • base form dictionary (lemmas) - Natural Language Systems - Harriehausen

  8. content • 1 morphemes • 2 compounds / concatenation / decompounding • 3 idiomatic phrases • multiple word entries (MWE) • spell aid • regular expressions • Finite State Automata (FSA) - Natural Language Systems - Harriehausen

  9. compounds / concatenation Definition: a compound is a lexeme that consists of more than one stem. Compounding or composition is the word formation that creates compound lexemes (= compounds). There is no clear upper limit in number of roots allowed in English compounds. It usually doesn‘t exceed 3 morphemes, but it is clearly a stylistic issue. Some compounds are written as one word: blackbird. Some are written with hyphens: mother-in-law. Most are written as separate words: smoke screen.  Typically not spelling, but stress and word-internal sound rules distinguish compounds from non-compounds: Compare white house with White House. Question:What do we put into our dictionary ? - Natural Language Systems - Harriehausen

  10. compounds / concatenation Compounding follows rules. e.g. from chemical compounds. (http://www.chem.qmul.ac.uk/iupac/) Substitutive nomenclature This naming method generally follows established IUPAC organic nomenclature. E.g.: Hydrides of the main group elements (groups 13–17) are given -ane base names, e.g. borane (BH3), oxidane (H2O), phosphane (PH3) . The compound PCl3 would be named substitutively as trichlorophosphane. Additive nomenclature This naming method has been developed principally for coordination compounds. An example of its application is: [CoCl(NH3)5]Cl2 pentaamminechloridocobalt(III) chloride - Natural Language Systems - Harriehausen

  11. Example of a chemical compoundComponents of Phane Parent Names bicyclo[8.6.0]hexadecaphane • The prefix "bicyclo" indicates that there are two rings (bi-cyclo). • The bridge descriptor describes the ring structure in terms of a sixteen-membered main ring [8 + 6 + 2 (the bridgehead nodes)] with a bridge consisting of a bond, i.e., zero nodes, which divides the main ring into an eight-membered and a ten-membered ring. • The numerical term "hexadeca" denotes the presence of sixteen skeletal nodes.and • the term "phane" indicates that at least one node represents a multiatomic (cyclic) structural unit. [http://www.chem.qmul.ac.uk/iupac/phane/PhI2.html] - Natural Language Systems - Harriehausen

  12. - Natural Language Systems - Harriehausen

  13. Example of a medical compound Medical compounds are usually composed of a prefix + root + suffix, where neither of the components can be used stand-alone. nephritis: inflammation of the kidney supra-renal: situated above the kidneys nephrologist: a kidney doctor gastroenteritis : inflammation of stomach and intestines nephr- 2 roots: Greek (νεφρός nephr(os)) , Latin (ren(es)). = kidney gastr- ancient Greek γαστήρ (gastēr), γαστρ- = stomach, belly -o- linking 2 body parts (linguistically) enter- ancient Greek ἔντερον (énteron) = intestine -itis = inflammation supra- = above - ologist = person studying a certain body part - Natural Language Systems - Harriehausen

  14. - Natural Language Systems - Harriehausen

  15. compounds / concatenation • formation of compounds: synthesis and agglutination • Compound formation rules vary widely across language types. • Examples of formation processes (usually linked to the language type): • synthesis (typically with synthetic languages, i.e. languages with a high morpheme-per-word ratio): e.g. German:Kapitänspatent = Kapitän (sea captain) + Patent (license) joined by an -s- (originally a genitive case suffix); „patent of a sea captain“Latin:paterfamilias = pater (father) + familias (genitive of the lexeme familia (family)); „father of a family“ - Natural Language Systems - Harriehausen

  16. compounds / concatenation formation of compounds: It can get more difficult: (German -> English) Aufsichtsratsmitgliederversammlung => Auf = onsicht+s =view + “Fuge-s“rat+s = council +„genitive-s“mit =with glied + er = link + „plural“ ver = „completion“ samml (stem = sammeln) = collect ung = „noun“ On-view-council-with-link-collect ?????????????????? = "meeting of members of the supervisory board" Notice:"with" and "link" form a derivation that is the German word for "member"; "completion", "collect" and "noun" form a derivation that means "meeting" - Natural Language Systems - Harriehausen

  17. compounds / concatenation • formation of compounds: synthesis and agglutination • agglutination (usually with agglutinative languages, which tend to create very long words with derivational morphemes), e.g. • German • Farbfernsehgerät = color television set • Funkfernbedienung = radio remote control • Donaudampfschifffahrtsgesellschaftskapitänsmütze = Danube steamboat shipping company Captain's hat • Finnish • hätä-uloskäytävä = emergency exit • Lentokone-suihku-turbiini-moottori-apu-mekaanikko-aliupseeri-oppilas • = Airplane jet turbine engine auxiliary mechanic non-commissioned officer student • Swedish • rörelseuppskattningssökintervallsinställningar = Motion estimation search range settings - Natural Language Systems - Harriehausen

  18. Samples for long compounds in German • die Armbrust • die Mehrzweckhalle • das Mehrzweckkirschentkerngerät • die Gemeindegrundsteuerveranlagung • die Nummernschildbedruckungsmaschine • der Mehrkornroggenvollkornbrotmehlzulieferer • der Schifffahrtskapitänsmützenmaterialhersteller • die Verkehrsinfrastrukturfinanzierungsgesellschaft • die Feuerwehrrettungshubschraubernotlandeplatzaufseherin • der Oberpostdirektionsbriefmarkenstempelautomatenmechaniker • das Rindfleischetikettierungsüberwachungsaufgabenübertragungsgesetz • die Donaudampfschifffahrtselektrizitätenhauptbetriebswerkbauunterbeamtengesellschaft Wolkenkratzer 'skyscraper': wolken 'clouds', + kratzer 'scraper' Eisenbahn 'railway': Eisen 'iron', + bahn 'track' Kraftfahrzeug 'automobile': Kraft 'power', + fahren/fahr 'drive', + zeug 'machinery' Stacheldraht 'barbed wire': stachel 'barb/barbed', + draht 'wire' Rinderkennzeichnungs- und Rindfleischetikettierungsüberwachungsaufgabenübertragungsgesetz: literally, Cattle marking and beef labeling supervision duties delegation law - Natural Language Systems - Harriehausen

  19. Samples for long compounds in different languages(see: http://en.wikipedia.org/wiki/Compound_%28linguistics%29) Chinese(Cantonese Jyutping): 學生 'student': 學 learn + 生 grow 太空 'universe': 太 t great + 空 emptiness 摩天樓 'skyscraper': 摩 touch + 天 sky + 樓 building (with more than 1 storey) 打印機 'printer': 打 strike + 印 stamp/print + 機 machine 百科全書 'encyclopaedia': 百 100 + 科 (branch of) study + 全 entire/complete + 書 book Dutch: Arbeidsongeschiktheidsverzekering 'disability insurance': arbeid 'labour', + ongeschiktheid 'inaptitude', + verzekering 'insurance'. Rioolwaterzuiveringsinstallatie 'wastewater treatment plant': riool 'sewer', + water 'water', + zuivering 'cleaning', + installatie 'installation'. Verjaardagskalender 'birthday calendar': verjaardag 'birthday', + kalender 'calendar'. Klantenservicemedewerker 'customer service representative': klanten 'customers', + service 'service', + medewerker 'worker'. Universiteitsbibliotheek 'university library': universiteit 'university', + bibliotheek 'library'. Doorgroeimogelijkheden 'possibilities for advancement': door 'through', + groei 'grow', + mogelijkheden 'possibilities'. - Natural Language Systems - Harriehausen

  20. Samples for long compounds in different languages(see: http://en.wikipedia.org/wiki/Compound_%28linguistics%29) Samples for long compounds in different languages Finnish: sanakirja 'dictionary': sana 'word', + kirja 'book' tietokone 'computer': tieto 'knowledge, data', + kone 'machine' keskiviikko 'Wednesday': keski 'middle', + viikko 'week' maailma 'world': maa 'land', + ilma 'air' rautatieasema 'railway station': rauta 'iron' + tie 'road' + asema 'station' suihkuturbiiniapumekaanikkoaliupseerioppilas: 'Jet engine assistant mechanic NCO student' atomiydinenergiareaktorigeneraattorilauhduttajaturbiiniratasvaihde: some part of a nuclear plant Korean:안팎 anpak 'inside and outside': 안 an 'inside' + 밖 bak 'outside‚ Spanish: Ciempiés 'centipede': cien 'hundred', + pies 'feet' Ferrocarril 'railway': ferro 'iron', + carril 'lane' Paraguas 'umbrella': para 'to stop, stops' + aguas '(the) water' - Natural Language Systems - Harriehausen

  21. Samples for long compounds in different languages(see: http://en.wikipedia.org/wiki/Compound_%28linguistics%29) Samples for long compounds in different languages Icelandic: járnbraut 'railway': járn 'iron', + braut 'path' or 'way' farartæki 'vehicle': farar 'journey', + tæki 'apparatus' alfræðiorðabók 'encyclopædia': al 'everything', + fræði 'study' or 'knowledge', + orða 'words', + bók 'book' símtal 'telephone conversation': sím 'telephone', + tal 'dialogue' Italian: Millepiedi 'centipede': mille 'thousand', + piedi 'feet' Ferrovia 'railway': ferro 'iron', + via 'way' Tergicristallo 'windscreen wiper': tergere 'to wash', + cristallo 'crystal, glass' Japanese: 目覚まし(時計) mezamashi(dokei) 'alarm clock': 目 me 'eye' + 覚まし samashi (-zamashi) 'awakening (someone)' (+ 時計 tokei (-dokei) clock) お好み焼き okonomiyaki: お好み okonomi 'preference' + 焼き yaki 'cooking' 日帰り higaeri 'day trip': 日 hi 'day' + 帰り kaeri (-gaeri) 'returning (home)' 国会議事堂 kokkaigijidō 'national diet building': 国会 kokkai 'national diet' + 議事 giji 'proceedings' + 堂 dō 'hall' - Natural Language Systems - Harriehausen

  22. compounds / concatenation formation of compounds and their structure: Most compounds are 2-root-compounds, but they come with a number of different structures: Nouns – Adjectives - Verbs A.  Nouns (see: http://public.wsu.edu/~gordonl/S05/256/compounds.htm) In each of these cases, the syntactic class of the compound is the same as the syntactic class of the final element of the compound. - Natural Language Systems - Harriehausen

  23. compounds / concatenation formation of compounds and their structure: In each of these cases, the syntactic class* of the compound is the same as the syntactic class of the final element of the compound. * syntactic class = part-of-speech, such as noun, verb, adjective,… - Natural Language Systems - Harriehausen

  24. compounds / concatenation • formation of compounds and their structure: • In each of these cases, the syntactic class of the compound is the same as the syntactic class of the final element of the compound. • Rule: • Germanic languages (e.g. English, German) are left-branching (the modifiers come before the head). Schoolteacher = teacher of a school, bluebird = bird of blue color • Romance languages ( e.g. French, Spanish) are usually right-branching; i.e. they are often formed by left-hand heads with prepositional components inserted before the modifier:chemin-de-fer = railway (lit. 'road of iron')moulin à vent = windmill (lit. 'mill (that works)-by-means-of wind') - Natural Language Systems - Harriehausen

  25. compounds / concatenation formation of compounds and their structure: B.  Adjectives (see: http://public.wsu.edu/~gordonl/S05/256/compounds.htm)In each of these cases, the syntactic class of the compound is the same as the syntactic class of the final element of the compound. - Natural Language Systems - Harriehausen

  26. compounds / concatenation • formation of compounds and their structure: • B.  Adjectives : hardworking • The internal structure may be complex: • hard + work + ing -> hardwork + ing OR hard + working • ing is typically the aspect-suffix that gets added to the verb (root): e.g. play-ing, laugh-ing, ask-ing,… • As a rule, we can form other wordforms (inflections, due to different tenses) from those roots, following the same inflectional pattern, i.e. verbal root + tense-marking-suffix, or insertion of modal verb: • Simple Present: He play-s. He laugh-s. He ask-s. • Simple Past: They play-ed. They laugh-ed. They ask-ed. • Simple Future: I will play. I will laugh. I will ask. - Natural Language Systems - Harriehausen

  27. compounds / concatenation formation of compounds and their structure: B.  Adjectives : hardworking The internal structure may be complex: hard + work + ing -> hardwork + ing OR hard + working * He hardworks. * They hardworked. * I will hardwork. -> hardwork + ingi.e. hardwork is not a verb by itself(see: http://public.wsu.edu/~gordonl/S05/256/compounds.htm) - Natural Language Systems - Harriehausen

  28. compounds / concatenation formation of compounds and their structure: B.  Adjectives : hardworking The internal structure may be complex: hard + work + ing -> hardwork + ing OR hard + working * He hardworks. * They hardworked. * I will hardwork. -> hardwork + ing(see: http://public.wsu.edu/~gordonl/S05/256/compounds.htm) Adj Adv Adj verb suffix hard work ing - Natural Language Systems - Harriehausen

  29. compounds / concatenation formation of compounds and their structure: C.  Verbs (see: http://public.wsu.edu/~gordonl/S05/256/compounds.htm) In each of these cases, the syntactic class of the compound is the same as the syntactic class of the final element of the compound. - Natural Language Systems - Harriehausen

  30. semantics of compounds • Semantic classification : it it common to classify compounds into 4 types: • endocentric description: A+B denotes a special kind of B • exocentric • copulative • appositional • Endocentric compounds consist of a head and modifiers, which restrict this meaning. Endocentric compounds tend to be of the same part of speech (word class) as their head. • Examples:- doghouse, where house is the head and dog is the modifier; i.e. a house intended for a dog • darkroom, where dark modifies room; i.e. a type of a room (usually used in photography) - Natural Language Systems - Harriehausen

  31. semantics of compounds • Semantic classification : it it common to classify compounds into 4 types: • endocentric • exocentric description: (one) whose B is A • copulative • appositional • Exocentric compounds have an unexpressed semantic head (e.g. a person, a plant, an animal...), and their meaning is often not transparent from its constituent parts. Examples: ●white-collar is neither a kind of collar nor a white thing, but the collar's colour is a metaphor for socioeconomic status ● red-neck only indirectly refers to a neck, but refers to a working person (e.g. farmer) • ● skinhead, may refer to a bald head but also refers to a certain group of people • ● paleface, native American Indians call the White Man a paleface - Natural Language Systems - Harriehausen

  32. semantics of compounds • Semantic classification : it it common to classify compounds into 4 types: • endocentric • exocentric • copulative description: A+B denotes 'the sum' of what A and B denote • appositional • Copulative compounds are compounds which have two semantic heads. • Examples: • bittersweet; having both tastes • sleepwalk; sleeping while walking OR walking in your sleep - Natural Language Systems - Harriehausen

  33. semantics of compounds • Semantic classification : it it common to classify compounds into 4 types: • endocentric • exocentric • copulative • appositional description: A and B provide different descriptions for the same referent; the meaning of which can be characterized as 'a AS WELL AS'. • Appositional compounds refer to lexemes that have two (contrary) attributes which classify the compound. • Examples: • actor-director; an actor who also plays the role of the director • maidservant; a maid who is also a servant OR a servant who is also a maid • Player-coach; someone who is a player as well as a coach - Natural Language Systems - Harriehausen

  34. semantics of compounds (ambiguities) When - in Germanic languages (e.g. German, English) - compound words are formed by prepending a descriptive word in front of the main word, the description or meaning between the components may be ambiguous. This is a problem for decompounding or translation. -> the orange bowl problem - Natural Language Systems - Harriehausen

  35. semantics of compounds (ambiguities) Can you please bring me the orange bowl ? ? bowl of orange colour ? bowl filled with oranges ? ? bowl that was formerly / usually filled with oranges bowl having the shape of an orange ? bowl with an orange pattern - Natural Language Systems - Harriehausen

  36. compounding - decompounding decompounding -> follows rules principles / rules: FANO rule: „the analysis is unambiguous, when a morpheme is not the beginning of another morpheme“ (= principle of longest match) e.g. but / butter (Orthographic) Ambiguities in segmentation : horseshoe: horses – hoe (?) vs. horse-shoe (the FANO rule would lead to the incorrect/unlikely segmentation) Segmentation has to be done recursively in order to find all possibilities: - Natural Language Systems - Harriehausen

  37. compounding - decompounding English: petshopping: pet-shopping vs. pets-hopping egg roll: Chinese food vs. rolling egg a green ´house vs. a ´greenhouse The white ´house vs. The ´White House - Natural Language Systems - Harriehausen

  38. compounding - decompounding German: Staubecken: Stau-becken = a reservoir Staub-ecken = dusty corners Wachstube: Wach-stube = die Stube einer Wache (the room of a guard) Wachs-tube = eine Tube, in der Wachs aufbewahrt wird (a tube filled with wax) Gelbrand: Gelb-rand = gelber Rand (a yellow border) Gel-brand = Brand eines Gels (burning of a gel) Tonerkennung: Toner-kennung = die Kennung eines Toners (the identifier of a toner) Ton-erkennung = das Erkennen von Tönen (the identification of tones) Lachen: Lache-n = mehrere Pfützen (multiple puddles of water) Lachen = eine menschliche Lautäußerung wie Gelächter (laughter) Druckerzeugnis: Druck-erzeugnis = Gedrucktes (printed matter) Drucker-zeugnis = Zeugnis für einen Drucker (certificate for a printer) beinhalten : bein-halten vs. be-inhalten (imagine: Beinhalten….) Abteilungen : Abtei-lungen vs. Abteil-ungen - Natural Language Systems - Harriehausen

  39. compounding - decompounding context or stress (in spoken language) is needed for disambiguation - Natural Language Systems - Harriehausen

  40. (problems with )concatenation • Summary • Structural as well as semantic challenges with compounds: • ambiguities in meaning (orange bowl) • ambiguities in hyphenation points (Staubecken) • not all morphemes can form a compound (sheepchops)-> - Natural Language Systems - Harriehausen

  41. (problems with )concatenation - Natural Language Systems - Harriehausen

  42. compounds -> MWE -> idiomatic phrases In addition to the compounds that have one of the four descriptions (endocentric, exocentric, copulative, appositional), i.e. stick to the original lexical meaning of at least one of its components, we need to consider „multiple morpheme strings / multi word expressions (MWE)“ (fixed phrases) that have „lost“ the original lexical meaning of its components. Those MWE are called idiomatic phrases or idioms. increasing the idiomatic rigidity increasing the formal complexity • compounding: combination of lexical meanings: carseat, houseboat, cellar door,... • compounding: not a combination of the lexical meanings: starfish, paperback, ladybug,... • depending on the context: bite the dust, lose face, kick the bucket,... = - Natural Language Systems - Harriehausen

  43. content • 1 morphemes • 2 compounds / concatenation • 3 idiomatic phrases • multiple word entries (MWE) • spell aid • regular expressions • Finite State Automata (FSA) - Natural Language Systems - Harriehausen

  44. idiomatic phrases (http://www.geo.de/GEOlino/mensch/redewendungen/englisch) • Out of the blue • To be on Cloud Nine • A leopard cannot change its spots • Head over heels • Fair Play • As cool as a cucumber • The early bird catches the worm • As fit as a fiddle • Beat about the bush • The Big Apple • The apple of my eye • Wet behind the ears • A bird in the hand is worth two in the bush • It's raining cats and dogs - Natural Language Systems - Harriehausen

  45. idiomatic phrases (http://www.geo.de/GEOlino/mensch/redewendungen/deutsch) • Wie bei Hempels unterm Sofa • Schmetterlinge im Bauch • Jemanden übers Ohr hauen • Ein Bäuerchen machen • Mit jemandem durch dick und dünn gehen • Seine Pappenheimer kennen • Jemandem die Würmer aus der Nase ziehen • Die Arschkarte ziehen • Mit jemandem Pferde stehlen können • Sich aus dem Staub machen • Hummeln im Hintern haben • Im siebten Himmel sein • Viele Wege führen nach Rom • Mit einem lachenden und einem weinenden Auge • Nah am Wasser gebaut haben • Da ist der Bär los • Nachtigall, ick hör dir trapsen • Mein lieber Scholli! - Natural Language Systems - Harriehausen

  46. idiomatic phrases (http://www.geo.de/GEOlino/mensch/redewendungen/deutsch) • Jemandem einen Denkzettel verpassen • Sich auf den Schlips getreten fühlen • Alles für die Katz • Wo drückt denn der Schuh? • Gegen den Strich gehen • Den Faden verlieren • Etwas ausbaden müssen • Einen Stein im Brett haben • Bahnhof verstehen • Der springende Punkt • Der Sündenbock sein • Einen Ohrwurm haben • Das ist doch zum Mäusemelken! • Schmiere stehen • Den Teufel an die Wand malen • Auf dem Holzweg sein • Eselsbrücke • In der Kreide stehen - Natural Language Systems - Harriehausen

  47. idiomatic phrases (http://www.geo.de/GEOlino/mensch/redewendungen/deutsch) • Die Ohren steif halten • Auf Vordermann bringen • Um die Ecke bringen • Hals- und Beinbruch • Auf dem Kerbholz haben • Eine Schlappe einstecken • Frosch im Hals • Es zieht wie Hechtsuppe • Jemandem einen Bärendienst erweisen • Damoklesschwert • Tomaten auf den Augen haben • Jemandem raucht der Kopf • Für 'n Appel und 'n Ei • Etwas an die große Glocke hängen • Das ist Jacke wie Hose • Etwas aus dem Ärmel schütteln • Ein X für ein U vormachen • Jemandem nicht das Wasser reichen können - Natural Language Systems - Harriehausen

  48. idiomatic phrases (http://www.geo.de/GEOlino/mensch/redewendungen/deutsch) • Alles im grünen Bereich • Die Hand ins Feuer legen • Das kann kein Schwein lesen! • Auf Draht sein • Sein blaues Wunder erleben • Der hat es faustdick hinter den Ohren • Mein Name ist Hase, ich weiß von nichts • Aus dem Stegreif • Der Groschen ist gefallen • Einen Vogel haben • Den Kürzeren ziehen • Bis in die Puppen • Etwas hinter die Ohren schreiben • Ins Fettnäpfchen treten • Beleidigte Leberwurst • Jemanden auf dem Kieker haben • Ich verstehe immer nur Bahnhof! • Die Katze im Sack kaufen - Natural Language Systems - Harriehausen

  49. idiomatic phrases (http://www.geo.de/GEOlino/mensch/redewendungen/deutsch) • Bekannt wie ein bunter Hund • Den Kopf in den Sand stecken • Mit dem ist nicht gut Kirschen essen • Aller guten Dinge sind drei • Lampenfieber • Das kommt mir spanisch vor • Schwein haben • Das hast du dir selbst eingebrockt • Seinen Senf dazugeben • Jemandem ist eine Laus über die Leber gelaufen • Kalte Füße bekommen • Im Stich lassen • Schwedische Gardinen • Alles in Butter • Geld auf den Kopf hauen • Das Handtuch werfen • Sich mit fremden Federn schmücken - Natural Language Systems - Harriehausen

  50. idiomatic phrases – and their morpho-syntax • Idiomatic expressions are extremely rigid, in that morpho-syntactic modifications are not allowed (without a change in meaning) : • GERMAN • Singular - Plural • Bekannt wie ein bunter Hund • ??? Bekannt wie bunte Hunde. • * Bekannt wir 2 bunte Hunde. • adjectival modification • Den Kopf in den Sand stecken. • Den Kopf in den weichen Sand stecken. - Natural Language Systems - Harriehausen

More Related