1 / 50

Oplæg: Vidensrepræsentation af fagligt stof: Er kulturviden nødvendig?

Oplæg: Vidensrepræsentation af fagligt stof: Er kulturviden nødvendig?. Birthe Mousten MA, Ph.D. 12. Oktober 2016. Kobling af faglig viden og kultur. Myte 1: Faglig viden er tværkulturel Gælder kun grundviden inden for kemi, fysik, matematik

amalie
Télécharger la présentation

Oplæg: Vidensrepræsentation af fagligt stof: Er kulturviden nødvendig?

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Oplæg: Vidensrepræsentation af fagligt stof: Er kulturviden nødvendig? Birthe Mousten MA, Ph.D. 12. Oktober 2016

  2. Kobling af faglig viden og kultur • Myte 1: Faglig viden er tværkulturel Gælder kun grundviden inden for kemi, fysik, matematik De anvendte videnskaber, teknikker og metoder er kulturelt bundne og systembundne. Man udfører forskellige funktioner på forskellige måder.

  3. Kobling af faglig viden og kultur • Myte 2: Fagsproglig viden er tværkulturel Gælder kun kernebegreber inden for kemi, fysik, matematik De anvendte videnskaber, teknikker og metoder er kulturelt bundne og systembundne. De forskellige funktioner giver forskelligt fagsprog, som ikke altid er sammenlignelige på tværs af sprog.

  4. Kobling af fag og kultur Eksempler: VELUX vinduer Køleskabe Biler Medicin Og alt muligt andet Fagfolk Fagsproglig viden Faglig viden Sprogfolk

  5. Hvordan kan vi så bruge korpuslingvistik? • Kortlægning af fagsprog • Kortlægning af genretræk, sprogstrukturer mv. • Kortlægning af faglig viden • Kortlægning af kulturens særlige træk • …og meget mere.

  6. Workshop: Korpuslingvistik og fagsprog: AntConc som korpusredskab til kortlægning af viden og sprog.

  7. Beskrivelse: Korpuslingvistik: Primært kendt fra megakorpora. • fx Oxford’s og Longman’s ordbøger • Fx Webster • Fx Duden og LaRousse. • Fx Den Danske Ordbog online => Altså som online søgeredskaber

  8. Beskrivelse: • Dansk retskrivning er registreret i Retskrivningsordbogen. • Udvalget af fagord sparsomt. • Også et problem for fx engelsk og tysk. • Fagsprogsordbøgerne heller ikke dækkende. • Det ser vi især inden for teknisk sprog.

  9. Beskrivelse: • En professionel sprogbruger kan uddrage både faglig og fagsproglig viden fra et korpus – typisk større korpora • Metoder: leksikografi, terminologi og genrekarakteristika • Et ad-hoc-korpus med fagtekster • Kvik-værktøj til at skrive eller oversætte en fagtekst • Korpusprogram • Freeware-programmet AntConc

  10. Korpuslingvistik – Plan for workshop • Artikler om ad-hoc korpuslingvistik • Mega-korpora • Ad-hoc-korpora • Traditionel korpusbrug • Tværsproglig korpusbrug • Fagsprogskorpora • Afprøvning af AntConc – freeware-program

  11. Corpus linguistics - articles Ourhumble start:

  12. Corpus linguistics - Tutorial

  13. Corpus linguistics – Case study

  14. Mega-corpora American National Corpora: http://corpus.byu.edu/ British National Corpus: http://www.natcorp.ox.ac.uk/ Wordschatz: Http://wortschatz.uni-leipzig.de/ KorpusDK: http://ordnet.dk/

  15. Overview American National Corpora

  16. British National Corpus

  17. German National Corpus

  18. Danish National Corpus

  19. Task later: Knowledge about insulin analogs • Youwillbeasked to work with insulin analogslater, so why not test that term right now? • Wewillsearch the American megacorpora for insulin analog and seewhere it getsus. • Let ustry the American megacorpus first.

  20. Result: 11 hits = 11 texts; at a closer look maybeonly 6, of whichthreeare not American English, but probably Indian English, and one is international English, and a press release, and twoothertexts art from the three hits from FDA, whichareprobably from onlyonetext. So in practice, from an American point of view, there is the FDA text and some more dubiousones. American NOW corpus Not very impressive: Shows the need for a lack of further search to work with the area. However, why not take the FDA text for our corpus now that we have it. So we click the text and…

  21. The first Reference renders this: ….getthistext, whichwecopy for ourtext corpus.

  22. Our corpus text NO. 1 The same textcopied to Word.

  23. Startingourcollection for the corpus • Copytext • Open Word • Savingtext in Word in a file in a folder: - Save as –> source/title/date (or yourchosen parameters) - Save as .txt by choosingplaintext => (all codes from html removed)

  24. Save as My chosen folder Source text date Plain text

  25. Building up the corpus • I have to searchelsewhere for text, and why not the largest big-data corpus in the world: Google. • I use Google advancedsearch – it is easier, and quicker. • But a small step beforethat – I readmy wiki • https://www.google.ca/advanced_search

  26. Wikipedia To helpmemake a veryprecisesearch on Google, I sneak peak in Wikipedia to seewhether it knowsanythingabout insulin analog: • An insulin analog is an altered form of insulin, different from any occurring in nature, but still available to the human body for performing the same action as human insulin in terms of glycemic control. Through genetic engineering of the underlying DNA, the amino acid sequence of insulin can be changed to alter its ADME (absorption, distribution, metabolism, and excretion) characteristics. Officially, the U.S. Food and Drug Administration (FDA) refers to these as "insulin receptor ligands", although they are more commonly referred to as insulin analogs. • These modifications have been used to create two types of insulin analogs: those that are more readily absorbed from the injection site and therefore act faster than natural insulin injected subcutaneously, intended to supply the bolus level of insulin needed at mealtime (prandial insulin); and those that are released slowly over a period of between 8 and 24 hours, intended to supply the basal level of insulin during the day and particularly at nighttime (basal insulin). The first insulin analog approved for human therapy (insulin Lispro rDNA) was manufactured by Eli Lilly and Company.

  27. Then Google Advanced I wantthese parameters My keyconcept Most recent year

  28. Ok – let’sget to it then Give yourself 10 minutes to copypasteintoyour folder. Google Advanced results

  29. Ten minutesafter I have my corpus Then I am ready for my corpus work

  30. Now --- our task We just got a task from a company, say Eli Lilly or Novo Nordisk aboutwriting or translation somethingabout insulin analog

  31. Typical corpus search • How can a corpus helpyou? • Register • Collocations • Definitions • Synonyms • Knowledge! • Etc.

  32. Please join mehere! Getting the program Find Laurence Anthony’s website – Just google search the name. By the way, the address is here: http://www.antlab.sci.waseda.ac.jp/software.html#antpconc Press: AntConc 3.4.4 (or the Mac or other version that is compatible with your computer) (NB: The languagecode must be Western Latin 1! (Check under Global Settings – Language encoding – set it to Western Latin 1)

  33. Loadingyour corpus into the program • Guide: • Press AntConc 3.4.4 (sprogudgave skal være Latin 1) • Load your folder (Windows explorermethod) • Thenyouareready to search.

  34. AntConcopened – but empty AntConc is now open. Press File Open Directory 3) Load your corpus folder in the Windows way

  35. Your Corpus list

  36. Press Word List

  37. Lantus – what is that? Onlythreesources: -> Product name? Check in texts.

  38. Scrolldown -> differentsourcesuse the word => ESP word What is hypoglycemia If youpresssome of the words, yougetdirectlyinto the texts.

  39. The or trick Finding: Content knowledge Alternatives Synonyms (tryalso Aka Referred to Known as (

  40. The ( trick – findingparenthetical info Which kind of info wouldyou find in a parenthesis and whatdoesthat data tellyou?

  41. Findingcollocations - right Set the sorting parameters at the bottom. 1R 2R 3 R Findings: Metabolicchanges Metaboliccontrol Metabolicdecompensation Metabolicdeterioration All of themconcepts in theirown right.

  42. Findingcollocations - left Set the sorting parameters at the bottom. 1L 2L 3L Findings: Good metabolic Poormetabolic Rapid metabolic … ..but for instance not bad metabolic!

  43. Statistics & collocation – for the nerds Shows ranking, frequencyleft, frequency right, statistics and the collocate. Note for instanceregimen, canbeused with dosing a L and R collocate. The same meaning?

  44. File View – for bettercontext

  45. Concordance plot for precautions Shows in whichtexts a wordexists and how the worduse is distributedthroughout the text. Precautions has a strong front tendency in the texts. A possible genre tool?

  46. Yourturn • Let yourphantasyloose. • What do you find out? • Whatcan it beused for? • Is it useful in the firstplace?

  47. My opinion – Good The program is intuitive in use – I never learned it from anyone. General Microsoft processeswork in AntConc. Good as an ad-hoc writing and translation tool. Good for register work. Good for collocations. Good for proof of whatyouaredoing. Forgetaboutyour intuitive ideas and check how it works.

  48. My opinion – Good Define a problem and devise yourown solution method. Even 10 texts as in our case can provide a wealth of information. Must nowadaysnecessarilyreplaceanyoldfashioned pencil-and-paperwork.

  49. My opinion - limitations • The quality of the findingsdepends on the user Quality in => quality out • You have got to getstarted to like it • You have to learnapprox. fiveshortcuts in order not to tire out

  50. Good luck • Thankyou for joiningme in this. • If youwant to contactme, please writeme: • bmo@cc.au.dk / bmo@expo-com.dk

More Related