1 / 27

Resurse lingvistice computationale

1. Corpora. 1.1Validate manualNAACL 2003 (corpus paralel englez-rom

heller
Télécharger la présentation

Resurse lingvistice computationale

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


    1. Resurse lingvistice computationale / aplicatii PLN pentru Limba Româna Institutul de Cercetari pentru Inteligenta Artificiala

    2. 1. Corpora 1.1 Validate manual NAACL 2003 (corpus paralel englez-român continând aproximativ 1.6 milioane de entitati segmentate în cele doua limbi ) Orwell, 1984 (corpus paralel englez-român cu aproximativ 250 mii de entitati segmentate în cele doua limbi) Platon, Republica (corpus paralel franceza-româna cu aproximativ 250 mii de entitati segmentate) Ziare (corpus realizat din diverse articole din Evenimentului Zilei) ROCO (corpus românesc din material jurnalistic de aproximativ 7.1 milioane de entitati segmentate)

More Related