120 likes | 253 Vues
The Semantic Assistants Wiki (SAW) under the ETC project aims to enhance collaborative content creation by integrating Natural Language Processing (NLP) into the wiki environment. This self-aware wiki supports users in information analysis, allowing them to manage and enrich content effectively. Leveraging NLP capabilities like entity recognition, the system facilitates biomedical literature curation and other domains. By automating tasks such as data enrichment and cross-linking multilingual entries, SAW significantly reduces manual curation efforts, fostering smarter wiki interactions.
E N D
Semantic Assistants Wiki (SAW) In the Context of the ETC Project
SAW Intro • Wiki • Users collaboratively develop and organize content • Information Analysis is up to the user • Goal: • “Self-aware wiki that can develop and organize its content” • Support users in information analysis • Requires NLP to handle majority of content • Semantic Assistants Wiki: Integration of NLP with Wikis
SAW in Action …Mary won the first prize... wiki.org/Mary “…Mary won…” “…Mary won…” … [[hasType::Person|Mary]] won… XML XML Wiki Wiki-NLP Integration Semantic Assistants GATE NLP Pipeline:Names Entity Recognition
Example: Biomedical Literature Curation • GenWiki: Filled with full text research papers • Entity Recognition: Enzymes, Organisms • Additional semantic information • Systematic name • Link to enzyme database entry • Wiki is enriched with additional data e.g. using Semantic MediaWiki markup [[hasType::Enzyme] • Time required to curate research papers reduced
Example: Wiktionary • Automatically populate the wiki using computational linguistics • Cross-link between different language entries • Manual work can be reduced
SAW for ETC • Charaparser …abaxialfaces, without septatetrichomes... etc-project.org/wiki/fna19 “…abaxial faces, without…” “…abaxial faces, without…” Wiki Wiki-NLP Integration Semantic Assistants GATE NLP Pipeline:Charaparser
SAW for ETC <structure id="o0" name="face" constraint="abaxial“/> <structure id="o1" name="trichome"> <character name="architecture" value="septate"/> </structure><relation id="r0" name=“without" negation="false" from="o0" to="o1"/> • Charaparser Faces * [[hasConstraint::Abaxial]] * [[without::Trichome]] XML Trichome * [[hasArchitecture::septate]] etc-project.org/wiki/fna19/superstructure/faces etc-project.org/wiki/fna19/superstructure/trichome Wiki Wiki-NLP Integration Semantic Assistants Charaparser NLP Pipeline
SAW for ETC • Charaparser • GATE compliant NLP pipeline • Charaparser output to wiki markup translation module • Wiki as ‘User Interface’ of Charaparser • Logic Reasoning, Information Theory, Ontology building can • Read and query charaparser results from wiki (RDF triplets) • Be possibly integrated in wiki
Open Questions • Charaparser as GATE compliant pipeline; Effort? • Alternatives to Semantic Assistant Wiki? Apache Stanbol? • Representation of Structure, Character, Relation in Wiki (e.g. duplicate structure names)
References BaharSateli and René Witte. Natural Language Processing for MediaWiki: The Semantic Assistants Approach. WikiSym2012. BaharSateli, Marie-Jean Meurs, Greg Butler, Justin Powlowski, Adrian Tsang, René Witte. IntelliGenWiki: An Intelligent Semantic Wiki for Life Sciences. NETTAB 2012. René Witte and Thomas Gitzinger. Connecting Wikis and Natural Language Processing Systems. WikiSym 2007. http://www.semanticsoftware.info/first-open-source-release-semantic-assistants-wiki-nlp