1 / 10

Andreas Becks

Aachen. St.Augustin. Roma, 24 novembre 2005. Visual Text Mining with SWAPit Detection of semantic relationships among text documents and associated data sources. Andreas Becks Fraunhofer-Institute of Applied Information Technology Sankt Augustin & Aachen, Germany.

fell
Télécharger la présentation

Andreas Becks

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Aachen St.Augustin Roma, 24 novembre 2005 Visual Text Mining with SWAPitDetection of semantic relationships among text documents and associated data sources Andreas Becks Fraunhofer-Institute of Applied Information TechnologySankt Augustin & Aachen, Germany

  2. Lost in the Ocean of Text Documents? • Text Mining helps to explore and analyse natural-language texts uncover relationships, recognize trends group, condense pieces of knowledge categorize text information • A huge amount of organisational knowledge is stored in text documents 85 to 90 percent of all corporate data according to Merrill Lynch and Gartner studies • Even when DMS and desktop search are used, a huge amount of time is necessary to find important information 80% of companies and 40% of public administrations need more than one day [Zylab survey]

  3. SWAPit Helps You to Navigate Through Your Text Data The tool visualises semantic relationships among text documents... X-ray view for document archives

  4. SWAPit Integrates Text and Data Mining ... and allows to navigate, search, browse and analyse text documents and associated data and metadata related structured data associations Fact View Similarity View text documents categorization catalogue oftext categories Tools for analysis and search Category View

  5. Application Example: Document Management Document similarity helps to create ‘fascicoli’ and find misclassified documents Protocollazione Project selection New text documents Information about type, AOO/UO, ‘Fascicoli’, etc. DL-based categorization Titolario

  6. SWAPit as a Single Point of Access From scattered information... ...to integrated information intuitive, user-centred access multi-schema databases, distributed & data-centred access text documents Virtual Integrated Database DL-based categorization DL-based integration user-specific schema & integrated access operational databases

  7. Monitoring Documents with SWAPit and DL From information overflow... ...to information overview  3 news in 1 minute  1 document map per day intuitively structured text documents conceptually filtered, relevant text documents unfiltered and unstructured text documents DL-based filter DL-based catalogue builder

  8. XML XML XML XML XML XML XML XML XML Displaying XML Documents in SWAPit From complex, machine-readable documents... ...to a human-oriented presentation data with technically rich structural annotation customized, task-oriented view metadata (selected attributes and elements) text content from specified attributes and elements web ontology ontology-context of specified elements

  9. Conclusion: Visual and Intuitive Text Mining with SWAPit • SWAPit combines views on text documents and associated data sources on a single sreen • Overview instead of overflow • Improves quality of text access tasks • Leverages knowledge sources • Flexible architecture • Designed to integrate Semantic Web technology • Derives additional power from integration of DL technologies • Can be integrated easily into existing infrastructures or company portals • Can be tailored to specific needs of different market segments • Long-standing experience in research and practical applications • Document Management, Business Intelligence, Customer Relationship Management, ... • Main sectors: Insurance, Textile, Engineering, Social Science • Technology has been extended in a joint project with Maurizio Lenzerini (SEWASIE)

  10. Grazie dell’attenzione!

More Related