1 / 15

Visualizing Text Loretta Auvil UIUC February 25, 2011

Visualizing Text Loretta Auvil UIUC February 25, 2011. Overview. Architecture Overview. Knowledge Discovery Process. Knowledge Discovery Infrastructure Benefits . Provides access to data management tools Selecting/Loading data from databases, flat files or repositories

olisa
Télécharger la présentation

Visualizing Text Loretta Auvil UIUC February 25, 2011

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Visualizing Text Loretta Auvil UIUC February 25, 2011

  2. Overview

  3. Architecture Overview

  4. Knowledge Discovery Process

  5. Knowledge Discovery Infrastructure Benefits • Provides access to data management tools • Selecting/Loading data from databases, flat files or repositories • Integrates data mining algorithms • Supports an extensible interface for creating one’s own algorithms • Provides means for building and applying models • Provides integrated visualizations components • Provides capability to build custom applications • Provides access for local or distributed computation • Provides the ability to share components and applications

  6. Meandre Services from Firefox Plugin Tag Cloud Analysis Readability Analysis Network Analysis Date Entity to Simile Timeline Automatic Summarization Location Entity to Google Map Example: Zotero, SEASR, Protovis, Google Maps, Simile

  7. Mashups & Dashboards • Html driven dashboard • 6 services • 4 different tag cloud views • 2 relationship views Example: SEASR, Protovis

  8. Large Blue circles are Locations extracted from Tweets Orange circles are the Top Words found in these tweets Locations & Top Words from Tweets Example: Twitter, SEASR, Protovis

  9. Concept Mapping Example: SEASR, Flare

  10. From Silos to Mashups • Definition: Mashup is a web page or application that uses and combines data, presentation or functionality from two or more sources to create new services • Why do we want this? • Enable out services in many applications and on a variety of devices (laptop, high-res display wall, ipad, iphone or the others) • Share and reuse is a good thing • Reach communities with our tools and their data!!! • What can we do to change this? • We can think and create data driven solutions so that they can be mashed up with other tools. • We can build web services that can be deployed or accessed. • We can create API’s to be used.

  11. Mashup Framework Visualizations User Interfaces Apps Plugins Web Apps Services Meandre Workbench Repositories Data Analysis Components Flows Meandre Data-Intensive Flows Components Developer Tools Data Analytics Visualization Component Repository Component Discovery Meandre Infrastructure Virtualization Infrastructure Computational Resources

  12. Meandre Workbench • Web-based UI • Components and flows are retrieved from server • Additional locations of components and flows can be added to server • Create flow using a graphical drag and drop interface • Change property values • Execute the flow Components Flows Locations

  13. Meandre for Mashups • Major Capabilities • Dataflow execution • Semantic technology (using RDF for storing meta info) • Web-Oriented • Supports publishing services for data, analytics and visualization • Modular components • Encapsulation and execution mechanism • Promotes reuse, sharing, and collaboration • Cloud-friendly infrastructure • Implements MapReduce for parallelization • Note: Trading off some performance for reuse, flexibility and modular components… with option to parallelize components to improve performance

  14. Components • Analytics • Unsupervised Learning • Clustering • Frequent Pattern Analysis (Rule Association) • Supervised Learning • Naïve Bayesian • Support Vector Machines (Weka) • Decision Trees (c4.5) • Optimization Approaches • Genetic Algorithm • Text Analysis (NLP, Entity Ext) • OpenNLP • Stanford NER • OpenMary (NLP, Text-Speech) • Visualization • Geographic (Google Maps) • Temporal (Simile) • Network Graphs – Link Nodes and Arcs (Protovis) • Parallel Coordinates (Protovis) • Stacked Area Chart (Flare) • Tag Cloud Maker • Decision Tree (Applet D2K) • Naïve Bayes (Applet D2K) • Rule Association (Applet) • Dendogram (GWT)

  15. Links • www.seasr.org • www.seasr.org/meandre

More Related