1 / 10

Linked Data Visualizations for Eurostat Linked Data

Linked Data Visualizations for Eurostat Linked Data. Dr. Brand Niemann Director and Senior Data Scientist Semantic Community http://semanticommunity.info/ http://datacommunitydc.org/blog/2013/08/cloud-soa-semantics-and-data-science-conference/ October 28, 2013

season
Télécharger la présentation

Linked Data Visualizations for Eurostat Linked Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Linked Data Visualizations for Eurostat Linked Data Dr. Brand Niemann Director and Senior Data Scientist Semantic Community http://semanticommunity.info/ http://datacommunitydc.org/blog/2013/08/cloud-soa-semantics-and-data-science-conference/ October 28, 2013 http://semanticommunity.info/European_Union_Open_Data_Portal

  2. Background • I have noticed that the Eurostat web site provided many pages to find and download data, seven visualization applications (one of which does not seem to work: CubeViz), and linked data (data which can be retrieved by queries to build new tables of related data). • This raises the question: • Why not integrate all three functions together so one can see how the original data becomes visualizations that are linked and linkable to other data and visualization, and so on? • I think I can do that with the tools in the Semantic Community Platform like I did for a previous story on Eurostat.

  3. Previous Dashboard Purpose: CKAN and others have dialogued recently about data catalogs and linked open data for "humans" because both are difficult for most to do themselves without special training and tools. More would understand this if there was some relationship between them and conventional data tables and relational database work. The author is piloting this with Excel Spreadsheets and Spotfire. The concept is to provide data in a "human context" and "an interoperability interface" as follows: Information (Topics and Sub-topics) and Data (Tables and Data Elements) so one could more easily do data integration across high-quality statistical and environmental data within the US and within the EU and across the two countries. To do this with confidence for a decision-maker requires the kind of information and tools provided in this pilot. Created by: Brand Niemann, May 16, 2011 (in process) https://silverspotfire.tibco.com/us/library#/users/bniemann/Public?EUUSData-Spotfire

  4. Eurostat – Linked Data http://eurostat.linked-statistics.org/

  5. Goals and Process • A broader question that always comes to mind when I look at Web pages is: • What is at all of those URLs in a Web page and what if you could look at an index that summarizes what is at those URLs and see which have actual data (Excel, Access, etc.) or could be data through screen scraping, conversion, etc. (HTML, PDF, etc.)? • And what if this was in RDF triple format (Subject, Object, and Predicate) so it was both human readable and machine readable so maybe it was triples about triples, about triples, and so on? • This is why I decided to call it Linked Data Visualizations for Eurostat Linked Data. • Hopefully this will help me and others who find using RDF for Data to be more work than using PDF for Data, as I concluded in my previous story: • With data in PDF you can try to use the Adobe utility that converts PDF to Excel or screen scrape it by hand, but with RDF I have not been able to figure that out and those working with RDF do not always provide the source data in Excel, Access, etc. to easily go back to. • The process is to start with the Eurostat Bulk Download, convert it to a knowledge base of Linked Data in MindTouch and Excel spreadsheets, and visualizes those linked data sets in Spotfire. • The goal is to be able to go from data elements-to-data tables-to data bases that are statistically sound and semantically interoperable with both conventional and Semantic Web standards and technologies.

  6. Eurostat Bulk Download:Directory and Metadata http://epp.eurostat.ec.europa.eu/NavTree_prod/everybody/BulkDownloadListing http://epp.eurostat.ec.europa.eu/NavTree_prod/everybody/BulkDownloadListing?sort=1&dir=metadata

  7. Knowledge Base: MindTouch My Note: The entire platform can be searched. My Note: The Semantic Community Platform includes MindTouch (Wiki), Spotfire (Analytics), and Be Informed (Be Free). This page can be search by Google Chrome Find. http://semanticommunity.info/European_Union_Open_Data_Portal#Story_2

  8. Knowledge Base: Spreadsheet My Note: All of these spreadsheets can be searched. My Note: The Semantic Community approach is consistent with the ISA Recommended URI Design and Management Principles. http://semanticommunity.info/@api/deki/files/26772/EuroStatsCD2013.xlsx

  9. Knowledge Base: Spotfire Find Data Set In the Navigation Tree Select Row and See Details-on-Demand Find the Code in Bulk Download Tables Then Find in Codes Then Download and Visualize EUMIDA Spreadsheet My Note: These Visualizations Are Dynamically Linked. https://silverspotfire.tibco.com/ViewAnalysis.aspx?file=/users/bniemann/Public/EuroStatsLD2013-spotfire.dxp

  10. Conclusions and Recommendations • The goal is to be able to go from data elements-to-data tables-to data bases that are statistically sound and semantically interoperable with both conventional and Semantic Web standards and technologies. • The process is to start with the Eurostat Bulk Download, convert it to a knowledge base of Linked Data in MindTouch and Excel spreadsheets, and visualize those linked data sets in Spotfire, where more linked data tables and their visualizations can be created. • The knowledge base is both human readable and machine readable and is the triples about triples, about triples, and so on. • For example, the Code Lists contain 168,314 rows and is a definitive semantic data asset repository at the data element level. • Essentially, I found only one data set in the Linked Data Report on the Publication of European Institutions Data that did not have to be screen scraped. • Again, I find using RDF for Data to be more work than using PDF for Data.

More Related