Meeting on PSI Portals Luxembourg, September 25th, 2009 PSI:Activities undertakenin Regione Piemonte Marta Garabuggio Andrea Muraca
Agenda • The regulatory framework • Regione Piemonte • PA Databases • Data Sharing policies • Re-use and licences • Information Directory • The catalogue as a PSI asset list • Future developments
The regulatory framework • European Directive on public data reuse 2003/98/CE • Italy: D.Lgs. 36, 24 January 2006, implementation of European Directive
Promotion policy • Promotion policy for public data reuse • Nodal points from EU Commission’s promotion policy: • to draft non discriminatory and user-friendly standard licences • to provide ourselves with a complete and well-supplied (with research and consultation tools, on the web, of course) metadata catalogue (metadata is the key!) • to plan fitted supply classes, in order to ease public data input in private information systems • Regione Piemonte is working on all sides
Piemonte Region • Located in the north-west of Italy, bordering France and Switzerland • Capital city: Turin • Population: 4.5 Millions • 8 Provinces • 1,206 Municipalities/Towns • 8.1% contribution to national GDP • 7.0% unemployment rate Data as at March 2009
About 1.400 geographic databases Raster, Vector, Ortophotos, Digital models of Landscape, Satellite Telerilevatory Data Administrative and statistical databases About 1500 databases (170 statistical) Admin. DW 1700 Gb 2100 Gb 15 Gb Asset catalogue Books, culture, naturalistic, archives 292 Mb Public Sector Information Heritage - Piemonte Multimedial Databases Video Community On-Line (Provincia Torino) Multimedia information sharing on environment (Regione Piemonte and ARPA, Regional Agency for Environment) Piemonte Digital Library Photo-Archive (Provincia Torino) Databases of laws and administrative bills Regional Laws (Arianna, about 2000 laws from 1971) 7,3 Gb 17 Gb CSI-Piemonte supports data-sharing and integration of public bodies’ information systems
Institutional framework • In Piemonte • DGR n. 11/1161 “Agreement Protocol on sharing, developing and disseminating Regional Information Heritage ” (July 2005) art. 4The Public Administrations, …, to create favourable and competitive market conditions, commit to define the form of sharing to economic operators” • DGR 31 – 11679, 29 June 2009 Approval of document “Guidelines on Regional Information Heritage reuse“ and standard licence pattern for reuse. First time in Italy for a PA initiative of this kind.
Public Sector Information sharing PSI Interchange In Piemonte Access and trasparency of public data Sharing toprivate sector Re-use public data Data Sharing policies: undertaken actions Agreement Protocol among Local Governments Development of access data services (e.g DWH) Definition of licences
Data Reuse Licences: key issues Licences define data use regulation's "contract" In licences you define: • commercial/non commercial use • price • allowed reuse • use conditions (integration, new services, etc.) • source acknowledgment and logo use With the DGR 31 - 11679 (29th of June, 2009) guidelines and licence model have been approved. Guidelines set principles and general policies for reuse in Piemonte region, while the licence model represents a regional management tool to ease definition of single supply licences.
Information Directory • It is the metadata catalogue of Piemonte’s PA resources • it contains information related to databases, applications and products of: • Piemonte Region • Province of Turin • Municipality of Turin • Regional Council • users can look for information by means of a search engine service (free search) or browsing methods (thematic channels search)
Information Directory: back-office Back-office Metadata Front-office
Information Directory: back-office Metadata operator, who describes information,use back-office in order to: • insert and update objects’ metadata (databases, applications) • classify objects and define their keywords • relate objects with other resources • publish objects • give authorizations to users to modify or read metadata • import metadata (including XML format) from external systems
Information Directory: metadata Back-office Metadata Front-office
Information Directory: metadata • The heart of InfoDir is the metadata repository; • Main types of used metadata are; • Descriptive metadata such as title, description, author • Structural metadata that indicate relations, coverage, e.g. • Administrative metadata that provide information about how the resources are created, who can access them • Technical metadata like server name, type of database, e.g. • Implementation of repository is based on Qualified Dublin Core standard http://www.dublincore.org/ • The whole textual content of repository is indexed
Information Directory: front-end Back-office Metadata Front-office
Information Directory: front-end Users use the search engine to look for information. Information Directory provides different types of search: • Simple: bykeyword OR classifications • Combined: bykeyword AND classifications • Advanced: using filters different ways to find relevant information
Information Directory: front-end • It’s possible for each information resource: • to consult descriptive metadata • to explore related resources and documents • Users can also: • explore structure of databases (tables and columns) • access to free services that provide public domain information Tabel 1 Columns Database Tabel 2 Tabel n
PSI Policy: international principles • ePSIplus Recommendations • R1.Progress on implementation of the Directive • R2. Channels for redress • R3. Discriminatory practices • R4. Access to PSI • R5. Stimulating the private sector to act • R6. The Economic Case • R7. Specific provisions of the Directive 4.2 Practical initiatives to create ‘asset registries’ or other PSI infrastructures supporting re-use should be supported at national level and where cross-border in nature, at European level. Initiatives of this kind should incorporate rights expression facilities including the ability for the user to identify applicable licences and should build on the potential gains being made in Semantic Web and Web 2.0 technologies.
Information Directory: history 2000:Information Directory was developed as a metadata catalogue for Regione Piemonte’s information resources (on intranet) 2008:Information Directory as a PSI Asset Register (on internet) Added to the asset-list of SEMIC.eu
Information Directory as a PSI asset list • PSI Asset Register • Starting point: InfoDir • selection of databases; • possibility of searching for only reusable databases; • selection of metadata thinking of final users; • possibility of consulting associated license only for reusable databases http://www.sistemapiemonte.it/innovazionetecnologia/infodir/index.shtml
Information Directory: organisational view Automatic update procedures Metadata DB CSI metadata operator Public Sector metadata operator Web application + XML Back-office: Metadata entering and update DocDigger Controller and Indexer Web server Indexed Contents on file system Web application Information Directory: • Multi-access • Multi-layout • Public • Private Piemonte Region: 1. select data to internet catalogue publishing 2. select the re-use databases RE-USE DATA LICENCES Citizens and Firms Piemonte Region
New release • “Licenze” Application • Development of the “Licenze” application, which plugs into Information Directory and allows registered users to accept a dynamically-composed licence (according to user’s profile and type of reuse) and access a private area for downloading regional data, using an open and standard format
Licences and accommodation facilities • First regional data released for reuse First case of raw data released for reuse by means of a designated licence: Accommodation Facilities Registry. The supply: daily updated XML files for about 5000 hotels and other accomodations, and about 500 features and services for each. Currently, Piemonte Region intend to disseminate this data for free to every re-use purposes
Future Developments • Increasing public data supply flux for reuse, license equipped with its own regional XML standard schema for data dissemination (for example, PS address list, schools, museums, local public transport) • Developing a proper reuse portal, with information, news and tools for interactive activities and automatic updating • Supervising success and use of data released for reuse
Future Developments • Application of Data Quality certification to InfoDir, in order to link quality information to data set supply • Design and Implementation of a Data Quality Firewall, in order to manage data quality in service architecture • Introduction of a ontology-based semantic layer, to improve information search and comprehension • Participation to CALL5 - Intelligent Information Management, together with Milan Bicocca University and other partners, with the RIMMEL4US project’s proposal
Future Developments • Promoting activities • promotion (together with other PA) of a regional model for public data reuse (best practice): • at local level (model dissemination towards Provinces, Municipalities, ...) and national level (reuse portal) • at international level (based on the database catalogue construction’s topic, for example; the general impression is for Piemonte to be undoubtedly ahead from this point of view, as far as both technology and processes are concerned) • promotion, by means of public presentations, use and comparison, with private operators, of available tools.
Thank you for your attention! Marta.Garabuggio@regione.piemonte.it Andrea.Muraca@csi.it