1 / 14

Patstat beyond Europe

Patstat beyond Europe. By Gianluca Tarasconi Madrid, 9/12/2010. An insight into Patstat data from patent authorities other than EP O. What is PATSTAT.

enye
Télécharger la présentation

Patstat beyond Europe

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. PatstatbeyondEurope By Gianluca Tarasconi Madrid, 9/12/2010 An insight into Patstat data from patent authorities other than EPO 1

  2. What is PATSTAT PATSTATstandsforEPO Worldwide PatentStatistical Database.Contains a snapshot of the EPO master documentation database (DOCDB) which contains data of about 90 national and international patent offices with different degree of coverage.Data include bibliographic data, citations and family links. This database is designed to be used for statistical research and requires the data to be loaded in the customer's own database. http://www.epo.org/patents/patent-information/raw-data/test/product-14-24.html http://forums.epo.org/epo-patstat-faqs/ 2

  3. Non EPO data vs APE-INV Name Game Data from other patent authorities may help in: Validate algorithms against other spellings/conventions; Fill missing/correct data (FI address/city) using data from equivalents; Use Patent Family(1) data to improve algorithms using other data to give a similarity score; (1) For a list of patent family definitions see : C. Martinez Insight into Different Types of Patent Families, STI Working Paper 2010/2 3

  4. Example (I): inpadoc family # 75, Mr Roberts 6 different spellings for name, 3 different addresses In this case name and city are better parsed in US equivalent patent data; 4

  5. Example (II): inpadoc family # 88, Mr Newman WO patent data confirm that correct address is 43111 Robbins street US patent tells us A. stand for Antony 5

  6. What countries (I) Patstat contains 92 application authorities; 45 are inside Europe; 47 are outside Europe; Contains regional/international authorities (WIPO; ARIPO…); Contains also ‘terminated’ authorities (DDR, URSS) 6

  7. What countries (II) 7

  8. What dimensions are relevant A) data coverage (% of coverage by year) Are data from patent authority X 100% included into Patstat from year W to year Z ? B) Data transmission delays How long does it take a non EPO patent to reach in PATSTAT? C) Completeness of geographic data How is quality (and coverage) of address / city / country code ? 8

  9. Data coverage (I) EPO gives partial informations http://www.epo.org/patents/patent-information/data-quality.html http://www.epo.org/patents/patent-information/raw-data/useful-tables.html Total number of applications is given but not the % of total (EPO gives what it gets) 9

  10. Data coverage (II): example on India In patstat are reported from EPO 66219 Indian applications Indian Patent office reports 28.882 applications filed only for 2006 10

  11. Data Transimission delays (I) We study time series 2003- 2008 for BR, CN, JP, DE, KR and IN compared to EP; Graph differences suggest publication lags and data transmission lags differ from country to country; Timeseries may also highlight ‘holes’ or changes of population (FI USPTO from 2000 onward) 11

  12. Data Transimission delays (II) 12

  13. Completeness of geographic data Table for the TOP 20 by inventor count; 13 authorities have more than 80% of records with no country code; 12 authorities have 0% of address/city; Anyway in many cases address data are inside first name field (FI: DE) (data from patstat 09/2009) 13

  14. Conclusions Non EPO havecoverage, quality and ‘spelling’ thatmaychange a lotfrompatent authority topatent authority; Data can beusedasaddictional source of information butnotasmain source (BONUS not MALUS); EPO couldprobablyimprovequalityofthis data, especiallyadd more addresses (FI in april 2011 willrelease WO address data) is up tousersdemand more on thistopic. 14

More Related