1 / 18

Inventor Mobility Index

Inventor Mobility Index. Thorsten Doherr Zentrum für Europäische Wirtschaftsforschung Center of Economic Research, Mannheim Germany. Mission. Problem:. Two inventors with the same name are not neccessarily the same person

sivan
Télécharger la présentation

Inventor Mobility Index

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Inventor Mobility Index Thorsten Doherr Zentrum für Europäische Wirtschaftsforschung Center ofEconomic Research, Mannheim Germany

  2. Mission Problem: • Twoinventorswiththe same nameare not neccessarilythe same person • Defining an inventor only by its name results in too much false mobility especiallyforinventorswithcommonnames • Restricting the definition too much (i.e.: name and home address) will cancel any mobility Mission: Youhavetodecidewethertwopatentsfrominventorswiththe same nameareactuallyfromthe same personorfrom different personsthatsharethe same name Tools: The complete patent data

  3. Plausibility Rules Twoinventorswiththe same namearethe same person… • if they are inventing for the same applicant • if they have the same home address • if they are working with the same co-inventors • if one is citing the other • if they have patents in the same area of technology (ipc) Inventor: A singleinventorentry in a patent document Person: All inventors with a specific name that are linked by at least one plausibility rule

  4. HarmonizationofApplicants SearchEngine The SearchEngine is an in-house developed software package specialized in company address matching. It implements the following steps: • Normalizing of the search fields (company name, address fields) by transforming them to uppercase, replacing special letters to their common (phonetic) representation (i.e.: Ü  UE, ß  SS), compressing abbreviations (i.e.: S.P.A.  SPA) and replacing special characters with blanks • Creating a dictionary containing all the words of the search fields along with their occurrence. To preserve the context, every search field has its own chapter. The occurence is the base for the heuristic search algorithm. There are also supporting tables that link the dictionary entries back to the company table. • The search algorithm separates a search term into words. Each word is associated with the occurrence counter of the appropriate dictionary entry. The occurrence reflects the identification potential of the word. A low occurrence has a high identity, because the resulting list of potential hits is small.

  5. HarmonizationofApplicants ExampleoftheSearchEngineAlgorithm

  6. HarmonizationofApplicants Finalization • A cutofflimitfortheidentityisappliedtofilter all results (i.e. 90%) • The resulting list of matching pairs is not symmetric: A can be linked to B but it is not required that B is linked to A  linked pairs create a network • Network Analysis: if A is linked to B and B is linked to C, the analysis identifies the group A,B,C • Re-iteration of the network analysis for too large groups with an increased cutoff limit for their members.

  7. Harmonizationof Inventor Names • The SearchEngine is of limited use because… • it is most efficient with search terms consisting of multiple words • the main problem are typing errors and misspellings • Creating phonetic representations of the name using the Metaphone algorithm by Lawrence Philips, 1990 • Phoneticalgorithmscreateuniquerepresentationsforsimilarsoundingwords (names) andcanbeindexed directdatabaseaccess • Originallytheresultstheydeliveredweremanuallyvalidatedbecauseoftheir strong tendencyforfalse positives automatedmatchingrequires an automatedvalidationprocess • Automated comparison of the retrieved names with the searched name • The function is based on the least relative character position deltas and requires two words as parameters  can not beusedforindexbaseddirectaccess • Needs phoneticindexingtoquicklygenerate a listof potential candidates • Tolerancefortypingerrorsincreaseswiththelengthofthewords longerwordsaremorepronetotypingerrors

  8. Harmonizationof Inventor Names Example for the Metaphone Search

  9. Harmonizationof Inventor Names ExamplefortheLeast Relative Character Position Deltas + + + + 1.875 + + + + = = + = 1-

  10. Plausibility Rules Twoinventorswiththe same namearethe same person… • if they are inventing for the same applicant. • if they have the same home address. • if they are working with the same co-inventors. • if one citing the other. • if they have patents in the same area of technology (ipc). Inventor: A singleinventorentry in a patent document. Person: All inventors with a specific name that are linked by at least one plausibility rule.

  11. All Patents of an Inventor Name 9 8 11 10 7 12 3 15 14 6 2 16 13 4 17 18 5 1 22 19 21 20

  12. The Same Applicant Rule 9 8 11 10 7 12 3 15 14 6 2 16 13 4 17 18 5 1 22 19 21 20

  13. The Same Home Address Rule 9 8 11 10 7 12 3 15 14 6 2 16 13 4 17 18 5 1 22 19 21 20

  14. The Co-Inventor Rule 9 8 11 10 7 12 3 15 14 6 2 16 13 4 17 18 5 1 22 19 21 20

  15. The Citation Rule 9 8 11 10 7 12 3 15 14 6 2 16 13 4 17 18 5 1 22 19 21 20

  16. The IPC Rule 9 8 11 10 7 12 3 15 14 6 2 16 13 4 17 18 5 1 22 19 21 20

  17. Italian Inventor Mobility Index 123356 patents from Italian applicants and inventors 49101 different harmonized inventor names 60268 nodes after applying the same applicant rule 53316 nodes after applying the same home address rule 53572 nodes after applying the co-inventor rule 52504 nodes after applying the citation rule 50276 nodes after applying the ipc rule Main Database: Espace Bulletin (March 2010), EPO Citations: Patstat(September 2010), OECD Development: Microsoft Visual FoxPro 9.0

  18. Traversal of a Network Table 8 7 3 6 2 4 5 1

More Related