1 / 18

Patent Track @ CLEF

Patent Track @ CLEF. John Tait, Chief Scientific Officer, IRF. The IRF Mission. To bridge the gap between information retrieval research and the world of professional search especially in patents and intellectual property To promote open research on very large scale information retrieval

media
Télécharger la présentation

Patent Track @ CLEF

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Patent Track @ CLEF John Tait, Chief Scientific Officer, IRF

  2. The IRF Mission • To bridge the gap between information retrieval research and the world of professional search especially in patents and intellectual property • To promote open research on very large scale information retrieval • To make available a facility that enables largescale information retrieval and in depthpatent and other complex data processing.

  3. Patents- General Intellectual Property (IP): Across the world there are about 60 million patents and the number is growing rapidly Patent documents formed the most important shared information pool: • Knowledge and research • Innovative capacity and commercial strength • Legal information 80% of world technical-scientific knowledge can be found in patent documents – in some branches of industry the number is significantly higher still

  4. Patents – Commercial importance Intangible Assets: • Innovation improves competitivity, creates jobs, promotes growth and secures prosperity. • The only valid and binding instrument to protect innovation • An important commercial asset – a monopoly on the use of an invention • The issue of licences has become a significant revenue source for many companies

  5. Distinctive Patent Search Characteristics • High Recall: a single missed document can invalidate a patent • Session based: single searchers may involve days of cycles of results review and query reformulation • Defendable: Process and results may need to be defended in court

  6. Matrixware • Established in 2005 • Headquarters in Vienna • Has over 70 employees, an expert team of software developers, technicians, mathematicians, language experts and other specialists • Field of activity: Information Retrieval in the segment of Intellectual Property • Products:innovative solutions for searching and categorising patent data

  7. Committed to provide • Sample from Alexandria patent database • Leonardo • Eclipse based IR open development platform • Populated with various tools • General IR • NLP • MT • UI • But not necessarily in time for CLEF 2009

  8. Patent Retrieval Distinctive Problems

  9. Patent Process

  10. Types of Patent Search • Patentability • Validity • Clearance (Freedom to Operate) • Infringement • State of the Art • Patent Landscape • 1-3 dependent of prior art search

  11. Very High Recall • Any prior publication will invalidate a patent • Other patents including lapsed • Scientific Publication • Comics ???!!!

  12. Session Based • Patent Professionals Searching • Often Spend 2 or more days on one search • May review more than 1000 results • Work with other professionals (lawyers, chemical engineers, chemists, marketing etc. • Have to record and defend search process to clients and courts

  13. Classification • All patents are classified • IPTC • Automatic Classification Possible • People search for Gaps

  14. Multilingual • A Russian patent can invalidate a British patent • Complex and changing patterns of filing language • Patents come in families • Same idea: different jurisdictions and languages • MT already widely used

  15. Filing Languages • English continues to be the dominant language • Chinese is the most rapidly growing language and may surpass English shortly (China now bigger than US) • Activity in India is growing rapidly but looks set to be English dominated • Cyrillic Languages especially Russian are also rising rapidly • Japanese and Korean are very important • German and French are important but declining relatively • Spanish is underepresented versus world wide speakers • “Minor” European Languages are declining rapidly

  16. PAIR 08 • CIKM Workshop • http://www.ir-facility.org/events/pair08 • Includes proposed TREC Chemistry Track and a proposalfrom Erik Graf and Leif Azzopardi from Glasgow on automatic Test Collection creation

  17. Break Out Session • Meeting Room 1 • Through tunnel at end of corridor • 118 Mødelokale 1.1 • Areas for Discussion • Test Collection Creation • Task(s) • Evaluation Methodolgies • Organizational Issues • Future Developments

  18. Thank you for your attentionAny questions ? Mailing List Subscription: http://tinyurl.com/clef-ip www.ir-facility.org www.matrixware.com

More Related