1 / 13

Open Relevance Project

In the beginning…. Open Relevance Project. Lucene Connector Framework. SF Lucene Meetup. Agenda. Announcements and Introductions Solr 1.4 and Data Discovery – Hoss Tips and Tricks with Analyzers – Tom Hill Optimizing Solr for performance and User Experience – Brian Doll

landon
Télécharger la présentation

Open Relevance Project

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. In the beginning… Open Relevance Project Lucene Connector Framework

  2. SF Lucene Meetup

  3. Agenda • Announcements and Introductions • Solr 1.4 and Data Discovery – Hoss • Tips and Tricks with Analyzers – Tom Hill • Optimizing Solr for performance and User Experience – Brian Doll • Spatial Search with Lucene and Solr - Grant

  4. Introductions and Announcements • Intro • Haiti Fundraiser/Raffle • http://forms.lucidimagination.com/go/lucidimagination/donate2haiti • Lucene Connector Framework in Incubator • http://incubator.apache.org/connectors

  5. Spatial Search

  6. Introduction • What is it? • Use Cases • Background Knowledge • Implementing • Status • Resources

  7. What is It? • Spatial search is the merging of unstructured content with spatial information to produce search results specific to one or more locations • In other words • “Find me all the Starbucks selling double espressos within 5 miles of my current location” • Lucene/Solr solve specific spatial problems, but doesn’t provide a general purpose GIS solution

  8. Use Cases • Asset Management • “Dude, where’s my stuff?” • Social Networking • Find all friends near me • Targeted, local search results and ads • “restaurants in Austin Texas” • “Starbucks, 55313” • Business Intelligence • Restrict doc set for analysis by location

  9. Spatial Search Concepts • Spatial Data Types • Points (latitude/longitude) • Lines • Shapes • Maps and overlays • Streets, POI • Integration with unstructured text • Metadata, descriptions, user reviews, etc. http://www.openstreetmap.org/?lat=44.9744&lon=-93.2484&zoom=14&layers=B000FTFT

  10. Implementation Needs • Indexing support for data types • Query Parsing • Efficient distance calculations • Euclidean, Great Circle (Haversine), Vincenty’s • Filtering • Bounding Box • Sort by Distance • Relevance Enhancement • Faceting • Advanced: shape intersections, routes

  11. Things you can do now in Lucene/Solr • Distance Functions • Solr has: haversine (great circle), Euclidean, Manahattan (others) • Filter • Frange, Cartesian Tier/Tile coming soon, Lat/Long Range • Boost Score • http://localhost:8983/solr/select/?q=_val_:"recip(dist(2, store, point(34.0232,-81.0664)),1,1,0)"&fl=*,score • Sort • SOLR-1297 added support for sorting by Function (any function) • Contrib/Spatial has similar support

  12. What’s next/needed • Faceting • “Generic” Filtering support • Function Query results as fields in output • Query Parsing Support • Geocoding, etc. • Shapes, lines, intersections?

  13. Resources • https://issues.apache.org/jira/browse/SOLR-773 • Several issues in Lucene for improvements • http://wiki.apache.org/solr/SpatialSearch • http://www.nsshutdown.com/projects/lucene/whitepaper/locallucene_v2.html • http://www.ibm.com/developerworks/opensource/library/j-spatial/index.html • http://www.lucidimagination.com/search/?q=spatial

More Related