510 likes | 683 Vues
SharePoint 2010 Search. SharePoint 2010 Search Overview. The SharePoint Strategy. Web Portals (Intranet and Internet). Divisional Solutions. User Base. Departmental Solutions. Collaboration Sites (Team Sites). Personal Sites (My Sites). Employee Empowerment.
E N D
The SharePoint Strategy Web Portals (Intranet and Internet) • Divisional Solutions User Base Departmental Solutions Collaboration Sites (Team Sites) Personal Sites(My Sites) Employee Empowerment
Enterprise Search Product Portfolio in SharePoint 2010 Solutions for Internet Business Solutions for Business Productivity FAST Search for SharePoint Internet Sites FAST Search for SharePoint Integrated with SharePoint SharePoint Server for Internet Sites SharePoint Server FAST Search For Internet Business FAST Search for Internal Applications Stand-alone Search Server Entry-Level Solutions Search Server Express
SharePoint Search and FAST Search SharePoint Server Search FAST Search for SharePoint + Great core search experience • Unique social search features • Broad content reach including BDC Industrial strength platform Strong developer experience • Conversational and rich user experience • Precise control of the user experience • Enhanced content processing capabilities • Deep platform flexibility and scale • Highly configurable and extensible Key Capabilities + OOB intranet + site search OOB people + expertise search Custom search verticals + Target Applications • High end (custom) search applications APIs and developer experience End user and site administrator enablement SharePoint platform integration Social search features and integration Admin & deployment capabilities Operations advantages (SCOM, scripting) Common Developer and IT Pro foundation
New Capabilities: End User and IT • User experience • Social search • Deployment and manageability • Customization and extensibility
End-User UI • One-stop Search Center • Scopes, web parts, refinements, best bets, top answers , advanced search • Query federation • Core search experience • Metadata, taxonomy and social tags based results refinement • Did you mean • Query suggestions • View in browser • Improved query syntax • Improved relevance ranking • Enhanced multi-lingual support • New form factors
Great Search Experience OOB • Get morerelevantresults • Find informationfaster • Search from anywhere Win7 Connector Related searches Launch in Office Web Apps Refinement panel Federated results
Social Search • People finding experience • Front door to the Office social network • Better expertise & interest search • Email mining to bootstrap profiles with interests and colleagues • “Address book style” search • Phonetic name matching • Nickname matching • Relevance models tuned specifically for people search • Metadata refinement, better hit highlighting, recently authored content • Social behavior drives search quality • Relevance ranking • Query suggestions • Social tagging influence
Amplify the Impact of Knowledge & Expertise • Connect with expertise • Improve relevance with use • Find people Recently authored content Refine by focus, expertise, and other attributes Expertise identification Phonetic and nickname matching
FAST Search in SharePoint • Extends user experience • Conversational • Visual • Refined • Contextual • Flexible • Scalable • …FAST!
Search Experience • Visual cues and calls to action • Thumbnails • Visual Best Bets • Open result in Browser • Exploration and Navigation • Refiners • Sort flexibility • Search similar • Did you mean • Context • User Context • Multiple relevance ranking
How Does FAST Add Value • Strong linguistic coverage • Configurable content pipeline • Property extraction creates metadata
FAST Search Differentiators • Features Unique to FAST Conversational User Experience Precise Control Content Processing Platform • Visual Cues • Thumbnails & Previews • Visual Best Bets • Exploration • Deep refiners • Similarity Search • Multilevel Sorting • …on any property • Context-Based Results • Promote/Demote • User Context • Relevance Control • Multiple profiles • Word/Term control • Custom/Extensibility • Property extraction • Prebuilt • Custom/Extensibility • Content coverage • 400+ document types • 82 languages • Advanced Linguistics • Offensive content filter • Extreme scale • in 3 dimensions • at best footprint • Query Expressiveness (FQL) • Relevance and sorting • Symbol & proximity search Features Provide the best, most productive, user experience Maximize productivity through control Make the most of all your content Tackle the most demanding search challenges Benefit The best search from Microsoft Conversational, Visual, Actionable Extracted metadata -> better search Social Search (Integrated w/ SharePoint) One platform for many applications Scale as you grow with no few limits
Richer Manageability • Consolidated administration UI dashboard • Automated service password management • PowerShell support • Built-in system health monitoring • Built-in and extensible search analytics
Connectors - Query and Index • Common strategy • Indexing connectors • Query federation connectors • Indexing connectors • SharePoint, Web/HTTP, File share, Exchange PF, Lotus Notes DBs supported out of box • Business Connectivity Services (BCS) framework for adding indexing connectors including LOB • Deprecating protocol handlers • Query federation
Connector Framework • Support for attachments • Item level security • Crawl through entity associations • Inline caching and Batching for better citizenship • Richer crawl options
Search page points to Profile Page or custom page to show search result Search results page – results based on metadata Search user queries Connector architecture Search Engine/Index BCS Profile Pages or External Lists Search BCS Runtime Execution call routing through Business Data Connectors using model file Fetch data from repository Databases WCF Protocol Handlers .NET Assembly Connector Custom Connector
Search Extensibility: UI and Federation • Change Web Part properties – No Code • Extend OOB Web parts programmatically • Extend Search Center with vertical experiences • Query Federation
“Out of the Box” Federation • People Search and Related Searches • Bing and other OpenSearch providers • Windows 7 and IE8 searching SharePoint 2010 • Rich RSS feeds and federation generator
Architecture and Design • Deployment and management • Scale-Out architecture • Introduction to concepts • Scale-out features and options • Other engine enhancements
Deployment and Management • Industrial Strength Search at High Scale • Simplified Installation, Topology Design, and Deployment • Easier Administration • Better Monitoring
Scale-Out Architecture • 2010 Core Engine tenets: • Sub-second query latencies at large scale • Fresher indexes • Better resiliency/higher availability • Basic philosophy • Componentize the system • Remove system bottleneck through scale-out
Search Technology Concepts Concepts Search Center - UI for users to issue queries and interact with results OpenSearch Source Query Object Model Query Servers- Accept query requests from users and return results Query Servers Query Federation - Return results from non-SharePoint Indexes Index Partition Indexing - Extract information from items to enable efficient matching Indexer Index Partition - Subsett of the overall index Scaling Crawling - Traverse URL space to record items in search catalog Crawler Connectors - Know how to process different content sources Content Sources - Host the content we want to return in main results Content • Content • Content
SharePoint Scale-out Multiple Indexers Stateless Crawlers Crawl Distribution Query Mirroring Query Partitioning Multiple Property DBs Admin Database + Admin Component
Content Distribution • Crawl Distribution • Built-in load balancer • Crawlers crawl content that is covered by crawl database • Default configuration can be overwritten using host distribution rules • Query Distribution • Low query latency • Distribution by hash of Document Id • Crawlers partition indexed data and propagate to query servers
Industrial Strength Resiliency • 2007 style mirroring for index partitions • Ability to add multiple crawlers to minimize crawl downtime • Native support for SQL mirroring
Content Processing Flow End Users OpenSearch Source • Data moves from content source to end user queries • It gets crawled, processed and refined, an index is created • User executes queries and retrieves data, metadata, and federated search results Content Query Processor Indexer Content Processor Crawler Federation Index Partition Search Center Profiles User Context Relevance Control Metadata Indexing Connectivity
FAST Search for SharePoint Scale Out • Scale-out in different “dimensions” • Query Volume • Content Volume • Processing power • Indexing freshness • Redundancy options • Search • Indexing • Performance targets* • 30 mDocs/node • 50 QPS/node • 35 docs/sec * Dependent on document and hardware characteristics
Content Gathering • Document Processing • Search FAST Search CPU Usage • Administrative services • Enterprise Crawler • Content distributor • Index/Search • Administrative • Name service • Web Analyzer • Content processor(s) • Query result processor • Security access module (SAM) • Indexing dispatcher • Database Connector Key Heavy Moderate Light
Content Gathering • Document Processing • Search FAST Search Memory Usage • Administrative services • Enterprise Crawler • Index/Search • Content distributor • Administrative • Name service • Web Analyzer • Content processor(s) • Query result processor • Security access module (SAM) • Indexing dispatcher • Database Connector Key Heavy Moderate Light
Content Gathering • Document Processing • Search FAST Search Disk Usage • Administrative services • Enterprise Crawler • Content distributor • Index/Search • Administrative • Name service • Web Analyzer • Content processor(s) • Query result processor • Security access module (SAM) • Indexing dispatcher • Database Connector Key Heavy Moderate Light
Server Planning • You can use your current 64-bit hardware • Enterprise class servers: • 2U > 6 disks SAS • 2xQuad Core CPUs (mid-range OK) • Upgrade the memory to 32GB or more for servers hosting virtual environments • Upgraded RAID controller: • Battery Backed Write Cache will improve performance and reliability
Disk Arrays • Direct attached storage provides the best performance for the cost • 100GB+ • IE: 2x146GB SAS drives RAID 1 • 1TB, 2000 IOPS • IE: 8x146GB 10K RPM SAS drives RAID 5 • 3TB, 3000 IOPS • IE: 12x300 GB 10K RPM SAS drives RAID 5/50
SharePoint 2010 Search – Small Topology Web Server Query Server Web Server Query Server INDEX PARTITION 1 QUERY COMPONENT 1 QUERY COMPONENT 1M Crawler Server Administration Crawler Database Server Crawl db Property db Search Admin db
SharePoint 2010 Search – Medium Topology Web Server Query Server Web Server Query Server INDEX PARTITION 1 QUERY COMPONENT 1 QUERY COMPONENT 1M INDEX PARTITION 2 QUERY COMPONENT 2 QUERY COMPONENT 2M Crawler Server Crawler Crawler Server Admin Crawler Database Server Other farm Dbs Database Server Crawl db Property db Search Admin db