550 likes | 708 Views
SharePoint 2007: Search Deep Dive. Erik Mau Inetium. Agenda. Search Overview User Experience Extensibility Administration Scenarios Q&A. Search Overview. Topics. Topology High-level Architecture Performance WSS vs. MOSS Search Comparison Search Server 2008. Topology.
E N D
SharePoint 2007:Search Deep Dive Erik Mau Inetium
Agenda • Search Overview • User Experience • Extensibility • Administration • Scenarios • Q&A
Topics • Topology • High-level Architecture • Performance • WSS vs. MOSS Search Comparison • Search Server 2008
Topology • Single Server • Separate Index Server with WFE playing Query Role • Separate Index and Separate Query Server • Index Server can crawl itself or a specific WFE
Architecture • Index Engine • Query Engine • Protocol Handlers • IFilters • Content Index • Property Store • Search Configuration • Wordbreakers
Performance • Change Log / Security Change Only Crawl • Continuous Propagation • 10s of Millions of documents per Indexer • WSS sites use MOSS Search
WSS vs. MOSS Search Comparison * Knowledge Network is a separate product ** BDC is Enterprise CAL
Topics • Search Center • Search Usability Features • Power User Features • Demo
Search Center • Site Template • Tabbed Interface • Multiple Entry-points • Keywords, Definitions, and Best Bets • Extensible
Search Usability Features • Hit Highlighting • “Did you mean” feature • Duplicate collapsing
“Power User” Features • Property-based Searches • Customer:Acme (find all items where Customer metadata = Acme) • Author:”Sally Jones” (find all items authored by Sally Jones) • ContentType:task (find all items of a specific content type) • Site:http://www.microsoft.com • Keyword Searches • +Solution (keyword inclusion) • -Plumtree (keyword exclusion) • Quotes for Exact Match (i.e., “User Group”)
Topics • Search Web Parts • Search Tabs • Search API • Demo
Browser-Based Extensibility Options • Search Web Parts • Search Box / Advanced Search • Search Summary • Search Statistics • Search Paging • Search Actions • Search Best Bets / High Confidence Results • Search Results • Search Tab Pages
Search Input Web Parts • Search Box • Scope options • Additional query terms • Advanced Search Page • Search Results Page • Advanced Search • Input types • Properties (metadata) • Search Results Page
Search Utility Web Parts • Search Summary • “Did you mean” • “You searched for” • Search Statistics • Search Paging • Next / Prev Text • Next / Prev Images • Search Action Links • Determine what links to display
Search Best Bets / High Confidence Results • Same web part, different configuration • XSLT for display • Show Keywords? • Show Best Bets? • Show High Confidence Matches • Best Bet Limit
Search Core Results • Results per Page • Enable removal of duplicates • Enable stemming • Selected Columns • Configurable XSLT • Scope
Search Tabs • Based off of lists in Search Center Site • Tabs in Search Pages • Tabs in Search Results • Process: • Create Entry Page • Create Results Page • Configure Web Parts on Entry and Results Page • Add Entry Page to Tabs in Search Pages • Add Results Page to Tabs in Search Results
Search API • KeywordQuery • Allows basic property-based and free text queries • Namespace: Microsoft.Office.Server.Search.Query • Example: • KeyWordQuery query = new KeyWordQuery(); • query.SelectProperties.Add(“Customer”); • query.QueryText = “Author:Erik SharePoint”; • ResultTableCollection results = query.Execute();
Search API • FullTextSqlQuery • Allows highly customized MSFT Full text search syntax queries • Namespace: Microsoft.Office.Server.Search.Query • Example: • FullTextSqlQuery query = new FullTextSqlQuery(); • query.QueryText = “SELECT Customer, Path FROM Scope() WHERE …”; • ResultTableCollection results = query.Execute();
Topics • SSP Administration Features • SSP Administration Demo • Site Collection Administration Features • Site Collection Administration Demo
Search Administration SSP Administration
SSP Administration Overview • Search is managed at the Shared Service Provider (SSP) • Provides management of: • Content Sources (What is being indexed) • Managed Properties (How users want to perform advanced searches) • Scopes (How users want to search) • Relevance (What content sources are more relevant) • Health (What are users searching for and how are my crawls performing)
Content Source Management • What can we index? • SharePoint Sites (v2, v3) • Web Sites • File Shares • Exchange Public Folders • Lotus Notes • Business Data (BDC)
Business Data Catalog • MOSS 2007 (e-cal) provides access to business data to users • Data is accessed through the Business Data Catalog • Databases • Web Services • Exposed to many features in SharePoint • Web Parts (Lists and Details) • Metadata Columns
Business Data Catalog • Search can easily index the data • No need to write code • Highly customizable results • Launching pad into other web-based, querystring-addressable applications. • Integrated search results in the search center: • Documents • Web Content • Business Data
Metadata Property Mappings • Crawled Properties • SharePoint “discovers” properties during crawl • Organized in categories • Can be included/excluded in the index • Managed Properties • Out-of-the-box Managed Properties • Custom-defined Properties (i.e. CustomerName) • CustomerName:Acme • One Managed Property can be associated to 1 or more Crawled Properties
Scope Management • Logical organization of crawled content • Rule-based • Web Address / Folder • Property Query (Author = John Doe) • Specific Content Source • Supports multiple rules (joined via ANDs / Ors) • Supports content exclusion • Site Collections “opt-in” to SSP-defined scopes
Relevancy • Authoritative Pages • Three levels of “authority” • Ability to demote sites • Relevancy API • API Available to change the ranking of managed properties
Crawl Log • Administrative interface for viewing crawl history • Summary View • Details View
Search Usage Reporting • Out of the box usage reportingin MOSS • Query volume trends, top queries, click through rates, queries with zeroresults, etc. • At both SSP and site collection levels • Export data for extended reporting in Excel
Search Administration SSP Administration Demo
Search Administration Site Collection Administration
Site Collection Administration Overview • Site Collection Scopes • Scope Visibility • Search Visibility Settings • Keywords / Best Bets Management • Site Collection Search Usage Reports
Site Collection Scopes • Similar interface to SSP Scope Management • Organize by Display Group • Same flexibility for rules
Scope Visibility • Define Display Groups • Groups Control Visibility on Search Web Parts
Search Visibility • Control if a site shows up in search results • Sites are already security trimmed • Example: Search Center Set to “No”
Keywords & Best Bets • Use query reporting to help define keywords • Use keywords to provide access to content inaccessible to the crawler
Search Administration Site Collection Administration Demo
Scenario 1: Creating a Search Tab The sales department at Litware, Inc. would like easier access to their documents in SharePoint, documents on a legacy file share, and information on one of their competitor sites. Solution: • New Content Sources • New Scope • New Search Center Tab