|
|||||||||||||||||
|
|
SharePoint Search: An Enterprise Contender? February, 2008 by Jean Graef . Is the search component of Microsoft’s SharePoint suite a viable option for enterprise search? Some of our members have already chosen it, some have tried and rejected it, and many more are considering it as a serious contender. Gartner lists it along with Google in its “Challenger” category.(1) The reason is that, with the 2007 release of the product (now called Microsoft Office SharePoint Server or MOSS), SharePoint search now has most of the basic features we’ve come to expect in enterprise search along with low cost and tight integration with existing SharePoint installations and other Microsoft applications. As one person put it, “It isn’t the best in class, but it’s good enough.” Whether you deploy MOSS for enterprise search depends on your technology strategy and budget, how much you’ve invested in metadata and taxonomies, and how you plan to search multiple content repositories. If you use SharePoint for collaboration and content management but choose another product for enterprise search, you’ll need to consider two kinds of complimentary products:
Either way, you’ll need a strategy that integrates SharePoint’s bottom-up (decentralized) publication and management model with the top-down (centralized) enterprise search deployment model. You want users to be able to find resources – documents, Web sites, people – regardless of company location or technology yet not be overwhelmed by the minutiae of documents generated by local collaboration. MOSS 2007: A big improvement
In addition, MOSS 2007 search has two new features:
MOSS vs Google
The search manager also reported that MOSS 2007 is easier to administer and maintain, though he said that the index update process is still too time consuming. He liked the variety of usage reports, especially the one that shows the most popular search terms that have no Best Bets assigned to them (i.e. the editors have not selected one or more documents or sites to display at the top of the results list). Room for improvement
Many, if not all, of these features are available through third-party add-ons from vendors such as Coveo and Mondosoft Ontolica. Unlike other search engine vendors, who provide new features exclusively through the upgrade process, Microsoft encourages its customers to purchase enhancement packages created by independent developers. These add-ons, however, increase the total cost of MOSS search deployment. Influence of strategy and budget Moreover, companies selecting MOSS tend to look at search as part of a single system in which:
In other words, MOSS search is well suited to organizations that have standardized on the Microsoft technology platform, use SharePoint for collaboration, have a decentralized organization structure, and are in knowledge-intensive industries (e.g. R&D, software consulting). Investments in metadata and taxonomies The MOSS search crawler will discover metadata embedded within documents, then use it to filter search results and display options in Advanced Search. However, the administrator must first map the crawled metadata elements to “managed properties” (attributes such as author, title, and URL that can be used in search scopes and queries). The Dublin Core metadata library comes with MOSS out of the box.
Some common metadata elements are mapped by default, but it’s also possible to create new managed properties for such attributes as customer name, customer service rep, or customer service region. Managed properties can be incorporated into document and site templates to make it easier to add metadata values at creation time, but MOSS provides no auto-categorization program to add metadata retrospectively to an existing document collection.
Using a thesaurus with MOSS search
In MOSS you can also associate definitions with keywords. It’s not possible to simply import a traditional thesaurus into the MOSS thesaurus XML format because they’re two different animals. For one thing, a search thesaurus (i.e. a list of synonyms) should contain words that real users will type in the search box (from search logs) – not terms created by a professional indexer (though there will be some overlap). For another, a traditional thesaurus may contain phrases such as “packaging law & legislation,” while a search thesaurus should contain single words or, at most, two-word phrases. Finally, there’s no way to show broader/narrower relationships in search results (e.g. as “see also” links or an expandable hierarchy of related topics). At least two organizations we know of have bumped into size and performance limitations with the MOSS thesaurus (Microsoft says there’s a 10 mb limit). Changing the order of search results With Best Bets, MOSS administrators can associate keywords with specific Web pages or sites. When a user types the keyword into the search box, MOSS displays those sites designated as Best Bets at the top of the results list (or in a sidebar) and marks them with an icon, such as a star (see below).
With Authoritative Sites, administrators increase or decrease the relevance of content within search results by assigning one of four levels to a Web page or site: most authoritative, second-level authoritative, third-level authoritative, or sites to demote in the ranking. Sites that are not assigned an Authoritative Page level are weighted based on their “click distance” from an authoritative site. Click distance refers to the number of links between a page and an authoritative page linking to the content item. So, while it’s possible to tweak MOSS search results using a variety of techniques along with some data from an existing thesaurus, it’s a labor-intensive endeavor. For this reason, some organization with large, complex taxonomies opt to purchase third party thesaurus management software that integrate with SharePoint – an approach which Microsoft endorses. Examples of MOSS-compatible taxonomy management tools include Factica Synaptica, Data Harmony Machine-Aided Indexer, Schemalogic SchemaServer, and Interse I-box. A consistent search experience Within MOSS, an administrator can create a Shared Services Provider (SSP) and instruct it to crawl all the content sources deemed necessary for a particular business function. Sources can include SharePoint content, the company intranet, database applications such as SAP and Oracle, and external information services such as FindLaw. The crawl results are stored in a single index, which makes the search relatively fast and efficient. However, large organizations typically have multiple SSP’s. To allow a user to search all of them from a single user interface, you can purchase a third-party application such as Mondosoft’s Ontolica (see the federated search option on the Ontolica Web site). Or, you can select an enterprise search engine that can crawl and index SharePoint content. Examples include Autonomy, FAST, Longitude (BA-Insight), Oracle, Recommind, Vivissimo, and others. Is MOSS 2007 right for you? Several of our members have mentioned the effort needed to customize MOSS search and set up interfaces to other business applications through the MOSS Business Data Connector. Added to that is the cost of purchasing third-party programs for enhanced search features and taxonomy management. We suspect that for many organizations, the question is not “Should we use MOSS as our enterprise search engine” but rather “What’s the best way to integrate our non-Microsoft enterprise search engine with MOSS?” Created on February 6, 2008 l Updated on January 4, 2010 |
||||||