Archive for the Intranet Search Category

Social Bookmarking helps Search

In the previous post, we mentioned that Intranets are generally lack of hyperlink network, compared to the Internet. The user collaboration of tagging good resources in the intranets will result in a hyperlink structure with keywords highly specific to the enterprise and URLs highly relevant to a specific topic. When an intranet search engine sees this structure, relevant documents can be associated with right keywords in its inverted index. A social bookmarking system plays an important role in helping search engines to find relevant data for users’ queries.

We started a new section about the Social Bookmarking and will illustrate the steps of extending a WSS site to be a social bookmarking site. Furthermore, we’d like to show how to set the social bookmarking site as the Authorative site for the MOSS 2007’s search engine in the intranet to improve relevance.

Difference between Intranet and Internet Search

Will Google or other Internet search engines (e.g. live.com or yahoo.com) work as good as in Intranet? The answer is NO. The reason is due to the fact that Intranet and Internet are widely different. The differences are explained below:

  1. Hyperlinks - Algorithms of Internet search engines, e.g. Google’s PageRank are specific to internet because the Internet has wider and more links (archor or <a href> tag) to being connected, so pages referenced by more links are more “relevant” or “popular” than those referenced by less links. However, intranet is lack of this kind of links structures generally.
  2. File Type - In the Internet, most of the pages are Html pages. Intranet has a lot of PDF, Word, Excel and Powerpoint files, SharePoint or other portal software’s pages and other corporate data files.
  3. Security - Internet search engines usually crawl the pages with public-access, but Intranet search engines have to dual with document permission.

|