<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Beaconfire Wire &#187; Search Engines</title>
	<atom:link href="http://www.beaconfire.com/blog/category/search-engines/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.beaconfire.com/blog</link>
	<description></description>
	<lastBuildDate>Mon, 23 Jan 2012 14:28:47 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Urban Land Institute Google Search Implementation</title>
		<link>http://www.beaconfire.com/blog/2011/04/urban-land-institute-google-search-implementation/</link>
		<comments>http://www.beaconfire.com/blog/2011/04/urban-land-institute-google-search-implementation/#comments</comments>
		<pubDate>Wed, 13 Apr 2011 14:15:14 +0000</pubDate>
		<dc:creator>Sara Franco</dc:creator>
				<category><![CDATA[Project Management]]></category>
		<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://www.beaconfire.com/blog/?p=3000</guid>
		<description><![CDATA[Beaconfire was asked to assist Urban Land Institute (ULI) implement the Google Search Appliance (GSA) to replace the Endeca search engine which had been integrated with Sitecore. ULI is the preeminent, multidisciplinary real estate organization, facilitating an open exchange of ideas, information, and experience among local, national, and international industry leaders and policy makers dedicated [...]]]></description>
			<content:encoded><![CDATA[<p>Beaconfire was asked to assist <a href="http://www.uli.org" target="_blank">Urban Land Institute</a> (ULI) implement the Google Search Appliance (GSA) to replace the Endeca search engine which had been integrated with Sitecore. ULI is the preeminent, multidisciplinary real estate organization, facilitating an open exchange of ideas, information, and experience among local, national, and international industry leaders and policy makers dedicated to creating better places.</p>
<p>But the members who counted on ULI’s website weren’t always able to find what they were looking for.  ULI’s approximate 60 websites had different implementations (many not on Sitecore) and the current search did not effectively allow the user to search across all ULI related sites. Beaconfire implemented the Google GSA search application, giving ULI the robust functionality of Google Search. Part of this implementation included giving the user an option to search on the specific site or across all ULI and created a common advanced search and search results page across all ULI sites.  Beaconfire also implemented a number of Sitecore updates which allowed the search results to display certain data, such as secured pages and the update date.   You can try out the new search functionality at  <a href="http://www.uli.org/">http://www.uli.org/</a> or <a href="http://urbanland.uli.org/">http://urbanland.uli.org/</a>.</p>
<p>Now, the ULI websites offer a consistent and reliable search experience and results for ULI members and site users.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.beaconfire.com/blog/2011/04/urban-land-institute-google-search-implementation/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Google&#8217;s new social search: +1</title>
		<link>http://www.beaconfire.com/blog/2011/03/googles-new-social-search-1/</link>
		<comments>http://www.beaconfire.com/blog/2011/03/googles-new-social-search-1/#comments</comments>
		<pubDate>Thu, 31 Mar 2011 20:48:36 +0000</pubDate>
		<dc:creator>Jo</dc:creator>
				<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[Social Media]]></category>

		<guid isPermaLink="false">http://www.beaconfire.com/blog/?p=2965</guid>
		<description><![CDATA[To your cadre of awkwardly-named social media &#8220;likers&#8221; and &#8220;tweeps&#8221;, you may soon want to start adding &#8220;+1&#8242;ers&#8221;. Google just announce their rollout of the new +1 button, a way for searchers to promote links to their Google connections.  Instead of just sharing cool links on Facebook and Twitter, Google wants you to start marking [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft" style="margin-left: 5px; margin-right: 5px;" title="+1 button" src="http://www.google.com/+1/button/images/icon.png" alt="+1 button" width="64" height="54" />To your cadre of awkwardly-named social media &#8220;likers&#8221; and &#8220;tweeps&#8221;, you may soon want to start adding &#8220;+1&#8242;ers&#8221;.</p>
<p>Google just announce their <a href="http://googleblog.blogspot.com/2011/03/1s-right-recommendations-right-when-you.html">rollout of the new +1 button</a>, a way for searchers to promote links to their Google connections.  Instead of just sharing cool links on Facebook and Twitter, Google wants you to start marking useful links in your search results, so your friends can see them.</p>
<p>The web is <a href="http://www.huffingtonpost.com/brad-hill/google-plusone_b_842793.html">a-buzz with criticism of Google&#8217;s latest foray into social networking</a>, and whether it will be more successful than Buzz. End users <a href="http://www.huffingtonpost.com/2011/03/31/google-plus-1-page-rank_n_843169.html">don&#8217;t need to worry about it yet</a>: it&#8217;s being rolled out slowly, and you need to create a public profile before you can use it. But, if you&#8217;re a webmaster or do online marketing, you should be aware of how it might affect you:</p>
<ul>
<li>Your +1&#8242;s will affect your <strong>organic</strong> search rankings &#8211; they will be added as a new factor in Google&#8217;s ranking algorithms. Just as external links to your site are important to cultivate, you should cultivate +1&#8242;s as well.</li>
<li>+1&#8242;s may appear next to your <strong>Adwords ads</strong> as well as organic search results &#8211; <a href="http://adwords.blogspot.com/2011/03/1-button-adwords.html">another way for ads to help boost your influence</a>.</li>
<li>There isn&#8217;t one yet (at least not officially), but Google will soon release a +1 button that can live on your website, next to your &#8220;like&#8221; and &#8220;tweet&#8221; buttons. This may be the most common place that people &#8220;+1&#8243; your content, since I would imagine most people won&#8217;t recommend content from search results if they haven&#8217;t visited it yet.</li>
</ul>
<p>This move from Google has been a long time coming, as they try to find a way to bring social recommendations into their standard results, and to compete with Facebook as the place that web users go for interesting content.  We&#8217;ll have to wait and see whether it takes off, but to me, the fact that +1 is tied to search rank means that we&#8217;ll at least start seeing the button on most websites. Whether people will use it is another question.</p>
<p>Can you see yourself using +1 as you browse the web?  Do you think you&#8217;ll add it to your site? If you&#8217;ve tried it out already, I&#8217;d love to hear what you think so far!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.beaconfire.com/blog/2011/03/googles-new-social-search-1/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>5 SEO Tips for Non-Profits</title>
		<link>http://www.beaconfire.com/blog/2011/02/5-seo-tips-for-non-profits/</link>
		<comments>http://www.beaconfire.com/blog/2011/02/5-seo-tips-for-non-profits/#comments</comments>
		<pubDate>Fri, 04 Feb 2011 19:29:29 +0000</pubDate>
		<dc:creator>Rebecca</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Marketing]]></category>
		<category><![CDATA[Nonprofits]]></category>
		<category><![CDATA[Search Engines]]></category>

		<guid isPermaLink="false">http://beaconfire.com/blog/?p=2672</guid>
		<description><![CDATA[There are plenty of great SEO guides out there, but few address unique considerations for non-profits. Here are 5 specific ways to optimize your non-profit&#8217;s site: 1. Encourage links from supporters One of the most important factors for SEO is having incoming links from reputable external web sites. As a non-profit, you have an advantage [...]]]></description>
			<content:encoded><![CDATA[<p>There are plenty of great SEO guides out there, but few address unique considerations for non-profits. Here are 5 specific ways to optimize your non-profit&#8217;s site:</p>
<p><strong>1. Encourage links from supporters</strong></p>
<p>One of the most important factors for SEO is having incoming links from reputable external web sites. As a non-profit, you have an advantage in this area, because you likely have supporters who are happy to promote your cause with a link. Take advantage of this &#8212; if your organization has a blog or makes announcements online, make sure that content is easily shareable. Provide logos, badges, or widgets that your supporters can easily plug into their own web sites.</p>
<p>Beware of linked badges/graphics that don&#8217;t give search engines enough information. From <a href="http://www.nten.org/blog/2008/07/23/search-engine-optimization-and-paid-search-for-nonprofits">NTEN&#8217;s Blog</a>:</p>
<blockquote><p>Badges are an excellent way to get your supporters linking to you, but too many nonprofits make a common mistake: they fail to incorporate text links into their badges. Search engines prefer text links to graphics, as they signal which keywords are related to the linked page. (Remember, search engines can&#8217;t read graphics. Yet.) So, ask your supporters for links and be sure to provide them with SEO-friendly badges.</p></blockquote>
<p><strong>2. Don’t forget about pages hosted elsewhere</strong></p>
<div id="attachment_2674" class="wp-caption alignright" style="width: 272px"><a href="http://beaconfire.com/blog/wp-content/uploads/2011/02/sitelinks1.png"><img class="size-full wp-image-2674 " title="sitelinks" src="http://beaconfire.com/blog/wp-content/uploads/2011/02/sitelinks1.png" alt="Screenshot of Google Sitelinks" width="262" height="97" /></a><p class="wp-caption-text">If your donation form isn&#39;t hosted on your site, it won&#39;t appear in your Google Sitelinks.</p></div>
<p>The pages stored in your content management system may already be search engine optimized &#8212; but if some of your most important pages are hosted  elsewhere (through your CRM system, for example), they may not be optimized in the  same way.</p>
<ul>
<li>The externally hosted page may have a different domain name than the rest of your site,   which makes it difficult for search engines to associate it with your  organization.  For example, a donation form hosted on a separate domain  cannot be  listed as one of your Google Sitelinks. However, if you  create a  “Donate” page on your own site that directs users to your  donation form,  that <em>can</em> appear as a Google Sitelink.</li>
<li>You may not have as much control over the format, URL, title, heading usage, and meta tags for pages hosted elsewhere. Take the time to identify workarounds or dig deeper into the customization options for your highest priority pages.</li>
</ul>
<p><strong>3. Evaluate the calls-to-action throughout your site</strong></p>
<ul>
<li>Identify the pages on your site that already receive plenty of traffic from search engines, then make sure your most important calls-to-action are featured or at least accessible from those pages.</li>
<li>Make your calls-to-action SEO friendly by incorporating text links containing strong keywords and using alt tags for banners/images.</li>
</ul>
<p><strong>4. Manage your online identity<br />
</strong></p>
<ul>
<li>Which is more recognizable to your target audience: your organization&#8217;s full name or its acronym? Or maybe neither &#8212; are people most likely to be searching for a well-publicized project or campaign you&#8217;ve run recently? Does it depend on the audience? Account for these variations in your SEO strategy.</li>
<li>Be sure that your website is listed where it should be, both in broad non-profit listings sites (Idealist, Charity Navigator) and in topic or geographically-focused resources.</li>
</ul>
<p><strong>5. Do your keyword homework</strong></p>
<ul>
<li>When branding new campaigns and projects, do keyword research. Does the title of your new campaign align with the terms that your audience is searching for?</li>
<li>Identify the keywords that are most specific to your cause and be able to differentiate yourself.</li>
</ul>
<p>Do you have other SEO ideas that non-profits should have on their radar? Post them in the comments.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.beaconfire.com/blog/2011/02/5-seo-tips-for-non-profits/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Care and Feeding of your Website: 3 Free Keyword Tools</title>
		<link>http://www.beaconfire.com/blog/2010/10/care-and-feeding-of-your-website-3-free-keyword-tools/</link>
		<comments>http://www.beaconfire.com/blog/2010/10/care-and-feeding-of-your-website-3-free-keyword-tools/#comments</comments>
		<pubDate>Tue, 19 Oct 2010 21:20:59 +0000</pubDate>
		<dc:creator>Rebecca</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Cool Tools and Tips]]></category>
		<category><![CDATA[Search Engines]]></category>

		<guid isPermaLink="false">http://www.beaconfire.com/blog/?p=2320</guid>
		<description><![CDATA[If you’ve read anything about search engine optimization, you’ve probably heard about the importance of keywords: Make sure your keywords are in your page titles! In your URLs! In your headings! In your alt tags!&#8230; But how do you decide which keywords to focus in on? Here are a few free tools to help you [...]]]></description>
			<content:encoded><![CDATA[<p>If you’ve read anything about search engine optimization, you’ve probably heard about the importance of keywords: Make sure your keywords are in your page titles! In your URLs! In your headings! In your alt tags!&#8230;</p>
<p>But how do you decide which keywords to focus in on?</p>
<p>Here are a few free tools to help you out:</p>
<ul>
<li>
<p><strong><a href="http://www.google.com/analytics">Google Analytics</a> / <a href="http://www.google.com/webmasters/tools">Webmaster Tools</a><br />
Understand what keywords are driving your site traffic today</strong></p>
<p>
If you use Google Analytics, check out the following reports:</p>
<ul>
<li><strong><em>Traffic Sources &gt; Keywords</em></strong> will show you keywords  used on external search engines that lead visitors to your site, then  show you what happens once they enter your site.</li>
<li><strong><em>Content &gt; Site Search &gt; Search Terms</em></strong> will show  you keywords used for your internal search, which are often not specific  enough to target as keywords for public search engines, but will still  give you valuable insight into your visitors&#8217; goals and vocabulary.</li>
</ul>
<p>And in Google Webmaster Tools:</p>
<ul>
<li><em><strong>Your Site on the web &gt; Search queries</strong></em> is similar to the Google Analytics&#8217; <em>Traffic Sources &gt; Keywords</em> report, but reveals more about what&#8217;s happening on the search engine  side of the equation. How far down on the page does your page appear?  What is the click through rate?</li>
</ul>
<p><strong>Nice perk:</strong> Both of these tools make it easy to monitor keyword  performance over time. In Webmaster Tools, you can &#8220;star&#8221; certain terms  that you want to keep an eye on. In Analytics, you can set up reports  to arrive in your email inbox as a reminder to check back periodically.
</p>
<p><strong>Also see:</strong> If your CMS provides reporting, check for reports similar to these.</p>
</li>
<li>
<p><strong><a href="https://adwords.google.com/select/KeywordToolExternal">Google Adwords Keyword Tool</a><br />
Understand the larger keyword universe, and how you fit into it</strong></p>
<p>Don’t be misled by the title: Even if you aren’t using Adwords, you can benefit from this tool. For any keywords you enter, it will return the relative popularity of the phrase (in # of searches) and give you an idea of how competitive it is. It will also give you tons of additional ideas to add to your list.<strong> </strong></p>
<p><strong>Nice perk:</strong> You can download the data as a spreadsheet and work with it offline.</p>
<p><strong>Also see:</strong> <a href="https://freekeywords.wordtracker.com/">WordTracker</a>, <a href="http://www.wordstream.com/keywords/">WordStream<br />
</a></p>
</li>
<li>
<p><strong><a href="http://www.seomoz.org/term-extractor">SEOMoz’s Term Extractor</a><br />
Analyze specific pages of your site<br />
</strong></p>
<p>Use this tool to get the search engine’s view of important keywords on a specific page. Do they match the keywords you’re targeting? If not, you can make changes to page elements (Title, URL, heading tags, and alt tags, among others), then try again until you are satisfied with the relative weight of the keywords for the page. This is a great before-and-after test as you optimize pages throughout your site.</p>
<p><strong>Nice perk:</strong> You can add this as a browser button/bookmark to analyze the page you’re currently browsing with a single click.</p>
<p><strong>Also see:</strong> If you use <a href="http://www.google.com/webmasters/tools">Google Webmaster Tools</a>,  go to <em>Your site on the web &gt; Keywords</em>. This report gives you the equivalent of the Term Extractor tool, but on a site-wide rather than page-specific level.</p>
</li>
</ul>
<p>And lastly, if I lost you at &#8220;if you&#8217;ve read anything about search engine optimization,&#8221; Google&#8217;s <a href="http://www.google.com/webmasters/docs/search-engine-optimization-starter-guide.pdf">SEO Beginner&#8217;s Guide (PDF)</a> is a good place to start.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.beaconfire.com/blog/2010/10/care-and-feeding-of-your-website-3-free-keyword-tools/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Care and Feeding of Your Website: Google Webmaster Tools</title>
		<link>http://www.beaconfire.com/blog/2010/02/care-and-feeding-of-your-website-google-webmaster-tools/</link>
		<comments>http://www.beaconfire.com/blog/2010/02/care-and-feeding-of-your-website-google-webmaster-tools/#comments</comments>
		<pubDate>Mon, 15 Feb 2010 22:37:14 +0000</pubDate>
		<dc:creator>Rebecca</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Cool Tools and Tips]]></category>
		<category><![CDATA[Search Engines]]></category>

		<guid isPermaLink="false">http://www.beaconfire.com/blog/?p=1664</guid>
		<description><![CDATA[So, you have a website. It looks good, works well, and is kept updated &#8212; great! But what else should you be doing on a regular basis to ensure its success? Google Webmaster Tools is a free service provided by Google that will help you to see the &#8220;big picture&#8221; of how your site exists [...]]]></description>
			<content:encoded><![CDATA[<p>So, you have a website. It looks good, works well, and is kept updated &#8212; great! But what else should you be doing on a regular basis to ensure its success?</p>
<p><a href="https://www.google.com/webmasters/tools">Google Webmaster Tools</a> is a free service provided by Google that will help you to see the &#8220;big picture&#8221; of how your site exists on the web and how you can improve both its presence and performance. To get started, <a href="https://www.google.com/webmasters/tools">log in</a> with any Google Account. You&#8217;ll need to add your site and complete a brief verification process. Then you&#8217;re ready to go.<a href="http://www.google.com/webmasters/tools/"><img class="alignright size-full wp-image-1668" title="google_webmaster_tools" src="http://www.beaconfire.com/blog/wp-content/uploads/2010/02/google_webmaster_tools.jpg" alt="Google Webmaster Tools" width="234" height="65" /></a></p>
<p>Here are a few things you can accomplish through Webmaster Tools:</p>
<p><strong>Learn about Incoming Traffic</strong></p>
<p>Understanding the intentions of incoming visitors will help you to make sure they have a positive experience on your site. <strong><br />
</strong></p>
<ul>
<li><strong>View Top Search Queries</strong> shows when and how your web site is appearing in Google search results. Which keywords display your site the most? Which result in click-throughs? Learning the answers to these questions (and identifying the differences between them) will help you to ensure that your site&#8217;s content meets the needs of incoming traffic. If certain keywords are missing from the list or ranked lower than you would expect, you may need to add more content to your site related to that keyword and ensure that others are linking to it.
<p>More highlights after the jump.<span id="more-1664"></span></li>
<li> <strong>Links to Your Site</strong> shows which pages on your site are being linked to from other sites, and how many visits these links are generating. Why is this important? Maybe one of your most linked-to pages is in desperate need of an overhaul. Or maybe you&#8217;ll discover that most new visitors are not entering your site through the homepage, so they&#8217;re missing the important announcement you&#8217;ve posted there. <strong>Links to Your Site</strong> will also show you which sites visitors are coming from, which will give you even more insight into what they&#8217;re looking for.</li>
<li> <strong>Keywords</strong> is a list of the most common keywords found by Google when crawling your site. Ideally, this list should reflect your site&#8217;s content. If it doesn&#8217;t, you should confirm that your site is being crawled successfully and then evaluate your content strategy and use of keywords.</li>
</ul>
<p><strong>Guide Google Through Your Site</strong></p>
<p>Prevent Google from indexing specific pages and direct it to the ones that matter.<strong><br />
</strong></p>
<ul>
<li><strong>Sitemaps</strong> will allow you to submit a <a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=156184">sitemap</a> to Google and detect problems associated with it. You won&#8217;t need to re-submit the map every time you change it, but it is worth revisiting this page to catch any problems that Google detects.</li>
<li> <strong>Crawler access</strong> guides you through the process of creating and testing a robots.txt file, which tells search engines which pages or directories of your site to ignore.</li>
</ul>
<p><strong>Catch Problems Early</strong></p>
<p>By monitoring your site proactively, you can avoid problems that result in unhappy visitors or losses in traffic.</p>
<ul>
<li><strong>Site Performance</strong> (found under Labs) will show you how fast your site&#8217;s pages load and how this compares to other sites. It provides a high-level timeline chart, a list of specific pages and their load times, and then gives tips on how to speed up those pages.</li>
<li> <strong>HTML Suggestions</strong> (found under Diagnostics) will identify duplicate or missing page titles and meta descriptions, which both have an impact on how visitors find your page through Google search. What makes a good page title and description? <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=35264">Google provides some tips on that, too</a>.</li>
</ul>
<p>Although these are some good starting points, this isn&#8217;t an exhaustive list of everything available in Webmaster Tools. To keep up with the latest features and best practices, visit the <a href="http://googlewebmastercentral.blogspot.com/">Google Webmaster Blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.beaconfire.com/blog/2010/02/care-and-feeding-of-your-website-google-webmaster-tools/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What is Online Knowledge? How can OpenCalais help create better Online Knowledge?</title>
		<link>http://www.beaconfire.com/blog/2009/11/what-is-online-knowledge/</link>
		<comments>http://www.beaconfire.com/blog/2009/11/what-is-online-knowledge/#comments</comments>
		<pubDate>Sun, 08 Nov 2009 18:11:10 +0000</pubDate>
		<dc:creator>Rahul Singh</dc:creator>
				<category><![CDATA[Accessibility]]></category>
		<category><![CDATA[Business Strategy and Process]]></category>
		<category><![CDATA[Cool Tools and Tips]]></category>
		<category><![CDATA[Information Architecture]]></category>
		<category><![CDATA[Knowledge Management]]></category>
		<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[Web]]></category>
		<category><![CDATA[Content Management]]></category>
		<category><![CDATA[Information retrieval]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[natural language processing]]></category>
		<category><![CDATA[online content management software]]></category>
		<category><![CDATA[Online Knowledge]]></category>
		<category><![CDATA[Web 2.0]]></category>
		<category><![CDATA[Web search engine]]></category>

		<guid isPermaLink="false">http://www.beaconfire.com/blog/?p=1442</guid>
		<description><![CDATA[Much has changed since humanity acknowledged the word knowledge and started to classify the various subject matters into categories and taxonomies of learned disciplines. The definition of knowledge is outside the scope of this article because of simple reasons. I am not as qualified as the university professors, or librarians who pour their blood, toil, [...]]]></description>
			<content:encoded><![CDATA[<p>Much has changed since humanity acknowledged the word knowledge and started to classify the various subject matters into categories and taxonomies of learned disciplines.<br />
The definition of knowledge is outside the scope of this article because of simple reasons. I am not as qualified as the university professors, or librarians who pour their blood, toil, trouble, and tears into the understanding of knowledge and wisdom.</p>
<p>What I do know about is what knowledge is online. Since <a href="http://www.w3.org/People/Berners-Lee/">Sir Tim Berners-Lee</a> (<a href="http://news.bbc.co.uk/2/hi/technology/3357073.stm">Yes. He was knighted.</a>) created the <a href="http://www.google.com/search?hl=en&amp;client=firefox-a&amp;rls=org.mozilla:en-US:official&amp;hs=KGX&amp;defl=en&amp;q=define:world+wide+web&amp;ei=CAj3Sp_4HdLDlAfbiLnxCg&amp;sa=X&amp;oi=glossary_definition&amp;ct=title&amp;ved=0CAkQkAE">World Wide Web</a> to link documents together on the then nascent Internet, knowledge became more than monolithic documents or books that were linked loosely via citations and references. Instead of specifying in APA, MLA, Chicago, or Turabian style where the source of a particular knowledge was, one could directly link it using something called &#8220;HyperText&#8221;, or what some know as &#8220;Hyper Text Markup Language&#8221;. Today, all websites that you see online are built with a combination of HTML, some JavaScript, and possibly some Flash or Java.<br />
<span id="more-1442"></span><br />
Never mind what the knowledge is built with. Let&#8217;s examine very briefly the trend of tagging and keywording content. &#8220;Tagging&#8221; or &#8220;keywording&#8221; existing content can help web site users browse through indexes of such tags or keywords similar to how a book may be indexed by the editors and publishers. What was the first representation of such archaic systems on the Internet? Search Engines. Search engines were of two varieties. Directories such as Yahoo or Excite and Search Indexes such as Alta Vista. Both had their place. If you wanted to browse through a human edited directory, you had good reason to. People had taken the time to classify and organize the websites using their God given intellect. If you wanted to search, you also had good reason to.</p>
<p>Since the computer was able to search all of the content in a set of knowledge automatically, search indexes could be created using complex algorithms. One of the reasons Google succeeded in the search indexing industry was because they figured that relevant pages would link to each other and the more relevant websites would have higher density of connections. Similar sites around similar sets of knowledge would coalesce using the network effect to create centers of knowledge. Further more, content that is properly tagged and keyworded can heighten the quality of the search because the search engine has the power to correlate keyword usage and the keywords themselves.</p>
<h2>What is OpenCalais?</h2>
<blockquote><p>The Calais Web Service:  The Calais web service automatically attaches rich semantic metadata to the content you submit. Using natural language processing, machine learning and other methods, Calais categorizes and links your document with entities (people, places, organizations, etc.), facts (person &#8220;x&#8221; works for company &#8220;y&#8221;), and events (person &#8220;z&#8221; was appointed chairman of company &#8220;y&#8221; on date &#8220;x&#8221;). (<a href="http://www.opencalais.com">OpenCalais 2009</a>)</p></blockquote>
<p>Now that you have a bird’s eye view of how knowledge is stored and sought online, let&#8217;s talk about OpenCalais. OpenCalais is a toolkit that can be used by Content Management systems to automatically find the best set of keywords to represent a particular piece of content. Instead of having to pour your blood, toil, sweat, and tears, you can have a machine do it for you. OpenCalais is not for consumers and must be integrated into an existing piece of online content management software, or be integrated into a custom solution.<br />
If you have a 5-10 page website, you probably don&#8217;t need to use OpenCalais. If you have a catalog of any kind, be it books, products, article, or news you should look into integrating this tool so that your website becomes easier to search and navigate on your website as well as through external search engines. OpenCalais looks at your content and finds out if there are references to people, cities, companies, industry terms, and a number of other existing knowledge centers.</p>
<p style="text-align: center;">
<div id="attachment_1443" class="wp-caption alignleft" style="width: 530px"><a href="http://viewer.opencalais.com"><img class="size-full wp-image-1443 " title="OpenCalais Document Viewer" src="http://www.beaconfire.com/blog/wp-content/uploads/2009/11/calais_document_viewer.jpg" alt="A test using the OpenCalais Document Viewer" width="520" height="349" /></a><p class="wp-caption-text">A test using the OpenCalais Document Viewer</p></div>
<p>To demonstrate what OpenCalais can extract from your content, I submitted the content of this article above this sentence to get this screenshot. I&#8217;m only showing the highlight of one found term. Similarly, the other terms that were found are connected to vast amounts of knowledge already exist intent on the internet. Those knowledge centers have relevant information and hence if your content is relevant, it is given a high content relevancy rating for each word that it finds. Apparently, my article on &#8220;Online Knowledge&#8221; only gets a rating of 35%. It&#8217;s better than nothing. I&#8217;ll try again next time. You can try yourself using their <a href="http://viewer.opencalais.com">Document Viewer</a>. If you want help in implementing this great tool, let us know by contacting us through our website&#8217;s <a href="http://beaconfire.com/contact/index.php">Contact Us</a> page.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.beaconfire.com/blog/2009/11/what-is-online-knowledge/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The Search Facelift</title>
		<link>http://www.beaconfire.com/blog/2009/09/the-search-facelift/</link>
		<comments>http://www.beaconfire.com/blog/2009/09/the-search-facelift/#comments</comments>
		<pubDate>Fri, 18 Sep 2009 22:24:54 +0000</pubDate>
		<dc:creator>Alan Gallauresi</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[80/20]]></category>
		<category><![CDATA[facelift]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.beaconfire.com/blog/?p=1265</guid>
		<description><![CDATA[Sometimes a client needs a change.  When you&#8217;re talking about websites, that change can be the whole enchilada: a new (or first time!) content management system, a comprehensive content audit with audience matrices and  IA work, a big build with tons of new functionality.  Or it might just be a creative design refresh with some [...]]]></description>
			<content:encoded><![CDATA[<p>Sometimes a client needs a change.  When you&#8217;re talking about websites, that change can be the whole enchilada: a new (or first time!) content management system, a comprehensive content audit with audience matrices and  IA work, a big build with tons of new functionality.  Or it might just be a creative design refresh with some minor tweaks and enhancements,  resulting in the appearance of a major revision at a lower cost and quicker implementation &#8212; the website facelift.</p>
<p>A website&#8217;s search can be similar.  Many clients we talk to are convinced that their site search doesn&#8217;t work and needs to be torn out &#8212; and sometimes that&#8217;s true.   We can gut the whole works, put in a new Google Mini or MS Search Server, spend hours dividing your site into collections, setting up separate searches based on audiences or site sections, but that&#8217;s not always necessary.  As a counterpart to the website facelift, the site search facelift seeks the same result: more for less.  How? By applying lessons from your search metrics, particularly what the top search terms of people seeking information at your site are and then making specific corrections to content.</p>
<p>While a site search facelift has several techniques to improve results, one of the easiest steps you can take to improve search results is to review frequent search terms and provide recommended links through your search engine for those terms.  The term used is dependent on the search engine, but is commonly referred to as &#8220;best bets&#8221; or &#8220;suggested results&#8221;.</p>
<p>Perhaps your search engine doesn&#8217;t make those frequently searched for terms reports available? Don&#8217;t have that data in the first place? Don&#8217;t be so sure &#8211; if you&#8217;re using Google analytics, most of the time that <a href="http://analytics.blogspot.com/2007/11/site-search-now-available.html">information is available to you</a> irrespective of your search engine technology.</p>
<p>The best thing about this method is that it generally follows the <a href="http://en.wikipedia.org/wiki/Pareto_principle">80/20 rule</a> where 80% of your searches are coming from 20% of the search terms people use.  Start with the terms most frequently searched form and you are automatically handling the search results in the order that will gain the most benefit from clear, recommended content suggestions.  And while there&#8217;s a lot more to a full search facelift, this is a quick way to get some significant results.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.beaconfire.com/blog/2009/09/the-search-facelift/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Bing + Yahoo = Plan B for SEO?</title>
		<link>http://www.beaconfire.com/blog/2009/09/bing-yahoo-plan-b-for-seo/</link>
		<comments>http://www.beaconfire.com/blog/2009/09/bing-yahoo-plan-b-for-seo/#comments</comments>
		<pubDate>Fri, 11 Sep 2009 11:36:06 +0000</pubDate>
		<dc:creator>Alan Gallauresi</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[bing]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[seo]]></category>
		<category><![CDATA[yahoo]]></category>

		<guid isPermaLink="false">http://www.beaconfire.com/blog/?p=1256</guid>
		<description><![CDATA[There&#8217;s no doubt that Google is the search engine for most of the Internet.  Over the past few years, the big G has taken almost all of the growth in the search business (90% according to the linked source). And when vendors speak of SEO (Search Engine Optimization), they are recommending practices that are tantamount [...]]]></description>
			<content:encoded><![CDATA[<p>There&#8217;s no doubt that Google is <strong><em>the</em> </strong>search engine for most of the Internet.  Over the past few years, the big G has taken almost <a href="http://www.techcrunch.com/2009/01/28/google-gobbled-up-90-percent-of-all-us-search-growth-in-2008/">all of the growth</a> in the search business (90% according to the linked source). And when vendors speak of SEO (Search Engine Optimization), they are recommending practices that are tantamount to Google Engine Optimization.  And why not? Most website administrators looking at their search referrals can plainly see why that&#8217;s where to put your effort in.</p>
<p>With the <a href="http://www.techspot.com/news/35626-microsoft-bing-to-power-yahoo-under-10year-search-deal.html">recent agreement</a> between Yahoo and Microsoft making MS Bing the search platform for all of Yahoo&#8217;s sites, there&#8217;s now a new wrinkle in the SEO game.  Take the two numbers in your referral logs for Bing and Yahoo, combine them together &#8212; big enough to matter now? Especially with MS pushing the technology everywhere it can?  Right now you&#8217;ll continue to have traffic from both Bing and Yahoo, but moving forward, most sites will only have to optimize for a single additional search engine in the #2 position &#8212; Bing.</p>
<p>If you decide Bing is worth watching, <a href="http://www.bizzia.com/articles/bing-seo-tips-or-should-it-be-deo-tips/">try out these tips on optimization</a>.  Remember &#8211; SEO is always a bit of black magic rather than hared science, but the rewards can definitely be worth the effort.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.beaconfire.com/blog/2009/09/bing-yahoo-plan-b-for-seo/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>URL shorteners: how to stay out of trouble</title>
		<link>http://www.beaconfire.com/blog/2009/05/url-shorteners-how-to-stay-out-of-trouble/</link>
		<comments>http://www.beaconfire.com/blog/2009/05/url-shorteners-how-to-stay-out-of-trouble/#comments</comments>
		<pubDate>Tue, 26 May 2009 17:15:48 +0000</pubDate>
		<dc:creator>Jo</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Information Architecture]]></category>
		<category><![CDATA[Marketing]]></category>
		<category><![CDATA[Search Engines]]></category>

		<guid isPermaLink="false">http://www.beaconfire.com/blog/?p=1002</guid>
		<description><![CDATA[Recently, there&#8217;s been a proliferation of url shortening services, driven largely by Twitter&#8217;s 140 character limit and the need to keep links as short as possible to fit in your tweets.  They are run on websites with names like bit.ly and tr.im, as well as the classic tinyurl.com.  As a user of the internet, I [...]]]></description>
			<content:encoded><![CDATA[<p>Recently, there&#8217;s been a proliferation of url shortening services, driven largely by Twitter&#8217;s 140 character limit and the need to keep links as short as possible to fit in your tweets.  They are run on websites with names like bit.ly and tr.im, as well as the classic tinyurl.com.  As a user of the internet, I have a serious dislike of these cryptic little URLs, and I went searching to find out if they came with other problems I hadn&#8217;t thought of, especially problems for SEO.  It turns out that <a href="http://navigator.cision.com/The_Skinny_On_URL_Shorteners.aspx">they&#8217;re not as bad as I suspected</a>, but they do have some serious downsides (<a href="http://www.techcrunch.com/2009/04/06/are-url-shorteners-a-necessary-evil-or-just-evil/">some even say they&#8217;re evil</a>).  If you&#8217;re trying to make a name for your nonprofit in social media, these are issues you&#8217;ll want to watch out for.</p>
<p>Shortening urls provides some important benefits to the person sharing the link, which is why they&#8217;ve become so popular.  Aside from taking up less space (thus making them more tweetable), many of the services provide tracking data, giving you details about how your content is being used.  That can be really fun for the casual user, and valuable for organizations.  Some people think they look tidier than long urls.</p>
<p>But there are two (or maybe three) other parties affected by the shortened url, and they aren&#8217;t so well served by it.<span id="more-1002"></span></p>
<p>The reader, who comes across a shortened url in a post or tweet, may have no idea where the link is supposed to go.  All they get is the context of the link, which may be as little as &#8220;check out this cool link&#8221;.  Without clicking on it, they have no way to know where it goes.  (Not totally true &#8211; there exist Firefox plugins that will expand them for you &#8211; but it is true for most users.)  From a shortened url, you can&#8217;t tell whether the destination is the New York Times, a non-profit, a personal blog, or something far less reputable.  For you, as a user, all the clues you use to decide whether to click on a link may be missing.  That&#8217;s a problem for the person posting the content &#8211; if you don&#8217;t trust them, you&#8217;re much less likely to click.  Furthermore, it&#8217;s less likely to catch your interest.</p>
<p>The benefit of longer, SEO-conscious urls is that you should be able to tell what you&#8217;ll find before you click: you know the domain, the page name, and possibly other information.  Take this post, for example: <span id="sample-permalink">http://www.beaconfire.com/blog/2009/05/26/url-shorteners-how-to-stay-out-of-trouble tells you that a company called Beaconfire is providing the content (even if you&#8217;re not familiar with who we are), that it&#8217;s a post from our blog, that it was published in May 2009, and that the topic is url shorteners.  Armed with that information, you can decide whether you&#8217;re interested enough to click on it.  Run it through a service like bit.ly, and all that information is lost &#8211; but at least the link is short?</span></p>
<p>There&#8217;s also the content provider &#8211; the site that&#8217;s being linked to.  At the very least, they&#8217;re losing their branding, since their domain is no longer displayed.  A lot of SEO experts are also concerned that the shortening services themselves &#8211; which parse the link and redirect you to the destination &#8211; may rob the destination content of its search ranking, that search bots may not follow the link and give credit to the destination.  Is this a serious concern?  It&#8217;s an open question, and every service works a little bit differently.  Some do much better than others, so it&#8217;s worth doing your homework on <a href="http://searchengineland.com/analysis-which-url-shortening-service-should-you-use-17204">which url shortener will best preserve your SEO</a>.  (In all cases, you lose the benefit of having keywords in the link text, which is typically just the url.)</p>
<p>The third party affected by shortened urls is the internet as a whole.  Joshua Schachter has written a fantastic and detailed explanation of <a href="http://joshua.schachter.org/2009/04/on-url-shorteners.html">why shortened urls are hurting the web</a> &#8211; the gist is that these services are middlemen in an already complex system.  They are numerous, fragile, and unreliable, and if they someday disappear, we&#8217;ll have a lot of links that no longer point to their content, and the internet will be broken.  If that seems like an overly philosophical concern, consider how quickly sites often come and go on the internet.</p>
<p>If it were up to me, I&#8217;d say these services aren&#8217;t worth the trouble.  But, unless the whole world wakes up tomorrow and decides that Twitter is just no longer cool, I think shortened urls will be around for a while.</p>
<p>So, how can you embrace social media without the downsides of shortened urls?</p>
<p>Most importantly, don&#8217;t use them when you don&#8217;t need them.  If you&#8217;re twittering to promote your own campaign, create a friendly url that will fit easily in a tweet, and you&#8217;ll promote recognition for your site.  This is especially true if you&#8217;re not using any tracking features from the service.</p>
<p>If you do need to use them, look for a service that lets you customize the url to some extent, as well as following good SEO practices with their redirects.  <a href="http://searchengineland.com/analysis-which-url-shortening-service-should-you-use-17204">Check this list for an easy feature comparison.</a> Be sure to provide plenty of context about why users should click through.</p>
<p>One service, kl.am, even <a href="http://raven-seo-tools.com/blog/393/klam-url-link-shortener-now-supports-google-analytics-campaign-source-medium-and-name-variables">lets you append Google Analytics campaign variables for your short url</a> so you can track them by source in Analytics.  I can&#8217;t speak to kl.am&#8217;s other features, but this seems like a nice one &#8211; often, appending source codes is part of what makes urls long and unsightly.</p>
<p>The biggest lesson, to me, is not to use shortened urls frivolously.  If you don&#8217;t need them, don&#8217;t use them.  Period.  If you do need to keep your url short, consider your readers and your brand, and use these services wisely.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.beaconfire.com/blog/2009/05/url-shorteners-how-to-stay-out-of-trouble/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Tech Quick Take: MS Search Server 2008 vs Google Mini</title>
		<link>http://www.beaconfire.com/blog/2008/11/tech-quick-take-ms-search-server-2008-vs-google-mini/</link>
		<comments>http://www.beaconfire.com/blog/2008/11/tech-quick-take-ms-search-server-2008-vs-google-mini/#comments</comments>
		<pubDate>Wed, 12 Nov 2008 20:40:44 +0000</pubDate>
		<dc:creator>Alan Gallauresi</dc:creator>
				<category><![CDATA[Nonprofits]]></category>
		<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[comparison]]></category>
		<category><![CDATA[google mini]]></category>
		<category><![CDATA[ms search server]]></category>
		<category><![CDATA[strongman competitions]]></category>

		<guid isPermaLink="false">http://www.beaconfire.com/blog/?p=887</guid>
		<description><![CDATA[A short feature comparison between the Google Mini appliance and Microsoft's Search Server 2008.]]></description>
			<content:encoded><![CDATA[<p class="MsoNormal" style="text-align: left;"><a href="http://www.beaconfire.com/blog/wp-content/uploads/2008/11/clip_image00111_1.jpg"><img class="size-medium wp-image-888 alignright" style="float: right;" title="Search Server 2008" src="http://www.beaconfire.com/blog/wp-content/uploads/2008/11/clip_image00111_1.jpg" alt="" width="300" height="235" /></a>Microsoft quietly launched their free <a href="http://www.microsoft.com/enterprisesearch/serverproducts/searchserver/default.aspx">Microsoft Search Server 2008</a> product earlier this year as a product that heavily leverages the same Enterprise Search technology found in their Sharepoint 2007 product. That quiet introduction belies the fact that Search Server is directly poised to take on the de-facto market leader in low cost website spidering and searching, the <a href="http://www.google.com/enterprise/mini/">Google Mini</a>. When the Mini was first launched it had a great price-point, the Google name, and a slew of glaring deficiencies that have now been largely patched out of existence. Search technology, which used to be the largest feature gap our company had to account for in implementations for content management systems without their own search provider, had started to become a new brainer – sometimes clients without the slightest idea what sort of technology to use for a website redesign were coming to us with a Mini already purchased. That is, until MS Search Server 2008 arrived, immediately becoming the Mini’s foremost competitor in the low and mid-tier market and presenting a compelling case for those clients already heavily invested in Microsoft technology.</p>
<p class="MsoNormal" style="text-align: left;">Right now, we’re in the midst of a website redesign that utilizes Search Server 2008. It’s a pretty common type of redesign for us at Beaconfire – a website built on a content management system that doesn’t have a built-in search technology, or at least not one we want to use. The client’s search needs are also pretty typical for a lot of site builds:</p>
<ol>
<li>The search is primarily targeted at the site being redesigned – federated searching of external content might be a plus but takes a back seat to getting good search results for the site the user is searching on.</li>
<li>The  focus in on public HTML content, not documents or database items, and there&#8217;s little in the way of heavily role-based content restrictions</li>
<li>The need for advanced filtering is minimal but needs to be expandable for the future</li>
<li>The search is fully integrated into the main website. The search and the results returned are displayed seamlessly in pages on the site instead of shuttling users off to another server.</li>
<li>Emphasis is placed on returning the best results possible through keywords and “best bet” mechanisms</li>
</ol>
<p class="MsoNormal" style="text-align: left;">Those requirements result in what is <em>decidedly not</em> an Enterprise level of search, but a very good example of the type of search plenty of customers want in a redesign – not a fancy search, just a particularly smooth and relevant one.</p>
<p class="MsoNormal" style="text-align: left;">With those requirements in mind, here’s the my quick take on how the two technologies stack-up based on my impressions with both, right after the jump.<span id="more-887"></span></p>
<p class="MsoNormal" style="text-align: left;"><strong>Price:</strong></p>
<p class="MsoNormal" style="text-align: left;">A quick and easy win for Microsoft. You can’t beat free, and the circumstance where you need to buy a license (load-balanced search servers) is uncommon enough that those who need it can afford the minor software fee.  For many organizations, the Google Mini’s rather reasonable cost to purchase the box ($2999 for the standard model with 2 years of support) is less of a burden than the hosting costs associated with racking it up.</p>
<p class="MsoNormal" style="text-align: left;">
<p class="MsoNormal" style="text-align: left;">
<p class="MsoNormal" style="text-align: left;"><strong>Administering Indexing:</strong></p>
<p class="MsoNormal" style="text-align: left;"><a href="http://www.beaconfire.com/blog/wp-content/uploads/2008/11/27srxid_strongest_man_2_08-27-2006_0k8cfev.jpg"><img class="alignright size-full wp-image-890" title="Where\'s Magnuson?" src="http://www.beaconfire.com/blog/wp-content/uploads/2008/11/27srxid_strongest_man_2_08-27-2006_0k8cfev.jpg" alt="" width="336" height="240" /></a>I’ve had far too much experience with Minis to still be as baffled as I am by its indexing interface. It’s only become more complex over time, with various settings in different places for toggling the frequency of particular URLS to get indexed (which only applies in certain crawling modes), the feature to tell it to index a page as soon as possible (as in NOW) without seeing much effect, and the predictive ability to tell you what it plans to index in the next hour that rarely seems to match reality.  Starting a Google crawl is like a strongman competition for pulling a dump truck with a nylon rope and your own teeth – incredibly tough to get started, slow going at the best of times, and very painful to stop. MS Search, on the other hand, is brilliant –check a box and it starts, crawling the site extremely quickly.</p>
<p class="MsoNormal" style="text-align: left;">
<p class="MsoNormal" style="text-align: left;">
<p class="MsoNormal" style="text-align: left;"><strong>Development to Deployment:</strong></p>
<p class="MsoNormal" style="text-align: left;">Search engines are a particular pain to network because they are usually within the same network segment as the website you’re deploying, yet they are set to spider content as if they were an external user looking at your site. Combine that with the issue of switching a DNS entry on a redesign launch and you get a ton of small but significant problems related to hostnames and networking. Every time we launch, there are tons of tasks for switching the search settings, reindexing immediately and testing. Search 2008 makes it a lot easier with a simple mechanism – integrated URL mapping. Just replace your staged website with the URL of the soon-to-be-live site and you can worry about reindexing later, when things aren’t so crazy. Google, take note – I want this in the Mini.</p>
<p class="MsoNormal" style="text-align: left;">
<p class="MsoNormal" style="text-align: left;">
<p class="MsoNormal" style="text-align: left;"><strong>Reporting</strong></p>
<p class="MsoNormal" style="text-align: left;">Microsoft’s approach is simple – a bunch of reasonable, pre-made reports. While Google has a couple prepared reports like content type stats, most of the appliance uses a byzantine method of creating a report, naming it, refreshing the page to see that it already finished by the time you named it, and then viewing, is terribly unfriendly by comparison.</p>
<p class="MsoNormal" style="text-align: left;">
<p class="MsoNormal" style="text-align: left;">
<p class="MsoNormal" style="text-align: left;"><strong>Result Summaries</strong></p>
<p class="MsoNormal" style="text-align: left;">Given that we do full integrations of both search interface and results display into our site pages, we want a result set that can be easily shoehorned into whatever interface the designers want, from basic results to AJAX-y showpieces. Both engines return text summaries that show the term hits in the context of the page and string it all together into a hit highlighted abstract. Google’s hit highlighting is basic, but good enough for most purposes, as it surrounds hit terms with &lt;b&gt; tags – like a lot of the HTML Google produces, it’s lowest common denominator (as anyone who has ever looked at their &lt;FONT&gt; tag ridden XSLT stylesheets can attest to).  Microsoft uses a numbered hit tag system (c0, c1, etc), perhaps easy enough to understand once <a href="http://blogs.msdn.com/arbindo/archive/2008/04/04/test.aspx">various</a> <a href="http://www.sharepointblogs.com/zimmer/archive/2007/09/01/moss-2007-customizing-the-search-results-page-xslt-part-2-add-highlighting.aspx">helpful</a> <a href="http://geekswithblogs.net/shehan/Default.aspx">bloggers</a> explain it, but still a <strong>bad idea</strong>. Note to Microsoft – don’t return your hit results with tags that Firefox and Chrome can style natively with CSS, but your own Internet Explorer ignores, or requires <a href="http://msdn.microsoft.com/en-us/library/ms584121.aspx">an XSL transformation </a> when a span with classes or ids would do fine, and don’t number the tag names so each one has to be enumerated separately <strong>which is why you typically don&#8217;t see tags with numbers in the name</strong>.  Devs, be prepared to do some wholesale text replacement on Search 2008 results if you want to format them nicely, like placing ellipses between term hits.</p>
<p class="MsoNormal" style="text-align: left;">
<p class="MsoNormal" style="text-align: left;"><strong>Meta Data</strong></p>
<p class="MsoNormal" style="text-align: left;">A feature list examination would give Microsoft a clear edge here –defined crawled properties for many different types of documents,  which can be interactively mapped into managed properties in a chosen order, so properties can gracefully fall back onto less relevant ones if the most targeted ones aren’t available. That’s fabulous, especially compared to Google’s need-to-know approach of “we’ll tell you what metadata properties we indexed when you search and not before.” But Google is also understanding of some of the basics – just about anything you index is going to be a string, except for numbers and dates. There’s a good chance you’re going to want to use dates for range filtering or sorting, and Google is also extremely tolerant at guessing dates from a variety of formats sources like  meta fields or server headers. MS Search 2008, on the other hand, <a href="https://forums.microsoft.com/TechNet/ShowPost.aspx?PostID=4090271&amp;SiteID=17">seems to think just about every HTML property is a text field</a>, and won’t let you map crawled text properties to date type managed properties, simultaneously segregating HTML content away from nicely indexed documents like PDFs or Word files and forcing workarounds for dealing with ranges.</p>
<p class="MsoNormal" style="text-align: left;">
<p class="MsoNormal" style="text-align: left;"><strong>Indexing Redux:</strong></p>
<p class="MsoNormal" style="text-align: left;">You might recall I loved the instant gratification of the MS Search indexing. Less lovable is the fact that Search Server will happily replace a perfectly good index with whatever it just saw in its latest spidering attempt, even if it found absolutely nothing because, say, your entire site was down for a few minutes or some networking hiccup prevented the search from seeing your site.  I’d love to say the Mini handles this situation very well – older versions had prominent toggles for a minimum number of crawled pages and specific page urls that must have been crawled before placing a staged index into production – but this feature is either gone or extremely well hidden in modern Minis.</p>
<p style="text-align: center;"><img class="alignnone size-medium wp-image-889" title="Google Appliance" src="http://www.beaconfire.com/blog/wp-content/uploads/2008/11/01425i18024900.gif" alt="" width="275" height="163" /></p>
<p class="MsoNormal" style="text-align: left;"><strong>Best Bets, Keywords and Synonyms</strong></p>
<p class="MsoNormal" style="text-align: left;">On the surface, both engines seem capable in this regard, and each has some unique features. Both have keymatching or “best bet” functionality, both allow synonyms but not fancy stemming through fuzzy logic. MS lets you specify authoritative pages. Google lets you import keymatches and related queries from text documents. Etc.  Best bets (defining a single “featured” link for a common search term) and synonyms (equating a search for one term to be the same as another term) are particularly useful in the scenario where organizations want to push users to special events or articles. Unfortunately Search Server 2008 has a fine-print limitation on their synonym feature – synonyms are only relevant when coupled with an already existing best bet match, meaning that it’s basically just a way to keep from having to type the same best bet over again with a different term. I’d really love to see a true synonym system similar to Google’s.</p>
<p class="MsoNormal" style="text-align: left;">
<p class="MsoNormal" style="text-align: left;">
<p class="MsoNormal" style="text-align: left;"><strong>Error Handling</strong></p>
<p class="MsoNormal" style="text-align: left;">To my mind, Google has this right – it is tolerant of mistakes or “bad” queries to the point that it’s nearly impossible to cause an actual error. The Microsoft querying web service has the heavy handed approach of throwing an exception for just about everything – from an unrecoverable network problem, to a developer configuration error when requesting a property the search doesn’t know about, to a user search of something like the words “and the”. That last query returns “<span class="error">Your query included only common words and / or characters, which were removed. No results are available.</span> Try to add query terms. ” They could have added “Keep trying at that internet thing, you’ll figure it out eventually.”  Yes, each of those conditions should be handled, but they’re not remotely in the same class. Developer exception trapping shouldn’t be a substitute for reasonable behavior from a search engine.</p>
<p class="MsoNormal" style="text-align: left;">
<p class="MsoNormal" style="text-align: left;"><strong> </strong></p>
<p class="MsoNormal" style="text-align: left;"><strong>Results and Ranking</strong></p>
<p class="MsoNormal" style="text-align: left;">This is the most crucial feature, and in my opinion, the Mini wins this hand down. Google’s search technology is widespread for a very good reason – their “magic” algorithms do a great job of returning relevant results. Things have only improved over time as they’ve added features to the Mini borrowed from its big brother Google.com search, like automatically filtering similar results to drastically reduce the number of results displayed. Meanwhile, MS lacks some important features like partial page indexing (excluding content within a page from being indexed) which results in matches for keywords found in common navigation areas, etc. Google <a href="http://www.geekzilla.co.uk/ViewC8614968-56ED-4729-9C12-F01677DAC412.htm">has had those features (googleoff:index) forever </a>but has improved the relevancy to the point where they hardly need to be implemented any more.  Google has also begun opening up their sacrosanct black box of assigning hit relevance by letting administrators apply bias to results using date and site section as a factor.</p>
<p class="MsoNormal" style="text-align: left;">So there’s my quick take. I’ve skipped over a lot of functionality, and there are areas like federated searching and role-based authentication where MS clearly shines. Then there is the full-fledged Google Search Appliance whose features I haven’t touched on.  So tell me where I’m right, tell me where I missed things, and tell me where I’m just plain wrong in the comments.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.beaconfire.com/blog/2008/11/tech-quick-take-ms-search-server-2008-vs-google-mini/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

