<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:trackback="http://madskills.com/public/xml/rss/module/trackback/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:media="http://search.yahoo.com/mrss/" xmlns:evnet="http://www.mscommunities.com/rssmodule/"><channel><title>Entries tagged with search - MIX Online</title><image><url>http:///images/10logo_100.jpg</url><title>Entries tagged with search - MIX Online</title><link>http:///tags/Search/</link></image><description>MIX</description><link>http:///tags/Search/</link><language>en-us</language><pubDate>Tue, 24 Jun 2008 20:55:21 GMT</pubDate><generator>EvNet (EvNet, Version=1.0.0.0, Culture=neutral, PublicKeyToken=null)</generator><item><title>SearchMonkey is Disruptive</title><description>&lt;img alt="" border="0" align="left" src="http://visitmix.com/images/blogs/smlogo.jpg /&gt;
&lt;p style="margin: 0in 0in 10pt"&gt;OK, after being initially skeptical, I’ve become convinced that Yahoo! SearchMonkey has the potential to really change the game in search.  The evidence is mounting that they have really thought this through, and that they intend to disrupt the existing order.  The plan is somewhat crazy, but this just might work.&lt;/p&gt;
&lt;p style="margin: 0in 0in 10pt"&gt;When SearchMonkey launched, about 6 weeks ago, it seemed that the news was primarily lauded by proponents of RDF who believed Yahoo!’s endorsement of RDF would resurrect their beloved but anemic Semantic Web (with a big “S”) standard and give it legs to finally dethrone the small-“s” semantic web technologies like tags and microformats.  To understand why they were so excited, you need to understand that it is the search engines who strangled RDF in the first place.&lt;/p&gt;
&lt;p style="margin: 0in 0in 10pt"&gt;7 years ago (when Google was still a serious underdog), &lt;a href="http://www.google.com/search?hl=en&amp;q=%22making+a+semantic+web%22+joshua+allen"&gt;I argued  that the search engines completely control the fate of “semantic web” standards&lt;/a&gt;, and explained that the major search engines have very little business incentive to support such standards.  You can read the whole whitepaper, but the summary is simple:&lt;/p&gt;
&lt;p style="margin: 0in 0in 0pt 0.5in"&gt;1)      Search engines are the dominant way that people query for entities on the web, and it’s nearly impossible to get authors to add the semantics and bootstrap the semantic web if search engines ignore the semantics or promote competing semantics.&lt;/p&gt;
&lt;p style="margin: 0in 0in 0pt 0.5in"&gt;2)      When it is difficult to extract semantics from documents, it gives advantage to incumbents with massive scale data centers who can extract semantics from natural language.  It creates barriers to entry for new competitors.&lt;/p&gt;
&lt;p style="margin: 0in 0in 10pt 0.5in"&gt;3)      As a top search engine, you want the most useful semantic information stored in a format that your competitors cannot utilize.&lt;/p&gt;
&lt;p style="margin: 0in 0in 10pt"&gt;For 7 years, my thesis held.  At the time, I lobbied both Google and Microsoft to start indexing RDF (and later microformats).  My hope was that their desire to disrupt the (then) dominant Yahoo! search position would lead to a more open web.  But for 7 years, no search engine was crazy enough to truly adopt open standards for semantics.  In fact, Google even &lt;i&gt;dropped&lt;/i&gt; support for meta tag’s rudimentary semantics during that time period.&lt;/p&gt;
&lt;p style="margin: 0in 0in 10pt"&gt;Then came SearchMonkey.  For the reasons outlined above, indexing RDF and microformats is a pretty crazy underdog disruptive strategy, so I was skeptical.  At first, my skepticism seemed to be justified:&lt;/p&gt;
&lt;p style="margin: 0in 0in 0pt 0.5in"&gt;1)      At first, they supported only a handful of partners.  See point #3 above.&lt;/p&gt;
&lt;p style="margin: 0in 0in 0pt 0.5in"&gt;2)      The functionality was totally opt-in by consumers, and Yahoo! was doing nothing to evangelize it to average users.  It looked like a silly PR stunt to curry favor with the RDF and microformats camps, and clearly Yahoo! was not putting any wood behind it.&lt;/p&gt;
&lt;p style="margin: 0in 0in 10pt 0.5in"&gt;3)      Semantics can only be added by document owners, on their own subdomain.  This immediately favors large incumbents.  See the whitepaper for a description of why author-created metadata is a very weak form of semantics.&lt;/p&gt;
&lt;p style="margin: 0in 0in 10pt"&gt;In the past 2 weeks, however, the first two reasons for my initial skepticism have been obliterated.  The &lt;a href="http://gallery.search.yahoo.com"&gt;SearchMonkey gallery&lt;/a&gt; has expanded, and there are a number of interesting services already available.  It appears that Yahoo! is promoting services which are not necessarily created by the site authors, which is huge.  Check out the &lt;a href="http://gallery.search.yahoo.com/application?smid=knb"&gt;Wikipedia Topics&lt;/a&gt; entry, for example.  And the PHP API entry is a perfect example of why opt-in by default was a good choice – I may want my search results to show &lt;a href="http://gallery.search.yahoo.com/application?smid=EJK"&gt;PHP API entries&lt;/a&gt;, but most people do not.  In addition, Yahoo! has started to promote the gallery from the home page of &lt;a href="http://search.yahoo.com"&gt;search.yahoo.com&lt;/a&gt;, under the customize button.&lt;/p&gt;
&lt;p style="margin: 0in 0in 10pt"&gt;This isn’t a PR stunt.  These guys are serious.  Yahoo! took the single thing that drives publisher behavior (search engine exposure) and tied it squarely to open semantic standards.&lt;/p&gt;
&lt;p style="margin: 0in 0in 10pt"&gt;Now, let’s contrast this with the Google approach.  Google were the very first to offer “blended” search results, and much was made of the fact that Google Maps returns microformats on search results page.  But spitting up microformats from your proprietary index is the opposite of consuming microformats to enrich your index.  And the mechanism by which Google attaches semantics to the “&lt;a href="http://www.mattcutts.com/blog/new-google-ui-feature-plus-box/"&gt;plus box&lt;/a&gt;” is notoriously opaque.  Watching people &lt;a href="http://www.mattcutts.com/blog/new-google-ui-feature-plus-box/#comments"&gt;beg Matt Cutts for information&lt;/a&gt;, insinuate that blended results on SERP amounts to paid placement, or &lt;a href="http://blumenthals.com/blog/2008/03/06/google-plus-box-where-does-the-wrong-data-come-from/"&gt;speculate about the algorithm&lt;/a&gt; as it changes under their feet (did Google “plus box” &lt;i&gt;really&lt;/i&gt; just start scraping hCard?) makes you appreciate the way that Yahoo! does it out in the open.&lt;/p&gt;
&lt;img border="1" alt="Google SERP grabbed the hCard?" src="http://visitmix.com/images/blogs/hcardserp.jpg /&gt;
&lt;p style="margin: 0in 0in 10pt"&gt;Google did pay lip service to “out in the open”, when they launched Google Base to much fanfare and started integrating Google Base results into the main search results page.  But Google Base still required publishers to store their content in Google’s servers, and the prominent listing on the search results page quickly became a distant memory and Google Base a black hole with little influence on the main search page.&lt;/p&gt;
&lt;p style="margin: 0in 0in 10pt"&gt;I think people were a bit confused when Yahoo! claimed that SearchMonkey is a “long tail” strategy.  But the discussion of Google’s contrasts should have made it clear by now that they are right.  Yahoo!’s model of user opt-in makes room for both the default mass-appeal plugins (like Flickr) and the more niche plugins like PHP APIs.&lt;/p&gt;
&lt;p style="margin: 0in 0in 10pt"&gt;Overall, this is very strong progress in just 6 weeks.  To keep up the momentum, Yahoo! needs to continue promoting to end-users, and should be more aggressive about influencing search results ordering when SearchMonkey plugins are installed.  For example, I have opted-in to the Yelp plugin, but perfectly good Yelp tresults often get pushed off of the page by CitySearch and others.  Random samplings of users who haven’t tried any customizations should be shown enhanced search results pages and offered the chance to customize.&lt;/p&gt;
&lt;p style="margin: 0in 0in 10pt"&gt;In addition, Yahoo! should allow SearchMonkey plugins to customize results for other pages.  For example, I should be able to see the IMDB information next to a search result for a blog page that reviews a movie.  This would truly bootstrap the use of microformats, since adding a microformat to your page would automatically make it more useful to anyone using Yahoo!’s search engine.  Google tried something similar, with less than stellar results, when they started using scraped addresses from around the web to enhance their map “plus box”.  When Google scraped restaurant addresses from old and outdated sites, the search results page “plus box” started directing diners to the wrong location, leaving restaurant owners bewildered as they tried to figure out where the wrong data was coming from.  Yahoo!’s approach mitigates against this, since people opt-in to the provider, and they know where the data is coming from.  In Google’s approach, you get whatever plugins Google gives you, and you have no idea where they are getting the data.&lt;/p&gt;
&lt;p style="margin: 0in 0in 10pt"&gt;If Yahoo! share stabilizes or increases, I would expect Google to respond by being more aggressive with their “plus box”, and perhaps embracing and extending, with an eye to extinguishing SearchMonkey.  SearchMonkey will encourage the greatest proliferation of microformats yet seen on the Internet, and as more microformats are available, Google will certainly start to leverage this information more in building their index.&lt;/p&gt;
&lt;p style="margin: 0in 0in 10pt"&gt; &lt;/p&gt;&lt;img src="http://visitmix.com/blogs/Joshua/SearchMonkey-is-Disruptive/AggBug.aspx" height="1" width="1" alt="" /&gt;</description><comments>http://visitmix.com/blogs/Joshua/SearchMonkey-is-Disruptive/</comments><link>http://visitmix.com/blogs/Joshua/SearchMonkey-is-Disruptive/</link><pubDate>Tue, 24 Jun 2008 18:07:00 GMT</pubDate><guid isPermaLink="true">http://visitmix.com/blogs/Joshua/SearchMonkey-is-Disruptive/</guid><evnet:views>1475</evnet:views><evnet:viewtrackingurl>http://visitmix.com/blogs/Joshua/SearchMonkey-is-Disruptive/AggBug.aspx</evnet:viewtrackingurl><evnet:previewtext>OK, after being initially skeptical, I’ve become convinced that Yahoo! SearchMonkey has the potential to really change the game in search.  The evidence is mounting that they have really thought this through, and that they intend to disrupt the existing order.  The plan is somewhat crazy, but this&amp;#8230;</evnet:previewtext><media:group /><dc:creator>Joshua Allen</dc:creator><slash:comments>0</slash:comments><wfw:commentRss>http://visitmix.com/blogs/Joshua/SearchMonkey-is-Disruptive/RSS/</wfw:commentRss><trackback:ping>http://visitmix.com/blogs/Joshua/1114/Trackback.aspx</trackback:ping><category>Search</category><category>Yahoo!</category></item><item><title>A Data Center Built With Shipping Containers</title><description>Yesterday, Microsoft became the first in the industry to announce a data center designed entirely with shipping container-based server modules.  It's a sizable data center, supporting somewhere between 150,000 an 440,000 servers.  &lt;a href="http://datacenterlinks.blogspot.com/2008/04/miichael-manos-keynote-at-data-center.html"&gt;Mike Manos revealed some specifics about the Chicago data center&lt;/a&gt; at a talk at Data Center World.&lt;br /&gt;
&lt;br /&gt;
&lt;a href="http://perspectives.mvdirona.com/2008/04/02/FirstContainerizedDataCenterAnnouncement.aspx"&gt;James Hamilton has some commentary&lt;/a&gt; on his blog.  James has been evangelizing the advantages of shipping container-based data centers for years, inside the company and within the industry.  As he points out, several of the largest equioment companies are now providing containers, and you can be sure that we are not the only company buying them.  But this is the first time anyone has talked publicly about a real industry-scale data center designed from the ground up for containers.  I'm particularly interested by the fact that the containers use angle parking instead of being arranged at right angles.&lt;br /&gt;
&lt;br /&gt;
This is a big accomplishment, and no doubt a sign of where the rest of the industry will be moving in coming years.  Congratulations to Mike, James, and the rest of the team!&lt;img src="http://visitmix.com/blogs/News/A-Data-Center-Built-With-Shipping-Containers/AggBug.aspx" height="1" width="1" alt="" /&gt;</description><comments>http://visitmix.com/blogs/News/A-Data-Center-Built-With-Shipping-Containers/</comments><link>http://visitmix.com/blogs/News/A-Data-Center-Built-With-Shipping-Containers/</link><pubDate>Wed, 02 Apr 2008 20:06:00 GMT</pubDate><guid isPermaLink="true">http://visitmix.com/blogs/News/A-Data-Center-Built-With-Shipping-Containers/</guid><evnet:views>4845</evnet:views><evnet:viewtrackingurl>http://visitmix.com/blogs/News/A-Data-Center-Built-With-Shipping-Containers/AggBug.aspx</evnet:viewtrackingurl><evnet:previewtext>Yesterday, Microsoft became the first in the industry to announce a data center designed entirely with shipping container-based server modules.  It's a sizable data center, supporting somewhere between 150,000 an 440,000 servers.  Mike Manos revealed some specifics about the Chicago data center at a&amp;#8230;</evnet:previewtext><media:group /><dc:creator>Joshua Allen</dc:creator><slash:comments>0</slash:comments><wfw:commentRss>http://visitmix.com/blogs/News/A-Data-Center-Built-With-Shipping-Containers/RSS/</wfw:commentRss><trackback:ping>http://visitmix.com/blogs/News/1020/Trackback.aspx</trackback:ping><category>Cloud</category><category>Search</category></item><item><title>Greg Linden: Trends in Collective Intelligence and Centralization</title><description>&lt;img src="http://mschnlnine.vo.llnwd.net/d1/mix/6/6/9/GregLinden_small_mix.jpg" border="0" /&gt;&lt;p&gt;Rich user experiences are going mainstream on a wider variety of devices and putting pressure on web standards.  This trend is changing the face of the web as we know it, and we've covered this trend extensively here and at the MIX conferences.  But there is &lt;em&gt;another&lt;/em&gt; trend that is also changing the web forever, and Ray Ozzie's keynote at MIX08 was a shot across the bow regarding Microsoft's response to this seismic shift.&lt;br /&gt;
&lt;br /&gt;
In his keynote, Ray talked about the "mesh", and the inexorable shift of services into large centralized data centers.  We announced some cloud storage services, and discussed our philosophy of keeping control at the edges.  This is really just the beginning, and we'll be having a deep conversation with the industry over the next year.&lt;br /&gt;
&lt;br /&gt;
To kick off the conversation here on MIX Online, I asked Greg Linden to share his broad industry perspective about some of these topics.  Greg led development of Amazon's ground-breaking recommender systems, created Findory, and recently joined Microsoft to work on some top-secret incubation projects.  He continues to run the popular "&lt;a href="http://glinden.blogspot.com/"&gt;Geeking with Greg&lt;/a&gt;" blog, where he riffs on large-scale centralized computing, data mining, and "collective intelligence".&lt;br /&gt;
&lt;br /&gt;
Just a few of the topics we talked about:&lt;/p&gt;
&lt;ul&gt;
    &lt;li&gt;Is MapReduce/Hadoop really as good as SQL? &lt;/li&gt;
    &lt;li&gt;What are the limits of social search? &lt;/li&gt;
    &lt;li&gt;What good is collective intelligence, anyway? &lt;/li&gt;
    &lt;li&gt;Will all of the world get sucked into one or two datacenters? &lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;/p&gt;&lt;img src="http://visitmix.com/blogs/Joshua/Greg-Linden/AggBug.aspx" height="1" width="1" alt="" /&gt;</description><comments>http://visitmix.com/blogs/Joshua/Greg-Linden/</comments><link>http://visitmix.com/blogs/Joshua/Greg-Linden/</link><pubDate>Tue, 18 Mar 2008 22:02:00 GMT</pubDate><guid isPermaLink="true">http://mschnlnine.vo.llnwd.net/d1/mix/6/6/9/GregLinden_mix.wmv</guid><evnet:views>4903</evnet:views><evnet:viewtrackingurl>http://visitmix.com/blogs/Joshua/Greg-Linden/AggBug.aspx</evnet:viewtrackingurl><evnet:previewtext>Rich user experiences are going mainstream on a wider variety of devices and putting pressure on web standards.  This trend is changing the face of the web as we know it, and we've covered this trend extensively here and at the MIX conferences.  But there is another trend that is also changing the&amp;#8230;</evnet:previewtext><media:group><media:content url="http://mschnlnine.vo.llnwd.net/d1/mix/6/6/9/GregLinden_mix.mp4" expression="full" duration="1183" fileSize="63994402" type="video/mp4" medium="video" /><media:content url="http://mschnlnine.vo.llnwd.net/d1/mix/6/6/9/GregLinden_mix.mp3" expression="full" duration="1183" fileSize="9472128" type="audio/mp3" medium="audio" /><media:content url="http://mschnlnine.vo.llnwd.net/d1/mix/6/6/9/GregLinden_mix.mp4" expression="full" duration="1183" fileSize="63994402" type="video/mp4" medium="video" /><media:content isDefault="true" url="http://mschnlnine.vo.llnwd.net/d1/mix/6/6/9/GregLinden_mix.wma" expression="full" duration="1183" fileSize="9584871" type="audio/x-ms-wma" medium="audio" /><media:content isDefault="true" url="http://mschnlnine.vo.llnwd.net/d1/mix/6/6/9/GregLinden_mix.wmv" expression="full" duration="1183" fileSize="75235979" type="video/x-ms-wmv" medium="video" /><media:content url="http://mschnlnine.vo.llnwd.net/d1/mix/6/6/9/GregLinden_2MB_mix.wmv" expression="full" duration="1183" fileSize="370533043" type="video/x-ms-wmv" medium="video" /><media:content url="http://mschnlnine.vo.llnwd.net/d1/mix/6/6/9/GregLinden_Zune_mix.wmv" expression="full" duration="1183" fileSize="93868663" type="video/x-ms-wmv" medium="video" /><media:content url="mms://mschnlnine.wmod.llnwd.net/a1809/d1/mix/6/6/9/GregLinden_s_mix.wmv" expression="full" duration="1183" fileSize="189" type="video/x-ms-asf" medium="video" /></media:group><media:thumbnail url="http://visitmix.com/Link/47e57fd9-c533-44ee-9a54-24d5af5e9574/" height="240" width="320" /><media:thumbnail url="http://mschnlnine.vo.llnwd.net/d1/mix/6/6/9/GregLinden_small_mix.jpg" height="64" width="85" /><enclosure url="http://mschnlnine.vo.llnwd.net/d1/mix/6/6/9/GregLinden_mix.wmv" length="75235979" type="video/x-ms-wmv" /><dc:creator>Joshua Allen</dc:creator><slash:comments>0</slash:comments><wfw:commentRss>http://visitmix.com/blogs/Joshua/Greg-Linden/RSS/</wfw:commentRss><trackback:ping>http://visitmix.com/blogs/Joshua/966/Trackback.aspx</trackback:ping><category>Cloud</category><category>Data Mining</category><category>Search</category><category>Semantics</category><category>social browsing</category></item></channel></rss>