<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>WhatClinic.com Blog &#187; Web/Tech</title>
	<atom:link href="http://blog.whatclinic.com/category/webtech/feed" rel="self" type="application/rss+xml" />
	<link>http://blog.whatclinic.com</link>
	<description>Sharing Tech, Marketing &#38; Health 2.0 information</description>
	<lastBuildDate>Fri, 14 Oct 2011 08:55:30 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>The Perils Of Publishing Too Often</title>
		<link>http://blog.whatclinic.com/2011/09/the-perils-of-publishing-too-often.html</link>
		<comments>http://blog.whatclinic.com/2011/09/the-perils-of-publishing-too-often.html#comments</comments>
		<pubDate>Thu, 15 Sep 2011 08:00:35 +0000</pubDate>
		<dc:creator>Philip Boyle</dc:creator>
				<category><![CDATA[Web/Tech]]></category>
		<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Mashable]]></category>
		<category><![CDATA[News aggregators]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[SEOMOZ]]></category>
		<category><![CDATA[Webmaster Central]]></category>

		<guid isPermaLink="false">http://blog.whatclinic.com/?p=1761</guid>
		<description><![CDATA[Recently I was forced to reset my Google Reader account thanks to Google&#8217;s recent account clean up. The upside of this was that I had to resubscribe to any RSS feed I still wanted to read regularly. A few essentials were added first: Google&#8217;s Webmaster Central, SEOMoz, Distilled and so on. Then a few other favourites like Fred [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_1763" class="wp-caption alignnone" style="width: 435px"><a href="http://blog.whatclinic.com/wp-content/uploads/2011/09/newspapers.jpg"><img class="size-full wp-image-1763" title="Newspapers" src="http://blog.whatclinic.com/wp-content/uploads/2011/09/newspapers.jpg" alt="Newspapers" width="425" height="282" /></a><p class="wp-caption-text">Do you have too much to read?</p></div>
<p>Recently I was forced to reset my Google Reader account thanks to Google&#8217;s recent account clean up. The upside of this was that I had to resubscribe to any RSS feed I still wanted to read regularly. A few essentials were added first: Google&#8217;s <a title="Google webmaster central blog" href="http://googlewebmastercentral.blogspot.com/">Webmaster Central</a>, <a title="SEOMoz blog" href="http://www.seomoz.org/blog">SEOMoz</a>, <a title="Distilled blog" href="http://www.distilled.net/blog/">Distilled</a> and so on. Then a few other favourites like Fred Wilson&#8217;s <a title="Fred Wilson's blog" href="http://www.avc.com/">AVC blog</a> and Mark Suster&#8217;s <a title="Mark Suster's blog" href="http://www.bothsidesofthetable.com/">Both Sides Of The Table</a> were added.</p>
<p>What really struck me though was the number of subscriptions I didn&#8217;t want to keep up with any more. This was largely for three reasons:</p>
<ol>
<li>I was no longer that interested in the topic</li>
<li>The content was too repetitive</li>
<li>The content volume and quality had spiralled out of control</li>
</ol>
<p>For the first reason there was nothing the publisher could have done to keep me as a subscriber. For the second reason, it was possible but unlikely. Some of the topics were quite niche and there wasn&#8217;t a lot new to say on a regular basis. However the third reason is completely within every publisher&#8217;s control.</p>
<p><strong>Straying From The Original Plot</strong></p>
<p>I&#8217;ll pick out <a title="Mashable" href="http://mashable.com/">Mashable</a> as an example. It&#8217;s quite a regular occurrence that when I open my reader in the morning there are 30 or 40 stories in the Mashable folder. Buried in there somewhere are the one or two that I might still find interesting, but there is no way I&#8217;m going wade through the rest to find them, especially considering that another more focused blog is bound to reblog it, or someone in my Twitter stream will tweet about it.</p>
<p>Mashable, along with a growing number of other web properties, seem to be obsessed with growing visitor numbers at the expense of focus and even quality control, and in doing so they publish so often and on so many topics that I&#8217;m no longer interested in what Mashable has to offer. The same can be said for a growing number of blogs that are looking to grow visitors numbers by growing the number of articles they publish per day.</p>
<p><strong>A Pivot From Niche To Mainstream</strong></p>
<p>Mashable is supposed to be about Social Media News and Web Tips according to it&#8217;s own homepage &lt;title&gt; tag. So why is it publishing a story today about the new HTC Sensation XE with Beats Audio? And what about its article on Expanding Your Startup To International Markets? Or even the new version of VMware Fusion? Quite simply they know that these articles will gather traffic because they know they can rank easily. But should they publish them in the first place?</p>
<p>I guess the question comes down to this. Do Mashable just want as much traffic as they can get their hands on, or do they want to be the go to source for social media news? Do they want to be mainstream or niche? The answer in this particular case seems pretty clear to me. Maybe they will succeed in becoming the next Wired, and if that&#8217;s what they want, good luck to them. But in the meantime, when I want some social media news, I&#8217;ll go to a social media specialist source.</p>
<p>My tip? Unless you&#8217;re trying to be come a broad news aggregator stick to what you know, and what your readers want, and make sure you have something to say. If that means publishing less often, so be it, but at least you know that the resulting relevant traffic will be because of the quality of your content and not just because of the volume of articles you publish.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.whatclinic.com/2011/09/the-perils-of-publishing-too-often.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Using Varnish To Speed Up WhatClinic.com</title>
		<link>http://blog.whatclinic.com/2011/09/using-varnish-to-speed-up-whatclinic-com.html</link>
		<comments>http://blog.whatclinic.com/2011/09/using-varnish-to-speed-up-whatclinic-com.html#comments</comments>
		<pubDate>Mon, 05 Sep 2011 08:00:30 +0000</pubDate>
		<dc:creator>David Roe</dc:creator>
				<category><![CDATA[Web/Tech]]></category>
		<category><![CDATA[cache]]></category>
		<category><![CDATA[iis]]></category>
		<category><![CDATA[varnish]]></category>

		<guid isPermaLink="false">http://blog.whatclinic.com/?p=1755</guid>
		<description><![CDATA[[Here’s a bit of background about our previous cache setup. Skip ahead to “Using Varnish To Cache WhatClinic.com” if you want to jump straight into the Varnish section.] Our main website is built on Microsoft’s IIS and we have been using its built-in page and component level caching to serve html pages for several years. [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_1756" class="wp-caption alignnone" style="width: 310px"><a href="http://blog.whatclinic.com/wp-content/uploads/2011/08/varnish.png"><img class="size-full wp-image-1756" title="varnish" src="http://blog.whatclinic.com/wp-content/uploads/2011/08/varnish.png" alt="Varnish Web Cache" width="300" height="300" /></a><p class="wp-caption-text">Varnish makes websites fly... once you iron out a few issues.</p></div>
<p>[Here’s a bit of background about our previous cache setup. Skip ahead to “Using Varnish To Cache WhatClinic.com” if you want to jump straight into the Varnish section.]</p>
<p>Our main website is built on <a href="http://www.iis.net/">Microsoft’s IIS</a> and we have been using its built-in page and component level caching to serve html pages for several years. This built-in caching is easy to setup and quite flexible, but it is very memory hungry.</p>
<p>The memory issue isn’t much of a problem on small static websites with only a couple of hundred pages. Unfortunately though, WhatClinic.com is a dynamic site with potentially millions of individual pages to serve. Typically we were getting only 12% of our pages served from the cache, and sometimes this was as low as 6%. It was almost pointless running the cache at all.</p>
<p>The biggest problem for us is the breadth of the website. On a typical day we have 30,000 unique visitors, but they land on 23,000 distinct URLs. Over the course of a month this balloons to 145,000 distinct landing pages.  Worse still, they look at over half a million distinct pages on the site.</p>
<p>To try and improve the performance of the existing IIS cache we tried writing the page cache to disk. Under test conditions with relatively small numbers of pages this worked well, but to get even 50% of our pages from one month’s visits in the cache it meant having 250,000 pages written to the disk. In the end the NT file system on our servers starting grinding to a halt, not because of request volume but purely because of the number of individual files involved.</p>
<p><strong>Using Varnish To Cache WhatClinic.com</strong></p>
<p>We came up with some ways around the NT file system problem but decided in the end it would be better to move the cache off the main box altogether. At the same time we decided to look at <a href="https://www.varnish-cache.org/">Varnish</a> as a solution, with a view to hosting it on AWS.</p>
<p>On the upside Varnish is lightweight and powerful, but it also introduced a number of new problems for us to overcome:</p>
<p><strong>1. Varnish Caches Cookies</strong></p>
<p>We use a cookie to store all kinds of information about a new visitor, including things like their country of origin, so we can display clinics’ prices in the visitor’s local currency. To get around Varnish serving up pages based on one person’s cookie all the time we had to move our cookie drop into a javascript call rather than doing it on the page. No big deal, but something to be aware of.</p>
<p><strong>2. All Requests Go Through The Varnish Box</strong></p>
<p>To determine a visitor’s location we look at their IP address, but since all requests were going through the Varnish server our own server was only seeing one IP address hit it all the time. We changed the code to pass the referring IP address along and so we could pick it off.</p>
<p>Problem solved, except now our default access logs don’t record the proper IP address of each visitor. We use Google Analytics and our own logs for the bulk of our reporting so this isn’t a big deal, but at some point we might have to look at writing our own access logs with the referring IP address if only to give us the peace of mind that when something does go wrong we can track it in the raw log data.</p>
<p><strong>3. Altering Our Landing Pages</strong></p>
<p>Depending on whether you have just landed on WhatClinic.com, or are browsing subsequent pages, we alter the layout of the page. The layout differences are quite extensive even though the data is all the same, so it isn’t efficient to make the changes on the client side. We need to cache two different versions of the same page.</p>
<p>The solution involved getting Varnish to pass along the referring URL and using something like (isReferringDomainWhatClinic.com) as part of the key for the cache as well as the requested URL itself. In the end this was pretty easy to do too, but it did double the number of pages in the cache. However, we were trying in particular to improve the speed of our landing pages so it is worth it to us.</p>
<p><strong>4. Time To Live</strong></p>
<p>As we said in the intro, we have a very broad site. Our pages also change quite infrequently, so we wanted to have the maximum possible time to live for the cached pages, in the order of several months. However, some pages do change, and a change to any one of our customer&#8217;s data may have effects that ripple over hundreds of pages that their clinic might appear on.</p>
<p>The solution was to set our time to live to several months, and then remove pages from the cache only when they had been updated. Having implemented a means to remove the pages from our cache, we then had to determine when a change to a clinic&#8217;s data had occurred and which pages were affected by the change, so we knew which pages to remove from the cache and update.</p>
<p>Working out exactly which pages were affected turned out to be a little problematic but we solved it eventually and we’re reasonably happy that we’ve covered all the cases. We also coded a big red “Remove All This Clinic’s Data From The Cache” for use in case of emergencies.</p>
<p><strong>The Results</strong></p>
<p>Overall, it has been a big win. After about three weeks of operation we have a page hit rate of around 65%, which is a huge improvement from the 12% we used to get. Cached pages are returned now somewhere in the order of 100-200ms instead of 2000-5000ms, and the load on our server has dropped dramatically, improving performance for those pages which are never going to be in the cache too.</p>
<p>Of course, having improved the efficiency of generating the page html, we are now looking at the speed of all our own JavaScript, our external calls to analytics, our social media buttons and other external client-side calls.</p>
<p>Performance improvements never end, do they?</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.whatclinic.com/2011/09/using-varnish-to-speed-up-whatclinic-com.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Who Do You Want To Be?</title>
		<link>http://blog.whatclinic.com/2011/08/who-do-you-want-to-be.html</link>
		<comments>http://blog.whatclinic.com/2011/08/who-do-you-want-to-be.html#comments</comments>
		<pubDate>Wed, 31 Aug 2011 08:48:40 +0000</pubDate>
		<dc:creator>Philip Boyle</dc:creator>
				<category><![CDATA[Web/Tech]]></category>
		<category><![CDATA[authentication]]></category>
		<category><![CDATA[google plus]]></category>
		<category><![CDATA[identity]]></category>
		<category><![CDATA[verification]]></category>

		<guid isPermaLink="false">http://blog.whatclinic.com/?p=1742</guid>
		<description><![CDATA[[Image: The Nonsense Blog] Identity, or rather the ability to create new identities, is a key facet of the internet. Some people troll away on internet forums under assumed names just to cause trouble, while others use made up identities to expose political scandal or even circumvent local laws. This ability to change identity at [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.whatclinic.com/wp-content/uploads/2011/08/mask.jpg"><img class="alignnone size-full wp-image-1743" title="mask" src="http://blog.whatclinic.com/wp-content/uploads/2011/08/mask.jpg" alt="hiding behind a mask" width="500" height="351" /></a><br />
[Image: <a title="The nonsense blog" href="http://www.thinknonsense.com/bb/2008/10/30/real-face-behind-the-mask/">The Nonsense Blog</a>]</p>
<p>Identity, or rather the ability to create new identities, is a key facet of the internet. Some people troll away on internet forums under assumed names just to cause trouble, while others use made up identities to <a title="Guido Fawkes" href="http://order-order.com/">expose political scandal</a> or even <a title="Super Injunction Twitter" href="http://twitter.com/InjunctionSuper">circumvent local laws</a>. This ability to change identity at will is both a boon to the creative and a bane to the legislative, but in its own way it drives innovation and change.</p>
<p><strong>Identity And Social Media</strong></p>
<p>Eric Schmidt of Google spoke recently at the Edinburgh International Television Festival to announce the <a title="Eric Schmidt at the Edinburgh International Television Festival" href="http://www.guardian.co.uk/technology/2011/aug/26/eric-schmidt-chairman-google-education">launch of Google Television in Europe</a>, which should hit our shores in the new year. However, he also took some questions, including one from <a title="Andy Carvin asks Eric Schmidt a question" href="https://plus.google.com/117378076401635777570/posts/2y7vqXBtLny">Andy Carvin</a> about Google Plus (G+). He asked &#8220;how Google justifies the policy [of making people use their real name] given that real identities could put people at risk?&#8221;</p>
<p>Schmidt&#8217;s answer was that G+ was built primarily an identity service, and that people were free not to use it if they felt they could be putting themselves at risk. I found the answer a little disappointing, especially given the tone of his actual speech which took issue with the ongoing split between the sciences and the arts in the UK.</p>
<p>By forcing G+ users to use their real identities Google are in effect silencing the weird and the creative along with the subversive and the disruptive, leaving them to create their &#8220;fake&#8221; identities on message boards and Twitter and Facebook instead. Google&#8217;s attitude appears to be driven by their desire to <a title="Google Plus data in search results" href="http://www.wired.com/epicenter/2011/08/google-studying-re-ranking-search-results-using-1-button-data-but-its-touchy/">use G+ data as part of their search results algorithm</a> as a way of reducing web spam, but this seems like a short sighted method of guaranteeing the authenticity of a +1 click for instance.</p>
<p><strong>Authentication vs Identity</strong></p>
<p>Rather than focus purely on identity I think Facebook and Twitter are getting it right by focusing on authentication. By verifying that you are the person who created a certain Facebook or Twitter account you can continue that internet persona uninterrupted on a myriad of different sites. You could potentially Like things more than once, or share them on multiple Twitter accounts, but does that really cause a problem when a real person does it once or twice?</p>
<p>Say you <em>could</em> have more than one G+ account then, how many people would go to go to the trouble of creating two accounts to game Google&#8217;s search results? A lot unfortunately. There&#8217;s real money riding on it after all, and knowing the experience of Black Hat SEO practitioners and their &#8220;creative&#8221; ways of building links, they&#8217;ve probably already gotten around Google&#8217;s current protections anyway.</p>
<p>Google really are going to be fighting an uphill battle to keep it to one account per person. Twitter suffers a great deal from fake accounts being set up for spam purposes. Facebook apparently less so, even though Facebook Likes are now almost a web currency of their own. But companies with people as smart as the ones at Google, Facebook and Twitter should be able to decouple the ideas of multiple (valid) identities and spambots created purely to manipulate results.</p>
<p><strong>Who Are You Right Now?</strong></p>
<p>I think <a title="Fred Wilson on online identities" href="http://www.avc.com/a_vc/2011/08/indentity-authentication-and-provisioning-them-online.html">Fred Wilson&#8217;s take on Identity and Authentication</a> is pretty spot on too. He comes at it from a slightly different angle, not so much about fake or hidden identities but rather about his real identity being split across different sites for different reasons. It was for exactly this reason that G+ made the leap forward it did with Circles, letting people split out who they talk to based on some common themes, but by tying it all to your real name it restricts itself unnecessarily.</p>
<p>Quinton O&#8217;Reilly also recently covered some of the problems that arise from having a <a title="the problem with an online persona" href="http://www.simplyzesty.com/social-media/can-a-person%E2%80%99s-personality-be-defined-by-social-media/">publicly accessible profile</a> with its own unique persona, especially when potential employers come looking to dig up some dirt on you. To me that&#8217;s all the more reason why people should be allowed to have different accounts, or identities if they want to. In fact, it&#8217;s almost exactly the reason that most people I know on LinkedIn are members of that site. They don&#8217;t want their Facebook profiles perused by employers, colleagues or customers!</p>
<p><strong>Should Businesses Care?</strong></p>
<p>Which brings me to the business end of things. Most people who use WhatClinic.com use their real names when they create an enquiry and they use real email addresses and real phone numbers. But do we know if they just created that email account, or just bought a prepaid mobile phone? No. Do we know if they&#8217;re using a pseudonym? No. Should we care? Not really. So long as the clinic can actually contact the user they&#8217;re free to call themselves whatever they want.</p>
<p>People change depending on the situation they&#8217;re in. They do it in the real world and they do it online, and I doubt even Google are going to be able to stop that.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.whatclinic.com/2011/08/who-do-you-want-to-be.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>More Strange Analytics Behaviour</title>
		<link>http://blog.whatclinic.com/2011/08/more-strange-analytics-behaviour.html</link>
		<comments>http://blog.whatclinic.com/2011/08/more-strange-analytics-behaviour.html#comments</comments>
		<pubDate>Thu, 18 Aug 2011 11:22:29 +0000</pubDate>
		<dc:creator>Philip Boyle</dc:creator>
				<category><![CDATA[Web/Tech]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[bug]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[sessions]]></category>

		<guid isPermaLink="false">http://blog.whatclinic.com/?p=1738</guid>
		<description><![CDATA[Google Analytics was recently updated to change how it calculated when a visitor&#8217;s session ended. We were told this should have only a small effect, around 1% on average, on how our visitors were being counted. The change went live on Thursday August 11th. If you look at the graph above you can see on [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.whatclinic.com/wp-content/uploads/2011/08/non-unique-visitors.png"><img class="alignnone size-full wp-image-1736" title="non-unique-visitors" src="http://blog.whatclinic.com/wp-content/uploads/2011/08/non-unique-visitors.png" alt="Google Analytics over-reporting visits" width="500" /></a></p>
<p>Google Analytics was recently updated to change how it calculated when a visitor&#8217;s session ended. We were told this should have only a small effect, around 1% on average, on how our visitors were being counted. The change went live on Thursday August 11th. If you look at the graph above you can see on Friday the 12th our reported visits increased by over 40%. This trend continued for nearly a week.</p>
<p>Now a 40% increase in traffic would obviously be welcome, but our unique visitors report told a very different story. Nothing had changed much at all!</p>
<p><a href="http://blog.whatclinic.com/wp-content/uploads/2011/08/unique-visitors.png"><img class="alignnone size-full wp-image-1737" title="unique-visitors" src="http://blog.whatclinic.com/wp-content/uploads/2011/08/unique-visitors.png" alt="Google Analytics unique visitors" width="500" /></a></p>
<p>So, what was going on? It turns out there were some bugs in the Analytics update that created new sessions for users when they should have had only one. Full details are in the <a title="update to google analytics session change" href="http://analytics.blogspot.com/2011/08/update-to-sessions-in-google-analytics.html">update to the announcement</a> of the original change. It would seem that in our case visitors who clicked the back button in their browser to go back to the landing page they arrived on were being counted multiple times.</p>
<p>Google pushed a fix to this problem on Tuesday the 16th of August and everything seems to be back to normal now, but I&#8217;m sure we&#8217;re not alone in having spent some time trying to work out what was going on and what changes we&#8217;d need to make to be able to compare reports from before and after the change. Thankfully it looks like that&#8217;s not an issue anymore. Still, it seems like a pretty big bug to slip through the net for such an important product.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.whatclinic.com/2011/08/more-strange-analytics-behaviour.html/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>How To Export More Than 500 Rows From The New Google Analytics</title>
		<link>http://blog.whatclinic.com/2011/07/how-to-export-more-than-500-rows-from-the-new-google-analytics.html</link>
		<comments>http://blog.whatclinic.com/2011/07/how-to-export-more-than-500-rows-from-the-new-google-analytics.html#comments</comments>
		<pubDate>Thu, 07 Jul 2011 08:00:55 +0000</pubDate>
		<dc:creator>Philip Boyle</dc:creator>
				<category><![CDATA[Stuff we've learned]]></category>
		<category><![CDATA[Web/Tech]]></category>
		<category><![CDATA[500 rows]]></category>
		<category><![CDATA[export limit]]></category>
		<category><![CDATA[google analytics]]></category>

		<guid isPermaLink="false">http://blog.whatclinic.com/?p=1683</guid>
		<description><![CDATA[I switched to the new Google Analytics interface and almost immediately ran into that old problem of wanting to export more than 500 rows of data without having to resort to using API calls. The old &#8220;limit=50000&#8243; trick doesn&#8217;t work with the new format, but thankfully there is a work around which I came across [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.whatclinic.com/wp-content/uploads/2011/07/new-analytics-export.png"><img class="alignnone size-full wp-image-1684" title="New Google Analytics Export" src="http://blog.whatclinic.com/wp-content/uploads/2011/07/new-analytics-export.png" alt="New Google Analytics Export" width="600" height="126" /></a></p>
<p>I switched to the new Google Analytics interface and almost immediately ran into that old problem of wanting to export more than 500 rows of data without having to resort to using API calls. The old &#8220;limit=50000&#8243; trick doesn&#8217;t work with the new format, but thankfully there is a work around which I came across on the <a title="convonix blog" href="http://www.convonix.com/blog/web-analytics/how-to-export-more-than-500-results-in-new-version-of-google-analytics">Convonix blog</a>.</p>
<p>If you choose to show more than the standard 10 rows using the drop down at the bottom of the page, a new &#8220;rowcount&#8221; variable is added to your URL. For example, I changed a page to display 25 rows and the variable looks like this:</p>
<p><a href="http://blog.whatclinic.com/wp-content/uploads/2011/07/analytics-rowcount-variable.png"><img class="alignnone size-full wp-image-1685" title="Google Analytics Row Count Variable" src="http://blog.whatclinic.com/wp-content/uploads/2011/07/analytics-rowcount-variable.png" alt="Google Analytics Row Count Variable" width="250" height="48" /></a></p>
<p>By changing the 25 you can change how many rows get displayed and then export them, up to a 50,000 row limit apparently. I&#8217;d caution against relying on this as a long term solution though. The previous 50,000 row limit trick got reduced to 20,000 after so many people started using it, and I imagine the same will happen with this trick once its use catches on. In the meantime though, enjoy!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.whatclinic.com/2011/07/how-to-export-more-than-500-rows-from-the-new-google-analytics.html/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>How Not To Use The Rel=&#8221;Canonical&#8221; Tag</title>
		<link>http://blog.whatclinic.com/2011/06/how-not-to-use-the-relcanonical-tag.html</link>
		<comments>http://blog.whatclinic.com/2011/06/how-not-to-use-the-relcanonical-tag.html#comments</comments>
		<pubDate>Tue, 07 Jun 2011 13:48:29 +0000</pubDate>
		<dc:creator>Philip Boyle</dc:creator>
				<category><![CDATA[Stuff we've learned]]></category>
		<category><![CDATA[Web/Tech]]></category>
		<category><![CDATA[canonical]]></category>
		<category><![CDATA[experiment]]></category>
		<category><![CDATA[rel]]></category>
		<category><![CDATA[seo]]></category>

		<guid isPermaLink="false">http://blog.whatclinic.com/?p=1666</guid>
		<description><![CDATA[One of the fun problems we have working at WhatClinic.com is trying to organise the millions of pages that result from listing tens of thousands of clinics in thousands of locations for thousands of treatments. Our search results pages list up to 12 clinics at a time, and when they’re full they offer a great [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_1667" class="wp-caption alignnone" style="width: 568px"><a href="http://blog.whatclinic.com/wp-content/uploads/2011/06/rel-canonical-usage.gif"><img class="size-full wp-image-1667 " title="Rel Canonical" src="http://blog.whatclinic.com/wp-content/uploads/2011/06/rel-canonical-usage.gif" alt="Rel=Canonical Usage" width="558" height="295" /></a><p class="wp-caption-text">Typical proper use of the rel=&quot;canonical&quot; tag (via SEOMoz.org)</p></div>
<p>One of the fun problems we have working at WhatClinic.com is trying to organise the millions of pages that result from listing tens of thousands of clinics in thousands of locations for thousands of treatments.</p>
<p>Our search results pages list up to 12 clinics at a time, and when they’re full they offer a great user experience. Lots of choice and lots of information is presented along with a simple way to contact whichever of the clinics takes your fancy.</p>
<p>However, not every combination of clinic type + location + treatment will have a full page of results. In fact with only a little knowledge you could probably guess the URL of a page with no results on it. The obvious solution to these empty pages is to return a <a href="http://en.wikipedia.org/wiki/HTTP_404">404 response code</a> and not to link to the pages internally, minimising the chance that they’ll be found by users or search engines alike.</p>
<p><strong>What’s Right For The User?</strong></p>
<p>Add one clinic to the page though and we’re left with a quandary. Is this really a useful page for a user? Wouldn’t they like more choice? We know for instance that pages with more clinics on them have a better conversion rate, so would we be better off sending users to a “parent” location page instead, i.e. a location that contains the smaller location but should have more than one clinic on offer?</p>
<p>Another option available to us would be to fill the rest of the page with 11 of the nearest clinics to the location (which could be tens if not hundreds of miles away in some cases), but this would massively increase the duplication of data served across the pages on our site as clinics’ listings would appear in far more locations than they currently do.</p>
<p><strong>Similar Pages – The Rel=”Canonical” Solution</strong></p>
<p>We decided we’d like to see what effect the first option had, i.e. sending the users to a parent page, but we were uncomfortable with 301 redirecting every page that only had one clinic on it, so we decided to try a slightly softer approach.</p>
<p>Having read an article on SEOMoz about <a href="http://www.seomoz.org/blog/using-canonical-tag-to-get-more-than-one-anchor-text-value-11283">using the Rel=”Canonical” tag to get more than one keyword to rank</a> for a given piece of content, we decided to try what we thought was quite a clever scheme that would serve the user and the search engines.</p>
<p>We would put a Rel=”Canonical” tag on our search results pages with only one clinic listed, and we’d hope to send people searching Google for Place A to the search results of Place B, which would contain the search results for Place A and more, giving the user a better choice.</p>
<p><strong>Anchor Text Isn’t A Very Strong Ranking Signal For Pages With A Rel=”Canonical”</strong></p>
<p>Unfortunately for us, the experiment hasn’t exactly gone to plan. We were cautious and only put the Rel=”Canonical” links on a subset of our one result pages, but even still we have enough data to see that for now at least none of the Place B pages are ranking for Place A keywords.</p>
<p>Of a sample set of 20 one result pages with a Rel=”Canonical” tag, 14 have been crawled and no longer appear in Google’s index, and searching using the “Place A” keyword for these pages doesn’t return the Place B search results page.</p>
<p>You might think, well Google have decided that the Place A and Place B pages aren’t sufficiently similar to be a valid use of the Rel=”Canonical” tag, and you might be right, but the fact that original Place A URLs are no longer appearing in the index seems to counter this supposition.</p>
<p>More likely it seems is that the anchor text of the links pointing at Place A pages isn’t a strong enough signal for the Place B pages to rank for keywords based on “Place A”.</p>
<p><strong>Back To The Drawing Board</strong></p>
<p>So it looks like we’re back to square one on this particular problem. I think the next thing to try is the option discussed above where we fill out the search results. It seems like a good thing to do for the user, but I am slightly worried about diluting our content by potentially overusing it. We’ll be sure to keep you posted about the results when we try it out.</p>
<p>Have you run any experiments with the Rel=”Canonical” tag yet? For what purpose, and what results did you see? Let us know in the comments.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.whatclinic.com/2011/06/how-not-to-use-the-relcanonical-tag.html/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Digging Deeper Into Your Analytics</title>
		<link>http://blog.whatclinic.com/2011/05/digging-deeper-into-your-analytics.html</link>
		<comments>http://blog.whatclinic.com/2011/05/digging-deeper-into-your-analytics.html#comments</comments>
		<pubDate>Thu, 19 May 2011 08:00:58 +0000</pubDate>
		<dc:creator>Philip Boyle</dc:creator>
				<category><![CDATA[Stuff we've learned]]></category>
		<category><![CDATA[Web/Tech]]></category>
		<category><![CDATA[conversion rates]]></category>
		<category><![CDATA[google analytics]]></category>
		<category><![CDATA[keyword length]]></category>

		<guid isPermaLink="false">http://blog.whatclinic.com/?p=1649</guid>
		<description><![CDATA[We all like to keep an eye on the usual metrics when looking at our Google Analytics accounts. Visits, unique visitors, bounce rates, time on site, and conversion rates all get a look in. These are all great pieces of information for making sure that things are working the way you expect them to on [...]]]></description>
			<content:encoded><![CDATA[<p>We all like to keep an eye on the usual metrics when looking at our Google Analytics accounts. Visits, unique visitors, bounce rates, time on site, and conversion rates all get a look in. These are all great pieces of information for making sure that things are working the way you expect them to on your website, but what if you want to look a little deeper?</p>
<p>Unfortunately Analytics can’t answer every question you might have about your site, in which case it’s time to dust off your Excel For Dummies book and get stuck into manipulating the data yourself. For those of you looking for a good guide to some of the most useful Excel functions for SEO analysis I can recommend the <a href="http://www.distilled.net/excel-for-seo/">Microsoft Excel for SEO</a> guide from Distilled.net.</p>
<p>Digging deeper often requires large amounts of data to give meaningful answers, so you’re going to want to get familiar with adding the <a href="http://blog.whatclinic.com/2009/05/how-to-export-more-than-500-rows-in-google-analytics.html">“&amp;limit=50000</a>” to your GA URLs, or better still start using the <a href="http://code.google.com/apis/analytics/docs/gdata/home.html">Google Analytics Data Export API</a> or the <a href="http://excellentanalytics.com/">Excellent Analytics</a> Excel plug-in.</p>
<p><strong>Keyword Lengths and Conversion Rates</strong></p>
<p>I’m a firm believer that the more you know about your visitors and their behaviour the better you can tailor your product to suit their needs. So, from time to time we go and look at some metrics that are slightly off the beaten track. Have a look at the graph below for instance:</p>
<div id="attachment_1651" class="wp-caption alignnone" style="width: 310px"><a href="http://blog.whatclinic.com/wp-content/uploads/2011/05/traffic-and-conversion-rate-by-keyword-length.jpg"><img class="size-medium wp-image-1651" title="traffic-and-conversion-rate-by-keyword-length" src="http://blog.whatclinic.com/wp-content/uploads/2011/05/traffic-and-conversion-rate-by-keyword-length-300x180.jpg" alt="Traffic by keyword length with conversion rate" width="300" height="180" /></a><p class="wp-caption-text">The longer the keyword, the more likely the conversion.</p></div>
<p>It charts the traffic and email enquiry conversion rate of traffic over a recent two week period. The first thing that struck me was the more keywords people use to find WhatClinic.com the more likely they are to convert. The second thing was that just over 50% of our email conversions come from people who use 4 or 5 keywords to find the site.</p>
<p>All well and good you say, but what use is information like this? Well, for a website like ours with a long tail focus it shows us how long the keywords in the long tail are. We typically optimise pages for one or two keywords, usually two or three words in length. The data above suggests that maybe some pages should be optimised for slightly longer keywords, or perhaps even two longer keywords.</p>
<p>Thanks to other curious SEOs like SharkSEO we also know that you can <a href="http://sharkseo.com/whitehat/meta-descriptions/">write two completely different meta descriptions</a> for the same page and the search engines will pick the description that best matches the keyword being searched for. This opens up some new possibilities about how to organise our data and our site structure. Using the keyword length and conversion data above we can make more informed decisions about how to optimise the resulting pages.</p>
<p><strong>Are Keywords Getting Longer?</strong></p>
<p>Just over a year ago I wrote about how <a href="http://blog.whatclinic.com/2010/03/is-the-long-tail-getting-longer.html">people were using longer keywords</a> to find WhatClinic.com. Seeing as we’re talking about keyword lengths again I thought I’d take a quick peek at some data from this year. I was in for a surprise.</p>
<div id="attachment_1652" class="wp-caption alignnone" style="width: 310px"><a href="http://blog.whatclinic.com/wp-content/uploads/2011/05/traffic-by-keyword-length-2010-2011.jpg"><img class="size-medium wp-image-1652" title="traffic-by-keyword-length-2010-2011" src="http://blog.whatclinic.com/wp-content/uploads/2011/05/traffic-by-keyword-length-2010-2011-300x180.jpg" alt="keyword length 2010 and 2011" width="300" height="180" /></a><p class="wp-caption-text">Keyword length doesn&#39;t seem to be changing, but that&#39;s not the whole story.</p></div>
<p>If my data was to be believed keyword lengths were almost exactly the same as they were a year ago. The answer seemed too neat to me, so I decided to do a little segmentation. My suspicion was that by looking at our traffic as a whole I was missing some underlying trends, and it turns out I was right.</p>
<div id="attachment_1650" class="wp-caption alignnone" style="width: 310px"><a href="http://blog.whatclinic.com/wp-content/uploads/2011/05/irish-traffic-by-keyword-length-compared-to-average.jpg"><img class="size-medium wp-image-1650" title="irish-traffic-by-keyword-length-compared-to-average" src="http://blog.whatclinic.com/wp-content/uploads/2011/05/irish-traffic-by-keyword-length-compared-to-average-300x180.jpg" alt="WhatClinic.com Irish traffic by keyword length" width="300" height="180" /></a><p class="wp-caption-text">Irish visitors account for more than their fair share of our short keyword traffic.</p></div>
<p>Traffic from Ireland accounts for around 19% of our total visits, but as you can see from the chart above it accounts for over 30% of our one and two word keyword traffic. Again the question is how is this information useful or actionable? The simple answer again is to do with the messaging – the page title and the meta description in particular.</p>
<p>In Google.ie we now rank quite well for certain one word keywords like “<a href="http://www.google.ie/search?hl=en&amp;q=braces">braces</a>” or “<a href="http://www.google.ie/search?hl=en&amp;q=dentist">dentist</a>”. While this is great for us in terms of traffic, the pages are really optimised for people looking for our page about braces in Ireland, or dentists in Ireland. This means that as the keywords used to find these pages get more generic / head / short tail that maybe we should look at changing the messaging on them to better reflect more closely what the user is looking for. For the cases above, I think that the messaging might be OK, but we’ll test some alternatives and see how they affect CTR and conversion rates.</p>
<p><strong>The Importance Of Segmentation</strong></p>
<p>The Irish traffic above really skewed the keyword length data above. Seeing as our website deals with so many geographies and our keyword rankings quite a lot across them, any decisions about site structures and one page optimisation should only be made once the overall site figures have been sliced enough to have confidence in them.</p>
<p>Excluding the Irish traffic, keywords have gotten slightly longer since 2010, but not massively so. It is the relative shortening of Irish keywords that is much more significant to us on this occasion.</p>
<div id="attachment_1653" class="wp-caption alignnone" style="width: 310px"><a href="http://blog.whatclinic.com/wp-content/uploads/2011/05/traffic-by-keyword-length-2010-2011-incl-excl-ireland.jpg"><img class="size-medium wp-image-1653" title="traffic-by-keyword-length-2010-2011-incl-excl-ireland" src="http://blog.whatclinic.com/wp-content/uploads/2011/05/traffic-by-keyword-length-2010-2011-incl-excl-ireland-300x180.jpg" alt="keyword length with separate Irish data 2010 2011" width="300" height="180" /></a><p class="wp-caption-text">Irish visitors are skewing the data</p></div>
<p>We have previously observed similar big differences in user behaviour based on whether the landing takes place on a brochure / listing page or on one of our search results pages. We’ve even observed that the nearer the top of the tree structure a user lands the more likely they are to convert.</p>
<p>It’s often worth digging deeper than the reports or segments in Google Analytics can offer by themselves because the information that comes out can offer you a clearer picture of some of the bigger underlying trends affecting your site and give you the information you need to not only stay ahead of your competitors in the SERPs, but ultimately make your site better for your users.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.whatclinic.com/2011/05/digging-deeper-into-your-analytics.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why Groupon Secretly Prefers Selling Beauty Deals To Hair Deals</title>
		<link>http://blog.whatclinic.com/2011/05/why-groupon-secretly-prefers-selling-beauty-deals-to-hair-deals.html</link>
		<comments>http://blog.whatclinic.com/2011/05/why-groupon-secretly-prefers-selling-beauty-deals-to-hair-deals.html#comments</comments>
		<pubDate>Tue, 17 May 2011 08:00:46 +0000</pubDate>
		<dc:creator>Philip Boyle</dc:creator>
				<category><![CDATA[Health 2.0]]></category>
		<category><![CDATA[Web/Tech]]></category>
		<category><![CDATA[customer loyalty]]></category>
		<category><![CDATA[customer retention]]></category>
		<category><![CDATA[group deals]]></category>
		<category><![CDATA[groupon]]></category>

		<guid isPermaLink="false">http://blog.whatclinic.com/?p=1631</guid>
		<description><![CDATA[Today we have a guest post from Ronan Perceval of Phorest.com. According to a recent article on TechCrunch.com 20% of all Groupon and CityDeals worldwide are for hair and beauty treatments. That is a lot of money: approximately $1bn a year if you count all the deal sites and growing fast. But of this 20% [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.whatclinic.com/wp-content/uploads/2011/05/group-deals.jpg"><img class="alignnone size-full wp-image-1632" title="group-deals" src="http://blog.whatclinic.com/wp-content/uploads/2011/05/group-deals.jpg" alt="Group buying deal sites" width="500" height="200" /></a></p>
<p>Today we have a guest post from Ronan Perceval of <a title="Salon Software" href="http://www.phorest.com/">Phorest.com</a>.</p>
<p>According to a recent article on TechCrunch.com 20% of all Groupon and CityDeals worldwide are for hair and beauty treatments. That is a lot of money: approximately $1bn a year if you count all the deal sites and growing fast.</p>
<p>But of this 20% the majority are for beauty treatments rather than hair. This is because beauty customers are less loyalty to one salon than hair customers. According to data collected from 1,000 salons using <a href="http://www.phorest.com/">Phorest.com</a> salon software an average of 45% of customers who visit a hair salon in any one year will continue to visit that salon. For beauty salons the figure is only 30%.</p>
<p>This is because when people find a stylist that makes their hair look good, they are much more likely to want to return to that particular person than they are to the therapist that gives them a spray tan or massage that any therapist in a particular salon can be expected to carry out to the same standard.</p>
<p><strong>Loyalty To Groupon</strong></p>
<p>Groupon likes selling beauty offers because customers go from deal to deal, from salon to salon. In this way they stay loyal to Groupon rather than the salon after getting a deal and Groupon can continue milking those beauty customers for buying offers.</p>
<p>Groupon doesn’t like hair offers as much because customers are much more likely to stay with that salon after the deal and not use Groupon again for a hair offer. I was chatting to a hair and beauty salon owner yesterday who told me that they had run a beauty offer on Groupon CityDeal and wanted to run a hair offer next but the Groupon salesperson was adamant that they run another beauty one.</p>
<p>We ran a survey of 1,000 salon customers last week asking them how they chose their current hair salon and their regular beauty salon. The results are interesting. 9% of people first experienced their current hair salon because of an internet deal but only 4% had experienced their regular beauty salon for the first time this way. And this is despite the fact that there are 9 times as many internet deals for beauty than hair.</p>
<p><strong>Be Careful What You Offer</strong></p>
<p>For the dental and cosmetic beauty clinics on WhatClinic.com the advice is clear: if you are considering running a group deal think carefully about the treatment or service you are offering. Are customers who use the deal likely to return to you for this treatment again, or when the time comes will they just use another deal to go to another clinic?</p>
<p>Try to think of a way to make the deal depend on return visits in order to get the best value from it – maybe offer a 10% discount on all treatment for a 12 month period? And make sure you get to demonstrate why they should come back – excellent customer service, skilled staff, modern equipment, etc.</p>
<p><strong>About the author</strong>: Ronan Perceval is the CEO of <a href="http://www.phorest.com/">Phorest.com</a>, a leading provider of salon software to thousands of salons and spas in the UK and Ireland. Phorest also operate <a href="http://www.myzanadoo.com/">MyZanadoo.com</a>, the UK and Ireland&#8217;s number 1 destination for booking salon and spa appointments online.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.whatclinic.com/2011/05/why-groupon-secretly-prefers-selling-beauty-deals-to-hair-deals.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Airbnb Gets Social, Maybe Creepy?</title>
		<link>http://blog.whatclinic.com/2011/05/airbnb-gets-social-maybe-creepy.html</link>
		<comments>http://blog.whatclinic.com/2011/05/airbnb-gets-social-maybe-creepy.html#comments</comments>
		<pubDate>Fri, 13 May 2011 08:00:35 +0000</pubDate>
		<dc:creator>Philip Boyle</dc:creator>
				<category><![CDATA[Web/Tech]]></category>
		<category><![CDATA[airbnb]]></category>
		<category><![CDATA[facebook connect]]></category>

		<guid isPermaLink="false">http://blog.whatclinic.com/?p=1645</guid>
		<description><![CDATA[Airbnb, the website that let&#8217;s people rent out their spare rooms like hotel rooms has added a new feature which uses your Facebook account to let you know if any of your friends know anything about the people renting out the rooms. On the surface this is a really useful feature. I&#8217;d certainly rather rent [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.whatclinic.com/wp-content/uploads/2011/05/airbnb.jpg"><img class="alignnone size-full wp-image-1646" title="airbnb" src="http://blog.whatclinic.com/wp-content/uploads/2011/05/airbnb.jpg" alt="Airbnb Social Connection" width="500" height="250" /></a></p>
<p><a title="Room Rental And Sublets" href="http://www.airbnb.com">Airbnb</a>, the website that let&#8217;s people rent out their spare rooms like hotel rooms has <a title="Airbnb Facebook Connect" href="http://www.airbnb.com/social/">added a new feature</a> which uses your Facebook account to let you know if any of your friends know anything about the people renting out the rooms.</p>
<p>On the surface this is a really useful feature. I&#8217;d certainly rather rent a room from someone who I might have something in common with that a complete stranger, but as with all internet privacy issues, the question of where to draw the line crops up.</p>
<p>I logged in to Airbnb using Facebook connect and checked for rooms in Dublin. Two people who are friends of my friends were renting out rooms. I would put this down to not that many people in Dublin using the service yet. I checked London. Three friends of friends this time. I guess I don&#8217;t know that many people in London yet.</p>
<p>Then I checked New York. Eleven people in my social graph were renting out rooms this time, but this time it included people who had gone to the same school as me. I have to say I think that connection is pretty tenuous. I left school 17 years ago, and I&#8217;m not in touch with many people from there any more. Maybe Airbnb should provide some sort of controls on which parts of your social graph to use?</p>
<p>I have to admit it definitely feels a bit creepy having such open access to information about your friends&#8217; friends, but I guess both sides have opted in at least. My real concern is when companies and their websites start trying to take advantage of this information without really informing you in the process.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.whatclinic.com/2011/05/airbnb-gets-social-maybe-creepy.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How SearchMetrics And Sistrix Got It Wrong, And Why</title>
		<link>http://blog.whatclinic.com/2011/04/how-searchmetrics-and-sistrix-got-it-wrong-and-why.html</link>
		<comments>http://blog.whatclinic.com/2011/04/how-searchmetrics-and-sistrix-got-it-wrong-and-why.html#comments</comments>
		<pubDate>Tue, 19 Apr 2011 09:15:40 +0000</pubDate>
		<dc:creator>Philip Boyle</dc:creator>
				<category><![CDATA[Stuff we've learned]]></category>
		<category><![CDATA[Web/Tech]]></category>
		<category><![CDATA[farmer]]></category>
		<category><![CDATA[google algorithm]]></category>
		<category><![CDATA[panda]]></category>
		<category><![CDATA[searchmetrics]]></category>
		<category><![CDATA[sistrix]]></category>

		<guid isPermaLink="false">http://blog.whatclinic.com/?p=1595</guid>
		<description><![CDATA[Just over a week ago Google&#8217;s Panda algorithm update was rolled out for all &#8220;English language Google users&#8221;. Given that our own traffic is largely English-language, and largely Google sourced, we paid particular attention to both our own traffic, which thankfully has increased since the Panda update, and any reports that were available about how [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_1600" class="wp-caption alignnone" style="width: 310px"><a href="http://blog.whatclinic.com/wp-content/uploads/2011/04/searchmetrics-ehow-co-uk-opi.jpg"><img class="size-medium wp-image-1600" title="searchmetrics-ehow-co-uk-opi" src="http://blog.whatclinic.com/wp-content/uploads/2011/04/searchmetrics-ehow-co-uk-opi-300x115.jpg" alt="SearchMetrics OPI Graph For eHow.co.uk" width="300" height="115" /></a><p class="wp-caption-text">SearchMetrics OPI Graph For eHow.co.uk</p></div>
<p>Just over a week ago Google&#8217;s <a title="high quality sites algorithm update" href="http://googlewebmastercentral.blogspot.com/2011/04/high-quality-sites-algorithm-goes.html">Panda algorithm update</a> was rolled out for all &#8220;English language Google users&#8221;. Given that our own traffic is largely English-language, and largely Google sourced, we paid particular attention to both our own traffic, which thankfully has increased since the Panda update, and any reports that were available about how other sites were being affected.</p>
<p>Two reports in particular got a lot of attention from bloggers and on Twitter. They were from <a title="SearchMetrics Panda UK Update" href="http://blog.searchmetrics.com/us/2011/04/12/googles-panda-update-rolls-out-to-uk/">SearchMetrics</a> and <a title="Sistrix Panda Europe update" href="http://www.sistrix.com/blog/990-google-panda-on-its-way-to-europe.html">Sistrix</a> and they attempted to give some indication of the drop in &#8220;visibility&#8221; of particular websites after the algorithm was rolled out.</p>
<h3>Two Problems With &#8220;Visibility&#8221;</h3>
<p>The first problem with both of these reports is that they measure whether or not certain websites appear in the search results for a number of keywords that the companies are tracking. SearchMetrics for their part say that &#8220;<a title="SearchMetrics US Panda update" href="http://blog.searchmetrics.com/us/2011/03/03/google-farmer-update-whos-really-affected/">Over 55 million domains and 25 million keywords are continuously monitored</a>&#8220;. Sistrix say that their report is &#8220;<a title="Sistrix panda algorithm data" href="http://www.sistrix.com/blog/990-google-panda-on-its-way-to-europe.html">based on a dataset of one million keywords</a>&#8220;.</p>
<p>But what about the visibility the websites in question have for keywords these two companies aren&#8217;t tracking? How much of the websites&#8217; visibility is <em>invisible</em> to SearchMetrics and Sistrix? The answer to these questions started to come quite quickly from companies who appeared on the &#8220;biggest loser&#8221; lists.</p>
<p>Demand Media, owners of eHow.com <a title="demand media panda press release" href="http://ir.demandmedia.com/phoenix.zhtml?c=215358&amp;p=irol-newsArticle&amp;ID=1551166&amp;highlight=">issued a press release</a> aimed at their investors and an accompanying <a title="demand media panda blog post" href="http://www.demandmedia.com/blog/another-statement-about-search-engine-algorithm-changes/">blog post</a> both decrying the inaccuracy of the reports. Doug Scott of DiscountVouchers.co.uk went a step further and <a title="DiscountVouchers.co.uk post panda traffic" href="http://www.holisticsearch.co.uk/2011/04/18/google-panda-discountvouchers/">published pictures</a> of his Google Analytics account to show how &#8220;<a title="Doug Scott statement" href="http://www.acorndomains.co.uk/general-board/85677-discountvouchers-co-uk-not-hit-panda.html">we have lost none of our traffic</a>&#8220;. A number of other companies have since raised their hands and said that they haven&#8217;t been affected nearly as badly as the loser lists suggest.</p>
<p>Out of interest, I looked back at our own organic traffic since January 1st and looked at how many different keywords were used to find our website. We have had over 1.4 million unique visitors to the site since the start of the year, and they used 1 million distinct keywords to find us. There is no way that either SearchMetrics or Sistrix could hope to measure our visibility in the search results accurately with that many different keywords in play. The only people who could release accurate data about any website&#8217;s visibility would be the search engines themselves, and they&#8217;re not likely to do so any time soon.</p>
<h3>Visibility Does Not Equal Traffic</h3>
<p>The second problem is that readers were confusing visibility losses with traffic losses, and SearchMetrics and Sistrix didn&#8217;t do a lot to correct this misconception at the time. In fact SearchMetrics mentioned &#8220;the statistical value of traffic distribution&#8221; and Sistrix mentioned &#8220;click-through rate on specific positions&#8221; as elements of their visibility indices, i.e. they were trying to calculate the websites&#8217; traffic for the tracked keywords based on their search results positions.</p>
<p>As with our own example, many of the sites mentioned will have very high numbers of distinct keywords bringing in small amounts of traffic each, meaning that the SearchMetrics and Sistrix visibility indices bear little or no relation to the sites&#8217; actual traffic. In fact, wary of some of the criticism coming their way SearchMetrics have now published a new blog post saying that &#8220;<a title="searchmetrics explains its OPI" href="Searchmetrics OPI does not calculate the real traffic coming in to web pages">Searchmetrics OPI does not calculate the real traffic coming in to web pages</a>&#8220;.</p>
<h3>Why Publish A Visibility Index Based On Incomplete Data?</h3>
<p>So why would these companies publish visibility loser lists knowing as they must how people would misinterpret them? The answer is clear: links and online mentions. A quick look at Yahoo Site Explorer shows that the SearchMetrics article has <a title="external links in to searchmetrics" href="http://siteexplorer.search.yahoo.com/search?p=blog.searchmetrics.com%2Fus%2F2011%2F04%2F12%2Fgoogles-panda-update-rolls-out-to-uk%2F&amp;bwm=i&amp;bwmo=d">over 670 external links in</a>, and the Sistrix article has <a title="sistrix external links in" href="http://siteexplorer.search.yahoo.com/search?p=www.sistrix.com%2Fblog%2F990-google-panda-on-its-way-to-europe.html&amp;bwm=i&amp;bwmo=d">over 150</a>. SearchMetrics generated 111 comments on their post along with 225 ratings, and Sistrix managed 15 comments. There were also lots of Twitter mentions, although these are harder to measure because of the different URL shortening services used. [Figures correct as of 19th April 2011 @ 10am]</p>
<p>SearchMetrics and Sistrix both managed to hop on a rapidly moving story that was being closely monitored by some of the most-likely-to-link people in the world, the SEO community, and they succeeded brilliantly in creating classic link-bait. Their data sets were used to fill a temporary gap in knowledge and spread like wildfire. I even tweeted about the SearchMetrics report in relation to the loss reported for Qype.co.uk because it was so astonishing. What a pity then to find out that the data behind the reports couldn&#8217;t hope to tell the whole story in relation to the mentioned websites&#8217; total visibility and traffic.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.whatclinic.com/2011/04/how-searchmetrics-and-sistrix-got-it-wrong-and-why.html/feed</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
	</channel>
</rss>

