<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Internet Business &#38; Marketing Strategy - Andy Beard &#187; robots.txt</title>
	<atom:link href="http://andybeard.eu/tag/robotstxt/feed" rel="self" type="application/rss+xml" />
	<link>http://andybeard.eu</link>
	<description>Internet Marketing, Lead Acquisition, Online Business Strategy and Social Media with Original Opinion and Loads of Attitude</description>
	<lastBuildDate>Sun, 12 Feb 2012 06:16:17 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<item>
		<title>Dell Better At Social Media Than SEO?</title>
		<link>http://andybeard.eu/2529/dell-better-at-social-media-than-seo.html</link>
		<comments>http://andybeard.eu/2529/dell-better-at-social-media-than-seo.html#comments</comments>
		<pubDate>Tue, 08 Dec 2009 20:36:42 +0000</pubDate>
		<dc:creator>Andy Beard</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[SEO Blog]]></category>
		<category><![CDATA[301 redirect]]></category>
		<category><![CDATA[dell]]></category>
		<category><![CDATA[http header]]></category>
		<category><![CDATA[robots.txt]]></category>

		<guid isPermaLink="false">http://andybeard.eu/?p=2529</guid>
		<description><![CDATA[
<p><a href="http://www.bloomberg.com/apps/news?pid=newsarchive&#038;sid=akXzD_6YNHCk">Lots</a> <a href="http://www.techcrunch.com/2009/12/08/online-holiday-spending-reaches-16-billion-social-media-continues-to-influence-purchases/">of</a> <a href="http://www.marketingpilgrim.com/2009/12/social-media-darling-dell-updates-numbers.html">reports</a> are out today about how effectively Dell is using social media marketing and especially Twitter to generate revenue, $6.5M in sales are the headlines, I wonder what that translates to in margins.</p>
<p><a href="http://andybeard.eu/2529/dell-better-at-social-media-than-seo.html" class="more-link">Read more on Dell Better At Social Media Than SEO?&#8230;</a></p>
<div class="topsy_widget_data topsy_theme_brick-red" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fandybeard.eu%252F2529%252Fdell-better-at-social-media-than-seo.html%22%2C%20%22style%22%3A%20%22small%22%2C%20%22title%22%3A%20%22Dell%20Better%20At%20Social%20Media%20Than%20SEO%3F%22%20%7D);"></div>


	Tags: <a href="http://andybeard.eu/tag/301-redirect" title="301 redirect" rel="tag">301 redirect</a>, <a href="http://andybeard.eu/tag/dell" title="dell" rel="tag">dell</a>, <a href="http://andybeard.eu/tag/http-header" title="http header" rel="tag">http header</a>, <a href="http://andybeard.eu/tag/robotstxt" title="robots.txt" rel="tag">robots.txt</a>, <a href="http://andybeard.eu/tag/seo" title="SEO Blog" rel="tag">SEO Blog</a><br />
]]></description>
			<content:encoded><![CDATA[
<p><a href="http://www.bloomberg.com/apps/news?pid=newsarchive&#038;sid=akXzD_6YNHCk">Lots</a> <a href="http://www.techcrunch.com/2009/12/08/online-holiday-spending-reaches-16-billion-social-media-continues-to-influence-purchases/">of</a> <a href="http://www.marketingpilgrim.com/2009/12/social-media-darling-dell-updates-numbers.html">reports</a> are out today about how effectively Dell is using social media marketing and especially Twitter to generate revenue, $6.5M in sales are the headlines, I wonder what that translates to in margins.</p>
<p>However yesterday I read a post <a href="http://peterlaird.blogspot.com/2009/12/ranking-cloud-computing-vendors-2009.html">ranking cloud computing vendors</a> based on mind share using a points system based on Google ranking for the term [cloud computing]</p>
<p>The winner was Rackspace, but <a href="http://andybeard.eu/1879/rackspace-cloud-mosso-link-to-us-payola.html">we have some idea</a> how they achieved that</p>
<p>But I was more interested in the result for Dell</p>
<p><img src="http://cdn5.andybeard.name/wp-content/uploads/dell-no-snippet.png" alt="dell-no-snippet" title="dell-no-snippet" width="486" height="400" class="aligncenter size-full wp-image-2532" /></p>
<p>First off they are doing some funny redirects so the displyed URL in the snippet<br />
www.dell.com/cloudcomputing<br />
Redirects to their cloud computing page on a subdomain which I assume is some kind of CDN</p>
<p>http://content.dell.com/us/en/enterprise/cloud-computing.aspx?c=us&#038;cs=555&#038;l=en&#038;s=biz</p>
<p>Notice that there is only a URL displayed in the snippet and not an excerpt from the page or meta description. Google is just displaying the page as a reference, but for some reason couldn&#8217;t crawl the page.</p>
<p>Now before I show you what they have done wrong I did try to contact Dell about this yesterday. I sent a tweet to the guy who maintains at least one of their social media profiles, and since this tweet he has been tweeting on his account so I assume he has had a chance to read the tweet I sent.</p>
<p><img src="http://cdn5.andybeard.name/wp-content/uploads/dell-robotstxt.png" alt="dell-robotstxt" title="dell-robotstxt" width="436" height="74" class="aligncenter size-full wp-image-2531" /></p>
<p>I didn&#8217;t receive a response, and as this is a significant problem I am blogging about it, mainly because Google seemed to be doing something contrary to what I previously understood (they weren&#8217;t if you read more) and have seen written by authoritative sources such as Vanessa Fox.</p>
<p>Here is the robots.txt of that Dell subdomain</p>
<div style="font-size:x-small">
User-agent: *<br />
Allow: /at/de/home/<br />
Allow: /au/en/business/<br />
Allow: /ca/en/business/<br />
Allow: /de/de/corp/<br />
Allow: /de/de/home/<br />
Allow: /hk/en/business/<br />
Allow: /ie/en/business/<br />
Allow: /ie/en/home/<br />
Allow: /in/en/business/<br />
Allow: /my/en/business/<br />
Allow: /nz/en/business/<br />
Allow: /sg/en/business/<br />
Allow: /uk/en/business/<br />
Allow: /uk/en/home/<br />
Allow: /us/en/business/<br />
Allow: /us/en/corp/<br />
Allow: /us/en/enterprise/<br />
Allow: /us/en/home/<br />
Allow: http://content.dell.com<br />
Allow: http://content.dell.com/<br />
Allow: http://content.dell.com/au.sitemap.txt<br />
Allow: http://content.dell.com/hk.sitemap.txt<br />
Allow: http://content.dell.com/in.sitemap.txt<br />
Allow: http://content.dell.com/jp.sitemap.txt<br />
Allow: http://content.dell.com/my.sitemap.txt<br />
Allow: http://content.dell.com/nz.sitemap.txt<br />
Allow: http://content.dell.com/sg.sitemap.txt<br />
Disallow: /
</div>
<p>The page concerned us under /us/en/enterprise/ so should be controlled by this directive.<br />
Allow: /us/en/enterprise/</p>
<p>As <a href="http://janeandrobot.com/library/managing-robots-access-to-your-website">Vanessa states</a></p>
<blockquote><p>If conflicts exist in the file, the robot obeys the longest (and therefore generally more specific) line.</p></blockquote>
<p>So somthing is going wrong in between the URL displayed in the search results, and the final landing page.</p>
<h2>Server Header Analysis</h2>
<p>Search result<br />
>> [302] http://www.dell.com/cloudcomputing<br />
>> [302] http://content.dell.com/cloud-computing.aspx?c=us&#038;cs=555&#038;l=en&#038;s=biz<br />
>> [301] /us/en/enterprise/cloud-computing.aspx?c=us&#038;cs=555&#038;l=en&#038;s=biz</p>
<div style="font-size:x-small">
<p>http://www.dell.com/cloudcomputing</p>
<p>GET /cloudcomputing HTTP/1.1<br />
Host: www.dell.com<br />
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 6.0; en-GB; rv:1.9.1.5) Gecko/20091102 Firefox/3.5.5<br />
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8<br />
Accept-Language: en-gb,en;q=0.5<br />
Accept-Encoding: gzip,deflate<br />
Accept-Charset: UTF-8,*<br />
Keep-Alive: 300<br />
Connection: keep-alive<br />
Referer: http://www.google.com/search?hl=en&#038;pws=0&#038;gl=US&#038;q=cloud+computing&#038;start=20&#038;sa=N<br />
Cookie: SITESERVER=ID=6596edc940b547a0afe7a6a1db4e40ee; s_vi=[CS]v1|258EA06C05011703-600001124062905C[CE]; StormPCookie=pl=pl&#038;pc=pl&#038;bandwidth=NA</p>
<p>HTTP/1.x 302 Found<br />
Content-Type: text/html; charset=utf-8<br />
Location: http://content.dell.com/cloud-computing.aspx?c=us&#038;cs=555&#038;l=en&#038;s=biz<br />
Server: Microsoft-IIS/7.0<br />
Set-Cookie: stop_mobi=; path=/<br />
X-Powered-By: ASP.NET<br />
x-ua-compatible: IE=7<br />
Date: Tue, 08 Dec 2009 19:42:34 GMT<br />
Content-Length: 196</p>
<p>http://content.dell.com/cloud-computing.aspx?c=us&#038;cs=555&#038;l=en&#038;s=biz</p>
<p>GET /cloud-computing.aspx?c=us&#038;cs=555&#038;l=en&#038;s=biz HTTP/1.1<br />
Host: content.dell.com<br />
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 6.0; en-GB; rv:1.9.1.5) Gecko/20091102 Firefox/3.5.5<br />
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8<br />
Accept-Language: en-gb,en;q=0.5<br />
Accept-Encoding: gzip,deflate<br />
Accept-Charset: UTF-8,*<br />
Keep-Alive: 300<br />
Connection: keep-alive<br />
Referer: http://www.google.com/search?hl=en&#038;pws=0&#038;gl=US&#038;q=cloud+computing&#038;start=20&#038;sa=N<br />
Cookie: SITESERVER=ID=6596edc940b547a0afe7a6a1db4e40ee; s_vi=[CS]v1|258EA06C05011703-600001124062905C[CE]; StormPCookie=pl=pl&#038;pc=pl&#038;bandwidth=NA</p>
<p>HTTP/1.x 301 Moved Permanently<br />
Date: Tue, 08 Dec 2009 19:42:35 GMT<br />
Server: Microsoft-IIS/6.0<br />
X-Powered-By: ASP.NET<br />
X-AspNet-Version: 2.0.50727<br />
Location: /us/en/enterprise/cloud-computing.aspx?c=us&#038;cs=555&#038;l=en&#038;s=biz<br />
Cache-Control: private<br />
Content-Length: 0</p>
<p>http://content.dell.com/us/en/enterprise/cloud-computing.aspx?c=us&#038;cs=555&#038;l=en&#038;s=biz</p>
<p>GET /us/en/enterprise/cloud-computing.aspx?c=us&#038;cs=555&#038;l=en&#038;s=biz HTTP/1.1<br />
Host: content.dell.com<br />
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 6.0; en-GB; rv:1.9.1.5) Gecko/20091102 Firefox/3.5.5<br />
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8<br />
Accept-Language: en-gb,en;q=0.5<br />
Accept-Encoding: gzip,deflate<br />
Accept-Charset: UTF-8,*<br />
Keep-Alive: 300<br />
Connection: keep-alive<br />
Referer: http://www.google.com/search?hl=en&#038;pws=0&#038;gl=US&#038;q=cloud+computing&#038;start=20&#038;sa=N<br />
Cookie: SITESERVER=ID=6596edc940b547a0afe7a6a1db4e40ee; s_vi=[CS]v1|258EA06C05011703-600001124062905C[CE]; StormPCookie=pl=pl&#038;pc=pl&#038;bandwidth=NA</p>
<p>HTTP/1.x 200 OK<br />
Cache-Control: private,max-age=0<br />
Date: Tue, 08 Dec 2009 19:42:36 GMT<br />
Content-Type: text/html; charset=utf-8<br />
Expires: Wed, 01 Jan 1997 12:00:00 GMT<br />
Server: Microsoft-IIS/6.0<br />
X-Powered-By: ASP.NET<br />
X-AspNet-Version: 2.0.50727<br />
X-Awesomed-By: Thundera RE-TJN<br />
Set-Cookie: lwp=c=us&#038;l=en&#038;s=biz&#038;cs=555; domain=.dell.com; path=/<br />
Set-Cookie: dus=ci=cloud-computing&#038;th=Default; path=/<br />
Content-Encoding: gzip<br />
Vary: Accept-Encoding<br />
Transfer-Encoding: chunked
</p></div>
<h2>Problem &#8211; Robots.txt Disallow Match</h2>
<p>The inbetween URL redirect is blocked by robots.txt</p>
<p>http://content.dell.com/cloud-computing.aspx?c=us&#038;cs=555&#038;l=en&#038;s=biz</p>
<p>It is actually matched by<br />
Allow: http://content.dell.com/<br />
Disallow: /</p>
<p>Google has decided to take the most restrictive version, to disallow the URL.</p>
<p>I haven&#8217;t checked how many links are pointing to that URL both from internal and external sources, but I would guess it is a fair amount and I am assuming at this time that is a page they would want to rank as in their line of business it is probably quite a competitive term.</p>
<p>At a guess without that one URL blocked by robots.txt, they would have been much higher on the comparison charts and ranking higher for cloud computing &#8211; most likely that is a 1st page result thrown away by robots.txt</p>
<p>I haven&#8217;t checked how many other key landing pages are blocked in a similar manner &#8211; I bet it is quite a few</p>
<p>This is just chalking up one more reason why robots.txt should be avoided as it is so easy to mess things up.</p>
<p>p.s. My niece is 18 in 2 months &#038; I am sure <a href="http://andybeard.eu/contact">she would love a Dell</a></p>
<div class="topsy_widget_data topsy_theme_brick-red" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fandybeard.eu%252F2529%252Fdell-better-at-social-media-than-seo.html%22%2C%20%22style%22%3A%20%22small%22%2C%20%22title%22%3A%20%22Dell%20Better%20At%20Social%20Media%20Than%20SEO%3F%22%20%7D);"></div>


	Tags: <a href="http://andybeard.eu/tag/301-redirect" title="301 redirect" rel="tag">301 redirect</a>, <a href="http://andybeard.eu/tag/dell" title="dell" rel="tag">dell</a>, <a href="http://andybeard.eu/tag/http-header" title="http header" rel="tag">http header</a>, <a href="http://andybeard.eu/tag/robotstxt" title="robots.txt" rel="tag">robots.txt</a>, <a href="http://andybeard.eu/tag/seo" title="SEO Blog" rel="tag">SEO Blog</a><br />
]]></content:encoded>
			<wfw:commentRss>http://andybeard.eu/2529/dell-better-at-social-media-than-seo.html/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Are Blogger &#124; Blogspot Blogs SEO Friendly?</title>
		<link>http://andybeard.eu/1832/blogger-blogspot-blogs-seo.html</link>
		<comments>http://andybeard.eu/1832/blogger-blogspot-blogs-seo.html#comments</comments>
		<pubDate>Sun, 31 May 2009 19:57:59 +0000</pubDate>
		<dc:creator>Andy Beard</dc:creator>
				<category><![CDATA[blogging tips]]></category>
		<category><![CDATA[SEO Blog]]></category>
		<category><![CDATA[301 redirect]]></category>
		<category><![CDATA[blogger]]></category>
		<category><![CDATA[blogspot]]></category>
		<category><![CDATA[custom domain]]></category>
		<category><![CDATA[labels]]></category>
		<category><![CDATA[robots.txt]]></category>

		<guid isPermaLink="false">http://andybeard.eu/?p=1832</guid>
		<description><![CDATA[<strong>Just because Blogger / Blogspot is owned by Google doesn't make it SEO Friendly</strong>

For years I have been reading absurd suggestions that Google's Blogger / Blogspot service is SEO friendly. At one time this was semi-true, and it is even possible with the current incarnation. However if you use the service "as intended", without expert knowledge, you might find yourself up a creek without a paddle.

I am writing this on a Sunday, thus I am going to avoid vulgarities, cussing, and even an attention grabbing headline. The Google engineers at Blogger deserve that and more. Blogspot currently stinks for SEO.]]></description>
			<content:encoded><![CDATA[
<p><strong>Just because Blogger / Blogspot is owned by Google doesn&#8217;t make it SEO Friendly</strong></p>
<p>For years I have been reading absurd suggestions that Google&#8217;s Blogger / Blogspot service is SEO friendly. At one time this was semi-true, and it is even possible with the current incarnation. However if you use the service &#8220;as intended&#8221;, without expert knowledge, you might find yourself up a creek without a paddle.</p>
<p>I am writing this on a Sunday, thus I am going to avoid vulgarities, cussing, and even an attention grabbing headline. The Google engineers at Blogger deserve that and more. Blogspot currently stinks for SEO.</p>
<h2>Google Webmaster Guidelines Mess Up</h2>
<p>This is probably the <a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=35769">root cause of the Blogger problems</a></p>
<blockquote><p>Use robots.txt to prevent crawling of search results pages or other auto-generated pages that don&#8217;t add much value for users coming from search engines.</p></blockquote>
<h2>Blogspot Doesn&#8217;t Have Categories</h2>
<p>There was a historical lacking feature in Blogspot, no categories, and users wanted them.</p>
<p>So Blogger introduced labels, but they unfortunately followed the advice from their own Webmaster Guidelines, such as this example from Blogger&#8217;s own Blogger Buzz blog.</p>
<pre>User-agent: Mediapartners-Google
Disallow: 

User-agent: *
Disallow: /search

Sitemap: http://buzz.blogger.com/feeds/posts/default?orderby=updated</pre>
<p>That causes label pages such as this one on the Google Webmaster blog not to be crawled</p>
<p>http://googlewebmastercentral.blogspot.com/search/label/webmaster%20tools</p>
<p>The links to those tag pages don&#8217;t have nofollow, but Google can&#8217;t crawl them to read what is on the page, but pages blocked using robots.txt still accumulate PageRank.</p>
<div id="attachment_1841" class="wp-caption aligncenter" style="width: 468px"><img class="size-full wp-image-1841" title="Blogger blocks labels with Robots.txt" src="http://cdn5.andybeard.name/wp-content/uploads/labels-still-blocked.png" alt="Blogger blocks labels with Robots.txt" width="458" height="212" /><p class="wp-caption-text">Blogger blocks labels with Robots.txt</p></div>
<p>That creates &#8220;<a href="http://andybeard.eu/1121/seo-linking-gotchas-even-the-pros-make.html">hanging or dangling pages</a>&#8221; and the PageRank gets shared out between all the pages in the internet</p>
<p>There are easy solutions to this:-</p>
<h3>Best solution</h3>
<ul>
<li>Have excerpts on the label pages</li>
<li>Possibly noindex the label pages with either meta noindex or robots.txt noindex</li>
<li>Add nofollow to all external links on the label pages, both in the content and sidebar</li>
</ul>
<h3>Stop-gap solution</h3>
<ul>
<li>Nofollow label links from content &#8211; they still wouldn&#8217;t give any deep indexing benefit, but at least they wouldn&#8217;t throw PageRank away.</li>
</ul>
<p>Note: Those solutions are for blogger developers to implement instead of using robots.txt disallow, not for Blogger users. Sorry Blogger users, you have been at a significant disadvantage for years unless you can find a smart (viable) alternative to creating categories by hand.</p>
<h2>Blogspot Indexing Problems</h2>
<p>With labels being blocked by robots.txt, some but not all Blogspot blogs have major indexing problems.</p>
<p>As an example, it doesn&#8217;t seem like Google passes any juice through the dropdown menus used on the Webmaster Central blog.</p>
<div id="attachment_1836" class="wp-caption aligncenter" style="width: 384px"><img class="size-full wp-image-1836" title="Dropdown Menus On Blogspot Could Cause Indexing Problems" src="http://cdn5.andybeard.name/wp-content/uploads/dropdown-menu.png" alt="Dropdown Menus On Blogspot Could Cause Indexing Problems" width="374" height="255" /><p class="wp-caption-text">Dropdown Menus On Blogspot Could Cause Indexing Problems</p></div>
<p>Of couse not every Blogspot publisher displays date based archives, and even the traditional &#8220;older posts / newer posts&#8221; navigation ends up being blocked by the robots.txt file.</p>
<p>As an example this link from <a href="http://tins.rklau.com/">Rick Klau&#8217;s blog</a> (A Blogger Product Manager)</p>
<p>http://tins.rklau.com/search?updated-max=2009-05-04T11%3A53%3A00-07%3A00&#038;max-results=7</p>
<p>You can see by the lack of snippet and cache that this is blocked by robots.txt</p>
<div id="attachment_1837" class="wp-caption aligncenter" style="width: 430px"><img class="size-full wp-image-1837" title="Blogger Older Posts Navigation Blocked With Robots.txt" src="http://cdn5.andybeard.name/wp-content/uploads/older-posts.png" alt="Blogger Older Posts Navigation Blocked With Robots.txt" width="420" height="196" /><p class="wp-caption-text">Blogger Older Posts Navigation Blocked With Robots.txt</p></div>
<p>Fortunately for Rick he doesn&#8217;t allow his labels and pagination links to be the only method for Google to crawl his content, plus he has a huge amount of &#8220;Google Juice&#8221; to play with.</p>
<p>This is actually the best navigation implementation I have seen on any Blogspot blog, as it has allowed all his content to be indexed, or certainly the vast majority.</p>
<div id="attachment_1842" class="wp-caption aligncenter" style="width: 201px"><img class="size-full wp-image-1842" title="blogger-navigation-that-works" src="http://cdn5.andybeard.name/wp-content/uploads/blogger-navigation-that-works.png" alt="Blogger Navigation That Works" width="191" height="364" /><p class="wp-caption-text">Blogger Navigation That Works</p></div>
<p>This is still &#8220;ball linking&#8221; rather than a structured appraoch, but for Blogspot it is about all you could hope to achieve.</p>
<p>By having this massive navigation on every page, effectively turning every page into a sitemap, there are so many link on page that a few external or internal label links don&#8217;t really matter.</p>
<p>This isn&#8217;t the case with the majority of Blogger blogs, and blocking with robots.txt is an abomination.</p>
<p>At the same time Rick has been trying to persuade people to stop using the old, unreliable Blogspot system which allowed you to publish to your own domain, s<a href="http://buzz.blogger.com/2009/05/ftp-vs-custom-domains.html">uggesting that there are no problems</a>.</p>
<p><strong>How Hard Is It To Leave Blogger?</strong></p>
<p>Whilst people talk about it being easy to leave Blogger to another platform, the importer has been bugged for a long time &#8211; Blogger strips stopwords from generated URLs, WordPress doesn&#8217;t.</p>
<p>Then there is only what I would term a &#8220;Interstitial Defamation Wall&#8221;</p>
<div id="attachment_1843" class="wp-caption aligncenter" style="width: 510px"><img class="size-full wp-image-1843" title="blogger-redirects" src="http://cdn5.andybeard.name/wp-content/uploads/blogger-redirects.png" alt="Redirects from Blogspot to a custom domain now hosted externally get slapped with an interstitial warning." width="500" height="354" /><p class="wp-caption-text">Redirects from Blogspot to a custom domain now hosted externally get slapped with an interstitial warning.</p></div>
<p>See for yourself <a href="http://niche-website.blogspot.com">http://niche-website.blogspot.com</a></p>
<p>I had already left Blogspot before Google decided to finally provide custom domain support. I had set up a complicated script with meta redirects, and I even regained my rankings and wrote about it.</p>
<p>When custom domains became available, I decided that it would be best to assign andybeard.eu to my blogspot account, and then point it at WordPress so that visitors and search spiders saw an immediate 301 redirect.</p>
<p>Google when you set this up are very reassuring</p>
<div id="attachment_1844" class="wp-caption aligncenter" style="width: 493px"><img class="size-full wp-image-1844" title="blogger-custom-domain" src="http://cdn5.andybeard.name/wp-content/uploads/blogger-custom-domain.png" alt="Blogger Custom Domain Setup" width="483" height="274" /><p class="wp-caption-text">Blogger Custom Domain Setup</p></div>
<p>It seems this is only true if you stay with Blogger forever. I should also point out that during my testing I discovered that I could no longer assign andybeard.eu, and I was forced to use www.andybeard.eu</p>
<p><strong>Links &amp; Rankings</strong></p>
<p>The main thing that ties any Blogger to using Blogspot are incoming links and rankings. When I moved I probably had 24,000 links to my old Blogspot, depending on which reporting you believed.</p>
<p>Even a couple of weeks ago Yahoo was reporting ~10K , though it seemed a lot less today. Google in Webmaster tools only reports 100 or so, though they may have removed loads due to the redirects.</p>
<p>I don&#8217;t care that much, building up links takes time but isn&#8217;t something that can&#8217;t be fairly easily replaced over time, at least for me.</p>
<p>However for any potential blogging service provider (and honestly anyone running WordPress might soon be able to offer blog hosting within 3-6 months if WordPress core merges with WPMU) this is a serious barrier to overcome.</p>
<p>If you use Blogspot, but with a custom domain from the beginning, this isn&#8217;t a problem, but for everyone else, it might be a deciding factor to stay.</p>
<p>I could switch back to the delayed meta refresh method, but I can&#8217;t see why I should have to &#8211; Google should never have inserted such a message between me and my visitors.</p>
<p>I am looking to <a href="http://andybeard.eu/1816/seo-blogs-need-a-serious-spring-clean.html">clean up some old articles because there is too much SEO junk on the web</a>, but deciding how to handle my old &#8220;How to move from Blogspot to WordPress&#8221; articles are a major problem.</p>
<ul>
<li>I am adamant that people really should still leave Blogger for their own good, if they value their online business</li>
<li>I am not in a position to advise on best practice, and the most suitable method is messed up.</li>
</ul>
<p>As far as I am concerned this interstitial isn&#8217;t passing any juice through to Andybeard.eu, though there is a YES/No link, but it uses onclick, but returns false, but the URL is not constructed&#8230; ask Sebastian if you want a <a href="http://sebastians-pamphlets.com/how-to-turn-click-tracking-into-miserable-failure/">definitive answer</a>.</p>
<div class="topsy_widget_data topsy_theme_brick-red" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fandybeard.eu%252F1832%252Fblogger-blogspot-blogs-seo.html%22%2C%20%22style%22%3A%20%22small%22%2C%20%22title%22%3A%20%22Are%20Blogger%20%7C%20Blogspot%20Blogs%20SEO%20Friendly%3F%22%20%7D);"></div>


	Tags: <a href="http://andybeard.eu/tag/301-redirect" title="301 redirect" rel="tag">301 redirect</a>, <a href="http://andybeard.eu/tag/blogger" title="blogger" rel="tag">blogger</a>, <a href="http://andybeard.eu/tag/blogspot" title="blogspot" rel="tag">blogspot</a>, <a href="http://andybeard.eu/tag/custom-domain" title="custom domain" rel="tag">custom domain</a>, <a href="http://andybeard.eu/tag/labels" title="labels" rel="tag">labels</a>, <a href="http://andybeard.eu/tag/robotstxt" title="robots.txt" rel="tag">robots.txt</a>, <a href="http://andybeard.eu/tag/seo" title="SEO Blog" rel="tag">SEO Blog</a><br />
]]></content:encoded>
			<wfw:commentRss>http://andybeard.eu/1832/blogger-blogspot-blogs-seo.html/feed</wfw:commentRss>
		<slash:comments>83</slash:comments>
		</item>
		<item>
		<title>Google Reconsideration or Reinclusion Request</title>
		<link>http://andybeard.eu/1247/google-reconsideration-request.html</link>
		<comments>http://andybeard.eu/1247/google-reconsideration-request.html#comments</comments>
		<pubDate>Tue, 26 Feb 2008 19:58:17 +0000</pubDate>
		<dc:creator>Andy Beard</dc:creator>
				<category><![CDATA[blogging tips]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[SEO Blog]]></category>
		<category><![CDATA[nofollow]]></category>
		<category><![CDATA[paid links]]></category>
		<category><![CDATA[paid posts]]></category>
		<category><![CDATA[paid reviews]]></category>
		<category><![CDATA[reconsideration request]]></category>
		<category><![CDATA[reinclusion request]]></category>
		<category><![CDATA[robots.txt]]></category>

		<guid isPermaLink="false">http://andybeard.eu/2008/02/google-reconsideration-request.html</guid>
		<description><![CDATA[Now Google have <a href="http://smackdown.blogsblogsblogs.com/2008/02/21/google-reconsideration-request-language-modified-again/">changed their wording for reconsideration requests</a> (formerly reinclusion requests), I have filed one for this domain.

I am not going to call this a perfect example of a reconsideration request, but I decided that it was better to be 100% honest about my thought process for both now and in the future, because whilst I am now <a href="http://andybeard.eu/2008/02/paid-reviews-red-flag.html">blocking paid reviews with robots.txt</a>, there are so many things still not specified within the webmaster guidelines that it is a potential minefield, especially for someone who has previously been the target of a manual penalty.

Here is exactly what I sent to the Google webmaster team.]]></description>
			<content:encoded><![CDATA[
<p>Now Google have <a href="http://smackdown.blogsblogsblogs.com/2008/02/21/google-reconsideration-request-language-modified-again/">changed their wording for reconsideration requests</a> (formerly reinclusion requests), I have filed one for this domain.<br />
(note to Google, why isn&#8217;t Michael&#8217;s permalink ranking?)</p>
<p>I am not going to call this a perfect example of a reconsideration request, but I decided that it was better to be 100% honest about my thought process for both now and in the future, because whilst I am now <a href="http://andybeard.eu/2008/02/paid-reviews-red-flag.html">blocking paid reviews with robots.txt</a>, there are so many things still not specified within the webmaster guidelines that it is a potential minefield, especially for someone who has previously been the target of a manual penalty.</p>
<p>Here is exactly what I sent to the Google webmaster team.</p>
<blockquote><p>I honestly still believe I didn&#8217;t break the spirit of the webmaster guidelines, the webmaster guidelines as most frequently described by Google employees on official duties in regards to paid links and reviews, and even the &#8220;letter of the law&#8221;.</p>
<p>I have a very high rejection rate on paid reviews, approaching 80% &#8211; that shows editorial discretion far beyond many if not all paid directories.<br />
Content was always highly targeted to my audience</p>
<p>I retained editorial control of links &#8211; in all my blogging I give good search engine friendly links that are descriptive of the target &#8211; such practice is effectively law in the UK and Europe, though who is responsible for regulation isn&#8217;t certain.</p>
<p>Almost a year ago, when Google first made it possible to report a site for paid links, I reported myself with a request for clarification &#8211; at that time how my reviews were regarded by Google was not clearly specified, in many ways it still isn&#8217;t.</p>
<p>Adam Lasnik previously suggested multiple times that sites which were predominately high quality content were not your target for penalties &#8211; paid reviews content currently represents less than 3% of my original content &#8211; just 9 reviews</p>
<p>I strive to provide an example of paid review content that is benefiting readers in general, and has a reason to be indexed and counted as editorial content.<br />
The compensation I receive is more a token gesture, like a box of chocolates to say thanks, as the time I spend on them means I would earn more flipping burgers in McDs &#8211; how could that class as paid links? It doesn&#8217;t even cover my time. </p>
<p>Penalties have not been handed out evenly, I know Googlers read blogs that have written paid reviews where the links were not blocked in any way, and have even commented on the specific reviews. Those sites remain unpunished.</p>
<p>Googlers continually promote Google services from their private blogs, and certainly gain financial compensation from increased stock prices. Just today Matt wrote the following post without a specific disclaimer that he is a Google employee.<br />
<a href="http://www.mattcutts.com/blog/adding-new-features-to-google/">http://www.mattcutts.com/blog/adding-new-features-to-google/</a><br />
Surely such posts should have a nofollow to Google &#8211; it has purely promotional (though helpful) intent.</p>
<p>All that being said, I had no intention to break the Google guidelines, and if what is required for the Google Toolbar to truthfully depict the authority of my website is for my editorial links in paid reviews to be blocked from Google in some way, I am going to comply.</p>
<p>All permalinks to my paid reviews are now blocked using robots.txt &#8211; I have checked that this is the case within webmaster tools</p>
<p>Where excerpts of my articles appear on duplicate content pages, all links are nofollowed, such as on tags pages <a href="http://andybeard.eu/tag/volusion_review">http://andybeard.eu/tag/volusion_review</a></p>
<p>Ultimately the Google search results will be poorer quality, because a good paid review is a better result than an article based upon a press release, or a SEO optimized press release itself. I would argue that the links are of higher value as well.</p>
<p>There are still going to be cases where what I write will be within the grey area not defined by your webmaster guidelines.</p>
<p>I earn money from Google through Adsense &#8211; should I nofollow every link to Google and Google services now?</p>
<p>I have paid advertisers, many are purchased by my regular readers or services I frequently write about in an editorial manner. Do I now need to nofollow every past current and future link to them because I have accepted a small amount of money for display advertising?</p>
<p>Do I need to nofollow affiliate links? I can&#8217;t see an automatic way that Google can tell the difference between an affiliate link, and a paid link that has a tracking parameter.<br />
It is somewhat strange that Google has provided help for merchants in cleaning up search results with affiliate links using redirects, and thus gaining an SEO benefit from them. Affiliate links very frequently are not editorial endorsements.</p>
<p>My content gets syndicated often on authority sites such as Webpronews and Searchnewz, and many lesser sites &#8211; I trust that if they choose to publish my syndicated content as an editorial decision, that the fact that content was originally a paid review is no longer a problem.</p>
<p>Best regards</p>
<p>Andy Beard</p></blockquote>
<p>Before filing it I checked that the changes I previously made had taken effect and the pages were actually blocked.</p>
<p>I also made changes to my <a href="http://andybeard.eu/disclosure-policy">disclosure policy</a> to make sure that it was 100% clear to anyone from Google performing a manual inspection that all links in paid reviews <b>from my domain</b> will not affect search results.</p>
<p>This was important based on the discussion regarding <a href="http://www.seo-scoop.com/2007/12/02/admission-of-guilt-will-no-longer-be-required-for-google-reconsideration-request/">Donna&#8217;s reconsideration request</a>.</p>
<h3>Something For The Naysayers</h3>
<p>There were people who for some reason thought that blocking my paid reviews using robots.txt would for some reason be extremely harmful to my search traffic.<br />
It is true that a document that can&#8217;t be indexed cannot rank for long-tail phrases within it, but pages blocked with robots.txt can still rank in Google.</p>
<p><img src='http://cdn5.andybeard.name/wp-content/uploads/robotstxt-ranking.png' alt='Robots.txt Blocked Can Still Rank' /></p>
<p>That is just a snapshot of a SERP, it is certainly bouncing around a little and the position is changing daily.</p>
<p>That being said, that page was previously ranking 12th, and whilst it isn&#8217;t a high traffic term, it is quite competitive with lots of theme and plugin authors also attracting lots of links.<br />
I haven&#8217;t done as much as I could do to promote the page because it is a paid review.</p>
<p>In addition I have flattened my internal linking structure over the last week &#8211; frequent detractors obviously wouldn&#8217;t look on that as a <a href="http://andybeard.eu/2008/02/pagerank-google-search-ranking-factor.html">major search ranking factor</a>.</p>
<div class="topsy_widget_data topsy_theme_brick-red" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fandybeard.eu%252F1247%252Fgoogle-reconsideration-request.html%22%2C%20%22style%22%3A%20%22small%22%2C%20%22title%22%3A%20%22Google%20Reconsideration%20or%20Reinclusion%20Request%22%20%7D);"></div>


	Tags: <a href="http://andybeard.eu/tag/google" title="Google" rel="tag">Google</a>, <a href="http://andybeard.eu/tag/nofollow" title="nofollow" rel="tag">nofollow</a>, <a href="http://andybeard.eu/tag/paid-links" title="paid links" rel="tag">paid links</a>, <a href="http://andybeard.eu/tag/paid-posts" title="paid posts" rel="tag">paid posts</a>, <a href="http://andybeard.eu/tag/paid-reviews" title="paid reviews" rel="tag">paid reviews</a>, <a href="http://andybeard.eu/tag/reconsideration-request" title="reconsideration request" rel="tag">reconsideration request</a>, <a href="http://andybeard.eu/tag/reinclusion-request" title="reinclusion request" rel="tag">reinclusion request</a>, <a href="http://andybeard.eu/tag/robotstxt" title="robots.txt" rel="tag">robots.txt</a><br />
]]></content:encoded>
			<wfw:commentRss>http://andybeard.eu/1247/google-reconsideration-request.html/feed</wfw:commentRss>
		<slash:comments>17</slash:comments>
		</item>
		<item>
		<title>SEO Linking Gotchas Even The Pros Make</title>
		<link>http://andybeard.eu/1121/seo-linking-gotchas-even-the-pros-make.html</link>
		<comments>http://andybeard.eu/1121/seo-linking-gotchas-even-the-pros-make.html#comments</comments>
		<pubDate>Tue, 27 Nov 2007 20:38:09 +0000</pubDate>
		<dc:creator>Andy Beard</dc:creator>
				<category><![CDATA[blogging tips]]></category>
		<category><![CDATA[SEO Blog]]></category>
		<category><![CDATA[wordpress]]></category>
		<category><![CDATA[digg]]></category>
		<category><![CDATA[goog]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google PageRank]]></category>
		<category><![CDATA[Internal Linking]]></category>
		<category><![CDATA[Linking Structure]]></category>
		<category><![CDATA[matt cutts]]></category>
		<category><![CDATA[meta follow]]></category>
		<category><![CDATA[meta nofollow]]></category>
		<category><![CDATA[meta noindex]]></category>
		<category><![CDATA[nofollow]]></category>
		<category><![CDATA[pagerank]]></category>
		<category><![CDATA[robots.txt]]></category>
		<category><![CDATA[search engine optimization]]></category>
		<category><![CDATA[wikipedia]]></category>

		<guid isPermaLink="false">http://andybeard.eu/2007/11/seo-linking-gotchas-even-the-pros-make.html</guid>
		<description><![CDATA[<p>I am going to attempt to debunk almost every Wordpress SEO &#034;Expert&#034; article ever written, and in some respects this article even debunks some of the things I have written in the past.</p>
<p>This article does not reference Google Toolbar PageRank in any way</p>
<p>First of all you are going to need to do a little homework.</p>
<h3>Eric Enge interview with Matt Cutts</h3>
<p>The <a href="http://www.stonetemple.com/articles/interview-matt-cutts.shtml">Eric Enge interview with Matt Cutts</a> was truly exceptional and revealed a number of gotchas that for some reason continue to be circulated.</p>
<p>Key takeaways</p>
<p>
Matt Cutts: &#8230; Now, robots.txt says you are not allowed to crawl a page, and Google</p>]]></description>
			<content:encoded><![CDATA[
<p><a href="http://www.semmys.org/"><img style="border: 0;" src="http://cdn5.andybeard.name/wp-content/uploads/lg_blue_runner_up.gif" alt="2008 SEMMY Runner-Up" align="right" /></a> I am going to attempt to debunk almost every WordPress SEO &#8220;Expert&#8221; article ever written, and in some respects this article even debunks some of the things I have written in the past.</p>
<p><b>This article does not reference Google Toolbar PageRank in any way</b></p>
<p>First of all you are going to need to do a little homework.</p>
<h3>Eric Enge interview with Matt Cutts</h3>
<p>The <a href="http://www.stonetemple.com/articles/interview-matt-cutts.shtml">Eric Enge interview with Matt Cutts</a> was truly exceptional and revealed a number of gotchas that for some reason continue to be circulated.</p>
<p><b>Key takeaways</b></p>
<blockquote><p>
<strong>Matt Cutts: &#8230; </strong>Now, robots.txt says you are not allowed to crawl a page, and Google therefore does not crawl pages that are forbidden in robots.txt. However, they can accrue PageRank, and they can be returned in our search results.</p></blockquote>
<blockquote><p>
<strong>Matt Cutts: &#8230;</strong> So, with robots.txt for good reasons we&#8217;ve shown the reference even if we can&#8217;t crawl it, whereas if we crawl a page and find a Meta tag that says NoIndex, we won&#8217;t even return that page. For better or for worse that&#8217;s the decision that we&#8217;ve made. I believe Yahoo and Microsoft might handle NoIndex slightly differently which is little unfortunate, but everybody gets to choose how they want to handle different tags.</p>
<p><strong>Eric Enge:</strong> Can a NoIndex page accumulate PageRank?</p>
<p><strong>Matt Cutts:</strong> A NoIndex page can accumulate PageRank, because the links are still followed outwards from a NoIndex page.</p>
<p><strong>Eric Enge:</strong> So, it can accumulate and pass PageRank.</p>
<p><strong>Matt Cutts:</strong> Right, and it will still accumulate PageRank, but it won&#8217;t be showing in our Index. So, I wouldn&#8217;t make a NoIndex page that itself is a dead end. You can make a NoIndex page that has links to lots of other pages.</p>
<p>For example you might want to have a master Sitemap page and for whatever reason NoIndex that, but then have links to all your sub Sitemaps.
</p></blockquote>
<p>I have just provided a couple of highlights, I am not attempting to replace a need for visiting the site I am citing. This is something I hate seeing, when people take other people&#8217;s content and repurpose it, thus making the original article worthless.<br />
There are a few other gotchas in there, <strong>I suggest you read it 2 or 3 times</strong> to really understand what was said, and what wasn&#8217;t said.</p>
<h3>Dangling Pages</h3>
<p>One of the best descriptions of <a href="http://www.webworkshop.net/pagerank.html">dangling pages</a> is on the Webworkshop site, though they are assuming that links are totally taken out of the equation based on what they quote from the PageRank paper.</p>
<blockquote><p>
&#8220;Dangling links are simply links that point to any page with no outgoing links. They affect the model because it is not clear where their weight should be distributed, and there are a large number of them. Often these dangling links are simply pages that we have not downloaded yet&#8230;&#8230;&#8230;.Because dangling links do not affect the ranking of any other page directly, we simply remove them from the system until all the PageRanks are calculated. After all the PageRanks are calculated they can be added back in without affecting things significantly.&#8221; &#8211; extract from the original PageRank paper by Googleâ€™s founders, Sergey Brin and Lawrence Page.
</p></blockquote>
<p><b>Alternate interpretation</b></p>
<p><i>This is just an aside, as the amount of juice lost to dangling pages currently is hard to determine, and could be handled differently</i></p>
<p>They are assuming that if page A links to 6 other pages, 5 of them being dangling links, then the website will be treated as only having 2 pages until the end of the calculation.</p>
<p>Whilst I haven&#8217;t delved into the maths (and probably couldn&#8217;t through lack of information and lack of knowledge), it also seems to me that at the time the pages are taken out of the cyclic calculation, a percentage of the link value can still be taken with them.</p>
<p>Thus though the site for cyclic calculations will be just 2 pages, the link from A to B might only transfer 1/6 of the juice on each cycle.</p>
<p>At the time the original paper was written, Google only had a small proportion of the web indexed due to hardware and operating system restraints.<br />
In modern times they have a lot more indexed, thus a more complex way of handling dangling pages could be possible.</p>
<p>More food for thought, a link to a page that is considered supplemental could be treated as a full link or as a link to a dangling page, or some other variant.</p>
<p>Even more food for thought, a site with multiple interlinked pages with no external links at all could be looked on as a &#8220;dangling site&#8221;.</p>
<p><i>Ultimately what is important is that dangling pages are a juice leak, though it is difficult to determine exactly how much</i></p>
<h3>Additional Research On Link Juice Flow</h3>
<p>I have referenced these works before, and I am just going to keep on referring people to them.</p>
<ul>
<li><a href="http://www.seofaststart.com/download">SEOFastStart by Dan Thies</a> &#8211; a good introduction to SEO, and also introduces the ideas of controlling juice around a website &#8211; no email signup required</li>
<li><a href="http://www.revengeofthemininet.com/">Revenge of the Mininet by Michael Campbell</a> &#8211; a timeless classic as long as PageRank continues to be important &#8211; the download page isn&#8217;t hidden if you really don&#8217;t want to sign up to Michael&#8217;s mailing list, but I have been on his list for years.</li>
<li>Dynamic Linking by Leslie Rhode &#8211; A bonus that comes with Revenge of the Mininet</li>
</ul>
<p>I mentioned these is a comment on SEOmoz recently in a discussion on PageRank, and for some reason my comment received just 2 up votes and one down vote.</p>
<p>I don&#8217;t gain in any material way from promoting these free ebooks, though I might gain some goodwill. The main reason I link to them is because they are a superb resource, and it saves me countless hours writing beginners material.</p>
<p><b>OK, On to some debunking</b></p>
<h3>Blocking Pages With Robots.txt Creates Dangling Pages On The First Tier</h3>
<p>In the quoted paragraph above, Matt clearly states that pages blocked with Robots.txt still accumulate juice from the links they receive.</p>
<p><strong>Those pages don&#8217;t have any external 2nd tier links that are visible to a &#8216;bot, thus they are dangling pages.</strong></p>
<p>How much juice they leak depends on how Google currently factor in dangling pages, but Matt himself suggests not to create dangling pages.</p>
<p>If you read any SEO Guide that suggests that the ultimate cure for duplicate content is to block it with robots.txt, I suggest you might want to question the author about dangling pages.</p>
<h3>Meta NoIndex Follow Duplicate Content</h3>
<p>This is a better solution than using Robots.txt, because it doesn&#8217;t create dangling pages. Links on a duplicate content page are still followed, however both internal and external links are followed and thus are leaks, often multiple leaks for the same piece of content when using CMS systems such as WordPress which create site-wide links in the sidebar when using poorly designed themes, plugins, and especially WordPress Widgets.</p>
<p>If you read an article suggesting using Meta Noindex Follow, ask the author how they are controlling external links on duplicate content pages.</p>
<h3>Meta NoIndex Nofollow Duplicate Content</h3>
<p>If you use Meta Noindex Nofollow, whilst this is handled slightly differently by Google to Robots.txt, as the page won&#8217;t appear in search results, it is still a page accumulating Google Juice if you link to it, another dangling page or node.<br />
Second tier leaks from the page won&#8217;t leak, but the page as a whole will leak depending on how Google are currently handling dangling pages.</p>
<p>I don&#8217;t see people recommending this frequently, but as with Robots.txt, ask the author about dangling pages.</p>
<h3>Dynamic Linking &#038; rel=&#8221;nofollow&#8221;</h3>
<p>Extensive use of Nofollow and other forms of dynamic linking are the only way to effectively prevent duplicate content pages in some way having a effect on your internal linking structure and juice flow. The Wikipedia page on <a href="http://en.wikipedia.org/wiki/Nofollow">Nofollow</a> really isn&#8217;t correct.</p>
<h3>The Dangling Sales Page</h3>
<p>To finish I want to give you an example of how a sales page that previously might have benefited from lots of links can easily be turned into a dangling page and effectively discounted from cyclic PageRank calculations.</p>
<p><strong>Sales pages started off just as a single page with no links:-</strong></p>
<p><img src='http://cdn5.andybeard.name/wp-content/uploads/single-page.png' alt='Single Page' /></p>
<p>Despite all the links coming to the site from external sources, this website is a dangling page, thus excluded from iterative PageRank calculations. It might still benefit from anchor text and other factors, but it effectively is not part of Google&#8217;s global mesh and passes on no influence.</p>
<p><strong>Add Legal Paperwork And Reciprocal Links Directory:-</strong></p>
<p><img src='http://cdn5.andybeard.name/wp-content/uploads/sales-letter-variant.png' alt='Sales Letter Variant with Reciprocal Link Directory' /></p>
<p>A much more structured site, and whilst it gains some benefit from reciprocating links there are 2 factors that are almost universally overlooked.</p>
<ol>
<li><strong>No Longer A Dangling Page</strong> &#8211; because the site now has external links, it is valid as part of the global ranking calculations. Other pages as mentioned above were previously stating that the amount of juice passed to dangling pages was minimal, so this could be potentially a huge boost.</li>
<li><strong>More Pages Indexed</strong> &#8211; it is only a few pages, but with PageRank it is often not just how much juice you have flowing into a site, but what you do with it.</li>
</ol>
<p>The reciprocal low quality links might not have had a huge amount of value compared to the benefit of being a member of the &#8220;iteration club&#8221; and having a few more pages indexed.</p>
<p><strong>Add a link to the designer</strong></p>
<p><img src='http://cdn5.andybeard.name/wp-content/uploads/single-page-with-designer-credit.png' alt='Single Page With Designer Credit' /></p>
<p>Some early single page sales letters were not dangling pages, but didn&#8217;t benefit from any internal iterations, and acted as a conduit of juice to their web design firm.</p>
<p><strong>The Danger of Using Nofollow or Robots.txt on Unimportant Pages</strong></p>
<p><img src='http://cdn5.andybeard.name/wp-content/uploads/sales-letter-nofollowed.png' alt='The Danger of Using Nofollow or Robots.txt on Unimportant Pages' /></p>
<p>I have actually seen this on a few sites:-</p>
<ul>
<li>Reciprocal Link Directory Removed</li>
<li>Link to web designer removed</li>
<li>Nofollow added to legal papers that are looked on as being unimportant</li>
</ul>
<p>Such a website is now out of the iteration club, it is a dangling page as it is no longer voting on other pages.</p>
<h3>My Own Gotcha</h3>
<p>I mentioned that this catches me out as well.</p>
<p>A while ago I wrote an article about <a href="http://andybeard.eu/2007/03/blog_ranking.html">linking to Technorati</a> being a problem. It might still be true, but the amount of juice lost through such links might also be lower than I thought, due to Technorati using meta nofollow on every page. Technorati tag pages are themselves dangling pages with no external links.</p>
<p>Wikipedia and Digg on the other hand are not dangling pages. They still have external links to other sites, and thus any links to them are part of iterative calculations. </p>
<p>I would still say it is best to have tags pointing to your own domain tag pages, and to use nofollow on links to Wikipedia and Digg, though with Digg I suggest that is only on links to submission pages which contain no content.</p>
<p>Stumbleupon is also tricky &#8211; there are no external links from individual pages, but there is extensive internal linking.</p>
<p>With Digg and Stumbleupon, profiles rank extremely well, so you can use them for reputation management even if you get no juice direct from the profile.</p>
<p>I think I was the first to describe <a href="http://andybeard.eu/2007/01/wikipedia-nofollow-plugin-wikidigg.html">Wikipedia as a black hole of link equity</a>, explained <a href="http://andybeard.eu/2007/01/exactly-why-nofollow-at-wikipedia-is-bad.html">why you should nofollow Wikipedia</a> extensively, and was one of the first to promote <a href="http://whatjapanthinks.com/wikipedia-nofollow/">Ken&#8217;s Nofollow Wikipedia plugin</a>.</p>
<p>You would have thought in 10 months they would have come up with an alternative to using nofollow on all those out-bound links.</p>
<p>They do however link out to a few trusted sites without nofollow, from just a few pages. I suppose Google does still allow them to be part of their iterative calculations.</p>
<h3>Another Own Gotcha</h3>
<p>This isn&#8217;t 100% something I can fix. I have suggested people use robots.txt on certain sites knowing it wasn&#8217;t the perfect solution.</p>
<p>You might notice on this site I don&#8217;t use an extensive robots.txt, and the <a href="http://andybeard.eu/2007/06/wordpress-seo-masterclass-for-competitive-niches.html">design of my site structure</a> is deliberate, but then at the same time I use nofollow with lots of custom theme modifications, and should use it a lot more.</p>
<p>Eventually I will come up with solutions to make things a little easier.</p>
<h3>Tools In The Wrong Hands Can Be Dangerous</h3>
<p><strong>Using Robots.txt and Meta Noindex, Follow as a cure for duplicate content is a SEO bodge job or SEO bandaid. It may offer some benefits depending on how dangling pages are being handled, but is certainly not an ideal solution due to the amount of leaks that typically remain or dangling pages that are created. </strong></p>
<div class="topsy_widget_data topsy_theme_brick-red" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fandybeard.eu%252F1121%252Fseo-linking-gotchas-even-the-pros-make.html%22%2C%20%22shorturl%22%3A%20%22http%3A%2F%2Fbit.ly%2Fb10ivR%22%2C%20%22style%22%3A%20%22small%22%2C%20%22title%22%3A%20%22SEO%20Linking%20Gotchas%20Even%20The%20Pros%20Make%22%20%7D);"></div>


	Tags: <a href="http://andybeard.eu/tag/digg" title="digg" rel="tag">digg</a>, <a href="http://andybeard.eu/tag/goog" title="goog" rel="tag">goog</a>, <a href="http://andybeard.eu/tag/google" title="Google" rel="tag">Google</a>, <a href="http://andybeard.eu/tag/google-pagerank" title="Google PageRank" rel="tag">Google PageRank</a>, <a href="http://andybeard.eu/tag/internal-linking" title="Internal Linking" rel="tag">Internal Linking</a>, <a href="http://andybeard.eu/tag/linking-structure" title="Linking Structure" rel="tag">Linking Structure</a>, <a href="http://andybeard.eu/tag/matt-cutts" title="matt cutts" rel="tag">matt cutts</a>, <a href="http://andybeard.eu/tag/meta-follow" title="meta follow" rel="tag">meta follow</a>, <a href="http://andybeard.eu/tag/meta-nofollow" title="meta nofollow" rel="tag">meta nofollow</a>, <a href="http://andybeard.eu/tag/meta-noindex" title="meta noindex" rel="tag">meta noindex</a>, <a href="http://andybeard.eu/tag/nofollow" title="nofollow" rel="tag">nofollow</a>, <a href="http://andybeard.eu/tag/pagerank" title="pagerank" rel="tag">pagerank</a>, <a href="http://andybeard.eu/tag/robotstxt" title="robots.txt" rel="tag">robots.txt</a>, <a href="http://andybeard.eu/tag/search-engine-optimization" title="search engine optimization" rel="tag">search engine optimization</a>, <a href="http://andybeard.eu/tag/seo" title="SEO Blog" rel="tag">SEO Blog</a>, <a href="http://andybeard.eu/tag/wikipedia" title="wikipedia" rel="tag">wikipedia</a><br />
]]></content:encoded>
			<wfw:commentRss>http://andybeard.eu/1121/seo-linking-gotchas-even-the-pros-make.html/feed</wfw:commentRss>
		<slash:comments>98</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: basic
Database Caching 34/68 queries in 0.013 seconds using disk: basic
Object Caching 1407/1483 objects using disk: basic
Content Delivery Network via cdn5.andybeard.name

Served from: andybeard.eu @ 2012-02-13 05:40:24 -->
