<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Internet Business &#38; Marketing Strategy - Andy Beard &#187; Canonicalization</title>
	<atom:link href="http://andybeard.eu/tag/canonicalization/feed" rel="self" type="application/rss+xml" />
	<link>http://andybeard.eu</link>
	<description>Internet Marketing, Lead Acquisition, Online Business Strategy and Social Media with Original Opinion and Loads of Attitude</description>
	<lastBuildDate>Sun, 12 Feb 2012 06:16:17 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<item>
		<title>BUG: Google Now Counting Open Graph Tags For Canonicalization?</title>
		<link>http://andybeard.eu/3692/open-graph-canonicalization.html</link>
		<comments>http://andybeard.eu/3692/open-graph-canonicalization.html#comments</comments>
		<pubDate>Sun, 12 Feb 2012 05:57:14 +0000</pubDate>
		<dc:creator>Andy Beard</dc:creator>
				<category><![CDATA[SEO Blog]]></category>
		<category><![CDATA[Canonicalization]]></category>

		<guid isPermaLink="false">http://andybeard.eu/?p=3692</guid>
		<description><![CDATA[
<p>First of all I want to pay my condolences to <a href="https://twitter.com/#!/realbkbrown">Bobbi Kristina Brown</a> on the death of your mother.</p>
<p>When I heard the news I immediately went to Google to find out more information.</p>
<p><a href="http://andybeard.eu/3692/open-graph-canonicalization.html" class="more-link">Read more on BUG: Google Now Counting Open Graph Tags For Canonicalization?&#8230;</a></p>
<div class="topsy_widget_data topsy_theme_brick-red" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fandybeard.eu%252F3692%252Fopen-graph-canonicalization.html%22%2C%20%22shorturl%22%3A%20%22http%3A%2F%2Fbit.ly%2FAoCzt4%22%2C%20%22style%22%3A%20%22small%22%2C%20%22title%22%3A%20%22BUG%3A%20Google%20Now%20Counting%20Open%20Graph%20Tags%20For%20Canonicalization%3F%22%20%7D);"></div>


	Tags: <a href="http://andybeard.eu/tag/canonicalization" title="Canonicalization" rel="tag">Canonicalization</a><br />
]]></description>
			<content:encoded><![CDATA[
<p>First of all I want to pay my condolences to <a href="https://twitter.com/#!/realbkbrown">Bobbi Kristina Brown</a> on the death of your mother.</p>
<p>When I heard the news I immediately went to Google to find out more information.</p>
<p>I saw this search result</p>
<p><img src="http://cdn5.andybeard.name/wp-content/uploads/whitney-SEO-600x475.png" alt="" title="whitney-SEO" width="600" height="475" class="aligncenter size-medium wp-image-3693" /></p>
<p>At the time that link went directly to <a href="http://www.whitneyhouston.com/us/home">http://www.whitneyhouston.com/us/home</a></p>
<p>(it now redirects to <a href="http://www.whitneyhouston.com/us/remembering">http://www.whitneyhouston.com/us/remembering</a> which may end up with similar issues)</p>
<p>I checked the robots.txt to see if there was an issue</p>
<pre class="brush: plain; title: ; notranslate">
#
# robots.txt
#
# This file is to prevent the crawling and indexing of certain parts
# of your site by web crawlers and spiders run by sites like Yahoo!
# and Google. By telling these &quot;robots&quot; where not to go on your site,
# you save bandwidth and server resources.
#
# This file will be ignored unless it is at the root of your host:
# Used:    http://example.com/robots.txt
# Ignored: http://example.com/site/robots.txt
#
# For more information about the robots.txt standard, see:
# http://www.robotstxt.org/wc/robots.html
#
# For syntax checking, see:
# http://www.sxw.org.uk/computing/robots/check.html

User-agent: *
Crawl-delay: 10
# Directories
Disallow: /includes/
Disallow: /misc/
Disallow: /modules/
Disallow: /profiles/
Disallow: /scripts/
Disallow: /themes/
# Files
Disallow: /CHANGELOG.txt
Disallow: /cron.php
Disallow: /INSTALL.mysql.txt
Disallow: /INSTALL.pgsql.txt
Disallow: /install.php
Disallow: /INSTALL.txt
Disallow: /LICENSE.txt
Disallow: /MAINTAINERS.txt
Disallow: /update.php
Disallow: /UPGRADE.txt
Disallow: /xmlrpc.php
# Paths (clean URLs)
Disallow: /admin/
Disallow: /comment/reply/
Disallow: /logout/
Disallow: /node/add/
Disallow: /search/
Disallow: /user/register/
Disallow: /user/password/
Disallow: /user/login/
# Paths (no clean URLs)
Disallow: /?q=admin/
Disallow: /?q=comment/reply/
Disallow: /?q=logout/
Disallow: /?q=node/add/
Disallow: /?q=search/
Disallow: /?q=user/password/
Disallow: /?q=user/register/
Disallow: /?q=user/login/
# Friendlist Links
Disallow: /friendlist/add/
Disallow: /us/friendlist/add/
Disallow: /ar/friendlist/add/
Disallow: /au/friendlist/add/
Disallow: /at/friendlist/add/
Disallow: /be/friendlist/add/
Disallow: /br/friendlist/add/
Disallow: /ca/friendlist/add/
Disallow: /co/friendlist/add/
Disallow: /fi/friendlist/add/
Disallow: /fr/friendlist/add/
Disallow: /de/friendlist/add/
Disallow: /gr/friendlist/add/
Disallow: /hk/friendlist/add/
Disallow: /ie/friendlist/add/
Disallow: /it/friendlist/add/
Disallow: /jp/friendlist/add/
Disallow: /my/friendlist/add/
Disallow: /nl/friendlist/add/
Disallow: /nz/friendlist/add/
Disallow: /ph/friendlist/add/
Disallow: /pl/friendlist/add/
Disallow: /pt/friendlist/add/
Disallow: /ru/friendlist/add/
Disallow: /sg/friendlist/add/
Disallow: /es/friendlist/add/
Disallow: /se/friendlist/add/
Disallow: /ch-de/friendlist/add/
Disallow: /tw/friendlist/add/
Disallow: /tr/friendlist/add/
Disallow: /uk/friendlist/add/
Disallow: /th/friendlist/add/
</pre>
<p>No issue that I can see there, unless there was something wrong with the redirect</p>
<p>The redirect was a clean 301, nothing in the headers that I could see was causing something to be blocked.</p>
<p>The head of the page was also quite clean</p>
<pre class="brush: plain; title: ; notranslate">
&lt;meta http-equiv=&quot;Content-Type&quot; content=&quot;text/html; charset=utf-8&quot; /&gt;
  &lt;meta http-equiv=&quot;Content-Type&quot; content=&quot;text/html; charset=utf-8&quot; /&gt;
&lt;script type=&quot;text/javascript&quot; src=&quot;http://adm.fwmrm.net/p/sonymusic_live/AdManager.js&quot;&gt;&lt;/script&gt;
&lt;meta name=&quot;loginMethod&quot; content=&quot;anonymous&quot;/&gt;
&lt;meta name=&quot;siteSection&quot; content=&quot;home&quot;/&gt;
&lt;meta property=&quot;og:site_name&quot; content=&quot;The Official Whitney Houston Site&quot;/&gt;
&lt;meta property=&quot;og:title&quot; content=&quot;Whitney Houston&quot;/&gt;
&lt;meta property=&quot;og:type&quot; content=&quot;&quot;/&gt;
&lt;meta property=&quot;og:url&quot; content=&quot;http://www.whitneyhouston.com&quot;/&gt;
&lt;meta property=&quot;og:description&quot; content=&quot;Check out Whitney Houston at http://www.whitneyhouston.com&quot;/&gt;
&lt;link rel=&quot;shortcut icon&quot; href=&quot;http://www.whitneyhouston.com/sites/whouston/files/favicon_2.ico&quot; type=&quot;image/x-icon&quot; /&gt;
&lt;meta name=&quot;description&quot; content=&quot;Official Whitney Houston website featuring Whitney Houston news, music, videos, album info, tour dates and more. &quot; /&gt;

&lt;meta name=&quot;keywords&quot; content=&quot;Whitney Houston&quot; /&gt;

  &lt;title&gt;Whitney Houston | The Official Whitney Houston Site&lt;/title&gt;
  &lt;link type=&quot;text/css&quot; rel=&quot;stylesheet&quot; media=&quot;all&quot; href=&quot;/sites/all/modules/contrib/views/css/views.css?8&quot; /&gt;
&lt;link type=&quot;text/css&quot; rel=&quot;stylesheet&quot; media=&quot;all&quot; href=&quot;http://www.whitneyhouston.com/sites/whouston/files/css/css_c1430b97100c9d627ca5594b4b0016ee.css&quot; /&gt;
  &lt;meta http-equiv=&quot;X-UA-Compatible&quot; content=&quot;IE=8&quot; /&gt;
</pre>
<p>It is not conclusive, but when I tried visiting the site as Googlebot nothing funny happened. I know potentially they could still be doing additional validation and treating me differently, but that is rare.</p>
<p>The only thing which I can think of which might be an issue was this.</p>
<pre class="brush: plain; title: ; notranslate">
&lt;meta property=&quot;og:url&quot; content=&quot;http://www.whitneyhouston.com&quot;/&gt;
</pre>
<p>The facebook linter/debugger actually throws an error if you have an og:url that redirects to a page that redirects back &#8211; they shouldn&#8217;t do it because any brand may switch landing pages and URLs, but still wants to retain votes on a single canonical URL. I hit this issue with the uQast sales funnel moving pages, and potentially we have lost hundreds of likes.</p>
<p><img src="http://cdn5.andybeard.name/wp-content/uploads/Circular-Redirect.png" alt="" title="Circular-Redirect" width="510" height="211" class="aligncenter size-full wp-image-3694" /></p>
<p>However when I have had these issues on that sales funnel it didn&#8217;t affect Google. Maybe because we had a canonical set to the actual landing page that Google took as the preference.</p>
<p>I have never seen Google treat an open graph tag as rel canonical, but it is the only potential issue I can see.</p>
<p>There may be something funky happening with geolocation, but Google doesn&#8217;t seem to be picking that up as it should either.</p>
<p>You also won&#8217;t find Whitney&#8217;s other pages on Google easily, partially caused by this indexation error on the primary domain.</p>
<p><a href="http://www.myspace.com/whitneyhouston">Whitney Houston on Myspace</a><br />
<a href="http://www.facebook.com/WhitneyHouston">Whitney Houston on Facebook</a></p>
<p>I think this might be a rare bug in canonicalization &#8211; some of the localization and redirects happening are not exactly ideal, but shouldn&#8217;t be preventing indexation in this way.</p>
<p>p.s. I am deliberately not trying to grab search traffic for this sad event &#8211; I just want fans to be able to find a place to pay their respects. My first attempt (on Google+) <a href="https://plus.google.com/u/0/102279602913916787678/posts/2o71ZEk7kpg">to get word out to Googlers</a> has so far not had a response.</p>
<div class="topsy_widget_data topsy_theme_brick-red" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fandybeard.eu%252F3692%252Fopen-graph-canonicalization.html%22%2C%20%22shorturl%22%3A%20%22http%3A%2F%2Fbit.ly%2FAoCzt4%22%2C%20%22style%22%3A%20%22small%22%2C%20%22title%22%3A%20%22BUG%3A%20Google%20Now%20Counting%20Open%20Graph%20Tags%20For%20Canonicalization%3F%22%20%7D);"></div>


	Tags: <a href="http://andybeard.eu/tag/canonicalization" title="Canonicalization" rel="tag">Canonicalization</a><br />
]]></content:encoded>
			<wfw:commentRss>http://andybeard.eu/3692/open-graph-canonicalization.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>rel=&#8221;author&#8221; is Same-Site Only</title>
		<link>http://andybeard.eu/3596/relauthor.html</link>
		<comments>http://andybeard.eu/3596/relauthor.html#comments</comments>
		<pubDate>Tue, 07 Jun 2011 17:55:07 +0000</pubDate>
		<dc:creator>Andy Beard</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[SEO Blog]]></category>
		<category><![CDATA[web 2.0]]></category>
		<category><![CDATA[accessibility]]></category>
		<category><![CDATA[attribution]]></category>
		<category><![CDATA[Canonicalization]]></category>
		<category><![CDATA[rel-author]]></category>
		<category><![CDATA[relauthor]]></category>

		<guid isPermaLink="false">http://andybeard.eu/?p=3596</guid>
		<description><![CDATA[
<p>I managed to ping Google&#8217;s @mattcutts after the announcement of rel=&#8221;author&#8221; support from Google on Twitter and he clarified the use case a little.</p>
<p>As Twitter&#8217;s search is still so terrible at finding things I am adding the conversation here.</p>
<p><a href="http://andybeard.eu/3596/relauthor.html" class="more-link">Read more on rel=&#8221;author&#8221; is Same-Site Only&#8230;</a></p>
<div class="topsy_widget_data topsy_theme_brick-red" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fandybeard.eu%252F3596%252Frelauthor.html%22%2C%20%22shorturl%22%3A%20%22http%3A%2F%2Fbit.ly%2FkzWJfd%22%2C%20%22style%22%3A%20%22small%22%2C%20%22title%22%3A%20%22rel%3D%5C%22author%5C%22%20is%20Same-Site%20Only%22%20%7D);"></div>


	Tags: <a href="http://andybeard.eu/tag/accessibility" title="accessibility" rel="tag">accessibility</a>, <a href="http://andybeard.eu/tag/attribution" title="attribution" rel="tag">attribution</a>, <a href="http://andybeard.eu/tag/canonicalization" title="Canonicalization" rel="tag">Canonicalization</a>, <a href="http://andybeard.eu/tag/google" title="Google" rel="tag">Google</a>, <a href="http://andybeard.eu/tag/rel-author" title="rel-author" rel="tag">rel-author</a>, <a href="http://andybeard.eu/tag/relauthor" title="relauthor" rel="tag">relauthor</a><br />
]]></description>
			<content:encoded><![CDATA[
<p>I managed to ping Google&#8217;s @mattcutts after the announcement of rel=&#8221;author&#8221; support from Google on Twitter and he clarified the use case a little.</p>
<p>As Twitter&#8217;s search is still so terrible at finding things I am adding the conversation here.</p>
<!-- tweet id : 78134332937609216 --><div id='bbpBox_78134332937609216' class='bbpBox' style='padding:20px; margin:5px 0; background-color:#9ae4e8; background-image:url(http://a0.twimg.com/images/themes/theme1/bg.png); background-repeat:no-repeat'><div style='background:#fff; padding:10px; margin:0; min-height:48px; color:#000000; -moz-border-radius:5px; -webkit-border-radius:5px;'><span style='width:100%; font-size:18px; line-height:22px;'>New rel="author" support <a href="http://goo.gl/FCK3l" rel="nofollow">http://goo.gl/FCK3l</a> ( @<a href="http://twitter.com/intent/user?screen_name=mattcutts" class="twitter-action">mattcutts</a> is this suitable for cross domain attribution too for syndicated content? )</span><div class='bbp-actions' style='font-size:12px; width:100%; padding:5px 0; margin:0 0 10px 0; border-bottom:1px solid #e6e6e6;'><img align='middle' src='http://andybeard.eu/wp-content/plugins/twitter-blackbird-pie//images/bird.png' /><a title='tweeted on June 7, 2011 5:20 pm' href='http://twitter.com/#!/AndyBeard/status/78134332937609216' target='_blank'>June 7, 2011 5:20 pm</a> via web<a href='https://twitter.com/intent/tweet?in_reply_to=78134332937609216' class='bbp-action bbp-reply-action' title='Reply'><span><em style='margin-left: 1em;'></em><strong>Reply</strong></span></a><a href='https://twitter.com/intent/retweet?tweet_id=78134332937609216' class='bbp-action bbp-retweet-action' title='Retweet'><span><em style='margin-left: 1em;'></em><strong>Retweet</strong></span></a><a href='https://twitter.com/intent/favorite?tweet_id=78134332937609216' class='bbp-action bbp-favorite-action' title='Favorite'><span><em style='margin-left: 1em;'></em><strong>Favorite</strong></span></a></div><div style='float:left; padding:0; margin:0'><a href='http://twitter.com/intent/user?screen_name=AndyBeard'><img style='width:48px; height:48px; padding-right:7px; border:none; background:none; margin:0' src='http://a3.twimg.com/profile_images/421136667/andy_normal.jpg' /></a></div><div style='float:left; padding:0; margin:0'><a style='font-weight:bold' href='http://twitter.com/intent/user?screen_name=AndyBeard'>@AndyBeard</a><div style='margin:0; padding-top:2px'>Andy Beard</div></div><div style='clear:both'></div></div></div><!-- end of tweet -->
<!-- tweet id : 78138848982482945 --><div id='bbpBox_78138848982482945' class='bbpBox' style='padding:20px; margin:5px 0; background-color:#000000; background-image:url(http://a1.twimg.com/images/themes/theme10/bg.gif);'><div style='background:#fff; padding:10px; margin:0; min-height:48px; color:#000000; -moz-border-radius:5px; -webkit-border-radius:5px;'><span style='width:100%; font-size:18px; line-height:22px;'>@<a href="http://twitter.com/intent/user?screen_name=AndyBeard" class="twitter-action">AndyBeard</a> for now it's same-site, just to be safe. My (personal) guess is we'll see if that can be expanded over time in a trusted way.</span><div class='bbp-actions' style='font-size:12px; width:100%; padding:5px 0; margin:0 0 10px 0; border-bottom:1px solid #e6e6e6;'><img align='middle' src='http://andybeard.eu/wp-content/plugins/twitter-blackbird-pie//images/bird.png' /><a title='tweeted on June 7, 2011 5:38 pm' href='http://twitter.com/#!/mattcutts/status/78138848982482945' target='_blank'>June 7, 2011 5:38 pm</a> via web<a href='https://twitter.com/intent/tweet?in_reply_to=78138848982482945' class='bbp-action bbp-reply-action' title='Reply'><span><em style='margin-left: 1em;'></em><strong>Reply</strong></span></a><a href='https://twitter.com/intent/retweet?tweet_id=78138848982482945' class='bbp-action bbp-retweet-action' title='Retweet'><span><em style='margin-left: 1em;'></em><strong>Retweet</strong></span></a><a href='https://twitter.com/intent/favorite?tweet_id=78138848982482945' class='bbp-action bbp-favorite-action' title='Favorite'><span><em style='margin-left: 1em;'></em><strong>Favorite</strong></span></a></div><div style='float:left; padding:0; margin:0'><a href='http://twitter.com/intent/user?screen_name=mattcutts'><img style='width:48px; height:48px; padding-right:7px; border:none; background:none; margin:0' src='http://a1.twimg.com/profile_images/1093457250/twitter-matt_normal.png' /></a></div><div style='float:left; padding:0; margin:0'><a style='font-weight:bold' href='http://twitter.com/intent/user?screen_name=mattcutts'>@mattcutts</a><div style='margin:0; padding-top:2px'>Matt Cutts</div></div><div style='clear:both'></div></div></div><!-- end of tweet -->
<!-- tweet id : 78139667706413056 --><div id='bbpBox_78139667706413056' class='bbpBox' style='padding:20px; margin:5px 0; background-color:#9ae4e8; background-image:url(http://a0.twimg.com/images/themes/theme1/bg.png); background-repeat:no-repeat'><div style='background:#fff; padding:10px; margin:0; min-height:48px; color:#000000; -moz-border-radius:5px; -webkit-border-radius:5px;'><span style='width:100%; font-size:18px; line-height:22px;'>@<a href="http://twitter.com/intent/user?screen_name=mattcutts" class="twitter-action">mattcutts</a> thanks for the clarification & intended current use</span><div class='bbp-actions' style='font-size:12px; width:100%; padding:5px 0; margin:0 0 10px 0; border-bottom:1px solid #e6e6e6;'><img align='middle' src='http://andybeard.eu/wp-content/plugins/twitter-blackbird-pie//images/bird.png' /><a title='tweeted on June 7, 2011 5:42 pm' href='http://twitter.com/#!/AndyBeard/status/78139667706413056' target='_blank'>June 7, 2011 5:42 pm</a> via web<a href='https://twitter.com/intent/tweet?in_reply_to=78139667706413056' class='bbp-action bbp-reply-action' title='Reply'><span><em style='margin-left: 1em;'></em><strong>Reply</strong></span></a><a href='https://twitter.com/intent/retweet?tweet_id=78139667706413056' class='bbp-action bbp-retweet-action' title='Retweet'><span><em style='margin-left: 1em;'></em><strong>Retweet</strong></span></a><a href='https://twitter.com/intent/favorite?tweet_id=78139667706413056' class='bbp-action bbp-favorite-action' title='Favorite'><span><em style='margin-left: 1em;'></em><strong>Favorite</strong></span></a></div><div style='float:left; padding:0; margin:0'><a href='http://twitter.com/intent/user?screen_name=AndyBeard'><img style='width:48px; height:48px; padding-right:7px; border:none; background:none; margin:0' src='http://a3.twimg.com/profile_images/421136667/andy_normal.jpg' /></a></div><div style='float:left; padding:0; margin:0'><a style='font-weight:bold' href='http://twitter.com/intent/user?screen_name=AndyBeard'>@AndyBeard</a><div style='margin:0; padding-top:2px'>Andy Beard</div></div><div style='clear:both'></div></div></div><!-- end of tweet -->
<!-- tweet id : 78140760536530944 --><div id='bbpBox_78140760536530944' class='bbpBox' style='padding:20px; margin:5px 0; background-color:#000000; background-image:url(http://a1.twimg.com/images/themes/theme10/bg.gif);'><div style='background:#fff; padding:10px; margin:0; min-height:48px; color:#000000; -moz-border-radius:5px; -webkit-border-radius:5px;'><span style='width:100%; font-size:18px; line-height:22px;'>@<a href="http://twitter.com/intent/user?screen_name=AndyBeard" class="twitter-action">AndyBeard</a> sure thing. Remember, rel=canonical also started as same-site only, then as we trusted it more, it became cross-site.</span><div class='bbp-actions' style='font-size:12px; width:100%; padding:5px 0; margin:0 0 10px 0; border-bottom:1px solid #e6e6e6;'><img align='middle' src='http://andybeard.eu/wp-content/plugins/twitter-blackbird-pie//images/bird.png' /><a title='tweeted on June 7, 2011 5:46 pm' href='http://twitter.com/#!/mattcutts/status/78140760536530944' target='_blank'>June 7, 2011 5:46 pm</a> via web<a href='https://twitter.com/intent/tweet?in_reply_to=78140760536530944' class='bbp-action bbp-reply-action' title='Reply'><span><em style='margin-left: 1em;'></em><strong>Reply</strong></span></a><a href='https://twitter.com/intent/retweet?tweet_id=78140760536530944' class='bbp-action bbp-retweet-action' title='Retweet'><span><em style='margin-left: 1em;'></em><strong>Retweet</strong></span></a><a href='https://twitter.com/intent/favorite?tweet_id=78140760536530944' class='bbp-action bbp-favorite-action' title='Favorite'><span><em style='margin-left: 1em;'></em><strong>Favorite</strong></span></a></div><div style='float:left; padding:0; margin:0'><a href='http://twitter.com/intent/user?screen_name=mattcutts'><img style='width:48px; height:48px; padding-right:7px; border:none; background:none; margin:0' src='http://a1.twimg.com/profile_images/1093457250/twitter-matt_normal.png' /></a></div><div style='float:left; padding:0; margin:0'><a style='font-weight:bold' href='http://twitter.com/intent/user?screen_name=mattcutts'>@mattcutts</a><div style='margin:0; padding-top:2px'>Matt Cutts</div></div><div style='clear:both'></div></div></div><!-- end of tweet -->
<!-- tweet id : 78141827135455232 --><div id='bbpBox_78141827135455232' class='bbpBox' style='padding:20px; margin:5px 0; background-color:#9ae4e8; background-image:url(http://a0.twimg.com/images/themes/theme1/bg.png); background-repeat:no-repeat'><div style='background:#fff; padding:10px; margin:0; min-height:48px; color:#000000; -moz-border-radius:5px; -webkit-border-radius:5px;'><span style='width:100%; font-size:18px; line-height:22px;'>@<a href="http://twitter.com/intent/user?screen_name=mattcutts" class="twitter-action">mattcutts</a> I can't sneak a rel="canonical" into an author bio link, or ask content partners such as @<a href="http://twitter.com/intent/user?screen_name=WebProNews" class="twitter-action">WebProNews</a> to include it</span><div class='bbp-actions' style='font-size:12px; width:100%; padding:5px 0; margin:0 0 10px 0; border-bottom:1px solid #e6e6e6;'><img align='middle' src='http://andybeard.eu/wp-content/plugins/twitter-blackbird-pie//images/bird.png' /><a title='tweeted on June 7, 2011 5:50 pm' href='http://twitter.com/#!/AndyBeard/status/78141827135455232' target='_blank'>June 7, 2011 5:50 pm</a> via web<a href='https://twitter.com/intent/tweet?in_reply_to=78141827135455232' class='bbp-action bbp-reply-action' title='Reply'><span><em style='margin-left: 1em;'></em><strong>Reply</strong></span></a><a href='https://twitter.com/intent/retweet?tweet_id=78141827135455232' class='bbp-action bbp-retweet-action' title='Retweet'><span><em style='margin-left: 1em;'></em><strong>Retweet</strong></span></a><a href='https://twitter.com/intent/favorite?tweet_id=78141827135455232' class='bbp-action bbp-favorite-action' title='Favorite'><span><em style='margin-left: 1em;'></em><strong>Favorite</strong></span></a></div><div style='float:left; padding:0; margin:0'><a href='http://twitter.com/intent/user?screen_name=AndyBeard'><img style='width:48px; height:48px; padding-right:7px; border:none; background:none; margin:0' src='http://a3.twimg.com/profile_images/421136667/andy_normal.jpg' /></a></div><div style='float:left; padding:0; margin:0'><a style='font-weight:bold' href='http://twitter.com/intent/user?screen_name=AndyBeard'>@AndyBeard</a><div style='margin:0; padding-top:2px'>Andy Beard</div></div><div style='clear:both'></div></div></div><!-- end of tweet -->
<p>My last point is at least partially related to Google&#8217;s Panda update because it is quite frequently seen, possibly more than before, that original content doesn&#8217;t rank yet scraped copies of it does.</p>
<p>There are reasons why that happens, but a microformat rel=&#8221;author&#8221; and possibly something new&#8230; rel=&#8221;original&#8221; for a link to the canonical source would be useful.</p>
<p>Something like this would be easier to implement than the <a href="http://searchnewscentral.com/2010111790/General-SEO/source-attribution-meta-tags-solving-news-syndication-duplicate-content-issues.html">metatag alternative currently in testing with newspapers</a>. ( original-source &amp; syndication-source )</p>
<p>This is something really easy to get implemented in a number of CMSs, though in most cases it would be theme dependent not something that is part of core.</p>
<div class="topsy_widget_data topsy_theme_brick-red" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fandybeard.eu%252F3596%252Frelauthor.html%22%2C%20%22shorturl%22%3A%20%22http%3A%2F%2Fbit.ly%2FkzWJfd%22%2C%20%22style%22%3A%20%22small%22%2C%20%22title%22%3A%20%22rel%3D%5C%22author%5C%22%20is%20Same-Site%20Only%22%20%7D);"></div>


	Tags: <a href="http://andybeard.eu/tag/accessibility" title="accessibility" rel="tag">accessibility</a>, <a href="http://andybeard.eu/tag/attribution" title="attribution" rel="tag">attribution</a>, <a href="http://andybeard.eu/tag/canonicalization" title="Canonicalization" rel="tag">Canonicalization</a>, <a href="http://andybeard.eu/tag/google" title="Google" rel="tag">Google</a>, <a href="http://andybeard.eu/tag/rel-author" title="rel-author" rel="tag">rel-author</a>, <a href="http://andybeard.eu/tag/relauthor" title="relauthor" rel="tag">relauthor</a><br />
]]></content:encoded>
			<wfw:commentRss>http://andybeard.eu/3596/relauthor.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>The Ultimate WordPress Htaccess File?</title>
		<link>http://andybeard.eu/612/the-ultimate-wordpress-htaccess-file.html</link>
		<comments>http://andybeard.eu/612/the-ultimate-wordpress-htaccess-file.html#comments</comments>
		<pubDate>Mon, 02 Apr 2007 16:11:32 +0000</pubDate>
		<dc:creator>Andy Beard</dc:creator>
				<category><![CDATA[blogging tips]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[mininet]]></category>
		<category><![CDATA[SEO Blog]]></category>
		<category><![CDATA[vre]]></category>
		<category><![CDATA[wordpress]]></category>
		<category><![CDATA[blogging]]></category>
		<category><![CDATA[Canonicalization]]></category>
		<category><![CDATA[htaccess]]></category>
		<category><![CDATA[htaccess code]]></category>
		<category><![CDATA[Htaccess WordPress]]></category>
		<category><![CDATA[mod rewrite]]></category>
		<category><![CDATA[search engine optimization]]></category>
		<category><![CDATA[trailing slashes]]></category>

		<guid isPermaLink="false">http://andybeard.eu/2007/04/the-ultimate-wordpress-htaccess-file.html</guid>
		<description><![CDATA[<p>Canonicalization is a serious problem for webmasters, just read this <a href="http://www.mattcutts.com/blog/canonicalization-update/">latest entry by Matt Cutts</a> or  this <a href="http://www.johnon.com/267/trailing-slash-seo.html">great post from John Andrews</a>. However telling webmasters that they should fix these issues isn't enough, webmasters &#038; bloggers need solutions.</p>
<h3>In Search of the Ultimate Htaccess file</h3>
<p>A couple of months ago Alister Cameron posted a <a href="http://www.alistercameron.com/2007/01/12/two-wordpress-plugins-you-dont-need-and-shouldnt-use/">simple solution to .htaccess</a> such that you didn't need to use a plugin to convert URLs using www to URLs without.</p>
<p>At the time I suggested a couple of improvements, and also mentioned I would post about it here on my blog, hopefully to help</p>]]></description>
			<content:encoded><![CDATA[
<p>Canonicalization is a serious problem for webmasters, just read this <a href="http://www.mattcutts.com/blog/canonicalization-update/">latest entry by Matt Cutts</a> or  this <a href="http://www.johnon.com/267/trailing-slash-seo.html">great post from John Andrews</a>. However telling webmasters that they should fix these issues isn&#8217;t enough, <b>webmasters &#038; bloggers need solutions</b>.</p>
<h3>In Search of the Ultimate Htaccess file</h3>
<p>A couple of months ago Alister Cameron posted a <a href="http://www.alistercameron.com/2007/01/12/two-wordpress-plugins-you-dont-need-and-shouldnt-use/">simple solution to .htaccess</a> such that you didn&#8217;t need to use a plugin to convert URLs using www to URLs without.</p>
<p>At the time I suggested a couple of improvements, and also mentioned I would post about it here on my blog, hopefully to help develop what could be looked on as the &#8220;Ultimate&#8221; .htaccess file for WordPress, something you could just drop in your root folder and be done with it.<br />
For me the inclination was for multiple niche websites using WordPress as a CMS, so I really wanted to avoid anything that would make the content look dated.</p>
<div class="important" style="width:400px;">I am not an htaccess guru, and this is all cobbled together from code suggested by other people in various places<br />
<b>Before using any of this code, make a backup of your existing .htaccess, and be prepared to copy it back if testing proves something is broken.</b></div>
<p>Lets start off with the default .htaccess for WordPress once you turn on mod_rewrite for SEO friendly URLs</p>
<pre class="brush: plain; title: ; notranslate">
# BEGIN WordPress
&lt;IfModule mod_rewrite.c&gt;
RewriteEngine On
RewriteBase /
RewriteCond %{REQUEST_FILENAME} !-f
RewriteCond %{REQUEST_FILENAME} !-d
RewriteRule . /index.php [L]
&lt;/IfModule&gt;
# END WordPress
</pre>
<p>The first thing we want to do is get rid of the WWW if someone uses it. I know there are 2 schools of thought on whether URLs should have www by default or not, I prefer without and never type www unless I can&#8217;t access another site without it (broken htaccess).</p>
<p>Secondly we also want to get rid of trailing slash problems</p>
<p>The base rules that Alister first suggested were</p>
<pre class="brush: plain; title: ; notranslate">
&lt;IfModule mod_rewrite.c&gt;
RewriteEngine On
RewriteCond %{HTTP_HOST} !^www.alistercameron.com$ [NC]
RewriteRule ^(.*)$ http://www.alistercameron.com/$1 [R=301,L]

RewriteCond %{REQUEST_URI} ^/[^\.]+[^/]$
RewriteRule ^(.*)$ http://%{HTTP_HOST}/$1/ [R=301,L]
&lt;/IfModule&gt;
</pre>
<p>However we want this to be the <b>Ultimate htaccess code</b>, thus we don&#8217;t want to have to enter the domain name. I am not sure whether this will work if you have multiple blogs in sub-folders.</p>
<p>In this code we are using HTTP_HOST rather than adding a URL manually to every .htaccess file you create. If you are setting up 50 blogs (niche marketers do things like this, and fill them with unique original content &#8211; not everyone creates splogs) then being able to use one default file is a major advantage.</p>
<pre class="brush: plain; title: ; notranslate">
# If subdomain www exists, remove it first
RewriteCond %{HTTP_HOST} ^www\.([^\.]+\.[^\.]+)$ [NC]
RewriteRule ^(.*)$ http://%1/$1 [R=301,L]
</pre>
<p>Can we improve on the trailing slashes code?</p>
<p>Possibly&#8230;</p>
<p>A while ago I was also reading a post over on Aaron Walls SEO Book blog. Within the comments were suggestions with improvements to the code Aaron suggested.<br />
Finding the exact reference is a problem as it wasn&#8217;t <a href="http://www.seobook.com/archives/001714.shtml">on this thread</a></p>
<p>Searching on a phrase in the code these days only brings up a reference on Alister&#8217;s blog where I mentioned it in the comments, so I have no idea who to attribute this htaccess code to.</p>
<pre class="brush: plain; title: ; notranslate">
# If requested resource does not exist as a file
RewriteCond %{REQUEST_FILENAME} !-f
# and does not end with a period followed by a filetype
RewriteCond %{REQUEST_URI} !..+$
# and does not end with a slash
RewriteCond %{REQUEST_URI} !/$
# then add a trailing slash and redirect
RewriteRule (.*) $1/ [R=301,L]
&lt;/IfModule&gt;
</pre>
<p><b>I am not a htaccess guru</b>, but this seems to take into account more potential situations such as files to download.</p>
<p>If you put all this code together you end up with something like this</p>
<pre class="brush: plain; title: ; notranslate">
&lt;IfModule mod_rewrite.c&gt;
RewriteEngine On
# If subdomain www exists, remove it first
RewriteCond %{HTTP_HOST} ^www\.([^\.]+\.[^\.]+)$ [NC]
RewriteRule ^(.*)$ http://%1/$1 [R=301,L]

# If requested resource does not exist as a file
RewriteCond %{REQUEST_FILENAME} !-f
# and does not end with a period followed by a filetype
RewriteCond %{REQUEST_URI} !..+$
# and does not end with a slash
RewriteCond %{REQUEST_URI} !/$
# then add a trailing slash and redirect
RewriteRule (.*) $1/ [R=301,L]
&lt;/IfModule&gt;

# BEGIN WordPress
&lt;IfModule mod_rewrite.c&gt;
RewriteEngine On
RewriteBase /
RewriteCond %{REQUEST_FILENAME} !-f
RewriteCond %{REQUEST_FILENAME} !-d
RewriteRule . /index.php [L]
&lt;/IfModule&gt;

# END WordPress
</pre>
<p><strong>So can anyone offer any constructive improvements?</strong></p>
<p>If you offer improvements, please provide code samples and explain exactly why it is an improvement so people can learn from it (as I said I am a newbie at this)<br />
Code can be entered using code tags in square brackets.</p>
<div class="topsy_widget_data topsy_theme_brick-red" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fandybeard.eu%252F612%252Fthe-ultimate-wordpress-htaccess-file.html%22%2C%20%22shorturl%22%3A%20%22http%3A%2F%2Fbit.ly%2Fi6jt8F%22%2C%20%22style%22%3A%20%22small%22%2C%20%22title%22%3A%20%22The%20Ultimate%20Wordpress%20Htaccess%20File%3F%22%20%7D);"></div>


	Tags: <a href="http://andybeard.eu/tag/blogging" title="blogging" rel="tag">blogging</a>, <a href="http://andybeard.eu/tag/blogging-tips" title="blogging tips" rel="tag">blogging tips</a>, <a href="http://andybeard.eu/tag/canonicalization" title="Canonicalization" rel="tag">Canonicalization</a>, <a href="http://andybeard.eu/tag/htaccess" title="htaccess" rel="tag">htaccess</a>, <a href="http://andybeard.eu/tag/htaccess-code" title="htaccess code" rel="tag">htaccess code</a>, <a href="http://andybeard.eu/tag/htaccess-wordpress" title="Htaccess WordPress" rel="tag">Htaccess WordPress</a>, <a href="http://andybeard.eu/tag/mod-rewrite" title="mod rewrite" rel="tag">mod rewrite</a>, <a href="http://andybeard.eu/tag/search-engine-optimization" title="search engine optimization" rel="tag">search engine optimization</a>, <a href="http://andybeard.eu/tag/seo" title="SEO Blog" rel="tag">SEO Blog</a>, <a href="http://andybeard.eu/tag/trailing-slashes" title="trailing slashes" rel="tag">trailing slashes</a><br />
]]></content:encoded>
			<wfw:commentRss>http://andybeard.eu/612/the-ultimate-wordpress-htaccess-file.html/feed</wfw:commentRss>
		<slash:comments>40</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: basic
Database Caching 34/47 queries in 0.009 seconds using disk: basic
Object Caching 1228/1250 objects using disk: basic
Content Delivery Network via cdn5.andybeard.name

Served from: andybeard.eu @ 2012-02-13 07:00:26 -->
