SEO Linking Gotchas Even The Pros Make

I am going to attempt to debunk almost every Wordpress SEO “Expert” article ever written, and in some respects this article even debunks some of the things I have written in the past.

This article does not reference Google Toolbar PageRank in any way

First of all you are going to need to do a little homework.

Eric Enge interview with Matt Cutts

The Eric Enge interview with Matt Cutts was truly exceptional and revealed a number of gotchas that for some reason continue to be circulated.

Key takeaways

Matt Cutts: … Now, robots.txt says you are not allowed to crawl a page, and Google therefore does not crawl pages that are forbidden in robots.txt. However, they can accrue PageRank, and they can be returned in our search results.

Matt Cutts: … So, with robots.txt for good reasons we’ve shown the reference even if we can’t crawl it, whereas if we crawl a page and find a Meta tag that says NoIndex, we won’t even return that page. For better or for worse that’s the decision that we’ve made. I believe Yahoo and Microsoft might handle NoIndex slightly differently which is little unfortunate, but everybody gets to choose how they want to handle different tags.

Eric Enge: Can a NoIndex page accumulate PageRank?

Matt Cutts: A NoIndex page can accumulate PageRank, because the links are still followed outwards from a NoIndex page.

Eric Enge: So, it can accumulate and pass PageRank.

Matt Cutts: Right, and it will still accumulate PageRank, but it won’t be showing in our Index. So, I wouldn’t make a NoIndex page that itself is a dead end. You can make a NoIndex page that has links to lots of other pages.

For example you might want to have a master Sitemap page and for whatever reason NoIndex that, but then have links to all your sub Sitemaps.

I have just provided a couple of highlights, I am not attempting to replace a need for visiting the site I am citing. This is something I hate seeing, when people take other people’s content and repurpose it, thus making the original article worthless.
There are a few other gotchas in there, I suggest you read it 2 or 3 times to really understand what was said, and what wasn’t said.

Dangling Pages

One of the best descriptions of dangling pages is on the Webworkshop site, though they are assuming that links are totally taken out of the equation based on what they quote from the PageRank paper.

“Dangling links are simply links that point to any page with no outgoing links. They affect the model because it is not clear where their weight should be distributed, and there are a large number of them. Often these dangling links are simply pages that we have not downloaded yet……….Because dangling links do not affect the ranking of any other page directly, we simply remove them from the system until all the PageRanks are calculated. After all the PageRanks are calculated they can be added back in without affecting things significantly.” – extract from the original PageRank paper by Google’s founders, Sergey Brin and Lawrence Page.

Alternate interpretation

This is just an aside, as the amount of juice lost to dangling pages currently is hard to determine, and could be handled differently

They are assuming that if page A links to 6 other pages, 5 of them being dangling links, then the website will be treated as only having 2 pages until the end of the calculation.

Whilst I haven’t delved into the maths (and probably couldn’t through lack of information and lack of knowledge), it also seems to me that at the time the pages are taken out of the cyclic calculation, a percentage of the link value can still be taken with them.

Thus though the site for cyclic calculations will be just 2 pages, the link from A to B might only transfer 1/6 of the juice on each cycle.

At the time the original paper was written, Google only had a small proportion of the web indexed due to hardware and operating system restraints.
In modern times they have a lot more indexed, thus a more complex way of handling dangling pages could be possible.

More food for thought, a link to a page that is considered supplemental could be treated as a full link or as a link to a dangling page, or some other variant.

Even more food for thought, a site with multiple interlinked pages with no external links at all could be looked on as a “dangling site”.

Ultimately what is important is that dangling pages are a juice leak, though it is difficult to determine exactly how much

Additional Research On Link Juice Flow

I have referenced these works before, and I am just going to keep on referring people to them.

  • SEOFastStart by Dan Thies – a good introduction to SEO, and also introduces the ideas of controlling juice around a website – no email signup required
  • Revenge of the Mininet by Michael Campbell – a timeless classic as long as PageRank continues to be important – the download page isn’t hidden if you really don’t want to sign up to Michael’s mailing list, but I have been on his list for years.
  • Dynamic Linking by Leslie Rhode – A bonus that comes with Revenge of the Mininet

I mentioned these is a comment on SEOmoz recently in a discussion on PageRank, and for some reason my comment received just 2 up votes and one down vote.

I don’t gain in any material way from promoting these free ebooks, though I might gain some goodwill. The main reason I link to them is because they are a superb resource, and it saves me countless hours writing beginners material.

OK, On to some debunking

Blocking Pages With Robots.txt Creates Dangling Pages On The First Tier

In the quoted paragraph above, Matt clearly states that pages blocked with Robots.txt still accumulate juice from the links they receive.

Those pages don’t have any external 2nd tier links that are visible to a ‘bot, thus they are dangling pages.

How much juice they leak depends on how Google currently factor in dangling pages, but Matt himself suggests not to create dangling pages.

If you read any SEO Guide that suggests that the ultimate cure for duplicate content is to block it with robots.txt, I suggest you might want to question the author about dangling pages.

Meta NoIndex Follow Duplicate Content

This is a better solution than using Robots.txt, because it doesn’t create dangling pages. Links on a duplicate content page are still followed, however both internal and external links are followed and thus are leaks, often multiple leaks for the same piece of content when using CMS systems such as Wordpress which create site-wide links in the sidebar when using poorly designed themes, plugins, and especially Wordpress Widgets.

If you read an article suggesting using Meta Noindex Follow, ask the author how they are controlling external links on duplicate content pages.

Meta NoIndex Nofollow Duplicate Content

If you use Meta Noindex Nofollow, whilst this is handled slightly differently by Google to Robots.txt, as the page won’t appear in search results, it is still a page accumulating Google Juice if you link to it, another dangling page or node.
Second tier leaks from the page won’t leak, but the page as a whole will leak depending on how Google are currently handling dangling pages.

I don’t see people recommending this frequently, but as with Robots.txt, ask the author about dangling pages.

Dynamic Linking & rel=”nofollow”

Extensive use of Nofollow and other forms of dynamic linking are the only way to effectively prevent duplicate content pages in some way having a effect on your internal linking structure and juice flow. The Wikipedia page on Nofollow really isn’t correct.

The Dangling Sales Page

To finish I want to give you an example of how a sales page that previously might have benefited from lots of links can easily be turned into a dangling page and effectively discounted from cyclic PageRank calculations.

Sales pages started off just as a single page with no links:-

Single Page

Despite all the links coming to the site from external sources, this website is a dangling page, thus excluded from iterative PageRank calculations. It might still benefit from anchor text and other factors, but it effectively is not part of Google’s global mesh and passes on no influence.

Add Legal Paperwork And Reciprocal Links Directory:-

Sales Letter Variant with Reciprocal Link Directory

A much more structured site, and whilst it gains some benefit from reciprocating links there are 2 factors that are almost universally overlooked.

  1. No Longer A Dangling Page – because the site now has external links, it is valid as part of the global ranking calculations. Other pages as mentioned above were previously stating that the amount of juice passed to dangling pages was minimal, so this could be potentially a huge boost.
  2. More Pages Indexed – it is only a few pages, but with PageRank it is often not just how much juice you have flowing into a site, but what you do with it.

The reciprocal low quality links might not have had a huge amount of value compared to the benefit of being a member of the “iteration club” and having a few more pages indexed.

Add a link to the designer

Single Page With Designer Credit

Some early single page sales letters were not dangling pages, but didn’t benefit from any internal iterations, and acted as a conduit of juice to their web design firm.

The Danger of Using Nofollow or Robots.txt on Unimportant Pages

The Danger of Using Nofollow or Robots.txt on Unimportant Pages

I have actually seen this on a few sites:-

  • Reciprocal Link Directory Removed
  • Link to web designer removed
  • Nofollow added to legal papers that are looked on as being unimportant

Such a website is now out of the iteration club, it is a dangling page as it is no longer voting on other pages.

My Own Gotcha

I mentioned that this catches me out as well.

A while ago I wrote an article about linking to Technorati being a problem. It might still be true, but the amount of juice lost through such links might also be lower than I thought, due to Technorati using meta nofollow on every page. Technorati tag pages are themselves dangling pages with no external links.

Wikipedia and Digg on the other hand are not dangling pages. They still have external links to other sites, and thus any links to them are part of iterative calculations.

I would still say it is best to have tags pointing to your own domain tag pages, and to use nofollow on links to Wikipedia and Digg, though with Digg I suggest that is only on links to submission pages which contain no content.

Stumbleupon is also tricky – there are no external links from individual pages, but there is extensive internal linking.

With Digg and Stumbleupon, profiles rank extremely well, so you can use them for reputation management even if you get no juice direct from the profile.

I think I was the first to describe Wikipedia as a black hole of link equity, explained why you should nofollow Wikipedia extensively, and was one of the first to promote Ken’s Nofollow Wikipedia plugin.

You would have thought in 10 months they would have come up with an alternative to using nofollow on all those out-bound links.

They do however link out to a few trusted sites without nofollow, from just a few pages. I suppose Google does still allow them to be part of their iterative calculations.

Another Own Gotcha

This isn’t 100% something I can fix. I have suggested people use robots.txt on certain sites knowing it wasn’t the perfect solution.

You might notice on this site I don’t use an extensive robots.txt, and the design of my site structure is deliberate, but then at the same time I use nofollow with lots of custom theme modifications, and should use it a lot more.

Eventually I will come up with solutions to make things a little easier.

Tools In The Wrong Hands Can Be Dangerous

Using Robots.txt and Meta Noindex, Follow as a cure for duplicate content is a SEO bodge job or SEO bandaid. It may offer some benefits depending on how dangling pages are being handled, but is certainly not an ideal solution due to the amount of leaks that typically remain or dangling pages that are created.

Related posts

This entry was posted in SEO Blog, blogging tips, wordpress and tagged , , , , , , , , , , , , , , , . Bookmark the permalink. Both comments and trackbacks are currently closed.

61 Comments

  1. Igor The Troll (123 comments.)
    Posted November 27, 2007 at 8:56 pm | Permalink

    Andy, excelent post. The best analsyis of the matter I have seen! Will recomend it to Roy Cumbrowski to have a look at.

    I also have a matter with Akismet.
    http://www.phsdl.net/phsdl-vs-akismet-complaint.php

    Thank you,
    Igor

    • Andy Beard (1682 comments.)
      Posted November 27, 2007 at 9:05 pm | Permalink

      I have never been an Akismet fan http://andybeard.eu/tag/akismet

      As to the article, I feel I had to write what I had to write, as it is no good pointing this kind of thing in comments on other blogs.

      I just don’t want people to think that if they have used robots.txt or noindex follow for their duplicate content issues that they have “SEOed” their site.

  2. Igor The Troll (123 comments.)
    Posted November 27, 2007 at 9:11 pm | Permalink

    Andy, really I have seen many SEO bloggers chalenged this, including Rand Fishkin at SeoMoz and his grandfather Si, but by far you did the best job. You hit the nail right on the head.

    Home Run

  3. Patrick Altoft (2 comments.)
    Posted November 27, 2007 at 10:45 pm | Permalink

    To be honest my preferred method of removing content is sometimes just to cloak it and 301 googlebot to the homepage and let other users see it as normal (not recommended but not too dangerous).

    Robots.txt isn’t useful as you say because if a page has enough juice to get indexed you need to pass the juice to another page.

    • Andy Beard (1682 comments.)
      Posted November 27, 2007 at 11:00 pm | Permalink

      For some of the links it is possibly a viable solution, but I am not sure I would want to do that for all the pages that many people suggest are blocked with Robots.txt or meta tags.

      There is also a risk of diluting anchor text pointing to the home page.

  4. Hobo (17 comments.)
    Posted November 27, 2007 at 10:58 pm | Permalink

    Nice post Andy, and yesterdays.

    Andy I have a question that’s going round my head but perhaps you have come across before.

    If a page has 10 links going to 6 pages, (ie 5 links go to one page each and the other 5 go to the 6th) do you think the “link equity” is split 10 ways (with the 6th page getting the most) or is it split 6 ways evenly to the 6 unique urls?

    Shaun

    • Posted November 28, 2007 at 12:46 am | Permalink

      It is something I have never tested myself, and I have seen conflicting opinions not only on number but also on placement.

      Though this isn’t a conclusive test, it will be interesting to see if the anchor text from this comment is passed to the home page.

  5. Stephen Cronin (8 comments.)
    Posted November 28, 2007 at 1:44 am | Permalink

    Andy, great post! I have a question: How Google treats feeds? Does it crawl XML? Does it know what feeds are and ignore them? Or does it actually pass PR to the feed, with PR passed onwards via links in the feed content? I link to my main feed on every page of my site. Should I nofollow that link?

    Sorry if this is a ’silly question’, but I really don’t know the answer to this…

    • Andy Beard (1682 comments.)
      Posted November 28, 2007 at 8:55 am | Permalink

      Stephen, Feedbuner offer an option to have a feed blocked from search engines.
      I did it a while back as a test, but if most of the links from your feed point back to your own content, it isn’t actually a major problem.
      Think: tags, categories, related posts, internal links

      Links to comment feeds on every page are more of a problem. A while back it seems Google knocked them all into supplemental, and Matt Cutts I seem to remember stating that Google understands blog structure quite well.

      I would still use add_link_attribute plugin on as many unnecessary features as possible, especially on navigation on those duplicate content pages.

  6. chris (1 comments.)
    Posted November 28, 2007 at 1:44 am | Permalink

    In terms of a blog

    leaks for the same piece of content when using CMS systems such as Wordpress which create site-wide links in the sidebar when using poorly designed themes, plugins, and especially Wordpress Widgets.

    So we should remove things like, categories, archives ect.. and use a noindex on the frontpage to improve our seo?

    • Andy Beard (1682 comments.)
      Posted November 28, 2007 at 9:17 am | Permalink

      Chris there isn’t a single method of setting up a blog for SEO, a lot depends on how you want to place emphasis on particular content.

      I have mentioned before that if you were a gambler, the following would apply.

      6+1=7
      3+4=7

      I have provided a number of solutions in my Wordpress SEO masterclass
      http://andybeard.eu/2007/06/wordpress-seo-masterclass-for-competitive-niches.html

      The biggest problem is that different themes handle things link sidebar elements differently, and Wordpress widgets.

      One of the key tools is the add_link_attribute plugin.

      Eventually I hope to have some better solutions with a friendly interface.

  7. Chris (1 comments.)
    Posted November 28, 2007 at 2:30 am | Permalink

    Good lord there is a lot to learn. GREAT post. Thanks very much. Bookmarked.

  8. Local SEO Guide (5 comments.)
    Posted November 28, 2007 at 6:05 am | Permalink

    Quite a mind-blowing post Andy. Glad to see you are thinking overtime.

  9. Posted November 28, 2007 at 7:35 am | Permalink

    Wow- Best post I have read in weeks. Really exceptional analysis, Andy- Thanks for raising the bar!

  10. Chip (2 comments.)
    Posted November 28, 2007 at 9:18 am | Permalink

    I knew some of that stuff myself, but i wasn’t aware of the robots.txt facts. Although it’s more than obvious.

    Good article indeed, this is an evergreen.

    • Andy Beard (1682 comments.)
      Posted November 28, 2007 at 9:24 am | Permalink

      Chip it wasn’t obvious for me, and coming up with an explanation so that I wasn’t ripped to shreds by lots of SEOs wasn’t easy either.

  11. John Sadler (8 comments.)
    Posted November 28, 2007 at 10:37 am | Permalink

    Excellent reference to the Dan Thesis book. Thanks Andy!

  12. Mike Taylor (2 comments.)
    Posted November 28, 2007 at 1:36 pm | Permalink

    Wow, great post. I am going to have to re-read this at least twice to take it all in. Thanks.

  13. Seo Blog (1 comments.)
    Posted November 28, 2007 at 2:26 pm | Permalink

    Excelent post!

  14. Drew Stauffer (17 comments.)
    Posted November 28, 2007 at 10:20 pm | Permalink

    Andy have you ever designed your own Wordpress theme for the public? One that is properly developed with SEO in mind?

    • Andy Beard (1682 comments.)
      Posted November 30, 2007 at 8:52 pm | Permalink

      Drew it is a challenge I am working on but making it so it is usable by the majority of bloggers is something very hard to achieve.

      • Lord Matt (28 comments.)
        Posted January 25, 2008 at 5:17 pm | Permalink

        I have been working on a project called “the one theme” inspired by CSS Zen Garden. The idea would be to create a single markup that could converted to theme WordPress, Blogger, NucleusCMS, typo, etc and then the pretty stuff is done only by CSS. I’d love to create the “perfect” SEO base with that markup (or designed with SEO in mind). If only to separate the design and the HTML (making wordpress themes safer).

  15. Michael Streko (1 comments.)
    Posted November 29, 2007 at 10:41 pm | Permalink

    Andy you in depth posts and views never cease to amaze. Great post & even better for the person who stumbled this (where i found it)

  16. Carsten Cumbrowski (10 comments.)
    Posted November 30, 2007 at 7:18 am | Permalink

    “Extensive use of Nofollow and other forms of dynamic linking are the only way to effectively prevent duplicate content pages in some way having a effect on your internal linking structure and juice flow. The Wikipedia page on Nofollow really isn’t correct.”

    Hi Andy,

    I contributed a substantial part of the article to nofollow at Wikipedia and have it on my watch list and keep it updated as much as I can and time permits me to do so.

    Please elaborate what exactly is not correct in the current article. From what I take away from your post, could you argue that the article does not explain additional uses of nofollow, like the control of flow of linkjuice within your own website.

    This could be mentioned and makes sense IMO. It would at least provide some positive aspects to the whole thing and that webmasters seem to make the best possible thing out of this new tool (in contradiction to search engines who’s repurpose of the nofollow attribute causes more problems than anything else).

    Thanks

    • Igor The Troll (123 comments.)
      Posted November 30, 2007 at 8:26 pm | Permalink

      Roy, exactly! rel=”nofollow” best SEO practises is to control internal flow of PR iteration – juice…rand says this in his post as well.

    • Andy Beard (1682 comments.)
      Posted November 30, 2007 at 8:50 pm | Permalink

      Carsten the specific sections are these ones

      nofollow is a non-standard HTML attribute value used to instruct search engines that a hyperlink should not influence the link target’s ranking in the search engine’s index. It is intended to reduce the effectiveness of certain types of spamdexing, thereby improving the quality of search engine results and preventing spamdexing from occurring in the first place.

      I think it is currently clear that the scope is a little deeper now.

      What nofollow is not for

      The nofollow attribute value is not meant for blocking access to content or preventing content to be indexed by search engines. The proper methods for blocking search engine spiders to access content on a website or for preventing them to include the content of a page in their index are the Robots Exclusion Standard (robots.txt) for blocking access and on page Meta Elements that are designed to specify on an individual page level, what search engine spider should or should not do with the content of the crawled page.

      I think this needs to be revised in light of what Matt said in the interview on Stone Temple.

      Nofollow has been used for the control of juice since it was created almost 3 years ago, though references I know of are all on private email lists or private access web documents.
      am not sure how you can reference them but it would be more appropriate than referencing current discussion other than that Google is on record to say they are ok with it.

  17. SEO Agency (1 comments.)
    Posted November 30, 2007 at 12:41 pm | Permalink

    This is the best post I’ve read in months! Absolutely brilliant collection of gems, organised in a readable and useful way. I visit here every now and again, but for what it’s worth you’ve won my RSS subscription, andy!

  18. James Lee (1 comments.)
    Posted December 3, 2007 at 5:01 pm | Permalink

    I must admit that I’m coming around to understand Google’s position on the whole paid linking situation. Their seems to be a cottage industry evolving around manipulating back links and page rank.

    Sites like PPP are really harming the whole search results for everyone. I find that I might want to do a write up on one of my blogs talking about someone’s site and linking to them. This situation ruins that.

    In reality this all has derailed what page rank was supposed to be about…getting credit for others linking to your site honestly…which can’t really happen now.

  19. Graham (1 comments.)
    Posted December 5, 2007 at 4:01 pm | Permalink

    Thanks for the article Andy,

    The whole duplicate content issue is a really difficult one to deal with – the worst thing is you never know when the algorithms are gonnna change and you have to re-think everything again.

  20. cheap laptops (2 comments.)
    Posted December 7, 2007 at 3:09 pm | Permalink

    Wow Andy. Im not exaggerating when I say this has got to be the best post of have read in a very long time on an SEO blog (and I have all the main ones on my feed). This really changes my mindset on a lot of issues and opens my eyes to things that I may have been doing wrong in the past. Ive gota say congratulations on an excellent post. Keep it up.

  21. Alex Gogan (1 comments.)
    Posted December 8, 2007 at 7:19 pm | Permalink

    Excellent post, have book market this and will be reading this a lot. It will be given in snips to my tech and desgin guys to follow. Thanks again.

  22. Valentine (1 comments.)
    Posted December 9, 2007 at 8:36 am | Permalink

    Thanks for putting the time together to write this. These longer posts are often the best.

  23. Pet Lover
    Posted December 18, 2007 at 12:55 am | Permalink

    Wow. Definitely need to bookmark this one and read it over a second time after I get home from work. It answers a lot of questions I had about NoIndex pages. Very, very good post!

    Thanks Andy!

  24. Kamerovy (2 comments.)
    Posted December 18, 2007 at 7:37 am | Permalink

    Nice article. Especially the part about Dangling Pages was very helpfull for me.

  25. David DeAngelo (2 comments.)
    Posted December 19, 2007 at 10:58 pm | Permalink

    This is an eye opener. I always thought linking to external sites does not do any good to a sites/page pagerank. Infact, it seems now that we could be losing out by having dangling pages

  26. Utah SEO Pro (5 comments.)
    Posted December 21, 2007 at 5:32 am | Permalink

    Andy, I will have to agree with everyone else in stating that this is one of the best, if not the best, pieces of content I’ve read on here. I’ve relied on robots.txt for duplicate content issues but not anymore. Thank you!

  27. Seotaal Seo Blog (1 comments.)
    Posted January 8, 2008 at 7:18 pm | Permalink

    Nice article, was very helpfull. Keep on blogging :).

  28. Vinayak (1 comments.)
    Posted January 12, 2008 at 6:52 pm | Permalink

    Hi,
    I found your article very interesting and helpful. Made some changes to my blog following some of the advices.
    A big thanks!

  29. Posted January 21, 2008 at 8:48 pm | Permalink

    Excellent in depth article, have had to make a few changes to my blog after reading this.

  30. Jack Humphrey (4 comments.)
    Posted January 24, 2008 at 8:14 pm | Permalink

    This got my vote for the Semmy Award. Best one hands down! Great job.

    • Andy Beard (1682 comments.)
      Posted January 24, 2008 at 8:24 pm | Permalink

      Much appreciated Jack, though making it to the final was reward in itself with such notable judges and so much competition.

  31. gossipm (1 comments.)
    Posted February 2, 2008 at 8:12 am | Permalink

    Quite a lot of info to take in and digest. Still trying to understand seo and how it works. This post was definitely helpful. Thanks.

  32. Julie Francis (2 comments.)
    Posted February 2, 2008 at 2:26 pm | Permalink

    This no follow tag appears to be as dangerous as updating a plugin when you have a lot of other plugins, you never know quite how it’s going to fit in.I think I’ll wait another six months for the knowledge level to rise before I tackle no follow. Can you find and replace a whole blog then? Hell, I’ve got no rank to share so I may as well send my juice on.

    Your excellent instruction is bookmarked though, I’ll be back.

  33. CloudStrife (3 comments.)
    Posted March 1, 2008 at 6:37 pm | Permalink

    Firstly , thanks for the wonderful pictures you displayed , I fully get what you mean very easily. It is definitely not an easy job to get so much traffic so easily . The most important item in a website is : Users

  34. SEO Expert (2 comments.)
    Posted March 7, 2008 at 9:30 pm | Permalink

    Andy – once again you cease to impress with the effort and diligence to writing a great post. We are starting to use some of these tools and graphics for our internal training and documentation.

    What comes as second nature to us as internet marketers is not often as easily grasped by our employers and clients. i can’t tell you how many times I have pencil or white-board drawn the same chicken scratch tactical approaches for link structures and now I can just point them to your work here.

    Thanks for taking such effort to put out awesome content and diagrams. I have bookmarked your site on some social bookmarking sites and will spread the word to all I work with in industry.

    Best,
    Joe

  35. Posted March 23, 2008 at 2:20 am | Permalink

    I had a customer who was linking to Wikipedia in almost every single post that he wrote. All dofollow links, of course. Your explaination of why to nofollow Wikipedia references (with a little creative addition or two) really helped him understand.

  36. Edward Beckett (2 comments.)
    Posted April 8, 2008 at 8:52 am | Permalink

    Fantastic piece Andy …

    To add to the heirchial site design argument …

    Long, Long, Ago …

    Someone described in Great Detail …

    The Google PageRank Algorithm …

    And, how it works …

    • Andy Beard (1682 comments.)
      Posted April 8, 2008 at 9:33 am | Permalink

      That is one of the better pages, certainly far better than the Wikipedia page. Unfortunately like everywhere else it didn’t take into consideration dangling pages, which mess up everyone’s thought process when they actually stop to think about what is happening.

  37. Edward Beckett (2 comments.)
    Posted April 14, 2008 at 5:56 am | Permalink

    @Andy

    True … However, If read very carefully … we may deduce from Ian’s writing that dangling pages are not a wise move …

    I try to use the implementation from his fifth example, which fully suggests that child pages should support the parent(s).

    Unfortunately, this too leads to child pages that “sacrifice” their PR … to the benefit of the parent …

  38. Posted April 21, 2008 at 10:05 am | Permalink

    Very good article. Keep on eriting such helpful articles. I like to read english artickes because it more helpful then russian blogs :))

  39. Indianapolis SEO (1 comments.)
    Posted May 11, 2008 at 8:56 pm | Permalink

    Wow Andy,
    Everytime I start to think I understand the game someone like yourself throws me a little “wrench/news” that reminds me that the world is not flat. Your argument on the issues of dangling pages is tough to agree 100% without just drinking the kool-aid.

  40. zania (1 comments.)
    Posted May 30, 2008 at 3:33 am | Permalink

    As so many others said above. This post is really useful and a real eye opener for someone trying her best to grapple with SEO and being thwarted at every turn with conflicting information.
    Like Julie, I think I’ll wait a couple of months for the knowledge level to accumulate before I even attempt to tackle the pros and cons of no follow and dangling pages, so this post is bookmarked until then ;)

  41. SEO Pune
    Posted June 5, 2008 at 12:00 pm | Permalink

    Andy, great post..especially for SEO beginners

    • Andy Beard (1682 comments.)
      Posted June 5, 2008 at 12:16 pm | Permalink

      Are you claiming to be an SEO beginner, as you made some of these mistakes with a number of pages on your site?

  42. Gas Saver Guide (1 comments.)
    Posted June 11, 2008 at 12:45 am | Permalink

    I’m glad there’s Matt around to fill me in on this stuff all the time. Interesting little loopholes.

  43. Trevor Stolber (1 comments.)
    Posted June 26, 2008 at 12:43 pm | Permalink

    This just goes to show the importance of testing in SEO. I actively test lots of things to get hard and fast answers to many of the things that remain speculative in SEO.

  44. Cheap Laptops (2 comments.)
    Posted July 19, 2008 at 3:17 pm | Permalink

    Good information about SEO, and very valid information on how faster SEO works.

  45. Nick (1 comments.)
    Posted August 4, 2008 at 9:18 pm | Permalink

    Great post – very informative and easy to read. I am still pretty new to SEO so have bookmarked your blog as I obviously still have lots and lots to learn. Thanks.

  46. Webrevolve (1 comments.)
    Posted March 25, 2009 at 4:40 pm | Permalink

    Excellent post a very interesting read, i too have never been an Akismet fan what so ever, can’t understand why it comes pre-packaged with wordpress, everytime i upload a fresh install of wordpress i forget that i like to remove the plugin from the wp-content folder

  47. emmaseoagency
    Posted July 20, 2009 at 2:44 pm | Permalink

    as a seo beginner i appreciate your post very much, thanks!

  48. seo_company
    Posted August 14, 2009 at 2:12 pm | Permalink

    I love that everyone is getting there comment published, everyone seems to be twittering

37 Trackbacks

  1. [...] Take 15 minutes to read the Interview and then read Andy’s brilliant analysis in his post, SEO Link Gotchas Even the Pros Make and reflect on it a bit- there is a lot of meat in this post. Guaranteed you are making at LEAST [...]

  2. By Search Engine Optimisation Blog » on November 28, 2007 at 11:53 am

    [...] on from Andy Beard’s recent post on controlled linking, I’ve devised a simple table and guide to help you decide which linking method to use with [...]

  3. [...] are many things that help your blog and no one uses them. Even the most experienced bloggers. And they help your PageRank, your SERP, your traffic, your [...]

  4. [...] SEO Linking Gotchas Even The Pros Make [...]

  5. SearchCap: The Day In Search, November 28, 2007…

    Below is what happened in search today, as reported on Search Engine Land and from other places across the web…….

  6. [...] Andy Beard: SEO Linking Gotchas Even The Pros Make [...]

  7. By My Radical Blogs on December 3, 2007 at 12:41 am

    [...] writer that always has tips and advice on how to improve your blogging. One of his latest articles SEO Linking Gotchas got me thinking. Then I read an article by Mert Erkal  on 10 Blog Traffic [...]

  8. [...] Linking Gotchas Even the Pros Make (Andy Beard): Andy debunks a ton of articles and explains his reasoning. Pretty daring and altogether a wonderful read. [...]

  9. [...] MówiÄ…c w skrócie Dangling page’s to strony, które nie linkujÄ… dalej w głąb serwisu lub do innych serwisów. Podstrony tego typu sÄ… wyłączone z bieżących wyliczeÅ„ real Page Rank. WiÄ™cej… [...]

  10. By The Best Money Making Blog Post of 2007 on January 6, 2008 at 3:32 pm

    [...] Beard wrote a post about the Top SEO linkng mistakes even the Pro’s make. It made me step back a minute and look at my SEO abilities. Thanks [...]

  11. [...] zal ik in deze post een korte samenvatting weergeven van de blog van Andy Bear. Hij legt in zijn blog uit wat de nadelen zijn van de meta noindex [...]

  12. [...] Category SEO Linking Gotchas Even The Pros Make Google Category Dancing With The Gevil – Defamed By Google? Digg Favorites Slapped By Google Blogs [...]

  13. [...] In a recent interview of Matt Cutts done by Eric Enge (found at Andy Beard’s Blog) Matt explains how a page which has been restricted by robots.txt will not be spidered by [...]

  14. By SEO - 2008 Finalists » SEMMYS.org on January 24, 2008 at 6:12 pm

    [...] SEO Linking Gotchas Even The Pros Make Andy Beard | 11/27/07 [...]

  15. [...] it’s a sales page it’s not a “dangling page” at least not from what I think I know of “dangling pages” because although it [...]

  16. [...] I discussed in my SEO Linking Gotcha's post, Robots.txt does not prevent a page from accumulating PageRank, it purely stops a page from [...]

  17. By SEO - 2008 Winner » SEMMYS.org on February 1, 2008 at 5:14 pm

    [...] SEO Linking Gotchas Even The Pros Make Andy Beard | 11/27/07 [...]

  18. By Andy Beard Niche Marketing | Monetize Your Blog on February 2, 2008 at 5:43 pm

    [...] SEO Linking Gotchas Even The Pros Make [...]

  19. By The SEMMYS - Hobo SEO UK on February 7, 2008 at 1:38 am

    [...] SEO Linking Gotchas Even The Pros Make – Andy Beard [...]

  20. By Does Flash SEO still suck? • Tim Nash UK SEO Blog on February 9, 2008 at 5:25 pm

    [...] SWFAddress. On large sites such as the Samsung website (an SEO nightmare) this issue can cause dangling pages as well as usability and navigational [...]

  21. [...] It might be harder to rank, pages blocked using robots.txt still gather PageRank, and can appear in the index, though they would be looked on as dangling pages. [...]

  22. [...] to Matt Cutts, who was interviewed by Eric Enge which was published by Andy Beard, robots.txt will prevent the GoogleBot from crawling any page that is restricted within. However, [...]

  23. [...] Robots.txt – A lot of things are being blocked off in his Robots.txt file, including date based archives, paged content, images and author pages. He is also blocking off tags – I am not going to dispute that decision, it is something to test and track. There is overlap between categories and tags – it is possible to make that work ok by including additional content and different layouts, and with some SEO linking structures you would block both, others you would enhance both. One good thing he is doing is blocking all his /go/ affiliate links. Important:- Robots.txt is a bandaid as it can create hanging or dangling pages. [...]

  24. [...] Pages blocked by Robots.txt still accumulate Google Juice. [...]

  25. [...] a few of my posts, this one on Wordpress SEO is worthwhile, and people are still making these linking gotchas, and linking [...]

  26. By SEO - All 2008 Nominees » SEMMYS.org on May 19, 2008 at 5:52 pm

    [...] SEO Linking Gotchas Even The Pros Make Andy Beard | 11/27/07 [...]

  27. [...] 如果你想了解更高级的链接结构优化技巧,可以参考slightly shady seo 或 Andy Beard [...]

  28. [...] SEO Linking Gotchas Even Pros Make by Andy Beard [...]

  29. By META no-index? on April 22, 2009 at 6:58 pm

    [...] zal ik in deze post een korte samenvatting weergeven van de blog van Andy Bear. Hij legt in zijn blog uit wat de nadelen zijn van de meta noindex [...]

  30. [...] SEO Linking Gotchas Even the Pros Make Claim your FREE SEO guide for beginners quick! Friend us on Twitter or LinkedIn. [...]

  31. [...] I guess it all comes down to how much value these pages add to your visitors if they land on these pages from search results. I see many websites (especially big sites) that allow their internal search results to be spidered by Googlebot. Of course, if you follow Google’s advice and if people are linking to your search results pages too, and you robots.txt them out, you might be losing out on incoming link equity. Andy beard wrote about something similar some time ago- – SEO Linking Gotchas Even the Pros Make. [...]

  32. [...] Perhaps it comes down to how much value these pages add to your visitors if they land on these pages from search results. I see many websites (especially big sites) that allow their internal search results to be spidered by Googlebot. Of course, if you follow Google’s advice and if people are linking to your search results pages too, and you robots.txt them out, you might be losing out on incoming link equity. Andy beard wrote about something similar some time ago- – SEO Linking Gotchas Even the Pros Make. [...]

  33. [...] be done. I haven’t written anything on this yet but you can consult Slightly Shady SEO or Andy Beard B6: Site – Robots.txt 1. Check for robots.txt file. Does one exist? 2. See what’s being [...]

  34. By Dangling Pages - WebProWorld on May 25, 2009 at 6:27 am

    [...] page to keep its pr? I also think you are incorrect in reference to how the pr is transferd. Visit SEO Linking Gotchas Even The Pros Make. Getting back to my original question, now that I have ‘followed’ the links on my ‘NoIndex’ [...]

  35. By Are Blogger | Blogspot Blogs SEO Friendly? on May 31, 2009 at 7:58 pm

    [...] creates "hanging or dangling pages" and the PageRank gets shared out between all the pages in the [...]

  36. [...] Perhaps it comes down to how much value these pages add to your visitors if they land on these pages from search results. I see many websites (especially big sites) that allow their internal search results to be spidered by Googlebot. Of course, if you follow Google’s advice and if people are linking to your search results pages too, and you robots.txt them out, you might be losing out on incoming link equity. Andy beard wrote about something similar some time ago- – SEO Linking Gotchas Even the Pros Make. [...]

  • RAP Bank

    I have just signed up for RAP Bank myself - call this a mini-review.
    It seems to be one of the smartest ideas for a long time as a great way to consolidate 100s of affiliate programs under a single entity for promotion.

    It is also one of the best implementations of WordPress I have ever seen.
    It is not without its bugs, as this is effectively pre-launch. For example I just found a bug in the keyword based search RSS feeds (yes RSS feeds for any term will be hot), but it looks like they might be onto a winner.
    Check it out - signup for the basic service is free.

    >Affiliate Program

    P.S. It is not immediately obvious who is running the site, but looking at my Paypal receipt (yes I signed up for Vendor Plus) I believe the site is run by Bill Ortell From Rap-Extras.com who is RAP certified and does the membership add-on that is highly popular.