Duplicate Content - An Ounce Of Prevention Is Worth A Pound Of Cure

May 14, 2007 – 1:14 pm

If you're new here, you may want to subscribe to my RSS feed. Thanks for visiting!

Duplicate content is a nasty thing. Slowly but surely it will sap your website of any ability to rank in Google, and that alone is enough to destroy your dreams of online domination. Where does most duplicate content come from? Generally the content comes from template-driven web design where duplicate URLs are created automatically.

Refer to a URL only once

If these URLs are created in muliple spots, you need to clean it up fast. The new Google does really want to carry multiple copies of the same version, so it’s better for everyone’s sake if you just present one final form to the SERPs. This alone can guarantee better results in the modern Google. If you make sure you present clean HTML that is original, it’s quite easy to rank for many terms.

Now I can’t tell you exactly how to clean up your website because every site has a different architecture, but I can point out some possible steps to take to prevent major problems.

For popular CMS scripts like VBulletin and Wordpress you should be able to find a module or plug-in to do the work for you.

Otherwise, follow these tips:

  • Disallow any extra copies of URLs like print-friendly pages - also no-follow them or run them through a re-direct - I’ve included a simple explanation on how

So here’s the basic steps to take for your print-friendly pages:

1) Redirect the links to the print-friendly page. You can do no-follow, or use a re-direct. To redirect do the following:

  • In robots.txt add the following link: Disallow: /my-out.php
  • Upload my-out.php to your root directory
  • Change all of your links to the following format:
    My Redirected Link
  • Disallow the print friendly versions in robots.txt - Disallow: /printfriendly.php

Here’s the code in my-out.php


< ?php
parse_str($_SERVER['QUERY_STRING'],$vars);
$url = $vars['url'];

if ($url) {
header("Location: $url");
}

?>

Put this together and you have now disallowed the duplicate content, and removed the link popularity you send to the pages by re-directing the links. This should help you clean up fast. If you implement this system up front, it can save you a great deal of frustration later.

If you enjoyed this post, subscribe to the Sootle RSS feed!.

  1. 4 Responses to “Duplicate Content - An Ounce Of Prevention Is Worth A Pound Of Cure”

  2. Do directory redirects such as http://www.build-reciprocal-links.com/jump.asp?id=4197 count as duplicate content? I have noticed that Google search lists/indexes both http://www.build-reciprocal-links.com/jump.asp?id=4197 and the url it redirects to.

    By David Maxwell on May 21, 2007

  3. The content at both URLs is duplicate, so I would say yes.

    By Darren McLaughlin on May 21, 2007

  4. I thought that the search engines eventually recognize permanent redirects and then attribute the apparent content for the redirect URLs to the pages/URLs they are actually redirecting to.

    But I do not think that is true for temporary redirects.

    By David Maxwell on May 21, 2007

  1. 1 Trackback(s)

  2. May 15, 2007: Search Engine Optimization For Google In The Here And Now

Post a Comment