Recently I’ve been laying the groundwork for some future SEO projects doing some site migrations onto content management systems, showing search spiders friendlier content, and cleaning up some URL structures. I’ve been using Google Sitemaps to help keep track of how the projects are progressing.
Some of the problems I’ve been working to overcome are uncrawlable product/shopping cart URL’s like this
example.com/gifts/PSProductResults.aspx?SCAT_Id=3272&DPSV_Id=59659&CATY_Id=570&ASI=&WS=&LN=&pF=&pT=
another is a hodgepodge of architecture techniques like
example.com/foo/index.php
example.com/foo/index.html
example.com/foo.html
While it is completely possible to rank with a mix like that is does make it really hard to maintain and keep track of without loosing stuff on the way.
First a bit of a warning, IF YOUR SITE IS RANKING AND YOU ARE GETTING ORGANIC LISTINGS AND TRAFFIC DON’T MUCK WITH IT! If you do decide to muck with it do it very, very, slowly. I’d recommend logging in and creating a sitemaps account first before you actually make any changes. It gives you an idea of what’s going on and any problems that exist right off the bat. Note to the sitemaps team do you have to show the ‘submit a reinclusion request’ on the login screen if someone isn’t banned, it scares non-seo types who think it’s there for a reason.
Another thing I learned is many site owners really have no idea what’s going on on their sites. They know about a handful of pages, but by and large they really don’t have a full understanding of what pages are on their sites. I didn’t find any hidden spam but on the 6 sites I worked on all of them had old pages that were linked to from somewhere and had either duplicate, outdated or missing information. For example if the information was originally on example.com/page-1.htm and a significant update was made the information it was then put on example.com/page-2.htm . However what invariably happened was a few dangling links existed to page-1.htm and it stayed in the index. If it’s an update keep it on the same page if at all possible. If not delete the old page (keep a backup offline) and 301 the old page to the new page. If it’s for an annual event build a date into the URL structure like:
example.com/event/2005/
example.com/event/2006/
For the shopping cart site I had to roll up my sleeves and get into some programming. First I blocked the robots from the entire dynamic shopping cart structure. I then wrote some programming to generate a flat friendly URL structure for each of the departments and products. In addition I wrote a script to automatically generate a new XML sitemap with links to each of the departments and products, and send a ping to Google. I wouldn’t consider these doorway pages per se as they are perfectly readable for human visitors. I’d consider them teaser or jump pages leading directly to the products. (note: Some of you might ask why not use javascript to bring them directly to the product page instead of adding a page/click. While that would work it’s a little too iffy on the ‘sneaky redirects’ and ‘cloaking’ for an otherwise clean client website.)
For heavily/frequently crawled sites you may get feedback sooner, but I’ve found it takes 2-4 weeks from when you start moving pages around for the new sitemaps data to start to show up. So go out and do of all of the changes at once, ping sitemaps, wait and then check back to see what’s changed, and make adjustments.
Now above I recommended not changing things dramatically if you are ranking and getting traffic. I went completely against that advice here on this blog recently. I did this for two reasons, first I want to see what can really go wrong when you do a large scale site migration (1000+ pages). Second I think it’s important to experiment and try new stuff from time to time, you can really learn a lot from the mistakes you make. Here’s a screen shot of my missing pages you can see I’ve got over 180 pages that are missing that I need to 301 to the new locations.

While the experiment is clearly still in progress I haven’t seen a dramatic drop

Related Information
- Google Sitemaps – Have a conversation with Google about your site
- GOODROI Internet Marketing » Those Google Engineer’s are getting better looking
- Vanessa Fox of Google Sitemaps Interview (mp3 format)
- GOODROI Internet Marketing » Feb. 23 – Google Sitemaps with Vanessa Fox, Google Engineer
- GOODROI Internet Marketing » Follow-up on Google Sitemaps with Vanessa Fox
- Interview with Vanessa Fox of Google Sitemaps – Sugarrae

Related posts:
- Suggestions For Google Sitemaps Hey Google Sitemaps and Google Sitemaps Blog (not sure which...
- Using Google Sitemaps for Competitive Intelligence I’m a big fan of the Google Webmaster Central Program...
- Google Sitemaps Review I launched multiple new websites in July (more than 2...
- Google Sitemaps for Large Scale Sites If you’ve got a any first hand experience working with...
- Google and Site Searches As DaveN noted earlier today, when you search for a...
Advertisers:
- Text Link Ads - New customers can get $100 in free text links.
- BOTW.org - Get a premier listing in the internet's oldest directory.
- Ezilon.com Regional Directory - Check to see if your website is listed!
- Need an SEO Audit for your website, look at my SEO Consulting Services
- Directory Journal - Get permanent deep links in a search engine friendly directory
- TigerTech - Great Web Hosting service at a great price.
- Article-Writing-services.org - Article Writing Services creates quality content for websites and blogs at no cost to site owners.
- Link Building Services - Hire WeBuildLink.com for well-planned advanced link building campaigns. Very affordable. Contact us now for a FREE evaluation.
- Try HOTH Plus+ NOW - The First 1-Stop Link Building Solution Powered by 100% College Educated Copywriters!
- Professional website designs - Get a unique brand image with website designs that sets you apart and convert your visitors into customers. Make a brand, not just a website
See my disclaimer about advertising and affiliate links










