Google News is Getting the Date Wrong on Old News Stories

Michael Gray

By Michael Gray
In Google  


Last week there was a bit of a brouhaha over Google news incorrectly giving the wrong date for a news story. In that particular incident Google news picked up a 6 year old story about United Airlines filing bankruptcy which upset the stock market for that particular stock. I’ve seen things like that happen before in the past despite Google saying that it was an isolated incident. Here’s an example of where Google News is still assigning the wrong date.

do the following query in Google News [inurl:2007 site:nytimes.com]

In the screen shot below take note of the dates Google is assigning to the two stories September 14, 2008 and September 4, 2008.

Now visit the actual pages those two stories are on and notice the original publication date

Subprime Troubles Sack HRJ

Vintage Neil Young, Still Working for the Muse

Google is assigning a date of September 14 2008 for a news story that was published on August 24 2007, and September 4th of 2008 for a story that was published on October 28, 2007.

Big deal you say? Well What if I’m in the financial market and as part of my job I have to monitor HRJ Capital, and I go to Google News and search for them [HRJ Capital] I get that story that was originally published on August 24, 2007 but that Google thinks was published on September 14th 2008. Do you see how assigning the wrong date to old news could cause a problem?

Related posts:
  1. What if Google Was Wrong Earlier this week Danny Sullivan and many others noted that...
  2. Google and Nazi News Every few days it seems Google does everything they possibly...
  3. When Google Gets Duplicate Content Wrong There’s lots of hand wringing among publishers about duplicate content,...

See my disclaimer about advertising and affiliate links

tla starter kit

{ 3 trackbacks }

Google News & Indexing Old Stories As New
September 16, 2008 at 2:21 pm
What if Google gets date data wrong? | Strategist.org.uk
September 16, 2008 at 9:27 pm
Google updates forum searches | vipey.com
October 9, 2008 at 2:39 pm

{ 9 comments }

Chris September 16, 2008 at 2:14 pm

Hey Michael,

The same issue has been going on for months on Google Reader as well. The only difference I see is that when a new comment is added Google Reader makes the date the same as the latest post. So it could be a similar issue on Google News as well.

~ Chris

Anne H September 16, 2008 at 9:27 pm

Interesting. Using your example, I only see the story for Neil Young. This incident does seem different than the UAL one in that the NYT includes a publication date. I thought maybe it was a recent comment that triggered the new date, but that doesn’t seem to be the case. Maybe it’s the “var google_hints” that is triggering it ;-)

Thomas September 16, 2008 at 9:36 pm

In my capital markets research, I’ve been noticing this problem for some time now. In some cases, I think some web portals are obscuring the date in order to make their content seem “timeless” or always relevant. It makes research frustrating because the true dates of events starts to become obscured. Sounds like we need a webservice call OrigPubOn: or something. We need to preserve the actual dates of news and insights that are delivered to the web and not be beholden to the date that Google crawls it.

Ben McKay September 17, 2008 at 4:27 am

One of my clients works in the commodities and futures markets, relying on and releasing news all the time – they’re going to be miffed when I tell them this as it might make readers think that they are trying to rehash/repackage old news…

Shane September 17, 2008 at 11:34 am

Is it possible that the news site did republish the story (and by publish, I mean updated the HTML so the “HEAD” date is new) such that Google picked it up again and is listing the most recent update time rather than the original publish time? Maybe they need two fields: original date and last updated date.

That also fits in line with what folks above and myself have seen with Google Reader behavior.

Hannah September 17, 2008 at 12:50 pm

A report I wrote about a Jewish Holocaust survivor in 2006 turned up on Google in July with a new publish date.
The family found the story about their mother, who died in 2007, in 2008.
As a result, we were able to connect and talk about the legacy their mother had left behind. It’s not always negative when Google glitches, but it can have extremely deep and emotional impacts on families.

Chat Man September 17, 2008 at 4:50 pm

Mike,

The Sun-Sentinel, the site that is the purported source of Google’s ‘old news’ reported as ‘new news’ fiasco, doesn’t have a critcal element in place on their pages:

AND – the Popular Stories section (3 or 4 items down from the top of the right column) does not have ‘nofollow’ in the listed links (although, it seems that the sun-sentinel.com site does have a ‘nofollow’ policy, as evident in other links on the home page). This is where, apparently, the headline appeared when Google scraped and distributed the story.

Do you think these two elements, used properly, could have thwarted Evil Google’s plan to bring down the Tribune and United Airlines?

It’s just an idea.

SEO Chatter

Chat Man September 17, 2008 at 4:51 pm

Sorry, I forgot to alter the code in my last post. The critical element I’m referring to is

David September 18, 2008 at 7:15 am

Mike, I think you miss the point here on the “nofollow” and even placement of a date on a news story (many times I see the URL or the title tag carrying this). So many people seem to think THEY NEED to please, pacify or meet the Google guidelines if they want included in the index, but instead we see Google pickup on content/links that actually break the very rules they are imposing. MAYBE Google can’t do their job properly and index and sift the world’s content and expect us to meet their algo. Sorry, but this event for United Airlines, the examples posted just show how irrelevant that Google is becoming, and then use “suggestions” to web sites to meet their failing also. (Twitter is another fine example)

I for one am growing tired of Google this and Google that. I am sorry to say but I think we can expect US and Euro involvement/regulations coming to the WildWildWeb since we have many that game, others that just can’t contain the content released to the world.

David

Comments on this entry are closed.