34 Finding Out When a Page Was Published Using Google

Many pages will tell you the date they were published. But some pages don’t give publication dates, and some can’t be trusted.

Take, for example, this story from fake site ABCNews.co (a hoax site that attempts to to look like an ABC news site).

pro

Figure 86

You’ll note that the publication date was November 11.

That’s what the site looks like today. But we can see what it looked like previously, courtesy of archive.org‘s Wayback Machine.

Here’s what it looked like in March, sporting a publish date of March 24:

hoax.PNG

Figure 87

Here it is in June, sporting a date of June 16:

june.PNG

Figure 88

And in September, it sported a date of September 11:

september

Figure 89

 

Hoax sites often do this date incrementation to increase the share rate on older stories. People are more likely to share things if they believe they are breaking news and not yesterday’s story.

So how do we get some sense of when this story was first published?

We can’t get there exactly but we can often use Google to get close. Google stores the date of the first time it indexed a page–on popular sites this date is usually within a couple days of the true publish date (on unknown sites it is much less reliable).

To get Google to show the indexed date of a page, you’ll need to do two things:

  • Set up a search that will only return that particular page by using the “site:” search term
  • Trigger display date by setting a date range that ends with the current day.

Here’s what that looks like in this case:

Figure 90

As you can see, we’ve taken the URL of the page and entered the following as the search term:

site:abcnews.com.co/donald-trump-protester-speaks-out-i-was-paid-to-protest/

Then we’ve used date filtering to create a filter that doesn’t exclude anything (its date range is all possible dates), but triggers this sort of date display in Google.

Again, this is not a rock-solid publication date, but we can say that there was some content at this URL at this date, and in most cases, with a URL like this, that means the story was up by then.

 

License

Icon for the Creative Commons Attribution 4.0 International License

Web Literacy for College Students 2nd Ed Copyright © 2020 by NSCC and Michael A. Caulfield is licensed under a Creative Commons Attribution 4.0 International License, except where otherwise noted.

Share This Book