« Sony’s amazing crapware-free PC | Main | Dear Adobe, can we please have a 64-bit Flash player? »

July 29, 2008

Google Counts 1 Trillion+ Unique Web URLs

The information age will squish us if it isn't managed properly. The issue will be to weed out the bloat with better and better search capabilities. Google has always led the pack (much to Microsoft's chagrin) What isn't spoken about:

  • How much info will vanish as content is lost through website abandonment. Many point out that the web is free, but it costs real money to keep sites hosted and pay the annual domain name registration fees.
  • Digital Noise overwhelms us...

Blog popularity fed much of the information bloat and advertising revenue will help keep many informative sites alive. But we notice many forums are dying and disappearing. Google has social responsibilities...

Link: Google Counts More Than 1 Trillion Unique Web URLs

From: www.cio.com

Juan Carlos Perez | IDG News Service
July 25, 2008

In a discovery that would probably send the Dr. Evil character of the "Austin Powers" movies into cardiac arrest, Google recently detected more than a trillion unique URLs on the Web.

This milestone awed Google search engineers, who are seeing the Web growing by several billion individual pages every day, company officials wrote in a blog post Friday.

Feeling the pain at the pump?

Need a Pizza 2 for 1 Discount Finder?


In addition to announcing this finding, Google took the opportunity to promote the scope and magnitude of its index.

"We don't index every one of those trillion pages -- many of them are similar to each other, or represent auto-generated content ... that isn't very useful to searchers. But we're proud to have the most comprehensive index of any search engine, and our goal always has been to index all the world's data," wrote Jesse Alpert and Nissan Hajaj, software engineers in Google's Web Search Infrastructure Team.

It had been a while since Google had made public pronouncements about the size of its index, a topic that routinely generated controversy and counterclaims among the major search engine players years ago.

Those days of index-size envy ended when it became clear that most people rarely scan more than two pages of Web results. In other words, what matters is delivering 10 or 20 really relevant Web links, or, even better, a direct factual answer, because few people will wade through 5,000 results to find the desired information.

It will be interesting to see if this announcement from Google, posted on its main official blog, will trigger a round of reactions from rivals like Yahoo, Microsoft and Ask.com.

In the meantime, Google also disclosed interesting information about how and with what frequency it analyzes these links.

"Today, Google downloads the web continuously, collecting updated page information and re-processing the entire web-link graph several times per day. This graph of one trillion URLs is similar to a map made up of one trillion intersections. So multiple times every day, we do the computational equivalent of fully exploring every intersection of every road in the United States. Except it'd be a map about 50,000 times as big as the U.S., with 50,000 times as many roads and intersections," the officials wrote.

Copyright © 2008 IDG News Service. All rights reserved. IDG News Service is a trademark of International Data Group, Inc.

Feeling the pain at the pump?

Need a Pizza 2 for 1 Discount Finder?

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d83452fc7d69e200e553dd779e8834

Listed below are links to weblogs that reference Google Counts 1 Trillion+ Unique Web URLs:

Comments

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been saved. Comments are moderated and will not appear until approved by the author. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Comments are moderated, and will not appear until the author has approved them.

My Photo

Twitter Updates

    follow me on Twitter

    Google Site Search

    blogroll

    Mexico 2005

    • Puerto Aventura
      Paradise at the Mexican Riviera. A photo journal of our trip in July 2005.

    March Storm 2003

    • Whoa! It's getting deep out here!
      A photo journal of the snow storm that hit Littleton, Colorado on March 17th, 2003.

    Holiday Storm 2006

    • Snow Accumulation Above our Porch
      December 20th, 2006 New pictures are included from what the press is calling the 'Holiday Storm of 2006' comparing some of the scenes around our home in Littleton, Colorado to the March 2003 storm. Denver International Airport is closed, all the schools are shut-down and we wait for show plows and the sun. It will be a white Christmas this year even if the streets and sidewalks are clear next Monday and we can walk around in t-shirts :)

    Amazon Ad



    GPS Install

    • Proper Installation Confirmed
      This GPS Installation Guide will show you what to expect and how to go about installing GPS equipment in your vehicle. tekniaXP news and teknia tech have info on locating your ODB port.
    Products from tekniaXP
    Powered by Stylehive
    Blog powered by TypePad
    Member since 08/2003

    Spammer Be Gone

    • SPAMMER BE GONE

    Google Page Rank

    • Check Page Rank of any
      web site pages instantly:
      This free page rank checking tool
      is powered by Page Rank Checker service