Featured Research

from universities, journals, and other organizations

Search Engines Biased, Out-Of-Date, And Index No More Than 16% Of The Web

Date:
July 12, 1999
Source:
NEC Research Institute
Summary:
A new NEC Research Institute study analyzes the accessibility and distribution of information on the web. Among the studies findings: Search engine coverage has decreased substantially since Dec. 97, with no engine indexing more than about 16% of the publicly indexable web.

A new NEC Research Institute study analyzes the accessibility and distribution of information on the web. The study was conducted by Dr. Steve Lawrence and Dr. C. Lee Giles and will appear in the July 8 issue of the journal Nature.

-- LOW COVERAGE -- Search engine coverage has decreased substantially since Dec. 97, with no engine indexing more than about 16% of the publicly indexable web.

-- UNEQUAL ACCESS -- Search engines are more likely to index sites that have more links to them (more 'popular' sites). They are also typically more likely to index US sites than non-US sites, and more likely to index commercial sites than educational sites.

-- OUT-OF-DATE -- Indexing of new or modified pages by just one of the major search engines can take months.

-- AMOUNT OF INFORMATION -- The publicly indexable web contains about 800 million pages encompassing about 15 terabytes of data (about 6 terabytes of textual content after removing HTML tags, comments, and extra whitespace); it also contains about 180 million images.

-- TYPE OF INFORMATION -- 83% of sites contain commercial content and 6% contain scientific/educational content. Only 1.5% of sites contain pornographic content.

The web is transforming society, and the search engines are an important part of the process. For example, consumers use search engines to locate and buy goods or to research many decisions (such as choosing a vacation destination, medical treatment or election vote).

Search engine indexing and ranking may have economic, social, political, and scientific effects. For example, indexing and ranking of online stores can substantially effect economic viability; delayed indexing of scientific research can lead to the duplication of work or slower progress; and delayed or biased indexing may affect social or political decisions.

One of the great promises of the web is to equalize access to information. As the web fast becomes a major communications medium, attention should be paid to the accessibility of information on the web, in order to minimize unequal access to information, and maximize the benefits of the web for society.

For more information see http://wwwmetrics.com.

###

The NEC Research Institute conducts long-term, fundamental research in computer and physical sciences. The mission of the Institute is to contribute significant new understanding of computer and communication (C&C) technologies for the future. Institute research activities have a long-term goal of significant advances in the understanding of intelligence and information processing in biological and machine systems, and in the physical and system aspects of future computer architectures.


Story Source:

The above story is based on materials provided by NEC Research Institute. Note: Materials may be edited for content and length.


Cite This Page:

NEC Research Institute. "Search Engines Biased, Out-Of-Date, And Index No More Than 16% Of The Web." ScienceDaily. ScienceDaily, 12 July 1999. <www.sciencedaily.com/releases/1999/07/990712075603.htm>.
NEC Research Institute. (1999, July 12). Search Engines Biased, Out-Of-Date, And Index No More Than 16% Of The Web. ScienceDaily. Retrieved August 1, 2014 from www.sciencedaily.com/releases/1999/07/990712075603.htm
NEC Research Institute. "Search Engines Biased, Out-Of-Date, And Index No More Than 16% Of The Web." ScienceDaily. www.sciencedaily.com/releases/1999/07/990712075603.htm (accessed August 1, 2014).

Share This




More Computers & Math News

Friday, August 1, 2014

Featured Research

from universities, journals, and other organizations


Featured Videos

from AP, Reuters, AFP, and other news services

Google Mystery Barge Headed For The Scrap Yard

Google Mystery Barge Headed For The Scrap Yard

Newsy (Aug. 1, 2014) We may never know what was going on inside one of Google's mystery barges in Portland, Maine as it's now headed for the scrap yard. Video provided by Newsy
Powered by NewsLook.com
Escaping Email: Inspired Vision or Pipe Dream?

Escaping Email: Inspired Vision or Pipe Dream?

AP (Aug. 1, 2014) Dustin Moskovitz is plotting an escape from email, using his communications expertise in an attempt to change the way people connect at work, where the incessant drumbeat of email has become an excruciating annoyance. (Aug. 1) Video provided by AP
Powered by NewsLook.com
Google (Kind Of) Complies With 'Right To Be Forgotten Law'

Google (Kind Of) Complies With 'Right To Be Forgotten Law'

Newsy (July 31, 2014) Google says it is following Europe's new "Right To Be Forgotten Law," which eliminates user information upon request, but only to a certain degree. Video provided by Newsy
Powered by NewsLook.com
Tesla, Panasonic Ink Deal To Make Huge Battery 'Gigafactory'

Tesla, Panasonic Ink Deal To Make Huge Battery 'Gigafactory'

Newsy (July 31, 2014) The deal will help build a massive battery factory that Tesla says will produce 500,000 lithium batteries by 2020. Video provided by Newsy
Powered by NewsLook.com

Search ScienceDaily

Number of stories in archives: 140,361

Find with keyword(s):
Enter a keyword or phrase to search ScienceDaily for related topics and research stories.

Save/Print:
Share:

Breaking News:
from the past week

In Other News

... from NewsDaily.com

Science News

Health News

Environment News

Technology News



Save/Print:
Share:

Free Subscriptions


Get the latest science news with ScienceDaily's free email newsletters, updated daily and weekly. Or view hourly updated newsfeeds in your RSS reader:

Get Social & Mobile


Keep up to date with the latest news from ScienceDaily via social networks and mobile apps:

Have Feedback?


Tell us what you think of ScienceDaily -- we welcome both positive and negative comments. Have any problems using the site? Questions?
Mobile: iPhone Android Web
Follow: Facebook Twitter Google+
Subscribe: RSS Feeds Email Newsletters
Latest Headlines Health & Medicine Mind & Brain Space & Time Matter & Energy Computers & Math Plants & Animals Earth & Climate Fossils & Ruins