Featured Research

from universities, journals, and other organizations

Intelligent Life Sciences Search Engine: Grid Browser Understands Technical Terms And Context

Date:
May 23, 2009
Source:
ICT Results
Summary:
A web browser that can understand technical terms in life sciences and automatically find additional resources and services has been developed. It could lead to a new generation of intelligent search engines.

A web browser that can understand technical terms in life sciences and automatically find additional resources and services has been developed by European researchers. It could lead to a new generation of intelligent search engines.

The life sciences community has built numerous databases – such as for gene sequencing and information about diseases – that are available to researchers as ‘grid’ services.

“Grid computing is essentially about building virtual organisations that are independent of the physical location where they reside,” says Michael Schroeder of Technische Universitδt Dresden.

The problem is how to link those services to other scientific information found on the web. Schroeder is coordinator of the EU-funded Sealife project which has created a ‘semantic grid browser’ to make grid services for the life sciences much more accessible.

“We have the web on the one hand and then we have grid computing, with its many services, on the other,” he says. A semantic grid browser seamlessly integrates them.

“It tries to understand what it finds on web pages, interprets this content and then links it, on the fly, to services that might be useful to the user.”

A matter of semantics

The key to the Sealife browser is a ‘semantic hyperlink’ that shows up on the page to direct users to relevant services. The link is not put there by the website but by the browser itself.

How does it do that?

First, the browser needs to understand the content of the page and identify terms which could be linked to grid services. An example tested in the Sealife project is the naming of genes. Each human gene has an average of 5.5 names, Schroeder points out, but if it can be identified correctly, a link can be made to a wealth of information about that gene.

The browser must also be able to handle ambiguity. “If I see ‘Jaguar’ on a web page, what is it? Is it an animal? Is it a car? Is it the Mac operating system?” Sealife uses specialised algorithms to work out the context from other words on the page and correctly interpret the meaning.

It is still not an exact science, though. The Sealife team entered their algorithm in an international competition with 50 others to identify names of genes. They won, with an 81% success rate, though Schroeder says they have now got that up to 87%.

Background knowledge

The second challenge is the background knowledge that allows the browser to make sense of the identified terms. Such knowledge is formally known as an ‘ontology’, a systematic hierarchy of concepts and their relation to one another. Biology, with its extensive taxonomies, is an ideal field for semantic grid browsing.

“All these efforts of building hierarchical classification systems have been at the core of biology for centuries,” says Schroeder. “Biologists are used to it and there are many efforts to make information exchangeable.”

But outside the life sciences such systematic classification is not so well developed, and the Sealife project has created editors to build ontologies from published literature in any specific field of interest.

“We developed algorithms that grind through this data, identify the key concepts and then the ontology editor offers these concepts to you,” Schroeder explains. “If you agree, it then searches the web to find things that look like definitions. This whole process of building this background knowledge cannot be fully automated but you can ease the pain of doing this quite significantly.”

Different varieties of the Sealife browser build on work by partners in Edinburgh, Manchester, London and Sophia-Antipolis, as well as in Dresden. They have been tested in three scenarios: evidence-based medicine, mining of scientific and patent literature, and in molecular biology. In each case, the focus has been on infectious diseases.

Browser that understands everything?

TU Dresden has spun-off a new company, Transinsight, to exploit work done in Sealife. The company has sold semantic browsers to such major customers as BASF and Unilever and runs the GoPubMed search engine, which is linked to the respected PubMed archive of biomedical literature.

But there is no reason why a semantic browser should be confined to specialised academic areas. Could we have a browser that understands everything? Schroeder thinks that is not as far-fetched as it may seem. “The vision is to include every domain,” he says. “For example, if we were able to extract and formalise the knowledge in Wikipedia we would have this general background knowledge that covers all areas.”

Many researchers look forward to a next-generation search engine that can understand what the user is looking for and return much more relevant results than today’s engines can. “This will involve integrating information,” says Schroeder, “because very often answers to questions are not provided in one document as a single statement that I can pick up by keywords.

“In the future, we will need background knowledge and this is at the core of Sealife. If we build semantic into search, and make it scaleable, then you will have the next-generation search engine.”

The Sealife project received funding from the ICT strand of the EU’s Sixth Framework Programme for research.

 


Story Source:

The above story is based on materials provided by ICT Results. Note: Materials may be edited for content and length.


Cite This Page:

ICT Results. "Intelligent Life Sciences Search Engine: Grid Browser Understands Technical Terms And Context." ScienceDaily. ScienceDaily, 23 May 2009. <www.sciencedaily.com/releases/2009/05/090521084719.htm>.
ICT Results. (2009, May 23). Intelligent Life Sciences Search Engine: Grid Browser Understands Technical Terms And Context. ScienceDaily. Retrieved April 24, 2014 from www.sciencedaily.com/releases/2009/05/090521084719.htm
ICT Results. "Intelligent Life Sciences Search Engine: Grid Browser Understands Technical Terms And Context." ScienceDaily. www.sciencedaily.com/releases/2009/05/090521084719.htm (accessed April 24, 2014).

Share This



More Computers & Math News

Thursday, April 24, 2014

Featured Research

from universities, journals, and other organizations


Featured Videos

from AP, Reuters, AFP, and other news services

Will New FCC Rules Trigger Death Of Net Neutrality?

Will New FCC Rules Trigger Death Of Net Neutrality?

Newsy (Apr. 24, 2014) — The Federal Communications Commission will reportedly propose new rules for Net neutrality that could undermine the principles of a free and open Web. Video provided by Newsy
Powered by NewsLook.com
Apple Beats Estimates, Most Looking to Second Half of 2014

Apple Beats Estimates, Most Looking to Second Half of 2014

TheStreet (Apr. 24, 2014) — TheStreet's Stephanie Link and Real Money Contributor Dan Nathan discuss Apple's first quarter results. Link and Nathan expected the tech giant to lower guidance for the current quarter which they felt could send shares lower and present a buying opportunity. Nathan says options are cheap because Apple has been aggressively buying back shares. Video provided by TheStreet
Powered by NewsLook.com
Raw: Obama Plays Soccer With Japanese Robot

Raw: Obama Plays Soccer With Japanese Robot

AP (Apr. 24, 2014) — President Obama briefly played soccer with a robot during his visit to Japan on Thursday. The President has been emphasizing technology along with security concerns during his visit. (April 24) Video provided by AP
Powered by NewsLook.com
Obama Encourages Japanese Student-Scientists

Obama Encourages Japanese Student-Scientists

AP (Apr. 24, 2014) — President Obama spoke with student innovators in Japan and urged them to take part in increased opportunities for student exchanges with the US. (April 24) Video provided by AP
Powered by NewsLook.com

Search ScienceDaily

Number of stories in archives: 140,361

Find with keyword(s):
 
Enter a keyword or phrase to search ScienceDaily for related topics and research stories.

Save/Print:
Share:  

Breaking News:
from the past week

In Other News

... from NewsDaily.com

Science News

Health News

Environment News

Technology News



Save/Print:
Share:  

Free Subscriptions


Get the latest science news with ScienceDaily's free email newsletters, updated daily and weekly. Or view hourly updated newsfeeds in your RSS reader:

Get Social & Mobile


Keep up to date with the latest news from ScienceDaily via social networks and mobile apps:

Have Feedback?


Tell us what you think of ScienceDaily -- we welcome both positive and negative comments. Have any problems using the site? Questions?
Mobile iPhone Android Web
Follow Facebook Twitter Google+
Subscribe RSS Feeds Email Newsletters
Latest Headlines Health & Medicine Mind & Brain Space & Time Matter & Energy Computers & Math Plants & Animals Earth & Climate Fossils & Ruins