Featured Research

from universities, journals, and other organizations

Multilingual Culture and Heritage Internet Search System Developed

Date:
December 30, 2008
Source:
ICT Results
Summary:
European researchers say they are pushing online culture and heritage research way beyond Google by using a smart search system that is multilingual, multimedia and optimized for cultural heritage. Better yet, this promising system has wide application in other fields.

European researchers say they are pushing online culture and heritage research way beyond Google by using a smart search system that is multilingual, multimedia and optimised for cultural heritage. Better yet, this promising system has wide application in other fields.

European researchers have developed an optimised search system that can access an enormous quantity of cultural heritage resources that reside online. Current technology like Google takes a scattergun approach, dishing up dozens of links of sometimes variable quality.

“Right now, if you do a search online, you get lots of irrelevant overload,” explains Pasquale Savino, coordinator of the MultiMatch project, which set out to create state-of-the-art search technology for cultural heritage information.

The MultiMatch system targets searches using a variety of smart search methods. Better yet, the concept can be applied to other fields, like sport, politics, economics and technology.

“Consider that many portals already offer a specialised catalogue, but in many cases the selection and classification of data is done manually, while the MultiMatch platform can perform this work automatically,” Savino reveals.

Three trumps

Savino says MultiMatch trumps standard search in three, vital ways. “The system does not simply query the web, it also searches through archives, many of them not publically available,” he notes.

Archives like the National Library of Austria (ONB), Biblioteca Virtual Miguel de Cervantes and the Israel National Library, though currently the system accesses just a portion of these resources for research purposes.

It also supports multimedia searches, and not simply by looking for pictures by name. It can look for pictures using other pictures. If a user has one picture, say of Picasso’s Guernica, the system can search for images in a similar style. It can do the same types of search for sound and video resources, too.

MultiMatch is also fluent in six languages. A search entered in Polish can be targeted to look for results in Spanish, or English, Italian, Dutch and German – the other four languages that the system currently recognises.

Finally, MultiMatch presents its results in an aggregated way, with resources clearly identified by type and sorted by priority, whether it is relevance, historical period or some other criteria. It is a prioritised, sorted and easily grasped layout of results, a bit like a newspaper created on the fly, for your particular query.

Culture crawl

MultiMatch began by selecting well-known cultural heritage sites like the Biblioteca Virtual Miguel de Cervantes, to populate its database. Next, it used well-known cultural heritage websites to ‘train’ web crawlers.

A web crawler is an automated program that accesses a website and traverses through the site by following the links present on the pages. Crawlers index links and information found on the various websites.

The MultiMatch crawlers are self-learning, so after they were shown cultural heritage websites, followed by sites that were not related to cultural heritage, the crawlers ‘learned’ what to look for. Over time, the system becomes self-refining, as it learns appropriate and inappropriate websites.

The system can also identify relevant material via an in-depth ‘crawling’ of selected cultural heritage institutions. And the system is not just multilingual, it speaks metadata as well, the lingua franc of the ‘semantic web’ – an attempt to help machines ‘understand’ the context and significance of specific types of data.

The result is that MultiMatch can take advantage of whatever metadata descriptions are in place, typically in an archive.

But MultiMatch goes further. If there is no metadata, it tries to infer the semantic content of a page – what it means and what it refers to – and this, too, is self-learning, and so will improve over time.

MultiMatch can also automatically extract information which can then be used to create cross-referencing, via hyperlinks, between related material, such as the biography of an artist, exhibitions of his/her work, a video documentary or critical appraisals, and so on.

Obsessive wikis

“We hope, in the future, to take functionality further, so that you could search for Cubism, for example, or any art movement. The query would return a categorised and prioritised table of contents for that very specific topic. The system can not do that yet, but it is something we want to develop in the future,” Savino explains.

It would be like a personalised Wikipedia, created on the fly, that caters uniquely to your obsession with Cubism.

In the meantime, Savino and the MultiMatch team are focusing on three prototype demonstrators to test the prototype of the system. One will support teachers trying to develop a lesson plan, the other two will focus on archiving and tourism applications that are still to be finalised.

Commercial breaks

The team, however, do not expect any surprises in the tests: the system has been working reliably in the lab up to now. Once it is validated, however, several of the partners will incorporate aspects of the work into their commercial products.

“There is also the possibility that one of the partners will develop a new product from our results,” notes Savino, though he emphasises that the current platform version is a prototype and would need more work to make into a commercial product.

The lead partner, Savino’s ISTI-CNR, will keep the demonstrator running online for at least one year after the project finishes, in the winter of 2008, to carry on further work.

In the meantime, MultiMatch technology will be used in two other European-funded projects: Europeana, a major effort to provide online access to 2 million digital objects from the continent’s archives, museums and libraries, and the European Film Gateway, a similar project specialising its work in moving images.

“These projects are mainly using the technology we developed to ensure interoperability between different archiving systems, and multilingual search and discovery,” confirms Savino.

It is an indication of the value of the Multimatch search technology. The MultiMatch project received funding from the ICT strand of Sixth Framework Programme for research. The technology was also showcased at the ICT 2008 meeting in Lyon.


Story Source:

The above story is based on materials provided by ICT Results. Note: Materials may be edited for content and length.


Cite This Page:

ICT Results. "Multilingual Culture and Heritage Internet Search System Developed." ScienceDaily. ScienceDaily, 30 December 2008. <www.sciencedaily.com/releases/2008/12/081224094636.htm>.
ICT Results. (2008, December 30). Multilingual Culture and Heritage Internet Search System Developed. ScienceDaily. Retrieved October 21, 2014 from www.sciencedaily.com/releases/2008/12/081224094636.htm
ICT Results. "Multilingual Culture and Heritage Internet Search System Developed." ScienceDaily. www.sciencedaily.com/releases/2008/12/081224094636.htm (accessed October 21, 2014).

Share This



More Computers & Math News

Tuesday, October 21, 2014

Featured Research

from universities, journals, and other organizations


Featured Videos

from AP, Reuters, AFP, and other news services

Thanks, Marty McFly! Hoverboards Could Be Coming In 2015

Thanks, Marty McFly! Hoverboards Could Be Coming In 2015

Newsy (Oct. 21, 2014) If you've ever watched "Back to the Future Part II" and wanted to get your hands on a hoverboard, well, you might soon be in luck. Video provided by Newsy
Powered by NewsLook.com
Robots to Fly Planes Where Humans Can't

Robots to Fly Planes Where Humans Can't

Reuters - Innovations Video Online (Oct. 21, 2014) Researchers in South Korea are developing a robotic pilot that could potentially replace humans in the cockpit. Unlike drones and autopilot programs which are configured for specific aircraft, the robots' humanoid design will allow it to fly any type of plane with no additional sensors. Ben Gruber reports. Video provided by Reuters
Powered by NewsLook.com
Japanese Scientists Unveil Floating 3D Projection

Japanese Scientists Unveil Floating 3D Projection

Reuters - Innovations Video Online (Oct. 20, 2014) Scientists in Tokyo have demonstrated what they say is the world's first 3D projection that floats in mid air. A laser that fires a pulse up to a thousand times a second superheats molecules in the air, creating a spark which can be guided to certain points in the air to shape what the human eye perceives as an image. Matthew Stock reports. Video provided by Reuters
Powered by NewsLook.com
Apple Enters Mobile Payment Business

Apple Enters Mobile Payment Business

AP (Oct. 20, 2014) Apple is making a strategic bet with the launch of Apple Pay, the mobile pay service aimed at turning your iPhone into your wallet. (Oct. 20) Video provided by AP
Powered by NewsLook.com

Search ScienceDaily

Number of stories in archives: 140,361

Find with keyword(s):
Enter a keyword or phrase to search ScienceDaily for related topics and research stories.

Save/Print:
Share:

Breaking News:

Strange & Offbeat Stories


Space & Time

Matter & Energy

Computers & Math

In Other News

... from NewsDaily.com

Science News

Health News

Environment News

Technology News



Save/Print:
Share:

Free Subscriptions


Get the latest science news with ScienceDaily's free email newsletters, updated daily and weekly. Or view hourly updated newsfeeds in your RSS reader:

Get Social & Mobile


Keep up to date with the latest news from ScienceDaily via social networks and mobile apps:

Have Feedback?


Tell us what you think of ScienceDaily -- we welcome both positive and negative comments. Have any problems using the site? Questions?
Mobile: iPhone Android Web
Follow: Facebook Twitter Google+
Subscribe: RSS Feeds Email Newsletters
Latest Headlines Health & Medicine Mind & Brain Space & Time Matter & Energy Computers & Math Plants & Animals Earth & Climate Fossils & Ruins