Featured Research

from universities, journals, and other organizations

In the quest for excellence in machine translation

Date:
June 27, 2013
Source:
Elhuyar Fundazioa
Summary:
The IXA Group of the UPV/EHU-University of the Basque Country has just embarked on a new European project to take a qualitative leap forward in machine translation: QT Leap. The project aims to significantly improve the quality of today’s results in machine translation, which are far from perfect. The project has a duration of three years.

The IXA Group of the UPV/EHU-University of the Basque Country has just embarked on a new European project to take a qualitative leap forward in machine translation: QT Leap. The project aims to significantly improve the quality of today's results in machine translation, which are far from perfect. The project has a duration of three years.

Related Articles


All machine translators have their limitations. The translations are not perfect by any means, but they are suitable tools for providing help on more than one occasion. The UPV/EHU's IXA Group has recently secured a European project to overcome these limitations and to go on conducting research into machine translation: QTLeap. The IXA Group will be working in collaboration with many other European institutions that are pioneers in machine translation: the DKFI of Germany, University of Lisbon, the Charles University in Prague, the Bulgarian Academy of Sciences IICT-BAS, the Humboldt University of Berlin, and the University of Groningen.

"It's easier to produce a good machine translation between language pairs that are grammatically and morphologically similar, like the Spanish-Catalan or the Spanish-Galician pairs," explained Kepa Sarasola, a member of the IXA Group. In the case of Basque, however, the difficulty is greater and the quality more doubtful. "There are three main difficulties: firstly, the big morphological and grammatical differences between Basque and other languages; secondly, the selection of suitable equivalents that one word has in other languages (having to choose which meaning to use in each specific context); and finally, having a limited corpus of translated texts." Compared with other languages, Basque has a totally different structure and it is very difficult for machine translators to get the order of the elements in the translation right. In addition, one of the main challenges facing Basque machine translators is to obtain a sufficiently large number of translated texts so that large corpora can be produced.

Closer to perfection

Members of the UPV/EHU's IXA Group have been working hard on machine translators, and with the European QTLeap project they want to take a new step forward in this field and on the research level. In this project they will be endeavouring to overcome the limitations of today's machine translators.

For this purpose they will be using treebanks. "The aim behind using a lot of sentences that have been properly analysed syntactically, in other words by means of treebanks, is to help machine translators select the syntax better," says Sarasola.

On the other hand, the information needed to distinguish the meaning of a word is not only to be found in dictionaries, it can also be obtained from the Internet. So certain Internet resources will be used to properly distinguish between the meanings of words in order to dispel any doubts. For example, in large text collections like Wikipedia, very often the meaning of a word has been properly specified. In other words, "if there is a link below a word, the link will lead to one Wikipedia meaning or another." If you gather a lot of these links together, computers can learn from them in order to distinguish between meanings. Wikipedia is just one of the possible sources. "Today there are more and more texts on the Internet with links of this kind; all these possibilities are known as Linked Open Data," he added. That would help, for example, to clarify whether the Basque word baso is bosque (forest) or vaso (glass) in Spanish.

"We will also be working specifically on proper names, people's names, names of organisations or geographical locations. In fact, it is impossible to have all proper names properly tagged, but having a large number of them sorted would greatly improve the quality of the translations," said Sarasola. That would provide an opportunity, for example, to keep the name Pilar del Castillo as it is in translations and not to end up with the result gazteluko pilarea (the castle pillar).

These resources can be consulted in two ways: off-line or on-line. Off-line, a lot of information can be gathered and organised before starting to translate, and it can then be easily used in the translations. In an on-line consultation, on the other hand, while translating the program can resort to the Internet and look up how to translate the word or proper name that the machine does not understand.

Therefore, these three main branches -- treebanks, Internet resources and proper names -- will be studied by the IXA Group from November onwards in the QTLeap project, to look for new solutions for the three main problems dogging current Basque machine translators -- morphological and grammatical differences, meaning disambiguation, and poor corpus.

The long journey of machine translation

In the 1950s people started working on ideas to develop machine translation. Since then, many approaches have been put forward with the aim of achieving a successful system. The UPV/EHU's IXA Group has been working hard on these tools over the last fifteen years in collaboration with Elhuyar.

Recent years have seen a flowering of tools over the Internet for doing machine translation, including ones that anyone can download and use. One of them is Opentrad. OpenTrad is an open-code-based machine translation system available in all of Spain's official languages: Spanish, Catalan, Galician and Basque. Opentrad translation is underpinned by two technologies: Matxin (Spanish-Basque) developed by the UPV/EHU's IXA Group, and Apertium (Spanish-Catalan-Galician, etc.).


Story Source:

The above story is based on materials provided by Elhuyar Fundazioa. Note: Materials may be edited for content and length.


Cite This Page:

Elhuyar Fundazioa. "In the quest for excellence in machine translation." ScienceDaily. ScienceDaily, 27 June 2013. <www.sciencedaily.com/releases/2013/06/130627102545.htm>.
Elhuyar Fundazioa. (2013, June 27). In the quest for excellence in machine translation. ScienceDaily. Retrieved October 24, 2014 from www.sciencedaily.com/releases/2013/06/130627102545.htm
Elhuyar Fundazioa. "In the quest for excellence in machine translation." ScienceDaily. www.sciencedaily.com/releases/2013/06/130627102545.htm (accessed October 24, 2014).

Share This



More Computers & Math News

Friday, October 24, 2014

Featured Research

from universities, journals, and other organizations


Featured Videos

from AP, Reuters, AFP, and other news services

Microsoft Riding High On Strong Surface, Cloud Performance

Microsoft Riding High On Strong Surface, Cloud Performance

Newsy (Oct. 24, 2014) — Microsoft's Q3 earnings showed its tablets and cloud services are really hitting their stride. Video provided by Newsy
Powered by NewsLook.com
The Best Apps to Organize Your Life

The Best Apps to Organize Your Life

Buzz60 (Oct. 23, 2014) — Need help organizing your bills, schedules and other things? Ko Im (@konakafe) has the best apps to help you stay on top of it all! Video provided by Buzz60
Powered by NewsLook.com
Nike And Apple Team Up To Create Wearable ... Something

Nike And Apple Team Up To Create Wearable ... Something

Newsy (Oct. 23, 2014) — For those looking for wearable tech that's significantly less nerdy than Google Glass, Nike CEO Mark Parker says don't worry, It's on the way. Video provided by Newsy
Powered by NewsLook.com
Chameleon Camouflage to Give Tanks Cloaking Capabilities

Chameleon Camouflage to Give Tanks Cloaking Capabilities

Reuters - Innovations Video Online (Oct. 22, 2014) — Inspired by the way a chameleon changes its colour to disguise itself; scientists in Poland want to replace traditional camouflage paint with thousands of electrochromic plates that will continuously change colour to blend with its surroundings. The first PL-01 concept tank prototype will be tested within a few years, with scientists predicting that a similar technology could even be woven into the fabric of a soldiers' clothing making them virtually invisible to the naked eye. Matthew Stock reports. Video provided by Reuters
Powered by NewsLook.com

Search ScienceDaily

Number of stories in archives: 140,361

Find with keyword(s):
 
Enter a keyword or phrase to search ScienceDaily for related topics and research stories.

Save/Print:
Share:  

Breaking News:

Strange & Offbeat Stories

 

Space & Time

Matter & Energy

Computers & Math

In Other News

... from NewsDaily.com

Science News

Health News

Environment News

Technology News



Save/Print:
Share:  

Free Subscriptions


Get the latest science news with ScienceDaily's free email newsletters, updated daily and weekly. Or view hourly updated newsfeeds in your RSS reader:

Get Social & Mobile


Keep up to date with the latest news from ScienceDaily via social networks and mobile apps:

Have Feedback?


Tell us what you think of ScienceDaily -- we welcome both positive and negative comments. Have any problems using the site? Questions?
Mobile iPhone Android Web
Follow Facebook Twitter Google+
Subscribe RSS Feeds Email Newsletters
Latest Headlines Health & Medicine Mind & Brain Space & Time Matter & Energy Computers & Math Plants & Animals Earth & Climate Fossils & Ruins