Featured Research

from universities, journals, and other organizations

Grammar Lost Translation Machine In Researchers Fix Try Will

Date:
September 9, 2005
Source:
University of Southern California
Summary:
The makers of a University of Southern California computer translation system consistently rated among the world's best are teaching their software something new: English grammar.

A Tree Grows in Translation: Grammatical structure, long in second place, is emerging as a key to better English in the finished product.
Credit: Image courtesy of Information Sciences Institute, University of Southern California Viterbi School of Engineering

Related Articles


Software matchesup phrases that consistently show up in parallel fashion — the English"my brother's pants" and Spanish "los pantalones de mi hermano," — andthen use these matches to piece together translations of new material.

Itworks — but only to a point. ISI machine translation expert DanielMarcu (left) says that when such a system is "trained on enoughrelevant bilingual text ... it can break a foreign language up intophrasal units, translate each of them fairly well into English, and dosome re-ordering. However, even in this good scenario, the output isstill clearly not English. It takes too long to read, and it isunsatisfactory for commercial use."

So Marcu and colleague KevinKnight (right), both ISI project leaders who also hold appointments inthe USC Viterbi School of Engineering department of computer science,have begun an intensive $285,000 effort, called the Advanced LanguageModeling for Machine Translation project, to improve the system theycreated at ISI by subjecting the texts that come out of theirtranslation engine to a follow-on step: grammatical processing.

Thestep seems simple, but is actually imposingly difficult. "For example,there is no robust algorithm that returns 'grammatical' or'ungrammatical' or 'sensible' or 'nonsense' in response to a user-typedsequence of words," Marcu notes.

The problem grows out of anatural language feature noted by M.I.T. language theorist Noam Chomskydecades ago. Language users have literally a limitless ability to nestand cross-nest phrases and ideas into intricate referential structures— "I was looking for the stirrups from the saddle that my ex-wife'soldest daughter took with her when she went to Jack's new place inColorado three years ago, but all she had were Louise's second-handsaddle shoes, the ones Ethel's dog chewed during the fire."

Unravelingthese verbal cobwebs (or, in the more common description, tracingbranching "trees" of connections) is such a daunting task thatprogrammers long ago went in the brute force direction of matchingphrases and hoping that the relation of the phrases would become clearto readers.

With the limits of this approach becoming clear,researchers have now begun applying computing power to trying toassemble grammatical rules. According to Knight, one crucial step hasbeen the creation of a large database of English text whose syntax hasbeen hand-decoded by humans, the "Penn Treebank."

Using this andother sources, computer scientists have begun developing ways to modelthe observed rules. A preliminary study by Knight and two colleagues in2003 showed that this approach might be able to improve translations.

Accordingly,for their study, "We propose to implement a trainable tree-basedlanguage model and parser, and to carry out empiricalmachine-translation experiments with them. USC/ISI's state-of-the-artmachine translation system already has the ability to produce, for anyinput sentence, a list of 25,000 candidate English outputs. This listcan be manipulated in a post-processing step. We will re-rank theselists of candidate string translations with our tree- based languagemodel, and we plan for better translations to rise to the top of thelist."

One crucial trick that the system must be able to do is topick out separate trees from the endless strings of words. But this isdoable, Knight believes -- and in the short, not the long term.

Referringto the annual review of translation systems by the National Instituteof Science and Technology, in which ISI consistently gains top scores,"we want to have the grammar module installed and working by the nextevaluation, in August 2006," he said.

Knight and Marcu arecofounders and, respectively, chief scientist and chief technology andoperating officer of a spinoff company, Language Weaver.


Story Source:

The above story is based on materials provided by University of Southern California. Note: Materials may be edited for content and length.


Cite This Page:

University of Southern California. "Grammar Lost Translation Machine In Researchers Fix Try Will." ScienceDaily. ScienceDaily, 9 September 2005. <www.sciencedaily.com/releases/2005/09/050909074409.htm>.
University of Southern California. (2005, September 9). Grammar Lost Translation Machine In Researchers Fix Try Will. ScienceDaily. Retrieved October 24, 2014 from www.sciencedaily.com/releases/2005/09/050909074409.htm
University of Southern California. "Grammar Lost Translation Machine In Researchers Fix Try Will." ScienceDaily. www.sciencedaily.com/releases/2005/09/050909074409.htm (accessed October 24, 2014).

Share This



More Computers & Math News

Friday, October 24, 2014

Featured Research

from universities, journals, and other organizations


Featured Videos

from AP, Reuters, AFP, and other news services

The Best Apps to Organize Your Life

The Best Apps to Organize Your Life

Buzz60 (Oct. 23, 2014) — Need help organizing your bills, schedules and other things? Ko Im (@konakafe) has the best apps to help you stay on top of it all! Video provided by Buzz60
Powered by NewsLook.com
Nike And Apple Team Up To Create Wearable ... Something

Nike And Apple Team Up To Create Wearable ... Something

Newsy (Oct. 23, 2014) — For those looking for wearable tech that's significantly less nerdy than Google Glass, Nike CEO Mark Parker says don't worry, It's on the way. Video provided by Newsy
Powered by NewsLook.com
Chameleon Camouflage to Give Tanks Cloaking Capabilities

Chameleon Camouflage to Give Tanks Cloaking Capabilities

Reuters - Innovations Video Online (Oct. 22, 2014) — Inspired by the way a chameleon changes its colour to disguise itself; scientists in Poland want to replace traditional camouflage paint with thousands of electrochromic plates that will continuously change colour to blend with its surroundings. The first PL-01 concept tank prototype will be tested within a few years, with scientists predicting that a similar technology could even be woven into the fabric of a soldiers' clothing making them virtually invisible to the naked eye. Matthew Stock reports. Video provided by Reuters
Powered by NewsLook.com
Internet of Things Aims to Smarten Your Life

Internet of Things Aims to Smarten Your Life

AP (Oct. 22, 2014) — As more and more Bluetooth-enabled devices are reaching consumers, developers are busy connecting them together as part of the Internet of Things. (Oct. 22) Video provided by AP
Powered by NewsLook.com

Search ScienceDaily

Number of stories in archives: 140,361

Find with keyword(s):
 
Enter a keyword or phrase to search ScienceDaily for related topics and research stories.

Save/Print:
Share:  

Breaking News:

Strange & Offbeat Stories

 

Space & Time

Matter & Energy

Computers & Math

In Other News

... from NewsDaily.com

Science News

Health News

Environment News

Technology News



Save/Print:
Share:  

Free Subscriptions


Get the latest science news with ScienceDaily's free email newsletters, updated daily and weekly. Or view hourly updated newsfeeds in your RSS reader:

Get Social & Mobile


Keep up to date with the latest news from ScienceDaily via social networks and mobile apps:

Have Feedback?


Tell us what you think of ScienceDaily -- we welcome both positive and negative comments. Have any problems using the site? Questions?
Mobile iPhone Android Web
Follow Facebook Twitter Google+
Subscribe RSS Feeds Email Newsletters
Latest Headlines Health & Medicine Mind & Brain Space & Time Matter & Energy Computers & Math Plants & Animals Earth & Climate Fossils & Ruins