Featured Research

from universities, journals, and other organizations

Scientists create automated 'time machine' to reconstruct ancient languages

Date:
February 12, 2013
Source:
University of California - Berkeley
Summary:
Ancient languages hold a treasure trove of information about the culture, politics and commerce of millennia past. Yet, reconstructing them to reveal clues into human history can require decades of painstaking work. Now, scientists have created an automated "time machine," of sorts, that will greatly accelerate and improve the process of reconstructing hundreds of ancestral languages.

Proto-Austronesian “genealogical tree.”
Credit: Image courtesy of University of California - Berkeley

Ancient languages hold a treasure trove of information about the culture, politics and commerce of millennia past. Yet, reconstructing them to reveal clues into human history can require decades of painstaking work. Now, scientists at the University of California, Berkeley, have created an automated "time machine," of sorts, that will greatly accelerate and improve the process of reconstructing hundreds of ancestral languages.

In a compelling example of how "big data" and machine learning are beginning to make a significant impact on all facets of knowledge, researchers from UC Berkeley and the University of British Columbia have created a computer program that can rapidly reconstruct "proto-languages" -- the linguistic ancestors from which all modern languages have evolved. These earliest-known languages include Proto-Indo-European, Proto-Afroasiatic and, in this case, Proto-Austronesian, which gave rise to languages spoken in Southeast Asia, parts of continental Asia, Australasia and the Pacific.

"What excites me about this system is that it takes so many of the great ideas that linguists have had about historical reconstruction, and it automates them at a new scale: more data, more words, more languages, but less time," said Dan Klein, an associate professor of computer science at UC Berkeley and co-author of the paper published online Feb. 11 in the journal Proceedings of the National Academy of Sciences.

The research team's computational model uses probabilistic reasoning -- which explores logic and statistics to predict an outcome -- to reconstruct more than 600 Proto-Austronesian languages from an existing database of more than 140,000 words, replicating with 85 percent accuracy what linguists had done manually. While manual reconstruction is a meticulous process that can take years, this system can perform a large-scale reconstruction in a matter of days or even hours, researchers said.

Not only will this program speed up the ability of linguists to rebuild the world's proto-languages on a large scale, boosting our understanding of ancient civilizations based on their vocabularies, but it can also provide clues to how languages might change years from now.

"Our statistical model can be used to answer scientific questions about languages over time, not only to make inferences about the past, but also to extrapolate how language might change in the future," said Tom Griffiths, associate professor of psychology, director of UC Berkeley's Computational Cognitive Science Lab and another co-author of the paper.

The discovery advances UC Berkeley's mission to make sense of big data and to use new technology to document and maintain endangered languages as critical resources for preserving cultures and knowledge. For example, researchers plan to use the same computational model to reconstruct indigenous North American proto-languages.

Humans' earliest written records date back less than 6,000 years, long after the advent of many proto-languages. While archeologists can catch direct glimpses of ancient languages in written form, linguists typically use what is known as the "comparative method" to probe the past. This method establishes relationships between languages, identifying sounds that change with regularity over time to determine whether they share a common mother language.

"To understand how language changes -- which sounds are more likely to change and what they will become -- requires reconstructing and analyzing massive amounts of ancestral word forms, which is where automatic reconstructions play an important role," said Alexandre Bouchard-Côté, an assistant professor of statistics at the University of British Columbia and lead author of the study, which he started while a graduate student at UC Berkeley.

The UC Berkeley computational model is based on the established linguistic theory that words evolve along the branches of a family tree -- much like a genealogical tree -- reflecting linguistic relationships that evolve over time, with the roots and nodes representing proto-languages and the leaves representing modern languages.

Using an algorithm known as the Markov chain Monte Carlo sampler, the program sorted through sets of cognates, words in different languages that share a common sound, history and origin, to calculate the odds of which set is derived from which proto-language. At each step, it stored a hypothesized reconstruction for each cognate and each ancestral language.

"Because the sound changes and reconstructions are closely linked, our system uses them to repeatedly improve each other," Klein said. "It first fixes its predicted sound changes and deduces better reconstructions of the ancient forms. It then fixes the reconstructions and re-analyzes the sound changes. These steps are repeated, and both predictions gradually improve as the underlying structure emerges over time."


Story Source:

The above story is based on materials provided by University of California - Berkeley. The original article was written by Yasmin Anwar. Note: Materials may be edited for content and length.


Journal Reference:

  1. A. Bouchard-Cote, D. Hall, T. L. Griffiths, D. Klein. Automated reconstruction of ancient languages using probabilistic models of sound change. Proceedings of the National Academy of Sciences, 2013; DOI: 10.1073/pnas.1204678110

Cite This Page:

University of California - Berkeley. "Scientists create automated 'time machine' to reconstruct ancient languages." ScienceDaily. ScienceDaily, 12 February 2013. <www.sciencedaily.com/releases/2013/02/130212112025.htm>.
University of California - Berkeley. (2013, February 12). Scientists create automated 'time machine' to reconstruct ancient languages. ScienceDaily. Retrieved October 22, 2014 from www.sciencedaily.com/releases/2013/02/130212112025.htm
University of California - Berkeley. "Scientists create automated 'time machine' to reconstruct ancient languages." ScienceDaily. www.sciencedaily.com/releases/2013/02/130212112025.htm (accessed October 22, 2014).

Share This



More Mind & Brain News

Wednesday, October 22, 2014

Featured Research

from universities, journals, and other organizations


Featured Videos

from AP, Reuters, AFP, and other news services

Working Mother Getaway: Beaches Turks & Caicos

Working Mother Getaway: Beaches Turks & Caicos

Working Mother (Oct. 22, 2014) — Feast your eyes on this gorgeous family-friendly resort. Video provided by Working Mother
Powered by NewsLook.com
What Your Favorite Color Says About You

What Your Favorite Color Says About You

Buzz60 (Oct. 22, 2014) — We all have one color we love to wear, and believe it or not, your color preference may reveal some of your character traits. In celebration of National Color Day, Krystin Goodwin (@kyrstingoodwin) highlights what your favorite colors may say about you. Video provided by Buzz60
Powered by NewsLook.com
First-Of-Its-Kind Treatment Gives Man Ability To Walk Again

First-Of-Its-Kind Treatment Gives Man Ability To Walk Again

Newsy (Oct. 21, 2014) — A medical team has for the first time given a man the ability to walk again after transplanting cells from his brain onto his severed spinal cord. Video provided by Newsy
Powered by NewsLook.com
Portable Breathalyzer Gets You Home Safely

Portable Breathalyzer Gets You Home Safely

Buzz60 (Oct. 21, 2014) — Breeze, a portable breathalyzer, gets you home safely by instantly showing your blood alcohol content, and with one tap, lets you call an Uber, a cab or a friend from your contact list to pick you up. Sean Dowling (@SeanDowlingTV) has the details. Video provided by Buzz60
Powered by NewsLook.com

Search ScienceDaily

Number of stories in archives: 140,361

Find with keyword(s):
 
Enter a keyword or phrase to search ScienceDaily for related topics and research stories.

Save/Print:
Share:  

Breaking News:

More Coverage


Computerized 'Rosetta Stone' Reconstructs Ancient Languages

Feb. 11, 2013 — Researchers have used a sophisticated new computer system to quickly reconstruct protolanguages -- the rudimentary ancient tongues from which modern languages ... read more

Strange & Offbeat Stories

 

Health & Medicine

Mind & Brain

Living & Well

In Other News

... from NewsDaily.com

Science News

Health News

Environment News

Technology News



Save/Print:
Share:  

Free Subscriptions


Get the latest science news with ScienceDaily's free email newsletters, updated daily and weekly. Or view hourly updated newsfeeds in your RSS reader:

Get Social & Mobile


Keep up to date with the latest news from ScienceDaily via social networks and mobile apps:

Have Feedback?


Tell us what you think of ScienceDaily -- we welcome both positive and negative comments. Have any problems using the site? Questions?
Mobile iPhone Android Web
Follow Facebook Twitter Google+
Subscribe RSS Feeds Email Newsletters
Latest Headlines Health & Medicine Mind & Brain Space & Time Matter & Energy Computers & Math Plants & Animals Earth & Climate Fossils & Ruins