Featured Research

from universities, journals, and other organizations

Language: New analysis contradicts earlier findings

Date:
June 2, 2014
Source:
Linguistic Society of America
Summary:
New research presents evidence that the methods employed by the authors of articles published in international science journals are not supported by a more rigorous linguistic analysis. The new analysis comes in response to a number of papers published in high-profile science publications that have argued that statistical analyses of symbol combinations can provide insights into the origins of written language.

New research published in the June 2014 issue of Language presents evidence that the methods employed by the authors of articles published in prestigious international science journals are not supported by a more rigorous linguistic analysis. The Language article, "A statistical comparison of written language and non-linguistic symbol systems," was authored by Richard Sproat, a Research Scientist at Google, based on work he previously did at the Oregon Health & Science University.

Sproat's analysis comes in response to a number of papers published in high-profile science publications that have argued that statistical analyses of symbol combinations can provide insights into the origins of written language. One paper, by Rajesh Rao (University of Washington), Iravatham Mahadevan (Indus Research Centre) and colleagues at the TATA Institute in Mumbai, India, appeared in 2009 in the journal Science. It argued that a particular statistical measure -- bigram conditional entropy -- showed that the Indus Valley symbols behave more like those in linguistic texts than those of non-linguistic systems. In another paper in the Proceedings of the Royal Society, Rob Lee and colleagues (University of Exeter) claimed that a more sophisticated set of entropic measures put Pictish symbols in the same category as linguistic texts. Both papers (and other subsequent papers by Rao and his colleagues) received a large amount of attention from the news media. In these popular media accounts, the techniques were often presented as demonstrating that the symbol systems in question were written language, though this was not necessarily the intention of the authors.

Understanding statistical techniques for analyzing symbol systems and what they do and do not show is of fundamental importance to language science, as there are many old or ancient symbol systems whose function is largely or completely unknown. Examples include the Easter Island rongorongo inscriptions (19th century), the Pictish symbols of Scotland (6th century onwards), and the Indus Valley symbols (Northern India, Pakistan, 3rd millennium BCE). As part of his work on the question of whether symbol systems such as these exemplify written language, Sproat developed large, structured collections of text, or corpora, from a variety of non-linguistic systems, both ancient and modern, including Mesopotamian deity symbols (Babylonia), Totem poles (Pacific Northwest), Pennsylvania barn stars ("hex signs"), weather forecast icon sequences from http://www.wunderground.com, and Unicode characters for Asian emoticons. He compared these to corpora developed from fourteen languages representing a variety of different writing-system types, both ancient and modern.

From the point of view of the measures that had been proposed in the previous literature, all of the non-linguistic symbol systems in Sproat's collection or corpora behaved the same as the linguistic systems. However, he also found that a novel measure of the amount of local repetition and a version of one of Lee and colleagues' entropic measures with a different setting than they used could accurately distinguish two different categories of symbol systems. Moreover, his statistical procedure, unlike the earlier ones, classifies both the Pictish and Indus Valley symbols as non-linguistic.

Despite these promising results, Sproat cautions against relying too heavily on statistical measures to analyze ancient symbol systems that have not been deciphered. All statistical measures are heavily influenced by, among other things, the size of the corpus, the length of texts, and what kind of text is involved. Shopping lists, for example, have statistical properties that distinguish them from running prose from a novel. He argues that a truly reliable demonstration that a collection of symbols exemplifies written language requires supporting empirical evidence, such as a credible decipherment or independent archeological evidence of a related culture of active literacy. What is clear, however, is that the previously proposed statistical methods simply do not work for the intended purpose.


Story Source:

The above story is based on materials provided by Linguistic Society of America. Note: Materials may be edited for content and length.


Journal Reference:

  1. Richard Sproat. A statistical comparison of written language and nonlinguistic symbol systems. Language, 2014 [link]

Cite This Page:

Linguistic Society of America. "Language: New analysis contradicts earlier findings." ScienceDaily. ScienceDaily, 2 June 2014. <www.sciencedaily.com/releases/2014/06/140602101559.htm>.
Linguistic Society of America. (2014, June 2). Language: New analysis contradicts earlier findings. ScienceDaily. Retrieved August 1, 2014 from www.sciencedaily.com/releases/2014/06/140602101559.htm
Linguistic Society of America. "Language: New analysis contradicts earlier findings." ScienceDaily. www.sciencedaily.com/releases/2014/06/140602101559.htm (accessed August 1, 2014).

Share This




More Computers & Math News

Friday, August 1, 2014

Featured Research

from universities, journals, and other organizations


Featured Videos

from AP, Reuters, AFP, and other news services

Google (Kind Of) Complies With 'Right To Be Forgotten Law'

Google (Kind Of) Complies With 'Right To Be Forgotten Law'

Newsy (July 31, 2014) Google says it is following Europe's new "Right To Be Forgotten Law," which eliminates user information upon request, but only to a certain degree. Video provided by Newsy
Powered by NewsLook.com
Tesla, Panasonic Ink Deal To Make Huge Battery 'Gigafactory'

Tesla, Panasonic Ink Deal To Make Huge Battery 'Gigafactory'

Newsy (July 31, 2014) The deal will help build a massive battery factory that Tesla says will produce 500,000 lithium batteries by 2020. Video provided by Newsy
Powered by NewsLook.com
Sprint's Custom Prepaid Plans Draw Net Neutrality Fire

Sprint's Custom Prepaid Plans Draw Net Neutrality Fire

Newsy (July 31, 2014) Sprint's Virgin Mobile Custom plan offers optional social network access that doesn't count against data caps — but critics are crying foul. Video provided by Newsy
Powered by NewsLook.com
Britain Testing Driverless Cars on Roadways

Britain Testing Driverless Cars on Roadways

AP (July 30, 2014) British officials said on Wednesday that driverless cars will be tested on roads in as many as three cities in a trial program set to begin in January. Officials said the tests will last up to three years. (July 30) Video provided by AP
Powered by NewsLook.com

Search ScienceDaily

Number of stories in archives: 140,361

Find with keyword(s):
Enter a keyword or phrase to search ScienceDaily for related topics and research stories.

Save/Print:
Share:

Breaking News:
from the past week

In Other News

... from NewsDaily.com

Science News

Health News

Environment News

Technology News



Save/Print:
Share:

Free Subscriptions


Get the latest science news with ScienceDaily's free email newsletters, updated daily and weekly. Or view hourly updated newsfeeds in your RSS reader:

Get Social & Mobile


Keep up to date with the latest news from ScienceDaily via social networks and mobile apps:

Have Feedback?


Tell us what you think of ScienceDaily -- we welcome both positive and negative comments. Have any problems using the site? Questions?
Mobile: iPhone Android Web
Follow: Facebook Twitter Google+
Subscribe: RSS Feeds Email Newsletters
Latest Headlines Health & Medicine Mind & Brain Space & Time Matter & Energy Computers & Math Plants & Animals Earth & Climate Fossils & Ruins