Science News

Painstaking Search For Quotes In Television Programs Comes To An End

ScienceDaily (July 8, 2009) — The journalist recalls more or less what Ulla Schmidt said regarding the health reform, but needs the exact wording to be able to cite her. A new speech recognition system helps to search TV broadcasts. It does not need to be updated and so does not entail any running costs.

When was the financial crisis first mentioned in the news? What was it that Angela Merkel said concerning the presidential elections? Until now, journalists, archivists and media observers have had to search painstakingly to find a specific section in a TV recording – or have had to invest heaps of money in speech recognition software. The systems currently available to take over the search have to be regularly updated by specialists and therefore entail high running costs.

These systems are based on a kind of thesaurus containing all the words they can recognize. However, new topics and personalities bring along new words like “financial crisis” or names such as “Obama”. These terms need to be transferred to the thesaurus so that they can be found.

Researchers at the Fraunhofer Institute for Intelligent Analysis and Information Systems IAIS in Sankt Augustin have developed a speech recognition system that does not require expensive updating measures. “Our system is based on a syllable thesaurus instead of a word thesaurus. Conventional speech recognizers can only discern a limited number of words, while the total number of words in existence is too vast to handle. The number of existing syllables, on the other hand, is manageable. With about 10,000 stored syllables we can make up any word,” says IAIS scientist Daniel Schneider. The program can even acquire new words independently by composing them from the stored syllables: fi-nan-cial cri-sis. It does not need to be updated and so does not entail any running costs.

For each search, the programs are first of all split into segments. Whenever a new speaker starts to talk or a film contribution begins – in which case the content of the audio track changes – the program saves the following scene as a new segment. The user can then navigate from speaker to speaker, and can choose to watch only the contributions of one particular interview partner. In a second step, the individual words are analyzed by speech algorithms. Users can apply the program just like a conventional search engine. You simply enter the search term, and a few milliseconds later the program has scanned 10,000 hours of processed data. Just like an Internet search engine, it displays the results in context in their given sentences. The user then simply clicks on a word to play back the relevant section of film material.

The system can find over 85 percent of the spoken words in a program, and 99 out of a 100 located contributions are correct. A license model of the program is already available.

Email or share this story:
| More

Story Source:

Adapted from materials provided by Fraunhofer-Gesellschaft.

APA

MLA

Note: If no author is given, the source is cited instead.

Search ScienceDaily

Number of stories in archives: 78,103

Find with keyword(s):
 
Enter a keyword or phrase to search ScienceDaily's archives for related news topics,
the latest news stories, reference articles, science videos, images, and books.

 

Science Video News


Image Based Search Engine Created

VizSeek is one of the first search engines on the Internet to use a photograph, a 2D image, or a 3D model and transform it into a 3D shape. The. ...  > full story

Breaking News

... from NewsDaily.com

In Other News ...

Copyright Reuters 2008. See Restrictions.

Free Subscriptions

... from ScienceDaily

Get the latest science news with our free email newsletters, updated daily and weekly. Or view hourly updated newsfeeds in your RSS reader:

Feedback

... we want to hear from you!

Tell us what you think of the new ScienceDaily -- we welcome both positive and negative comments. Have any problems using the site? Questions?
Post this page to your favorite social bookmarking site:
close
Include this item in your blog or web site:
close
Cite this article in your essay, paper, or report:
close
Email this page's link to a friend or colleague:
close