Featured Research

from universities, journals, and other organizations

Mining the blogosphere: Researchers develop tools that make sense of social media

Date:
September 6, 2012
Source:
Concordia University
Summary:
Can a computer "read" an online blog and understand it? Several computer scientists are helping to get closer to that goal.

Can a computer "read" an online blog and understand it? Several Concordia computer scientists are helping to get closer to that goal.

Leila Kosseim, associate professor in Concordia's Faculty of Engineering and Computer Science, and a recently-graduated doctoral student, Shamima Mithun, have developed a system called BlogSum that has potentially vast applications. It allows an organization to pose a question and then find out how a large number of people talking online would respond. The system is capable of gauging things like consumer preferences and voter intentions by sorting through websites, examining real-life self-expression and conversation, and producing summaries that focus exclusively on the original question.

"Huge quantities of electronic texts have become easily available on the Internet, but people can be overwhelmed, and they need help to find the real content hiding in the mass of information," explains Kosseim, one of the lead researchers at Concordia's Computational Linguistics Laboratory (CLaC lab).

Analyzing informally-written language poses unique challenges compared to analyzing, for example, a news article. Blogs, forums and the like contain opinions, emotions and speculations, not to mention spelling errors and poor grammar. A summarization tool must address two particular problems, question irrelevance (sentences that are not relevant to the main question), and discourse incoherence, (sentences in which the intent of the writer is unclear).

BlogSum met these challenges with demonstrable efficiency. The researchers developed and tested their tool by examining a set of blogs and review sites. BlogSum used "discourse relations" to crunch the data -- ways of filtering and ordering sentences into coherent summaries. BlogSum was measured against prior computational rankings and achieved mostly superior results. In addition, it was evaluated by actual human subjects, who also found it to be superior. Summaries produced by BlogSum reduced question irrelevance and discourse incoherence, successfully distilling large amounts of text into highly readable summaries.

This study is an example of Natural Language Processing (NLP), in which Concordia, through the CLaC lab, is a leader. NLP stands at the intersection of artificial intelligence and linguistics, seeking to enable computers to derive meaning from human language.

"The field of natural language processing is starting to become fundamental to computer science, with many everyday applications -- making search engines find more relevant documents or making smart phones even smarter," explained Kosseim.


Story Source:

The above story is based on materials provided by Concordia University. Note: Materials may be edited for content and length.


Cite This Page:

Concordia University. "Mining the blogosphere: Researchers develop tools that make sense of social media." ScienceDaily. ScienceDaily, 6 September 2012. <www.sciencedaily.com/releases/2012/09/120906131352.htm>.
Concordia University. (2012, September 6). Mining the blogosphere: Researchers develop tools that make sense of social media. ScienceDaily. Retrieved July 23, 2014 from www.sciencedaily.com/releases/2012/09/120906131352.htm
Concordia University. "Mining the blogosphere: Researchers develop tools that make sense of social media." ScienceDaily. www.sciencedaily.com/releases/2012/09/120906131352.htm (accessed July 23, 2014).

Share This




More Computers & Math News

Wednesday, July 23, 2014

Featured Research

from universities, journals, and other organizations


Featured Videos

from AP, Reuters, AFP, and other news services

Robot Parking Valet Creates Stress-Free Travel

Robot Parking Valet Creates Stress-Free Travel

AP (July 23, 2014) 'Ray' the robotic parking valet at Dusseldorf Airport in Germany lets travelers to avoid the hassle of finding a parking spot before heading to the check-in desk. (July 23) Video provided by AP
Powered by NewsLook.com
Six Indicted in StubHub Hacking Scheme

Six Indicted in StubHub Hacking Scheme

AP (July 23, 2014) Six people were indicted Wednesday in an international ring that took over more than 1,000 StubHub users' accounts and fraudulently bought tickets that were then resold. (July 23) Video provided by AP
Powered by NewsLook.com
The Reviews Are In For The Amazon Fire Phone

The Reviews Are In For The Amazon Fire Phone

Newsy (July 23, 2014) Amazon's first smartphone, the Fire Phone, is set to ship this week, and so far the reviews have been pretty mixed. Video provided by Newsy
Powered by NewsLook.com
Bigger Apple Phone, Bigger Orders

Bigger Apple Phone, Bigger Orders

Reuters - Business Video Online (July 22, 2014) Apple is asking suppliers to make 70 to 80 million units of its new larger screen iPhone, a lot more initially than its current model. Fred Katayama reports. Video provided by Reuters
Powered by NewsLook.com

Search ScienceDaily

Number of stories in archives: 140,361

Find with keyword(s):
Enter a keyword or phrase to search ScienceDaily for related topics and research stories.

Save/Print:
Share:

Breaking News:
from the past week

In Other News

... from NewsDaily.com

Science News

Health News

Environment News

Technology News



Save/Print:
Share:

Free Subscriptions


Get the latest science news with ScienceDaily's free email newsletters, updated daily and weekly. Or view hourly updated newsfeeds in your RSS reader:

Get Social & Mobile


Keep up to date with the latest news from ScienceDaily via social networks and mobile apps:

Have Feedback?


Tell us what you think of ScienceDaily -- we welcome both positive and negative comments. Have any problems using the site? Questions?
Mobile: iPhone Android Web
Follow: Facebook Twitter Google+
Subscribe: RSS Feeds Email Newsletters
Latest Headlines Health & Medicine Mind & Brain Space & Time Matter & Energy Computers & Math Plants & Animals Earth & Climate Fossils & Ruins