Featured Research

from universities, journals, and other organizations

Pushing genome data analysis one step forward

Date:
October 28, 2012
Source:
Centre for Genomic Regulation
Summary:
Due to the exponential increase in sequencing capacity, efficient tools for data analysis are becoming essential to process the vast amount of biological data. Scientists have now developed a tool for the interpretation of genomic data that is several times faster and much more accurate than other tools currently being used.

Due to the exponential increase in sequencing capacity, efficient tools for data analysis are becoming essential to process the vast amount of biological data. The GEM project, led by Paolo Ribeca from the Centro Nacional de Análisis Genómico (CNAG) and including scientists from this center and the Center for Genomic Regulation (CRG), allowed the development of a tool for the interpretation of genomic data that is several times faster and much more accurate than other tools currently being used.

The study has been published in the journal Nature Methods.

If we use the well-known comparison of the genome with a book, then we can say without fear of being wrong that it is a very complicated book. It is thousands of times bigger than a regular book, with more than 3 billion letters in total, each one being an A, C, G or T, as per the four possible bases of the DNA code. One can see the genome as a sequence of millions of words without breaks between them nor capitalization nor punctuation. Most words occur only once in the genome, but some can be found thousands of times with small variations. And reading this book gets even more complicated when you can only see short sentences with few words, each one randomly extracted from the book.

Last generation sequencing techniques used at the CNAG and the CRG, involve breaking the genome into small pieces (alike to short sentences from the book), sequencing such pieces and trying to find them back in the genome. The next step, mandatory in most biological experiments, would be assigning the sentences to their correct original location. However, this can be an extremely difficult task: sentences might be misspelled (sequencing is not a perfect process, and introduces errors) or slightly different (the genome of the individual being sequenced usually contains small variations if compared to the reference one). In addition, each sequencing experiment produces billions of short sentences.

This is the starting point that led some researchers at the CRG and the CNAG to design a computer program that helps to find sequences in the reference genome, quickly and accurately: such tools, called 'mappers', are essential to interpret data in genomic studies, as they represent the first analysis step for many biological experiments. After 5 years of development the result is the GEM (Genomic Multitool) mapper.

The GEM mapper is several times faster than other reference programs in the field and delivers breathtaking performance, matching into the huge human genome of reference about 40 million sequences per hour on a single CPU core. As it uses algorithms that guarantee that it doesn’t miss matches, GEM is also much more accurate than other comparable programs. In addition, GEM allows the parameters of the search to be tuned to the specific requirements of the biological experiment being performed, offering a versatility that cannot be achieved with most existing tools.

The good performance profile of GEM will help to face a practical problem: the dramatic increase in the amount of sequencing data. As an example, the CNAG started operations in 2010 with a park of 12 second generation sequencers that generated roughly 50 Gbases per day. Thanks to the recent spectacular advances in sequencing technology, today, only 2 and a half years after, the CNAG generates almost 20 times more data with the same number of sequencing machines. However, it would have been impossible to increase the computing resources of the CNAG accordingly (and this is a problem common to biomedical research everywhere in the world). Hence, the development of more efficient analysis tools like GEM is essential to keep up with the increasing rate of production.

The GEM tools are a neat example of excellence research, and a world-class tool, entirely developed in Spain; although the project is lead by an Italian team member, the whole work has been carried out in Barcelona. This accomplishment was made possible by the very early adoption of next-generation sequencing machines at the CRG (in 2008), and the subsequent sustained investment in sequencing technologies by the Catalan and Spanish governments that culminated in the creation of the CNAG.

The research was funded by the Spanish Ministerio de Educación y Ciencia (Consolider program), by the US National Institutes of Health/National Human Genome Research Institute, and by the European Union (READNA and ESGI programs).


Story Source:

The above story is based on materials provided by Centre for Genomic Regulation. Note: Materials may be edited for content and length.


Journal Reference:

  1. Marco-Sola S, Sammeth M, Guigó R and Ribeca P. The GEM mapper: fast, accurate and versatile alignment by filtration. Nat. Methods, 2012 DOI: 10.1038/NMETH.2221

Cite This Page:

Centre for Genomic Regulation. "Pushing genome data analysis one step forward." ScienceDaily. ScienceDaily, 28 October 2012. <www.sciencedaily.com/releases/2012/10/121028142215.htm>.
Centre for Genomic Regulation. (2012, October 28). Pushing genome data analysis one step forward. ScienceDaily. Retrieved August 28, 2014 from www.sciencedaily.com/releases/2012/10/121028142215.htm
Centre for Genomic Regulation. "Pushing genome data analysis one step forward." ScienceDaily. www.sciencedaily.com/releases/2012/10/121028142215.htm (accessed August 28, 2014).

Share This




More Plants & Animals News

Thursday, August 28, 2014

Featured Research

from universities, journals, and other organizations


Featured Videos

from AP, Reuters, AFP, and other news services

Killer Amoeba Found in Louisiana Water System

Killer Amoeba Found in Louisiana Water System

AP (Aug. 28, 2014) — State health officials say testing has confirmed the presence of a killer amoeba in a water system serving three St. John the Baptist Parish towns. (Aug. 28) Video provided by AP
Powered by NewsLook.com
Raw: Australian Sheep Gets Long Overdue Haircut

Raw: Australian Sheep Gets Long Overdue Haircut

AP (Aug. 28, 2014) — Hoping to break the record for world's wooliest, Shaun the sheep came up 10 pounds shy with his fleece weighing over 50 pounds after being shorn for the first time in years. (Aug. 28) Video provided by AP
Powered by NewsLook.com
Minds Blown: Scientists Develop Fish That Walk On Land

Minds Blown: Scientists Develop Fish That Walk On Land

Newsy (Aug. 28, 2014) — Canadian scientists looking into the very first land animals took a fish out of water and forced it to walk. Video provided by Newsy
Powered by NewsLook.com
Fake Dogs Scare Real Geese from Wis. Park

Fake Dogs Scare Real Geese from Wis. Park

AP (Aug. 28, 2014) — Parks officials in Stevens Point, Wisconsin had a fowl problem. Canadian Geese were making a mess of a park, so officials enlisted cardboard versions of man's best friend. (Aug. 28) Video provided by AP
Powered by NewsLook.com

Search ScienceDaily

Number of stories in archives: 140,361

Find with keyword(s):
 
Enter a keyword or phrase to search ScienceDaily for related topics and research stories.

Save/Print:
Share:  

Breaking News:
from the past week

In Other News

... from NewsDaily.com

Science News

Health News

Environment News

Technology News



Save/Print:
Share:  

Free Subscriptions


Get the latest science news with ScienceDaily's free email newsletters, updated daily and weekly. Or view hourly updated newsfeeds in your RSS reader:

Get Social & Mobile


Keep up to date with the latest news from ScienceDaily via social networks and mobile apps:

Have Feedback?


Tell us what you think of ScienceDaily -- we welcome both positive and negative comments. Have any problems using the site? Questions?
Mobile iPhone Android Web
Follow Facebook Twitter Google+
Subscribe RSS Feeds Email Newsletters
Latest Headlines Health & Medicine Mind & Brain Space & Time Matter & Energy Computers & Math Plants & Animals Earth & Climate Fossils & Ruins