Featured Research

from universities, journals, and other organizations

Whole genome analysis speeds up: 240 full genomes in 50 hours

Date:
February 19, 2014
Source:
University of Chicago Medical Center
Summary:
Although the time and cost of sequencing the human genome has plummeted, analyzing the 3 billion base pairs of genetic information can take months. Researchers working with Beagle -- one of the world’s fastest supercomputers devoted to life sciences -- report they can analyze 240 full genomes in 50 hours.

Beagle, a Cray XE6 supercomputer at Argonne National Laboratory, supports computation, simulation and data analysis for the biomedical research community.
Credit: Argonne National Laboratory

Although the time and cost of sequencing an entire human genome has plummeted, analyzing the resulting three billion base pairs of genetic information from a single genome can take many months.

In the journal Bioinformatics, however, a University of Chicago-based team -- working with Beagle, one of the world's fastest supercomputers devoted to life sciences -- reports that genome analysis can be radically accelerated. This computer, based at Argonne National Laboratory, is able to analyze 240 full genomes in about two days.

"This is a resource that can change patient management and, over time, add depth to our understanding of the genetic causes of risk and disease," said study author Elizabeth McNally, MD, PhD, the A. J. Carlson Professor of Medicine and Human Genetics and director of the Cardiovascular Genetics Clinic at the University of Chicago Medicine.

"The supercomputer can process many genomes simultaneously rather than one at a time," said first author Megan Puckelwartz, a graduate student in McNally's laboratory. "It converts whole genome sequencing, which has primarily been used as a research tool, into something that is immediately valuable for patient care."

Because the genome is so vast, those involved in clinical genetics have turned to exome sequencing, which focuses on the two percent or less of the genome that codes for proteins. This approach is often useful. An estimated 85 percent of disease-causing mutations are located in coding regions. But the rest, about 15 percent of clinically significant mutations, come from non-coding regions, once referred to as "junk DNA" but now known to serve important functions. If not for the tremendous data-processing challenges of analysis, whole genome sequencing would be the method of choice.

To test the system, McNally's team used raw sequencing data from 61 human genomes and analyzed that data on Beagle. They used publicly available software packages and one quarter of the computer's total capacity. They found that shifting to the supercomputer environment improved accuracy and dramatically accelerated speed.

"Improving analysis through both speed and accuracy reduces the price per genome," McNally said. "With this approach, the price for analyzing an entire genome is less than the cost of the looking at just a fraction of genome. New technology promises to bring the costs of sequencing down to around $1,000 per genome. Our goal is get the cost of analysis down into that range."

"This work vividly demonstrates the benefits of dedicating a powerful supercomputer resource to biomedical research," said co-author Ian Foster, director of the Computation Institute and Arthur Holly Compton Distinguished Service Professor of Computer Science. "The methods developed here will be instrumental in relieving the data analysis bottleneck that researchers face as genetic sequencing grows cheaper and faster."

The finding has immediate medical applications. McNally's Cardiovascular Genetics clinic, for example, relies on rigorous interrogation of the genes from an initial patient as well as multiple family members to understand, treat and prevent disease. More than 50 genes can contribute to cardiomyopathy. Other genes can trigger heart failure, rhythm disorders or vascular problems.

"We start genetic testing with the patient," she said, "but when we find a significant mutation we have to think about testing the whole family to identify individuals at risk."

The range of testable mutations has radically expanded. "In the early days we would test one to three genes," she said. "In 2007, we did our first five-gene panel. Now we order 50 to 70 genes at a time, which usually gets us an answer. At that point, it can be more useful and less expensive to sequence the whole genome."

The information from these genomes combined with careful attention to patient and family histories "adds to our knowledge about these inherited disorders," McNally said. "It can refine the classification of these disorders," she said. "By paying close attention to family members with genes that place then at increased risk, but who do not yet show signs of disease, we can investigate early phases of a disorder. In this setting, each patient is a big-data problem."

Beagle, a Cray XE6 supercomputer housed in the Theory and Computing Sciences (TCS) building at Argonne National Laboratory, supports computation, simulation and data analysis for the biomedical research community. It is available for use by University of Chicago researchers, their collaborators and "other meritorious investigators." It was named after the HMS Beagle, the ship that carried Charles Darwin on his famous scientific voyage in 1831.

The National Institutes of Health and the Doris Duke Charitable Foundation funded this study. Additional authors include Lorenzo Pesce, Viswateja Nelakuditi, Lisa Dellefave-Castillo and Jessica Golbus of the University of Chicago; Sharlene Day of the University of Michigan; Thomas Coppola of the University of Pennsylvania; and Gerald Dorn of Washington University.


Story Source:

The above story is based on materials provided by University of Chicago Medical Center. Note: Materials may be edited for content and length.


Journal Reference:

  1. M. Puckelwartz, L. Pesce, V. Nelakuditi, L. Dellefave-Castillo, J. Golbus, S. Day, T. Cappola, G. Dorn, I. Foster, E. McNally. Supercomputing for the parallelization of whole genome analysis. Bioinformatics, 2014; DOI: 10.1093/bioinformatics/btu071

Cite This Page:

University of Chicago Medical Center. "Whole genome analysis speeds up: 240 full genomes in 50 hours." ScienceDaily. ScienceDaily, 19 February 2014. <www.sciencedaily.com/releases/2014/02/140219173146.htm>.
University of Chicago Medical Center. (2014, February 19). Whole genome analysis speeds up: 240 full genomes in 50 hours. ScienceDaily. Retrieved April 18, 2014 from www.sciencedaily.com/releases/2014/02/140219173146.htm
University of Chicago Medical Center. "Whole genome analysis speeds up: 240 full genomes in 50 hours." ScienceDaily. www.sciencedaily.com/releases/2014/02/140219173146.htm (accessed April 18, 2014).

Share This



More Computers & Math News

Friday, April 18, 2014

Featured Research

from universities, journals, and other organizations


Featured Videos

from AP, Reuters, AFP, and other news services

Twitter Introduces Facebook-Style App Install Ads

Twitter Introduces Facebook-Style App Install Ads

Newsy (Apr. 17, 2014) Twitter hopes to make money on app install ads, which has proven to be a successful strategy for Facebook. Video provided by Newsy
Powered by NewsLook.com
Heartbleed Hack Leads To Arrest

Heartbleed Hack Leads To Arrest

Newsy (Apr. 17, 2014) A 19-year-old computer science student has been arrested in relation to a data breach of 900 social insurance numbers from Canada's revenue agency. Video provided by Newsy
Powered by NewsLook.com
Apple Rumored To Introduce Song ID Service In Next iOS Build

Apple Rumored To Introduce Song ID Service In Next iOS Build

Newsy (Apr. 17, 2014) Sources close to Apple told Bloomberg the company plans to introduce an integrated song identification service during the launch of its next iOS. Video provided by Newsy
Powered by NewsLook.com
Honda's New ASIMO Robot, More Human-Like Than Ever

Honda's New ASIMO Robot, More Human-Like Than Ever

AFP (Apr. 17, 2014) It walks and runs, even up and down stairs. It can open a bottle and serve a drink, and politely tries to shake hands with a stranger. Meet the latest ASIMO, Honda's humanoid robot. Duration: 00:54 Video provided by AFP
Powered by NewsLook.com

Search ScienceDaily

Number of stories in archives: 140,361

Find with keyword(s):
Enter a keyword or phrase to search ScienceDaily for related topics and research stories.

Save/Print:
Share:

Breaking News:
from the past week

In Other News

... from NewsDaily.com

Science News

Health News

Environment News

Technology News



Save/Print:
Share:

Free Subscriptions


Get the latest science news with ScienceDaily's free email newsletters, updated daily and weekly. Or view hourly updated newsfeeds in your RSS reader:

Get Social & Mobile


Keep up to date with the latest news from ScienceDaily via social networks and mobile apps:

Have Feedback?


Tell us what you think of ScienceDaily -- we welcome both positive and negative comments. Have any problems using the site? Questions?
Mobile: iPhone Android Web
Follow: Facebook Twitter Google+
Subscribe: RSS Feeds Email Newsletters
Latest Headlines Health & Medicine Mind & Brain Space & Time Matter & Energy Computers & Math Plants & Animals Earth & Climate Fossils & Ruins