Featured Research

from universities, journals, and other organizations

Study Suggests That Publicly Available Genome Data May Contain Small But Significant Errors

Date:
February 16, 2006
Source:
New York University
Summary:
Comparing data from inferred probe maps to the available sequence assembly, the researchers' new method provides insights into the difficulties of establishing a canonical and accurate sequence or physical map, and suggests ways that the two types of data can be combined to render increased confidence levels of the assembly.

Since the genome sequence of the bacterial pathogen Haemophilus influenzae was published in 1995, the genetic code of many other large, complex, medically, and commercially significant organisms including humans has also been elucidated.

Related Articles


However, the techniques used to derive these genetic sequences are imperfect, and many researchers may be unaware of potential errors lurking within the publicly available published, or "canonical" sequence. If an organism's genome is unstable, variable, and contains rearrangements within a population or between strains, there may be no single true linear structure that will be valid for that organism, and imposing a linear sequence may not be biologically meaningful.

Now, researchers at Cold Spring Harbor Laboratory and New York University describe a high throughput microarray technique that involves testing many samples simultaneously and which can be used to assemble physical maps and validate genomic sequence assemblies. The findings appear in the latest issue of the Journal of Computational Biology.

The research was conducted by Joseph West, John Healy, and Michael Wigler of Cold Spring Harbor Laboratory, and William Casey and Bud Mishra of NYU's Courant Institute of Mathematical Sciences. Mishra is a Professor of Computer Science and Mathematics at the Courant Institute and also has an appointment in the Department of Cell Biology at NYU's School of Medicine.

Using their micro-array hybridization method, which used flourescently labeled snippets from the genome of the fission yeast S. pombe and examined how they bind to probes arrayed on a glass slide, they were able to computationally derive the "distance" between probes in the genome and organize the probes along the genome. The resulting physical map of the S. pombe genome was compared to the corresponding map computed from publicly available S. pombe sequence. The comparison showed a small number of significant discrepancies between their results and that of the map derived from the public sequence released in 2002. S. pombe's genome is only about 14 million bases long (almost a thousandth of the human genome), and is widely considered to be a gold-standard in whole-genome assembly.

The authors show that with appropriate experimental conditions, array hybridization data can be used to establish a physical distance between unique arrayed probes--a sequence of DNA which in this case was 70 base pairs long. Each of the 70 base pairs is unique in the target organism's genome and serves as a landmark in that genome. These probes can then be ordered in the correct sequence in which they occur in the target genome in much the same way as a mapmaker can locate landmarks at the correct coordinates by consulting a three-dimensional rendering of a geographical map. The distance between pairs of landmarks can be used to assemble physical maps, as an aid to sequence assembly, or as an independent method for validating sequence assembly and indicating where errors need correction.

Comparing data from their inferred probe maps to the available sequence assembly, the new method provides insights into the difficulties of establishing a canonical and accurate sequence or physical map, and suggests ways that the two types of data can be combined to render increased confidence levels of the assembly.

This physical mapping technology is simple to implement and is relatively inexpensive. It is likely to have significant commercial impact through disease-related genetics studies, such as cancer and autism. In addition, it complements other mapping and sequencing technologies (e.g., Optical Mapping and Sequencing being developed by Mishra) and cancer array CGH studies (e.g., ROMA project of Wigler and a versatile cancer genome analysis project of Mishra).



Story Source:

The above story is based on materials provided by New York University. Note: Materials may be edited for content and length.


Cite This Page:

New York University. "Study Suggests That Publicly Available Genome Data May Contain Small But Significant Errors." ScienceDaily. ScienceDaily, 16 February 2006. <www.sciencedaily.com/releases/2006/02/060216191949.htm>.
New York University. (2006, February 16). Study Suggests That Publicly Available Genome Data May Contain Small But Significant Errors. ScienceDaily. Retrieved April 19, 2015 from www.sciencedaily.com/releases/2006/02/060216191949.htm
New York University. "Study Suggests That Publicly Available Genome Data May Contain Small But Significant Errors." ScienceDaily. www.sciencedaily.com/releases/2006/02/060216191949.htm (accessed April 19, 2015).

Share This


More From ScienceDaily



More Plants & Animals News

Sunday, April 19, 2015

Featured Research

from universities, journals, and other organizations


Featured Videos

from AP, Reuters, AFP, and other news services

Un-Bee-Lievable: Bees on the Loose After Washington Truck Crash

Un-Bee-Lievable: Bees on the Loose After Washington Truck Crash

Reuters - US Online Video (Apr. 17, 2015) A truck carrying honey bees overturns near Lynnwood, Washington, spreading boxes of live bees across the highway. Rough Cut (no reporter narration). Video provided by Reuters
Powered by NewsLook.com
Our Love Of Puppy Dog Eyes Explained By Science

Our Love Of Puppy Dog Eyes Explained By Science

Newsy (Apr. 17, 2015) Researchers found a spike in oxytocin occurs in both humans and dogs when they gaze into each other&apos;s eyes. Video provided by Newsy
Powered by NewsLook.com
Dog Flu Spreading in Midwestern States

Dog Flu Spreading in Midwestern States

AP (Apr. 17, 2015) Dog flu is spreading in several Midwestern states. Dog daycare centers and veterinary offices are taking precautions. (April 17) Video provided by AP
Powered by NewsLook.com
Raw: Rare Whale Spotted in Gulf of Mexico

Raw: Rare Whale Spotted in Gulf of Mexico

AP (Apr. 17, 2015) Researchers from the E/V Nautilus had quite a surprise Tuesday, when a curious sperm whale swam around their remotely operated vehicle in the Gulf of Mexico. Cameras captured the encounter. (April 17) Video provided by AP
Powered by NewsLook.com

Search ScienceDaily

Number of stories in archives: 140,361

Find with keyword(s):
Enter a keyword or phrase to search ScienceDaily for related topics and research stories.

Save/Print:
Share:

Breaking News:

Strange & Offbeat Stories


Plants & Animals

Earth & Climate

Fossils & Ruins

In Other News

... from NewsDaily.com

Science News

Health News

Environment News

Technology News



Save/Print:
Share:

Free Subscriptions


Get the latest science news with ScienceDaily's free email newsletters, updated daily and weekly. Or view hourly updated newsfeeds in your RSS reader:

Get Social & Mobile


Keep up to date with the latest news from ScienceDaily via social networks and mobile apps:

Have Feedback?


Tell us what you think of ScienceDaily -- we welcome both positive and negative comments. Have any problems using the site? Questions?
Mobile: iPhone Android Web
Follow: Facebook Twitter Google+
Subscribe: RSS Feeds Email Newsletters
Latest Headlines Health & Medicine Mind & Brain Space & Time Matter & Energy Computers & Math Plants & Animals Earth & Climate Fossils & Ruins