Science News
from research organizations

New approach to 'spell checking' gene sequences

May 21, 2012
CSIRO Australia
Scientists have found a better way to 'spell check' gene sequences.

DNA fragments separated on a gel, a common fist step to gene sequencing.
Credit: Image courtesy of CSIRO Australia

A PhD student from CSIRO and the University of Queensland has found a better way to 'spell check' gene sequences and help biologists better understand the natural world.

The student, Lauren Bragg, has contributed to the May issue of the journal Nature Methods highlighting her new approach and its software implementation called Acacia.

Acacia analyses the output of next-generation gene sequencing instruments which read the four-letter alphabet of As, Cs, Ts and Gs -- the 'bases' that code for DNA and spell out the genes of different living organisms. Acacia specifically applies to important parts of microbe genes called amplicons.

Just as a computer spell checker finds typing errors in words, so Acacia finds errors in the DNA code of amplicon sequences produced during gene sequencing.

Acacia shows clear improvements over the two error-correction tools currently used by biologists for amplicon sequences and it's easier for biologists to use.

Ms Bragg's development of Acacia is part of the field of bioinformatics, a blend of computer science, statistics and biology. Despite her surname, however, she is modest about her achievements.

"It's exciting to be published in a journal like Nature Methods but I get more satisfaction from hearing how my software is helping biologists fix sequencing errors." she said.

Machine errors in the long lengths of A, C, G and T code can cause biologists to misinterpret which genes are there, or which microbial species might exist in a environmental samples from, say, a waste water treatment plant or from the ocean or even our guts.

Acacia works by using the statistical theory of likelihoods to analyze the code for DNA bases which may have been mistakenly added or deleted -- common errors in gene sequencing.

"The Nature article is our way of telling the international biology community that there's a new software tool they can use for error-correcting that's pretty easy to use, quick and reliable."

"That way, they won't think they've discovered a new microbe species when they haven't or overlooked one they should have found," she said.

The method, or algorithm, that Acacia uses took 18 months for Ms Bragg to fully develop and test.

Now it's nose to the grindstone to get the thesis done.

Story Source:

Materials provided by CSIRO Australia. Note: Content may be edited for style and length.

Journal Reference:

  1. Lauren Bragg, Glenn Stone, Michael Imelfort, Philip Hugenholtz, Gene W Tyson. Fast, accurate error-correction of amplicon pyrosequences using Acacia. Nature Methods, 2012; 9 (5): 425 DOI: 10.1038/nmeth.1990

Cite This Page:

CSIRO Australia. "New approach to 'spell checking' gene sequences." ScienceDaily. ScienceDaily, 21 May 2012. <>.
CSIRO Australia. (2012, May 21). New approach to 'spell checking' gene sequences. ScienceDaily. Retrieved May 23, 2017 from
CSIRO Australia. "New approach to 'spell checking' gene sequences." ScienceDaily. (accessed May 23, 2017).