Featured Research

from universities, journals, and other organizations

Ancient Origins Found In Arabidopsis Genome

December 25, 2000
Cornell University
A weedy, inedible member of the mustard family, related to broccoli and cauliflower, has become the first plant to yield the secrets of its primordial origins. In a computational research effort at Cornell University, the plant, Arabidopsis thaliana, was shown to contain genetic evidence of its emergence between 50 million and 200 million years ago.

ITHACA, N.Y. -- A weedy, inedible member of the mustard family, related to broccoli and cauliflower, has become the first plant to yield the secrets of its primordial origins. In a computational research effort at Cornell University, the plant, Arabidopsis thaliana, was shown to contain genetic evidence of its emergence between 50 million and 200 million years ago.

Related Articles

The finding, say the Cornell researchers, will be invaluable to those using Arabidopsis as a genetic model for other plant species, unlocking genes for important traits in agricultural crops like corn, tomatoes and wheat.

The researchers report on their discovery in the Dec. 15, 2000 edition of the journal Science.

A decade ago, Arabidopsis was widely adopted by plant scientists as an easily manipulated model for other plants because it is simple to grow in the laboratory, has a short life cycle and has a small genome -- only about 140 million base pairs of DNA compared with wheat, which might have as many as 16 billion pairs. This year, the entire DNA sequence of the plant was completed, and for the first time researchers were able to understand the sequence of the 25,000 genes necessary for an organism to function as a flowering plant. Using this genome sequence -- which is in the public domain on the Internet -- the Cornell researchers used computers to sort through the plant's DNA and find its genetic roots.

"We can take the entire genome of one plant and look back at it," says Steven D. Tanksley, the Liberty Hyde Bailey professor of plant breeding at Cornell and an author on the paper. "We are going back into genetic time, and we can see what the ancient genome looked like. If we can understand what the ancestral gene content in one plant is, then we can use that to learn the gene content in other plants." Tanksley and the lead researcher, Todd Vision, a Cornell visiting scientist, explained that for many plant genomes there is a lot of empty material between the proteins. Tanksley suggested that understanding a genome is like driving along a highway. On the East Coast, you do not have to drive far before you reach another city, while out west, there are long distances between cities. The point of the analogy is that scientists can gather more general genetic information from Arabidopsis in a shorter period of time. Says Tanksley: "Arabidopsis is the East Coast of DNA sequencing."

The researchers used a computer program called BLAST to classify the thousands of genes in Arabidopsis into gene families. BLAST (an acronym for Basic Local Alignment Search Tool) is a sequence similarity program designed to support analysis of nucleotide and protein databases. It was developed at the National Center for Biotechnology Information, part of the National Institutes of Health, in Bethesda, Md. The researchers then used novel algorithms to find large chunks of the chromosomes that were duplicated long ago.

In the process of duplication, all the genetic material in a species doubles, creating what is known as a polyploid. The researchers inferred that Arabidopsis was an ancient polyploid because it contained evidence of multiple duplications.

Although duplicated chromosomes diverged from one another and became scrambled over the eons, the research team was able to find 103 duplicated chromosome segments that ranged in age from 50 million to 200 million years. "We figured out where gene family members are located and used that information to find the ancient duplicated segments," says Vision, who is a molecular biologist at the Center for Agricultural Bioinformatics (CAB) at Cornell. The CAB is supported by the U.S. Department of Agriculture, Agricultural Research Service, in partnership with the College of Agriculture and Life Sciences and the Theory Center at Cornell.

With help from the dating estimates obtained by paleobotanists, the team was able to look at the duplicated gene sequences and deduce when the genome duplications in Arabidopsis occurred. The team found that a few large duplication events were responsible for the pattern they saw. "Our work was entirely computational, but a lot of other researchers' laboratory work went into it before that," says Vision. He draws an analogy between finding prehistoric genetic relationships and the development of language. Many words in Romance languages like Spanish, Italian, French and Portuguese are derived from Latin. "We can see the roots of the modern words as being derived from Latin," he says. "In our case, we are finding the genetic roots of the genes before they duplicated and diverged." The paper, "The Origins of Genomic Duplications in Arabidopsis ," was authored by Vision, Tanksley and Daniel G. Brown of the Whitehead Institute at the Massachusetts Institute of Technology. Brown participated in the research while completing his doctoral degree, which he earned from the Department of Computer Science at Cornell last spring. The research was funded by the USDA Agriculture Research Service, and grants from the National Science Foundation and the Office of Naval Research.

Related World Wide Web sites: The following sites provide additional information on this news release. Some might not be part of the Cornell University community, and Cornell has no control over their content or availability.

o The Arabidopsis Information Resource: http://www.arabidopsis.org/home.html

o Todd Vision page: http://www.igd.cornell.edu/~tvision/ToddVision.html

o Cornell Thory Center: http://www.tc.cornell.edu

o National Center for Biotechnology Information (NCBI) BLAST site (somewhat technical): http://www.ncbi.nlm.nih.gov/Education/BLASTinfo/information3.html

o USDA-ARS Center for Agricultural Bioinformatics at Cornell: http://genome.cornell.edu/index.html

How Cornell's computing resources help to "blast" against databases

Running a massive BLAST search on the Arabidopsis genome was easier for Cornell University researcher Todd Vision than it might have been for many other genomics researchers, thanks to the Cornell Theory Center (CTC). The center maintains a special computing resource in Rhodes Hall on the Cornell campus in conjunction with the U.S. Department of Agriculture's Center for Agricultural Bioinformatics (CAB).

The resource, loosely named the 'genomics cluster,' consists of 12 computers, each made up of four 500-Mhz Pentium III processors running the Windows 2000 operating system. With software developed for CTC, eight of the machines run as a parallel-processing cluster, effectively a supercomputer.

The cluster is primarily used for searches using BLAST (an acronym for Basic Local Alignment Search Tool), a program that searches gene and protein databases for pattern matches, much the same way a text searcher will match words and phrases. BLAST servers are available elsewhere to the worldwide research community through World Wide Web interfaces, but they are not suitable for running a large batch of queries such as the one Vision used to track the genetic history of Arabidopsis .

The BLAST server on the Web takes one query and "blasts" it against the database, Vision explained. "But I needed to run twenty-something thousand proteins. Imagine sitting there and clicking the mouse that many times. Doing it on Theory Center computers allowed us to do it in a batch. Doing it all on local computers allows you more speed, more flexibility and less hands-on processing."

Vision also assembled on a Theory Center computer a special version of the Arabidopsis genome database that would tell him the location on the genome of each protein it found. The processing, he said, took only about half a day on just one of the four-processor Pentiums.

Other computers in the resource are used for databases. One is a server for the CAB Web site known as Demeter's Genomes, which makes extensive databases of plant genomes available to the research community.

The genomics cluster was established several years ago with about $400,000 in funding from the USDA and is maintained by an annual USDA grant. David Schneider, Theory Center staff researcher, is principal investigator of the genomics cluster.

Story Source:

The above story is based on materials provided by Cornell University. Note: Materials may be edited for content and length.

Cite This Page:

Cornell University. "Ancient Origins Found In Arabidopsis Genome." ScienceDaily. ScienceDaily, 25 December 2000. <www.sciencedaily.com/releases/2000/12/001225061358.htm>.
Cornell University. (2000, December 25). Ancient Origins Found In Arabidopsis Genome. ScienceDaily. Retrieved March 29, 2015 from www.sciencedaily.com/releases/2000/12/001225061358.htm
Cornell University. "Ancient Origins Found In Arabidopsis Genome." ScienceDaily. www.sciencedaily.com/releases/2000/12/001225061358.htm (accessed March 29, 2015).

Share This

More From ScienceDaily

More Plants & Animals News

Sunday, March 29, 2015

Featured Research

from universities, journals, and other organizations

Featured Videos

from AP, Reuters, AFP, and other news services

New Arthropod Fossil Might Be Relative Of Spiders, Scorpions

New Arthropod Fossil Might Be Relative Of Spiders, Scorpions

Newsy (Mar. 29, 2015) A 508-million-year-old arthropod that swam in the Cambrian seas is thought to share a common ancestor with spiders and scorpions. Video provided by Newsy
Powered by NewsLook.com
Vietnam Rice Boom Piles Pressure on Farmers and the Environment

Vietnam Rice Boom Piles Pressure on Farmers and the Environment

AFP (Mar. 29, 2015) Vietnam&apos;s drive to become the world&apos;s leading rice exporter is pushing farmers in the fertile Mekong Delta to the brink, say experts, with mounting costs to the environment. Duration: 02:35 Video provided by AFP
Powered by NewsLook.com
Raw: Lioness Has Rare Five-Cub Litter

Raw: Lioness Has Rare Five-Cub Litter

AP (Mar. 27, 2015) A lioness in Pakistan has given birth to five cubs, twice the usual size of a litter. Queen gave birth to two other cubs just nine months ago. (March 27) Video provided by AP
Powered by NewsLook.com
Jockey Motion Tracking Reveals Racing Prowess

Jockey Motion Tracking Reveals Racing Prowess

Reuters - Innovations Video Online (Mar. 26, 2015) Using motion tracking technology, researchers from the Royal Veterinary College (RVC) are trying to establish an optimum horse riding style to train junior jockeys, as well as enhance safety, health and well-being of both racehorses and jockeys. Matthew Stock reports. Video provided by Reuters
Powered by NewsLook.com

Search ScienceDaily

Number of stories in archives: 140,361

Find with keyword(s):
Enter a keyword or phrase to search ScienceDaily for related topics and research stories.


Breaking News:

Strange & Offbeat Stories

Plants & Animals

Earth & Climate

Fossils & Ruins

In Other News

... from NewsDaily.com

Science News

Health News

Environment News

Technology News


Free Subscriptions

Get the latest science news with ScienceDaily's free email newsletters, updated daily and weekly. Or view hourly updated newsfeeds in your RSS reader:

Get Social & Mobile

Keep up to date with the latest news from ScienceDaily via social networks and mobile apps:

Have Feedback?

Tell us what you think of ScienceDaily -- we welcome both positive and negative comments. Have any problems using the site? Questions?
Mobile: iPhone Android Web
Follow: Facebook Twitter Google+
Subscribe: RSS Feeds Email Newsletters
Latest Headlines Health & Medicine Mind & Brain Space & Time Matter & Energy Computers & Math Plants & Animals Earth & Climate Fossils & Ruins