Nov. 8, 2007 Genome Research is publishing a number of papers related to comparative analyses of twelve Drosophila (fly) genomes. The twelve fly genome project is unique in that the analyses of closely related species has allowed for a more complete and correct annotation of functional genes and regulatory elements in Drosophila melanogaster, a major model organism in genetics.
With a life span of just weeks, the fruit fly has been an important model organism in genetic studies for decades and has helped researchers unravel the rules that govern inheritance. Though there are many differences between fruit flies and humans, the two also share many genes that regulate the same biological functions.
Expanding universe of microRNAs
MicroRNAs (miRNAs) are short RNA molecules encoded by plant and animal genomes that have garnered significant interest for their ability to regulate gene expression. A number of miRNAs have been discovered in recent years, however it is likely that many miRNAs have gone undetected. Two papers published in Genome Research utilize the twelve fly genomes to identify novel miRNAs, further refine the set of known miRNAs, and investigate the biology and origins of miRNA genes.
In a study led by Dr. David Bartel, a combination of computational methods and high-throughput sequencing techniques identified new miRNAs conserved across the Drosophila species. "The new fly genomes enabled us to predict new miRNAs, 20 of which we experimentally confirmed, and the genome alignments enabled us to more accurately predict the evolutionarily conserved targets of these and other miRNAs," explains Bartel.
While computational methods are important for identifying novel miRNAs, large-scale sequencing of small RNAs indicates that many miRNAs continue to evade prediction. "Most of the 59 novel miRNAs that we found were not predicted by us or by others," describes Bartel. "This illustrates the advantages of high-throughput sequencing of small RNAs, and the limitations of comparative sequence analysis for miRNA gene identification."
In a related paper, a study led by Dr. Manolis Kellis utilized the twelve Drosophila genomes to computationally predict and experimentally validate novel miRNAs by defining the structural and evolutionary properties of known miRNAs. Classification of newly identified miRNAs has revealed greater diversity in the regulation gene expression by miRNAs, with increased potential for combinatorial regulation, and provided new insights on miRNA biogenesis and function. "We learned that both arms of a miRNA hairpin can produce functional miRNAs, which sometimes work cooperatively to target a common pathway," explains Kellis.
The combination of comparative and experimental analyses by both groups also provided novel evidence for emergent gene function, deriving from the portion of the miRNA hairpin previously believed to be discarded, and the strand of the DNA previously not thought to produce a miRNA.
Reference: Ruby J.G. et al. 2007. Evolution, biogenesis, expression, and target predictions of a substantially expanded set of Drosophila microRNAs. Genome Res. doi:10.1101/gr.6597907.
Reference: Stark A. et al. 2007. Systematic discovery and characterization of fly microRNAs using 12 Drosophila genomes. Genome Res. doi:10.1101/gr.6593807.
Revisiting D. melanogaster
Drosophila melanogaster is one of the most intensely studied model organisms in biology. Numerous studies over the years have defined nearly 14,000 protein-coding genes by experimental and computational methods, however these methods are likely to have produced erroneous annotations or may be missing other annotations. In order to assess the D. melanogaster protein-coding gene catalog, a group of researchers led by Dr. Manolis Kellis identified evolutionarily signatures of protein-coding genes by comparative analysis of the twelve fly genomes. This strategy was then applied to evaluation of the current catalog and identification of genes that have escaped annotation.
The study led to the discovery of hundreds of new genes, refined existing genes, and concluded that greater than 10% of the protein-coding gene annotations requires refinement.
Additionally, the work revealed abundant unusual gene structures. "We have learned that many brain-expressed proteins may be undergoing post-transcriptional changes by stop-codon read-through," explains Kellis. "We found 149 genes for which a conserved stop codon is followed by strong evidence of protein-coding selection for up to hundreds of amino acids, suggesting a new mechanism for post-transcriptional regulation in animal genomes." The researchers also report additional widespread evidence suggesting several diverse mechanisms of post-transcriptional regulation for protein-coding genes.
Reference: Lin M.F. et al. 2007. Revisiting the protein-coding gene catalog of Drosophila melanogaster using twelve fly genomes. Genome Res. doi:10.1101/gr6679507
Keeping genes in order
In humans and other vertebrate genomes, long-range regulatory DNA sequences known as highly conserved noncoding elements (HCNEs) have been found to cluster around genes involved in developmental processes, forming genomic regulatory blocks (GRBs). The GRBs are conserved in vertebrates, maintaining the order, or microsynteny, of associated genes on the chromosome. In this study, researchers utilize mosquito genome sequences and sequences available from the twelve fly genome project to investigate the microsynteny underlying GRBs across a wider range of evolution than previously possible.
"By using insect (Drosophila and mosquito) genome comparisons, we show that long-range regulation of developmental genes by arrays of highly conserved regulatory elements is an ancient feature that has shaped the evolution of metazoan genomes," says Dr. Boris Lenhard, senior investigator of the study.
"Additionally, we present genome-wide evidence that the responsiveness of genes to long-range regulation is partially determined by the type of their core promoter," explains Lenhard, addressing the issue of how some genes that are conserved in GRBs are not regulated by HCNEs.
Reference: Engström P.G. et al. 2007. Genomic regulatory blocks underlie extensive microsynteny conservation in insects. Genome Res. doi:10.1101/gr.6669607.
Tracing the origins of relocated genes
Investigations into the evolution of genomes have revealed significant upheaval in genome organization: insertions, deletions, rearrangement or duplication of large regions, and even duplication of entire genomes. In addition, individual genes have undergone genomic relocation. Sequencing of the twelve Drosophila genomes now allows deeper investigations into single gene relocation and its origins.
"The availability of twelve fly genomes provides a unique opportunity to investigate fine-scale events, such as relocation of individual genes, using whole genome comparative analysis across various levels of evolutionary divergence," explains primary author Arjun Bhutkar. Bhutkar and colleagues identify and characterize positionally relocated genes (PRGs) in the Drosophila genus, and provide evidence for two distinct origins of PRGs: transposition of genes at the level of DNA, and retrotransposition of RNAs into the genome.
The researchers extended their study to comparisons of Drosophila and other insect genomes. "Such analyses demonstrate the role of PRGs in evolutionary chromosomal organization," says Bhutkar, as this study highlights the role of PRGs in creation of genomic diversity.
Reference: Bhutkar A. et al. 2007 Genome-scale analysis of positionally relocated genes. Genome Res. doi:10.1101/gr.7062307
These papers will appear online on November 7 in Genome Research, concurrent with the publication of two main papers on the comparative sequence analyses of twelve fly genomes in the journal Nature.
Other social bookmarking and sharing tools:
Note: If no author is given, the source is cited instead.