ScienceDaily
Your source for the latest research news
Follow Facebook Twitter LinkedIn Subscribe RSS Feeds Newsletters
New:
  • Proteins That Predict Future Dementia Risk
  • How and When the Milky Way Came Together
  • Rare COVID-19 Response in Children Explained
  • Harvesting Light Like Nature Does
  • Optimizing the Immune System to Fight Cancer
  • Virtual Reality Warps Your Sense of Time
  • Mammals Can Use Their Intestines to Breathe
  • Which Animals Will Survive Climate Change?
  • Antarctic Ice Sheet Retreat: Chain Reaction?
  • Harnessing the Hum of Fluorescent Lights
advertisement
Follow all of ScienceDaily's latest research news and top science headlines!
Science News
from research organizations

1

2

A comprehensive map of the SARS-CoV-2 genome

Researchers have determined the virus' protein-coding gene set and analyzed new mutations' likelihood of helping the virus adapt

Date:
May 11, 2021
Source:
Massachusetts Institute of Technology
Summary:
Researchers have generated what they describe as the most complete gene annotation of the SARS-CoV-2 genome. In their study, they confirmed several protein-coding genes and found that a few others that had been suggested as genes do not code for any proteins.
Share:
FULL STORY

In early 2020, a few months after the Covid-19 pandemic began, scientists were able to sequence the full genome of the virus that causes the infection, SARS-CoV-2. While many of its genes were already known at that point, the full complement of protein-coding genes was unresolved.

advertisement

Now, after performing an extensive comparative genomics study, MIT researchers have generated what they describe as the most accurate and complete gene annotation of the SARS-CoV-2 genome. In their study, which appears today in Nature Communications, they confirmed several protein-coding genes and found that a few others that had been suggested as genes do not code for any proteins.

"We were able to use this powerful comparative genomics approach for evolutionary signatures to discover the true functional protein-coding content of this enormously important genome," says Manolis Kellis, who is the senior author of the study and a professor of computer science in MIT's Computer Science and Artificial Intelligence Laboratory (CSAIL) as well as a member of the Broad Institute of MIT and Harvard.

The research team also analyzed nearly 2,000 mutations that have arisen in different SARS-CoV-2 isolates since it began infecting humans, allowing them to rate how important those mutations may be in changing the virus' ability to evade the immune system or become more infectious.

Comparative genomics

The SARS-CoV-2 genome consists of nearly 30,000 RNA bases. Scientists have identified several regions known to encode protein-coding genes, based on their similarity to protein-coding genes found in related viruses. A few other regions were suspected to encode proteins, but they had not been definitively classified as protein-coding genes.

advertisement

To nail down which parts of the SARS-CoV-2 genome actually contain genes, the researchers performed a type of study known as comparative genomics, in which they compare the genomes of similar viruses. The SARS-CoV-2 virus belongs to a subgenus of viruses called Sarbecovirus, most of which infect bats. The researchers performed their analysis on SARS-CoV-2, SARS-CoV (which caused the 2003 SARS outbreak), and 42 strains of bat sarbecoviruses.

Kellis has previously developed computational techniques for doing this type of analysis, which his team has also used to compare the human genome with genomes of other mammals. The techniques are based on analyzing whether certain DNA or RNA bases are conserved between species, and comparing their patterns of evolution over time.

Using these techniques, the researchers confirmed six protein-coding genes in the SARS-CoV-2 genome in addition to the five that are well established in all coronaviruses. They also determined that the region that encodes a gene called ORF3a also encodes an additional gene, which they name ORF3c. The gene has RNA bases that overlap with ORF3a but occur in a different reading frame. This gene-within-a-gene is rare in large genomes, but common in many viruses, whose genomes are under selective pressure to stay compact. The role for this new gene, as well as several other SARS-CoV-2 genes, is not known yet.

The researchers also showed that five other regions that had been proposed as possible genes do not encode functional proteins, and they also ruled out the possibility that there are any more conserved protein-coding genes yet to be discovered.

"We analyzed the entire genome and are very confident that there are no other conserved protein-coding genes," says Irwin Jungreis, lead author of the study and a CSAIL research scientist. "Experimental studies are needed to figure out the functions of the uncharacterized genes, and by determining which ones are real, we allow other researchers to focus their attention on those genes rather than spend their time on something that doesn't even get translated into protein."

The researchers also recognized that many previous papers used not only incorrect gene sets, but sometimes also conflicting gene names. To remedy the situation, they brought together the SARS-CoV-2 community and presented a set of recommendations for naming SARS-CoV-2 genes, in a separate paper published a few weeks ago in Virology.

advertisement

Fast evolution

In the new study, the researchers also analyzed more than 1,800 mutations that have arisen in SARS-CoV-2 since it was first identified. For each gene, they compared how rapidly that particular gene has evolved in the past with how much it has evolved since the current pandemic began.

They found that in most cases, genes that evolved rapidly for long periods of time before the current pandemic have continued to do so, and those that tended to evolve slowly have maintained that trend. However, the researchers also identified exceptions to these patterns, which may shed light on how the virus has evolved as it has adapted to its new human host, Kellis says.

In one example, the researchers identified a region of the nucleocapsid protein, which surrounds the viral genetic material, that had many more mutations than expected from its historical evolution patterns. This protein region is also classified as a target of human B cells. Therefore, mutations in that region may help the virus evade the human immune system, Kellis says.

"The most accelerated region in the entire genome of SARS-CoV-2 is sitting smack in the middle of this nucleocapsid protein," he says. "We speculate that those variants that don't mutate that region get recognized by the human immune system and eliminated, whereas those variants that randomly accumulate mutations in that region are in fact better able to evade the human immune system and remain in circulation."

The researchers also analyzed mutations that have arisen in variants of concern, such as the B.1.1.7 strain from England, the P.1 strain from Brazil, and the B.1.351 strain from South Africa. Many of the mutations that make those variants more dangerous are found in the spike protein, and help the virus spread faster and avoid the immune system. However, each of those variants carries other mutations as well.

"Each of those variants has more than 20 other mutations, and it's important to know which of those are likely to be doing something and which aren't," Jungreis says. "So, we used our comparative genomics evidence to get a first-pass guess at which of these are likely to be important based on which ones were in conserved positions."

This data could help other scientists focus their attention on the mutations that appear most likely to have significant effects on the virus' infectivity, the researchers say. They have made the annotated gene set and their mutation classifications available in the University of California at Santa Cruz Genome Browser for other researchers who wish to use it.

"We can now go and actually study the evolutionary context of these variants and understand how the current pandemic fits in that larger history," Kellis says. "For strains that have many mutations, we can see which of these mutations are likely to be host-specific adaptations, and which mutations are perhaps nothing to write home about."

The research was funded by the National Human Genome Research Institute and the National Institutes of Health. Rachel Sealfon, a research scientist at the Flatiron Institute Center for Computational Biology, is also an author of the paper.

make a difference: sponsored opportunity

Story Source:

Materials provided by Massachusetts Institute of Technology. Original written by Anne Trafton. Note: Content may be edited for style and length.


Journal Reference:

  1. Irwin Jungreis, Rachel Sealfon, Manolis Kellis. SARS-CoV-2 gene content and COVID-19 mutation impact by comparing 44 Sarbecovirus genomes. Nature Communications, 2021; 12 (1) DOI: 10.1038/s41467-021-22905-7

Cite This Page:

  • MLA
  • APA
  • Chicago
Massachusetts Institute of Technology. "A comprehensive map of the SARS-CoV-2 genome: Researchers have determined the virus' protein-coding gene set and analyzed new mutations' likelihood of helping the virus adapt." ScienceDaily. ScienceDaily, 11 May 2021. <www.sciencedaily.com/releases/2021/05/210511081216.htm>.
Massachusetts Institute of Technology. (2021, May 11). A comprehensive map of the SARS-CoV-2 genome: Researchers have determined the virus' protein-coding gene set and analyzed new mutations' likelihood of helping the virus adapt. ScienceDaily. Retrieved May 20, 2021 from www.sciencedaily.com/releases/2021/05/210511081216.htm
Massachusetts Institute of Technology. "A comprehensive map of the SARS-CoV-2 genome: Researchers have determined the virus' protein-coding gene set and analyzed new mutations' likelihood of helping the virus adapt." ScienceDaily. www.sciencedaily.com/releases/2021/05/210511081216.htm (accessed May 20, 2021).

  • RELATED TOPICS
    • Health & Medicine
      • Human Biology
      • Genes
      • Medical Topics
      • Viruses
      • HIV and AIDS
      • Gene Therapy
      • Vaccines
      • Bird Flu
advertisement

  • RELATED TERMS
    • Gene
    • BRCA2
    • Genetic code
    • Human genome
    • BRCA1
    • Severe acute respiratory syndrome
    • Gene therapy
    • Protein

1

2

3

4

5
RELATED STORIES

Study Identifies New 'Hidden' Gene in COVID-19 Virus
Nov. 10, 2020 — Researchers have discovered a new 'hidden' gene in SARS-CoV-2 -- the virus that causes COVID-19 -- that may have contributed to its unique biology and pandemic potential. In a virus that only has ...
CRISPR Screen Identifies Genes, Drug Targets to Protect Against SARS-CoV-2 Infection
Oct. 26, 2020 — A new study demonstrates how changes in human genes can reduce SARS-CoV-2 infection and describes a wide array of genes that have not previously been considered as therapeutic targets for ...
Human Genome Could Contain Up to 20 Percent Fewer Genes, Researchers Reveal
Aug. 30, 2018 — A new study reveals that up to 20 percent of genes classified as coding (those that produce the proteins that are the building blocks of all living things) may not be coding after all because they ...
Improved Gene Expression Atlas Shows That Many Human Long Non-Coding RNAs May Actually Be Functional
Mar. 1, 2017 — Scientists have generated a comprehensive atlas of human long non-coding RNAs with substantially improved gene models, allowing them to better assess the diversity and functionality of these RNAs. ...
FROM AROUND THE WEB

ScienceDaily shares links with sites in the TrendMD network and earns revenue from third-party advertisers, where indicated.
  Print   Email   Share

advertisement

1

2

3

4

5
Most Popular
this week

HEALTH & MEDICINE
Three Reasons Why COVID-19 Can Cause Silent Hypoxia
(c) (c) Design Cells / AdobeNew Research Optimizes Body's Own Immune System to Fight Cancer
Boy or Girl? It's in the Father's Genes
MIND & BRAIN
(c) (c) SciePro / AdobeThe Cerebellum May Have Played an Important Role in the Evolution of the Human Brain
Pink Drinks Can Help You Run Faster and Further, Study Finds
(c) (c) tashatuvango / AdobeProteins That Predict Future Dementia, Alzheimer's Risk, Identified
LIVING & WELL
(c) (c) rolffimages / AdobeOur Dreams' Weirdness Might Be Why We Have Them, Argues New AI-Inspired Theory of Dreaming
Eating More Fruit and Vegetables Linked to Less Stress, Study Finds
Eating Mushrooms May Reduce the Risk of Cognitive Decline
advertisement

Strange & Offbeat
 

HEALTH & MEDICINE
Brain Stimulation Evoking Sense of Touch Improves Control of Robotic Arm
An Illuminating Possibility for Stroke Treatment: Nano-Photosynthesis
Engineered Organism Could Diagnose Crohn's Disease Flareups
MIND & BRAIN
Robotic 'Third Thumb' Use Can Alter Brain Representation of the Hand
(c) (c) kegfire / AdobeVirtual Reality Warps Your Sense of Time
(c) (c) rolffimages / AdobeOur Dreams' Weirdness Might Be Why We Have Them, Argues New AI-Inspired Theory of Dreaming
LIVING & WELL
Wisdom, Loneliness and Your Intestinal Multitude
People Affected by COVID-19 Are Being Nicer to Machines
Facial Recognition ID With a Twist: Smiles, Winks and Other Facial Movements for Access
SD
  • SD
    • Home Page
    • Top Science News
    • Latest News
  • Home
    • Home Page
    • Top Science News
    • Latest News
  • Health
    • View all the latest top news in the health sciences,
      or browse the topics below:
      Health & Medicine
      • Allergy
      • Alternative Medicine
      • Birth Control
      • Cancer
      • Diabetes
      • Diseases
      • Heart Disease
      • HIV and AIDS
      • Obesity
      • Stem Cells
      • ... more topics
      Mind & Brain
      • ADD and ADHD
      • Addiction
      • Alzheimer's
      • Autism
      • Depression
      • Headaches
      • Intelligence
      • Psychology
      • Relationships
      • Schizophrenia
      • ... more topics
      Living Well
      • Parenting
      • Pregnancy
      • Sexual Health
      • Skin Care
      • Men's Health
      • Women's Health
      • Nutrition
      • Diet and Weight Loss
      • Fitness
      • Healthy Aging
      • ... more topics
  • Tech
    • View all the latest top news in the physical sciences & technology,
      or browse the topics below:
      Matter & Energy
      • Aviation
      • Chemistry
      • Electronics
      • Fossil Fuels
      • Nanotechnology
      • Physics
      • Quantum Physics
      • Solar Energy
      • Technology
      • Wind Energy
      • ... more topics
      Space & Time
      • Astronomy
      • Black Holes
      • Dark Matter
      • Extrasolar Planets
      • Mars
      • Moon
      • Solar System
      • Space Telescopes
      • Stars
      • Sun
      • ... more topics
      Computers & Math
      • Artificial Intelligence
      • Communications
      • Computer Science
      • Hacking
      • Mathematics
      • Quantum Computers
      • Robotics
      • Software
      • Video Games
      • Virtual Reality
      • ... more topics
  • Enviro
    • View all the latest top news in the environmental sciences,
      or browse the topics below:
      Plants & Animals
      • Agriculture and Food
      • Animals
      • Biology
      • Biotechnology
      • Endangered Animals
      • Extinction
      • Genetically Modified
      • Microbes and More
      • New Species
      • Zoology
      • ... more topics
      Earth & Climate
      • Climate
      • Earthquakes
      • Environment
      • Geography
      • Geology
      • Global Warming
      • Hurricanes
      • Ozone Holes
      • Pollution
      • Weather
      • ... more topics
      Fossils & Ruins
      • Ancient Civilizations
      • Anthropology
      • Archaeology
      • Dinosaurs
      • Early Humans
      • Early Mammals
      • Evolution
      • Lost Treasures
      • Origin of Life
      • Paleontology
      • ... more topics
  • Society
    • View all the latest top news in the social sciences & education,
      or browse the topics below:
      Science & Society
      • Arts & Culture
      • Consumerism
      • Economics
      • Political Science
      • Privacy Issues
      • Public Health
      • Racial Disparity
      • Religion
      • Sports
      • World Development
      • ... more topics
      Business & Industry
      • Biotechnology & Bioengineering
      • Computers & Internet
      • Energy & Resources
      • Engineering
      • Medical Technology
      • Pharmaceuticals
      • Transportation
      • ... more topics
      Education & Learning
      • Animal Learning & Intelligence
      • Creativity
      • Educational Psychology
      • Educational Technology
      • Infant & Preschool Learning
      • Learning Disorders
      • STEM Education
      • ... more topics
  • Quirky
    • Top News
    • Human Quirks
    • Odd Creatures
    • Bizarre Things
    • Weird World
Free Subscriptions

Get the latest science news with ScienceDaily's free email newsletters, updated daily and weekly. Or view hourly updated newsfeeds in your RSS reader:

  • Email Newsletters
  • RSS Feeds
Follow Us

Keep up to date with the latest news from ScienceDaily via social networks:

  • Facebook
  • Twitter
  • LinkedIn
Have Feedback?

Tell us what you think of ScienceDaily -- we welcome both positive and negative comments. Have any problems using the site? Questions?

  • Leave Feedback
  • Contact Us
About This Site  |  Staff  |  Reviews  |  Contribute  |  Advertise  |  Privacy Policy  |  Editorial Policy  |  Terms of Use
Copyright 2021 ScienceDaily or by other parties, where indicated. All rights controlled by their respective owners.
Content on this website is for information only. It is not intended to provide medical or other professional advice.
Views expressed here do not necessarily reflect those of ScienceDaily, its staff, its contributors, or its partners.
Financial support for ScienceDaily comes from advertisements and referral programs, where indicated.
— CCPA: Do Not Sell My Information — — GDPR: Privacy Settings —