ScienceDaily
Your source for the latest research news
Follow Facebook Twitter LinkedIn Subscribe RSS Feeds Newsletters
New:
  • Mars Habitability Limited by Its Small Size
  • Plants Evolved Complexity in Two Bursts
  • Improving Survival of Cancer Patients
  • Climate Change Threatens Base of Polar Ecosytem
  • Cancer Cells’ Unexpected Genetic Tricks
  • We May Have Already Detected Dark Energy
  • Snakes and Dino-Killing Asteroid
  • Pancreatic 'Organoids' Mimic the Real Thing
  • Personality Matters, Even for Squirrels
  • Warming Climate: Animals 'Shapeshifting'
advertisement
Follow all of ScienceDaily's latest research news and top science headlines!
Science News
from research organizations

1

2

Could all your digital photos be stored as DNA?

A technique for labeling and retrieving DNA data files from a large pool could help make DNA data storage feasible

Date:
June 10, 2021
Source:
Massachusetts Institute of Technology
Summary:
Biological engineers have demonstrated a way to easily retrieve data files stored as DNA. This could be a step toward using DNA archives to store enormous quantities of photos, images, and other digital content.
Share:
FULL STORY

On Earth right now, there are about 10 trillion gigabytes of digital data, and every day, humans produce emails, photos, tweets, and other digital files that add up to another 2.5 million gigabytes of data. Much of this data is stored in enormous facilities known as exabyte data centers (an exabyte is 1 billion gigabytes), which can be the size of several football fields and cost around $1 billion to build and maintain.

advertisement

Many scientists believe that an alternative solution lies in the molecule that contains our genetic information: DNA, which evolved to store massive quantities of information at very high density. A coffee mug full of DNA could theoretically store all of the world's data, says Mark Bathe, an MIT professor of biological engineering.

"We need new solutions for storing these massive amounts of data that the world is accumulating, especially the archival data," says Bathe, who is also an associate member of the Broad Institute of MIT and Harvard. "DNA is a thousandfold denser than even flash memory, and another property that's interesting is that once you make the DNA polymer, it doesn't consume any energy. You can write the DNA and then store it forever."

Scientists have already demonstrated that they can encode images and pages of text as DNA. However, an easy way to pick out the desired file from a mixture of many pieces of DNA will also be needed. Bathe and his colleagues have now demonstrated one way to do that, by encapsulating each data file into a 6-micrometer particle of silica, which is labeled with short DNA sequences that reveal the contents.

Using this approach, the researchers demonstrated that they could accurately pull out individual images stored as DNA sequences from a set of 20 images. Given the number of possible labels that could be used, this approach could scale up to 1020 files.

Bathe is the senior author of the study, which appears today in Nature Materials. The lead authors of the paper are MIT senior postdoc James Banal, former MIT research associate Tyson Shepherd, and MIT graduate student Joseph Berleant.

advertisement

Stable storage

Digital storage systems encode text, photos, or any other kind of information as a series of 0s and 1s. This same information can be encoded in DNA using the four nucleotides that make up the genetic code: A, T, G, and C. For example, G and C could be used to represent 0 while A and T represent 1.

DNA has several other features that make it desirable as a storage medium: It is extremely stable, and it is fairly easy (but expensive) to synthesize and sequence. Also, because of its high density -- each nucleotide, equivalent to up to two bits, is about 1 cubic nanometer -- an exabyte of data stored as DNA could fit in the palm of your hand.

One obstacle to this kind of data storage is the cost of synthesizing such large amounts of DNA. Currently it would cost $1 trillion to write one petabyte of data (1 million gigabytes). To become competitive with magnetic tape, which is often used to store archival data, Bathe estimates that the cost of DNA synthesis would need to drop by about six orders of magnitude. Bathe says he anticipates that will happen within a decade or two, similar to how the cost of storing information on flash drives has dropped dramatically over the past couple of decades.

Aside from the cost, the other major bottleneck in using DNA to store data is the difficulty in picking out the file you want from all the others.

advertisement

"Assuming that the technologies for writing DNA get to a point where it's cost-effective to write an exabyte or zettabyte of data in DNA, then what? You're going to have a pile of DNA, which is a gazillion files, images or movies and other stuff, and you need to find the one picture or movie you're looking for," Bathe says. "It's like trying to find a needle in a haystack."

Currently, DNA files are conventionally retrieved using PCR (polymerase chain reaction). Each DNA data file includes a sequence that binds to a particular PCR primer. To pull out a specific file, that primer is added to the sample to find and amplify the desired sequence. However, one drawback to this approach is that there can be crosstalk between the primer and off-target DNA sequences, leading unwanted files to be pulled out. Also, the PCR retrieval process requires enzymes and ends up consuming most of the DNA that was in the pool.

"You're kind of burning the haystack to find the needle, because all the other DNA is not getting amplified and you're basically throwing it away," Bathe says.

File retrieval

As an alternative approach, the MIT team developed a new retrieval technique that involves encapsulating each DNA file into a small silica particle. Each capsule is labeled with single-stranded DNA "barcodes" that correspond to the contents of the file. To demonstrate this approach in a cost-effective manner, the researchers encoded 20 different images into pieces of DNA about 3,000 nucleotides long, which is equivalent to about 100 bytes. (They also showed that the capsules could fit DNA files up to a gigabyte in size.)

Each file was labeled with barcodes corresponding to labels such as "cat" or "airplane." When the researchers want to pull out a specific image, they remove a sample of the DNA and add primers that correspond to the labels they're looking for -- for example, "cat," "orange," and "wild" for an image of a tiger, or "cat," "orange," and "domestic" for a housecat.

The primers are labeled with fluorescent or magnetic particles, making it easy to pull out and identify any matches from the sample. This allows the desired file to be removed while leaving the rest of the DNA intact to be put back into storage. Their retrieval process allows Boolean logic statements such as "president AND 18th century" to generate George Washington as a result, similar to what is retrieved with a Google image search.

"At the current state of our proof-of-concept, we're at the 1 kilobyte per second search rate. Our file system's search rate is determined by the data size per capsule, which is currently limited by the prohibitive cost to write even 100 megabytes worth of data on DNA, and the number of sorters we can use in parallel. If DNA synthesis becomes cheap enough, we would be able to maximize the data size we can store per file with our approach," Banal says.

For their barcodes, the researchers used single-stranded DNA sequences from a library of 100,000 sequences, each about 25 nucleotides long, developed by Stephen Elledge, a professor of genetics and medicine at Harvard Medical School. If you put two of these labels on each file, you can uniquely label 1010 (10 billion) different files, and with four labels on each, you can uniquely label 1020 files.

Bathe envisions that this kind of DNA encapsulation could be useful for storing "cold" data, that is, data that is kept in an archive and not accessed very often. His lab is spinning out a startup, Cache DNA, that is now developing technology for long-term storage of DNA, both for DNA data storage in the long-term, and clinical and other preexisting DNA samples in the near-term.

"While it may be a while before DNA is viable as a data storage medium, there already exists a pressing need today for low-cost, massive storage solutions for preexisting DNA and RNA samples from Covid-19 testing, human genomic sequencing, and other areas of genomics," Bathe says.

The research was funded by the Office of Naval Research, the National Science Foundation, and the U.S. Army Research Office.

make a difference: sponsored opportunity

Story Source:

Materials provided by Massachusetts Institute of Technology. Original written by Anne Trafton. Note: Content may be edited for style and length.


Journal Reference:

  1. James L. Banal, Tyson R. Shepherd, Joseph Berleant, Hellen Huang, Miguel Reyes, Cheri M. Ackerman, Paul C. Blainey, Mark Bathe. Random access DNA memory using Boolean search in an archival file storage system. Nature Materials, 2021; DOI: 10.1038/s41563-021-01021-3

Cite This Page:

  • MLA
  • APA
  • Chicago
Massachusetts Institute of Technology. "Could all your digital photos be stored as DNA? A technique for labeling and retrieving DNA data files from a large pool could help make DNA data storage feasible." ScienceDaily. ScienceDaily, 10 June 2021. <www.sciencedaily.com/releases/2021/06/210610135710.htm>.
Massachusetts Institute of Technology. (2021, June 10). Could all your digital photos be stored as DNA? A technique for labeling and retrieving DNA data files from a large pool could help make DNA data storage feasible. ScienceDaily. Retrieved September 24, 2021 from www.sciencedaily.com/releases/2021/06/210610135710.htm
Massachusetts Institute of Technology. "Could all your digital photos be stored as DNA? A technique for labeling and retrieving DNA data files from a large pool could help make DNA data storage feasible." ScienceDaily. www.sciencedaily.com/releases/2021/06/210610135710.htm (accessed September 24, 2021).

  • RELATED TOPICS
    • Matter & Energy
      • Biometric
      • Organic Chemistry
      • Forensic Research
      • Microarrays
    • Computers & Math
      • Hacking
      • Encryption
      • Computers and Internet
      • Information Technology
advertisement

  • RELATED TERMS
    • 3D computer graphics
    • Scientific visualization
    • Search engine
    • Quantum computer
    • Computational genomics
    • Computer virus
    • Radiography
    • Computer worm

1

2

3

4

5
RELATED STORIES

New Study Shows the Potential of DNA-Based Data-Structures Systems
Aug. 12, 2021 — Engineers have created new dynamic DNA data structures able to store and recall information in an ordered way from DNA molecules. They also analyzed how these structures are able to be interfaced ...
New Twist on DNA Data Storage Lets Users Preview Stored Files
June 10, 2021 — Researchers have turned a longstanding challenge in DNA data storage into a tool, using it to offer users previews of stored data files -- such as thumbnail versions of image ...
Progress on Molecular Data Storage System
Feb. 4, 2020 — Scientists have shown that they can store and retrieve more than 200 kilobytes of digital image files by encoding the data in mixtures of new custom libraries of small ...
Molecular Thumb Drives: Researchers Store Digital Images in Metabolite Molecules
July 3, 2019 — In a step toward molecular storage systems that could hold vast amounts of data in tiny spaces, researchers have shown it's possible to store image files in solutions of common biological small ...
FROM AROUND THE WEB

ScienceDaily shares links with sites in the TrendMD network and earns revenue from third-party advertisers, where indicated.
  Print   Email   Share

advertisement

1

2

3

4

5
Most Popular
this week

SPACE & TIME
(c) sdecoret / stock.adobe.comHave We Detected Dark Energy? Scientists Say It's a Possibility
(c) dimazel / stock.adobe.comMars Habitability Limited by Its Small Size, Isotope Study Suggests
(c) dottedyeti / stock.adobe.comWill It Be Safe for Humans to Fly to Mars?
MATTER & ENERGY
(c) yuthana Choradet / stock.adobe.comA Universal Equation for the Shape of an Egg
(c) magicmine / stock.adobe.comEngineers Grow Pancreatic 'Organoids' That Mimic the Real Thing
Researchers Infuse Bacteria With Silver to Improve Power Efficiency in Fuel Cells
COMPUTERS & MATH
Three Reasons Why COVID-19 Can Cause Silent Hypoxia
(c) Dana.S / stock.adobe.comToward Next-Generation Brain-Computer Interface Systems
Taking Lessons from a Sea Slug, Study Points to Better Hardware for Artificial Intelligence
advertisement

Strange & Offbeat
 

SPACE & TIME
Carbon Dioxide Reactor Makes 'Martian Fuel'
Hubble Finds Early, Massive Galaxies Running on Empty
Unveiling Galaxies at Cosmic Dawn That Were Hiding Behind the Dust
MATTER & ENERGY
Tube-Shaped Robots Roll Up Stairs, Carry Carts, and Race One Another
Winged Microchip Is Smallest-Ever Human-Made Flying Structure
Blowing Up Medieval Gunpowder Recipes
COMPUTERS & MATH
Human Learning Can Be Duplicated in Solid Matter
Augmented Reality Helps Tackle Fear of Spiders
New DNA-Based Chip Can Be Programmed to Solve Complex Math Problems
SD
  • SD
    • Home Page
    • Top Science News
    • Latest News
  • Home
    • Home Page
    • Top Science News
    • Latest News
  • Health
    • View all the latest top news in the health sciences,
      or browse the topics below:
      Health & Medicine
      • Allergy
      • Alternative Medicine
      • Birth Control
      • Cancer
      • Diabetes
      • Diseases
      • Heart Disease
      • HIV and AIDS
      • Obesity
      • Stem Cells
      • ... more topics
      Mind & Brain
      • ADD and ADHD
      • Addiction
      • Alzheimer's
      • Autism
      • Depression
      • Headaches
      • Intelligence
      • Psychology
      • Relationships
      • Schizophrenia
      • ... more topics
      Living Well
      • Parenting
      • Pregnancy
      • Sexual Health
      • Skin Care
      • Men's Health
      • Women's Health
      • Nutrition
      • Diet and Weight Loss
      • Fitness
      • Healthy Aging
      • ... more topics
  • Tech
    • View all the latest top news in the physical sciences & technology,
      or browse the topics below:
      Matter & Energy
      • Aviation
      • Chemistry
      • Electronics
      • Fossil Fuels
      • Nanotechnology
      • Physics
      • Quantum Physics
      • Solar Energy
      • Technology
      • Wind Energy
      • ... more topics
      Space & Time
      • Astronomy
      • Black Holes
      • Dark Matter
      • Extrasolar Planets
      • Mars
      • Moon
      • Solar System
      • Space Telescopes
      • Stars
      • Sun
      • ... more topics
      Computers & Math
      • Artificial Intelligence
      • Communications
      • Computer Science
      • Hacking
      • Mathematics
      • Quantum Computers
      • Robotics
      • Software
      • Video Games
      • Virtual Reality
      • ... more topics
  • Enviro
    • View all the latest top news in the environmental sciences,
      or browse the topics below:
      Plants & Animals
      • Agriculture and Food
      • Animals
      • Biology
      • Biotechnology
      • Endangered Animals
      • Extinction
      • Genetically Modified
      • Microbes and More
      • New Species
      • Zoology
      • ... more topics
      Earth & Climate
      • Climate
      • Earthquakes
      • Environment
      • Geography
      • Geology
      • Global Warming
      • Hurricanes
      • Ozone Holes
      • Pollution
      • Weather
      • ... more topics
      Fossils & Ruins
      • Ancient Civilizations
      • Anthropology
      • Archaeology
      • Dinosaurs
      • Early Humans
      • Early Mammals
      • Evolution
      • Lost Treasures
      • Origin of Life
      • Paleontology
      • ... more topics
  • Society
    • View all the latest top news in the social sciences & education,
      or browse the topics below:
      Science & Society
      • Arts & Culture
      • Consumerism
      • Economics
      • Political Science
      • Privacy Issues
      • Public Health
      • Racial Disparity
      • Religion
      • Sports
      • World Development
      • ... more topics
      Business & Industry
      • Biotechnology & Bioengineering
      • Computers & Internet
      • Energy & Resources
      • Engineering
      • Medical Technology
      • Pharmaceuticals
      • Transportation
      • ... more topics
      Education & Learning
      • Animal Learning & Intelligence
      • Creativity
      • Educational Psychology
      • Educational Technology
      • Infant & Preschool Learning
      • Learning Disorders
      • STEM Education
      • ... more topics
  • Quirky
    • Top News
    • Human Quirks
    • Odd Creatures
    • Bizarre Things
    • Weird World
Free Subscriptions

Get the latest science news with ScienceDaily's free email newsletters, updated daily and weekly. Or view hourly updated newsfeeds in your RSS reader:

  • Email Newsletters
  • RSS Feeds
Follow Us

Keep up to date with the latest news from ScienceDaily via social networks:

  • Facebook
  • Twitter
  • LinkedIn
Have Feedback?

Tell us what you think of ScienceDaily -- we welcome both positive and negative comments. Have any problems using the site? Questions?

  • Leave Feedback
  • Contact Us
About This Site  |  Staff  |  Reviews  |  Contribute  |  Advertise  |  Privacy Policy  |  Editorial Policy  |  Terms of Use
Copyright 2021 ScienceDaily or by other parties, where indicated. All rights controlled by their respective owners.
Content on this website is for information only. It is not intended to provide medical or other professional advice.
Views expressed here do not necessarily reflect those of ScienceDaily, its staff, its contributors, or its partners.
Financial support for ScienceDaily comes from advertisements and referral programs, where indicated.
— CCPA: Do Not Sell My Information — — GDPR: Privacy Settings —