New! Sign up for our free email newsletter.
Science News
from research organizations

Artificial neural networks could power up curation of natural history collections

Deep learning techniques manage to differentiate between similar plant families with up to 99 percent accuracy

Date:
November 2, 2017
Source:
Pensoft Publishers
Summary:
Fed with new knowledge for centuries, natural history collections contain critical data for many scientific endeavors. While recent efforts in mass digitization have already provided unprecedented insight by generating large datasets from these collections, a new pilot project -- one of the first of its kind -- suggests that the key to efficiently studying these data might lie in the new-age deep learning techniques.
Share:
FULL STORY

Millions, if not billions, of specimens reside in the world's natural history collections, but most of these have not been carefully studied, or even looked at, in decades. While containing critical data for many scientific endeavors, most objects are quietly sitting in their own little cabinets of curiosity.

Thus, mass digitization of natural history collections has become a major goal at museums around the world. Having brought together numerous biologists, curators, volunteers and citizens scientists, such initiatives have already generated large datasets from these collections and provided unprecedented insight.

Now, a study, recently published in the open access Biodiversity Data Journal, suggests that the latest advances in both digitization and machine learning might together be able to assist museum curators in their efforts to care for and learn from this incredible global resource.

A team of researchers from the Smithsonian Department of Botany, Data Science Lab, and Digitization Program Office recently collaborated with NVIDIA to carry out a pilot project using deep learning approaches to dig into digitized herbarium specimens.

Their study is among the first to describe the use of deep learning methods to enhance our understanding of digitized collection samples. It is also the first to demonstrate that a deep convolutional neural network -- a computing system modelled after the neuron activity in animal brains that can basically learn on its own -- can effectively differentiate between similar plants with an amazing accuracy of nearly 100%.

In the paper, the scientists describe two different neural networks that they trained to perform tasks on the digitized portion (currently 1.2 million specimens) of the United States National Herbarium.

The team first trained a net to automatically recognize herbarium sheets that had been stained with mercury crystals, since mercury was commonly used by some early collectors to protect the plant collections from insect damage. The second net was trained to discriminate between two families of plants that share a strikingly similar superficial appearance.

The trained neural nets performed with 90% and 96% accuracy respectively (or 94% and 99% if the most challenging specimens were discarded), confirming that deep learning is a useful and important technology for the future analysis of digitized museum collections.

"The results can be leveraged both to improve curation and unlock new avenues of research," conclude the scientists.

"This research paper is a wonderful proof of concept. We now know that we can apply machine learning to digitized natural history specimens to solve curatorial and identification problems. The future will be using these tools combined with large shared data sets to test fundamental hypotheses about the evolution and distribution of plants and animals," says Dr. Laurence J. Dorr, Chair of the Smithsonian Department of Botany and co-author of the study.


Story Source:

Materials provided by Pensoft Publishers. The original story is licensed under a Creative Commons License. Note: Content may be edited for style and length.


Journal Reference:

  1. Eric Schuettpelz, Paul Frandsen, Rebecca Dikow, Abel Brown, Sylvia Orli, Melinda Peters, Adam Metallo, Vicki Funk, Laurence Dorr. Applications of deep convolutional neural networks to digitized natural history collections. Biodiversity Data Journal, 2017; 5: e21139 DOI: 10.3897/BDJ.5.e21139

Cite This Page:

Pensoft Publishers. "Artificial neural networks could power up curation of natural history collections." ScienceDaily. ScienceDaily, 2 November 2017. <www.sciencedaily.com/releases/2017/11/171102091102.htm>.
Pensoft Publishers. (2017, November 2). Artificial neural networks could power up curation of natural history collections. ScienceDaily. Retrieved April 25, 2024 from www.sciencedaily.com/releases/2017/11/171102091102.htm
Pensoft Publishers. "Artificial neural networks could power up curation of natural history collections." ScienceDaily. www.sciencedaily.com/releases/2017/11/171102091102.htm (accessed April 25, 2024).

Explore More

from ScienceDaily

RELATED STORIES