A new German-based project is setting out to rescue biodiversity data at risk of being lost, because they are not integrated in institutional databases, are kept in outdated digital storage systems, or are not properly documented.
The project, run by the Botanic Garden and Botanical Museum Berlin-Dahlem, provides a good example for a GBIF recommendation to establish hosting centres for biodiversity data. This is one of a set of data management recommendations just published by GBIF.
The team behind the German project called reBiND, or Biodiversity Needs Data, has started identifying threatened databases for archiving, and will make them accessible via the GBIF network.
The focus is initially on specimen and observational data that are already digitized but that are not part of the documentation process of a museum or other institution. Examples include data from diplomas or PhD theses, generally stored on a computer hard drive or a disc and often in danger of getting lost because of lack of documentation.
Examples of data being 'rescued' by the reBiND project are:
The workflow developed by the reBiND project uses the Biological Collections Access Services (BioCASe) provider software package to transform data into XML files. BioCASe is one of the publishing tools through which data are published to the GBIF network. Repair software detects and corrects any errors introduced during the conversion process. ReBiND aims to enable users with a minimum of technical background knowledge to transform and archive their biodiversity data.
At present, the XML files containing the rescued data are stored on the project's own server, in a database specifically designed for the purpose. The intention is to make the data discoverable and accessible globally through the GBIF network, and the team is working with GBIF Germany to bring this about.
The project expects to take on data rescue work globally. The team is also working on a best practice handbook on the rescue and storage of threatened digital data.
The three-year project is funded by the German Research Foundation (Deutsche Forschungsgemeinschaft).
The GBIF position paper on data hosting infrastructure for primary biodiversity data looks at the rescuing and re-hosting of data stored in formats that are difficult to access. It emphasizes that the biodiversity community must adopt standards and develop tools to enable data discovery and thus help preserve data.
Cite This Page: