How to preserve and share data in data repositories?

Preservation and sharing of data in a data repository

The preferred option for data preservation and sharing is to deposit data in an established, trustworthy research data repository.

A data repository is an online platform that is used to: 

  • Publish completed datasets.
  • Share datasets externally with different access levels and reuse conditions.
  • Preserve datasets in the long term. 

A data repository is a database infrastructure that  

  • Compiles data from (many) different providers/researchers. 
  • Manages data, regularly in line with the FAIR data principles
  • Gives access to data and associated metadata and documentation.  

Personal websites and databases, version control systems as well as cloud storage services (GitHub, Dropbox, Google Drive, etc.) are not considered data repositories. 

 

How to select a suitable data repository? 

There are hundreds of data repositories or archives to choose from. Keep in mind, however, that not all repositories are equivalent. Some repositories focus more on disseminating and making your data visible than on ensuring their preservation in the long term. 

Basic tips 

Additional considerations 

  • Does the repository match your data needs (e.g. in terms of accepted data types and formats, access levels, licenses, legal requirements for data protection…)? Read the data submission guidelines (see below) on the website of the repository itself to check the scope of the repository.
  • Does it charge for its services?
  • Does it have an explicit commitment to long-term preservation?
  • Does it provide a landing page for each dataset, with publicly available metadata?
  • Does it assign persistent and globally unique identifiers?
  • Does it provide clarity about access levels and conditions?
  • Does it provide information about usage licenses?
  • Is it a trustworthy repository?
  • Is it certified?
  • Is it community-based, or a commercial solution?

 

How to prepare data for preservation and sharing in a data repository? 

If you have identified a suitable data repository or archive, check in advance what the data submission guidelines are, so you can adequately prepare your data for deposit. The data repository will have guidelines on how to build a data package. 

Examples of data submission guidelines: Dryad4TU.ResearchDataPangaeaSodha

More tips


Last modified Nov. 7, 2024, 2:21 p.m.