Data repositories

If you need to store your research data in a data repository, you have several options:

  • General data repositories
  • Domain-specific data repositories - recommended
  • Institutional data repositories

The repository should assign a DOI identifier to the dataset, allow setting of data access (open x closed data), selection of license for further use of the data (e.g. Creative Commons: CC BY).

General data repositories
They can be used by researchers in any field.

Zenodo

Zenodo is a general repository created by OpenAIRE and CERN. In addition to research data, articles, code, posters and presentations can also be stored in Zenodo. Files up to 50 GB can be uploaded, for larger files an agreement is required. The repository can assign a DOI identifier to the results. The results can be licensed under a Creative Commons license.

The metadata follows DataCite's Metadata Schema standard. Metadata can be exported to formats: MARCXML, Dublin Core, DataCite, DCAT, JSON-LD (Schema.org). The repository does not have a Core Trust Seal certificate, because under current rules this certificate cannot be assigned to general repositories. Nevertheless, the Zenodo repository is considered trustworthy by funders.

National Data Repository

The National Data Repository is operated by CESNET and is still in pilot operation. In the future, it will be one of the main repositories of research data in the Czech Republic within the National Repository Platform. Control of stored records is managed by the National Technical Library. The size of the dataset is limited to 500 GB. The dataset is assigned a DOI identifier. The results can be licensed under a Creative Commons license.

Harvard Dataverse

Harvard Dataverse is a data repository, it can be used by researchers in any field, most of the records are from the Social Sciences. Files up to 1 TB can be uploaded. Datasets are assigned a DOI identifier. The metadata meets the standards of Dublin Core, DataCite, OpenAIRE, etc. The results can be licensed under a Creative Commons license.

Figshare

It is a data repository from Digital Science, part of the Springer Nature portfolio. Files up to 20 GB can be uploaded. Datasets are assigned a DOI identifier by Data Cite. The results can be licensed under a Creative Commons license. Metadata is according to the DataCite standard. Figshare is considered a trusted repository and is ISO 27001 certified.

Dryad

The repository contains research data mainly from the life and medical sciences. You can upload a file up to 300 GB (up to 50 GB for free). Datasets are assigned a DOI identifier. The results can be licensed under a Creative Commons license.

Domain-specific data repositories
We recommend to store research data preferably in subject repositories. You can search for a subject repository in the data repository signposts: Re3data or OpenDoar. Please always read the terms and conditions of the repository carefully to see if you can contribute to it!

Czech Social Science Data Archive

Long-term preservation and access to social science research data. Data storage is based on a contract between the data producer and the Institute of Sociology of the Czech Academy of Sciences (of which the CSSDA is a part). The repository is Core Trust Seal certified. Research data will be assigned a DOI identifier.

LINDAT/CLARIAH-CZ

Domain repository for linguistic data and tools, built by the Institute of Formal and Applied Linguistics of MFF UK. The repository grants a Handle identifier and allows you to select a license for the data.

Data repositories recommended by some journals and publishers