In general, data should be submitted to discipline-specific, community-recognised repositories where possible, or to generalist repositories if no suitable community resource is available.
This list is derived from Scientific Data’s recommended repository list. The list is also available in figshare for free reuse, with attribution, by others. Scientific Data is an open access data journal published by Springer Nature.
Authors should consult journal information for authors in case of more specific repository recommendations. Some Springer Nature journals, such as Human Genome Variation, maintain their own research data repositories.
This list is not currently comprehensive for all research disciplines. Researchers from disciplines such as Computer Science and the Humanities should also consult the global registry of research data repositories re3data.org, or consider one of the general repositories from the recommended repositories list, such as figshare.
If an author’s preferred repository is not listed below, in the first instance we encourage repository managers to investigate listing their repositories with re3data.org. Life sciences repositories should also be listed with Biosharing. For those that meet the criteria, they should apply for listing with Scientific Data.
Nucleic acid sequence
Sequence information should be deposited following the MIxS guidelines.
Simple genetic polymorphisms or structural variations should be submitted to dbSNP or dbVar (please note that these repositories cannot accept sensitive data derived from human subjects); the NCBI Trace Archive may be used for capillary electrophoresis data, while SRA accepts NGS data only.
|Nucleic acid sequence repositories|
|Database of Genomic Variants Archive (DGVa)|
|European Variation Archive (EVA)|
|Protein sequence repositories|
Molecular & supramolecular structure
These repositories accept structural data for small molecules (COD); peptides and proteins (all); and larger assemblies (EMDB).
|Molecular & supramolecular structure repositories|
|Biological Magnetic Resonance Data Bank (BMRB)|
|Protein Circular Dichroism Data Bank (PCDDB)|
|Structural Biology Data Grid|
These data repositories all accept human-derived data (NeuroMorpho.org additionally accepts imaging data from other organisms). Please note that human-subject data submitted to OpenfMRI must be de-identified, while FCP/INDI can handle sensitive patient data.
|Functional Connectomes Project International Neuroimaging Data-Sharing Initiative (FCP/INDI)|
Please refer to the MIAME standard for microarray data. Molecular interaction data should be deposited with a member of the International Molecular Exchange Consortium (IMEx), following the MIMIx recommendations.
For data linking genotyping and phenotyping information in human subjects, we strongly recommend submission to dbGAP or EGA, which have mechanisms in place to handle sensitive data.
|The European Genome-phenome Archive (EGA)|
|Japanese Genotype-phenotype Archive|
|NCBI PubChem BioAssay|
Metabolomics data should be submitted following the MSI guidelines.
Taxonomy & species diversity
|Taxonomy & species diversity repositories|
|Global Biodiversity Information Facility (GBIF)|
Mathematical & modelling resources
|Mathematical & modelling resources repositories|
These resources provide information specific to a particular organism or disease pathogen. Where applicable, data records should be submitted both to a community repository and to one suitable for the type of data (e.g. transcriptome profiling; please see above).
|Organism-focused resources repositories|
|Mouse Genome Informatics (MGI)|
|Rat Genome Database (RGD)|
Earth & environmental sciencesTop
Some of the repositories in this section are suitable for datasets requiring restricted data access, which may be required for the preservation of study participant anonymity in clinical datasets. We suggest contacting repositories directly to determine those with data access controls best suited to the specific requirements of your study.
|Health sciences repositories|
|Virtual Skeleton Database (SICAS medical image repository)|
|National Database for Clinical Trials related to Mental Illness (NDCT)|
Springer Nature encourages authors to archive data to one of the above data-type specific repositories. However, generalist repositories can handle a wide variety of data, and may also be appropriate for storage of associated analyses, or experimental-control data, supplementing the primary data record.