Research data policies and services

Biological sciences

Nucleic acid sequence

Sequence information should be deposited following the MIxS guidelines.
Simple genetic polymorphisms or structural variations should be submitted to dbSNP or dbVar (please note that these repositories cannot accept sensitive data derived from human subjects); the NCBI Trace Archive may be used for capillary electrophoresis data, while SRA accepts NGS data only.

Nucleic acid sequence repositories
Database of Genomic Variants Archive (DGVa)
dbSNP
dbVar

DNA DataBank of Japan (DDBJ)

EBI Metagenomics

EMBL Nucleotide Sequence Database (ENA)

European Variation Archive (EVA)
GenBank
NCBI Assembly

NCBI Sequence Read Archive (SRA)

NCBI Trace Archive

Protein sequence

Protein sequence repositories

UniProtKB*

Molecular & supramolecular structure

These repositories accept structural data for small molecules (COD); peptides and proteins (all); and larger assemblies (EMDB).

Molecular & supramolecular structure repositories
Biological Magnetic Resonance Data Bank (BMRB)

Coherent X-ray Imaging Data Bank (CXIDB)

Crystallography Open Database (COD)

Electron Microscopy Data Bank (EMDB)

Protein Circular Dichroism Data Bank (PCDDB)
Structural Biology Data Grid

Worldwide Protein Data Bank (wwPDB)

Neuroscience

These data repositories all accept human-derived data (NeuroMorpho.org additionally accepts imaging data from other organisms). Please note that human-subject data submitted to OpenfMRI must be de-identified, while FCP/INDI can handle sensitive patient data.

Neuroscience repositories
Functional Connectomes Project International Neuroimaging Data-Sharing Initiative (FCP/INDI)

NeuroMorpho.org

OpenfMRI

Omics

Please refer to the MIAME standard for microarray data. Molecular interaction data should be deposited with a member of the International Molecular Exchange Consortium (IMEx), following the MIMIx recommendations.
For data linking genotyping and phenotyping information in human subjects, we strongly recommend submission to dbGAP or EGA, which have mechanisms in place to handle sensitive data.

Omics repositories

ArrayExpress

Biological General Repository for Interaction Datasets

Database of Interacting Proteins (DIP)

dbGAP
The European Genome-phenome Archive (EGA)

Gene Expression Omnibus (GEO)

GenomeRNAi

IntAct
Japanese Genotype-phenotype Archive
NCBI PubChem BioAssay

Metabolomics

Metabolomics data should be submitted following the MSI guidelines.

Metabolomics repositories

MetaboLights

Proteomics

We ask authors to submit proteomics data to members of the ProteomeXchange consortium (listed below), following the MIAPE recommendations.

Proteomics repositories

PeptideAtlas

PRIDE

ProteomeXchange

Taxonomy & species diversity

Taxonomy & species diversity repositories
Global Biodiversity Information Facility (GBIF)

Integrated Taxonomic Information System (ITIS)

KNB: The Knowledge Network for Biocomplexity

MorphoBank.org

NCBI Taxonomy*

Mathematical & modelling resources

Mathematical & modelling resources repositories

BioModels Database

Kinetic Models of Biological Systems (KiMoSys)

Cytometry & immunology

Cytometry & immunology repositories

FlowRepository

ImmPort

Organism-focused resources

These resources provide information specific to a particular organism or disease pathogen. Where applicable, data records should be submitted both to a community repository and to one suitable for the type of data (e.g. transcriptome profiling; please see above).

Organism-focused resources repositories

Eukaryotic Pathogen Database Resources (EuPathDB)

FlyBase

Influenza Research Database

Mouse Genome Informatics (MGI)
Rat Genome Database (RGD)

VectorBase

Xenbase

Zebrafish Model Organism Database (ZFIN)