Mandated data types
For the following data types submission to a community-endorsed, public repository is mandatory. Persistent identifiers (DOIs and accession numbers) assigned to the data by the repository must be appropriately cited and referenced in the published article.
DNA sequence data*
RNA sequence data*
Genome assembly data*
Protein sequence data
Any ProteomeXchange member repository
Genetic variation data
dbSNP (human variations less than 50bp)
dbVar (human variations greater than 50bp)
European Variation Archive (EVA) (all species)Genome Sequence Archive for Human (human variation)
Functional genomics data
Macromolecular structure data
Gene expression data
Crystallographic data for small molecules
*Novel DNA sequence, novel RNA sequence, and novel genome assembly data must be deposited to repositories that are part of the International Nucleotide Sequence Collaboration (INSDC), or those which are working towards INSDC inclusion (as listed in the table), unless there are privacy or ethics restrictions that prevent open sharing of such data. Novel DNA sequence, novel RNA sequence, and novel genome assembly data may in addition be deposited to any other repository (including regional or national repositories) as required.