How do I choose a repository for sharing and/or archiving my data?
NewSeveral factors should be considered when selecting a repository to share and archive your data.
Best practices for choosing a repository vary, but in general, researchers are encouraged to use (1) specific or required repositories (if applicable), (2) subject-specific repositories, (3) institutional repositories, or (4) generalist repositories, in that order.
Below is a flowchart prepared by the IU Research Data Steward that advises how to choose which type of repository to utilize in sharing your data.
Specific or Required Repositories
If your research funder or sponsor requires a specific database to store and share your data, then use the repository specified by your funder or sponsor.
Subject Specific Repositories
If your funder or sponsor does not require a specific repository, there are many specialized databases that you can use to store your data based on your discipline of research. One way to find a specialized database is through the RCRegistry of Data Repositories (re3data.org) Re3data is a global registry of research data repositories
Institutional Repositories
Institutional repositories are general repositories that serve an institutional community. RCIU DataCORE Provides access & preservation services for digital research data
You can find additional information regarding repositories and how to make your data accessible through the RCIU Libraries Open Data Website Benefits to open data
For more information about data sharing options, see the RCData Sharing Pathways Options for sharing data research for reuse
General Repositories
There are several general repositories available to store and share your data. The Generalist Repository Comparison Chart can help you compare the features of various generalist repository options to determine which is best suited for your data needs. Examples include RCFigshare Repository that stores any type of file and visualizes hundreds of file formats
For more information about data sharing options, see the RCData Sharing Pathways Options for sharing data research for reuse
There are several key features to consider when selecting a repository to use.
The NIH offers a list of RCDesirable Characteristics for All Data Repositories List of desirable characteristics to look for when choosing a repository
- Long-term sustainability
- Metadata
- Curation & quality assurance
- Free & easy access
- Broad & measured reuse
- Clear user guidance
- Security & integrity
- Confidentiality
- Common format
- Provenance
- Retention policy
Ensuring data and metadata follow FAIR Guiding Principles:
- Findable
- Accessible
- Interoperable
- Reusable
Yes, researchers can deposit the same dataset in multiple repositories; however, this approach should be done strategically and transparently to avoid confusion, versioning issues, or noncompliance with repository or funder policies. At Indiana University, data librarians generally recommend that researchers identify one authoritative version of the dataset, but there are legitimate scenarios where a secondary deposit, often referred to as “dual deposit”, is appropriate and even advisable.
Two key reasons to consider depositing in more than one location are redundancy and preservation. For example, if a researcher deposits data in a federally funded or domain-specific repository that meets funder requirements (e.g., GenBank, ICPSR, NIH dbGaP), they might also create a backup copy in IU’s RCScholarly Data Archive (SDA)
Researchers should avoid indiscriminate or uncoordinated duplication across multiple repositories, as this can create confusion about which version is definitive, especially when RCDOIs Mints DOIs for scholarly work deposited in DataCore & IUScholarWorks
The potential defunding or shutdown of a data repository is a growing concern in the research community. If a repository becomes inaccessible, researchers risk not only losing access to their data but also the ability for others to find, cite, and reuse it, which impacts reproducibility, compliance, and scholarly visibility. To mitigate this risk, researchers at Indiana University are encouraged to adopt redundant data preservation strategies and proactively plan for repository instability.
The first line of defense is to maintain a secure backup copy of any data deposited into an external repository. IU researchers can use the RCScholarly Data Archive (SDA)
Additionally, IU’s RCDataCORE Provides access & preservation services for digital research data
Finally, IU data librarians can assist with transition planning and repository evaluations. If researchers become aware that a repository they use is at risk, or if they receive notification of changes to access policies or infrastructure, they should immediately contact the repository for guidance. IU’s support teams can help extract data before shutdown, prepare a replacement deposit, and maintain data availability statements in line with funder or publisher requirements. Taking these proactive steps ensures that research outputs remain accessible, reliable, and preserved, regardless of external disruptions.