Data management checklist

Using a data management checklist

A checklist, such as the one below, can help with writing a thorough data management plan. It assists in identifying what good data practices to implement and which actions to take to optimise data sharing.

Resources and planning

  • Who will be responsible for which part of data management?
  • Will new skills be required for any activities?
  • Will you need extra resources to manage data, such as people, time, or hardware?
  • Will there be any costs associated with depositing data for long-term preservation and access?

Documenting

  • Will others be able to understand your data and use them properly?
  • Will the structured data be self-explanatory in terms of variable names, codes and abbreviations used?
  • Which descriptions and contextual documentation will explain how the data were collected, the methods used to create them, and what the data mean?
  • How will you label and organise data, and all the files?
  • How will you maintain consistency in data cataloguing?

Formatting

  • Will you be using standardised and consistent procedures to collect, process, transcribe, check, validate and verify data, such as standard protocols, templates or input forms?
  • Which data formats will you use? Do the formats and software you use enable sharing and the long-term sustainability of data, such as non-proprietary software and software based on open standards or specialised software widely used in the community?
  • When converting data across formats, will you check that no data, annotation or internal metadata has been lost or changed?

Storing

  • Will your digital and non-digital data, and any copies, be held in multiple safe and secure locations?
  • Will you work with personal or sensitive data? If so, will they be properly protected?
  • If data will be collected with mobile devices, how will you transfer and store the data?
  • If data will be held in multiple places, how will you keep track of versions? How will you know which version is the master version?
  • Will your files be backed up sufficiently and regularly, and will backups be stored safely?
  • Who will have access to which data during and after research? Will there be a need for access restrictions? How will these restrictions be managed after you are no longer around?
  • How long will you store your data for and have you planned which data to keep and which to destroy and when?

Confidentiality, ethics and consent

  • Will your Consent Form, Participant Information Sheet and any other protocols clearly describe how data will be collected, used, shared and reused?
  • Will you discuss data sharing plans with the respondents from whom you collect the data?
  • Will you have plans in place to handle participant withdrawal requests, including data deletion or withdrawal?
  • Will the data contain confidential or sensitive information? If so, how will you handle storing and sharing this data responsibly?
  • Will you consider cultural sensitivities or specific ethical considerations related to the communities or groups involved in your research?
  • If you will be collecting and processing personal data, have you considered conducting a Data Privacy Impact Assessment?
  • If you use third-party services or cloud storage, have you evaluated their compliance with data protection regulations?
  • Will you be anonymising the data for future reuse to enable wider sharing? Will you share more detailed versions of the data under more stringent access conditions?

Copyright

  • For both primary and secondary data, will you plan to clearly establish who owns the copyright in the data? Might there be joint copyright ownership?
  • If you will be purchasing or reusing someone else’s data sources, will you consider how that data might be shareable, if at all? For example, will you consider negotiating a new licence with the original supplier?

Sharing

  • Will you make all of your data available for sharing, or will you have to decide which data to preserve and share?
  • How and where will you preserve your research data for the long term?
  • Will you select and apply an appropriate licence that will clarify how your data can be reused?
  • Will you need a dissemination embargo, and if so, how long will it be?
  • How will you ensure your shared data meets the FAIR data principles?