Set up
Description¶
During Set up, the research is conceptualised, planned and funded. This stage highlights the environment in which dataset creation will happen, as the research question, mission and values will be defined and data collection practices outlined. At this stage, you are thinking about dataset creation as a potential method and/or output for your research, but are not actively undertaking creation yet - this stage merely sets the scene for it.
📔 Recommended documentation¶
Describe your project beginnings.
→ Write your Data Management Plan, Ethical Commitments and Mission Statement
→ Document your answers to questions regarding project conceptualisation. This will both be a guide as well as a place of reflection throughout your research.
Tasks¶
Identify a Research Topic¶
When undertaking this task, what should you consider?
Positionality¶
- How does the social, political, economic, cultural, academic context you are embedded in impact your choice of research?
- Are there policies/guidelines within your institution that are relevant to your project – if so, which ones?1
Silences¶
- What gaps exist in current research?
- What gaps are you trying to fill in your research? Time period, geography, actors, concepts?
- What is your primary source data and what are its advantages and shortcomings?
Representation¶
- Who is represented in the scholarship and sources that are used? And who is not represented?
Impact¶
- Who do the scholarship and/or research projects currently impact positively and/or negatively, and in what way? In terms of accessibility (digital divide), representation, use of language?
What are good practices in relation to this task?
- Reflect on these questions as part of project conceptualization.
- Document discussions held, research done and decisions made for future reference.
- Have your research be led by social justice values and positive societal impact - however your project may define these.2
- Ensure your research question(s) are shaped by the availability and content of sources. Heavily biased sources can be the best resources for one research question, but totally unfit to answer another.
Resources¶
Further websites with helpful guiding questions and toolkits to consider before the start of your research:
- Data Ethics Decision Aid: https://deda.dataschool.nl/en/
- USC Libraries, Inclusive and Responsible Dataset Usage: https://libguides.usc.edu/inclusive-datasets/considerations
- University of Bath, Research Data Service Homepage: Research Data Home: https://library.bath.ac.uk/research-data
- RDMkit, Ethical aspects: https://rdmkit.elixir-europe.org/ethics#which-aspects-of-rdm-might-raise-ethical-issues
- Always Already Computational - Collections as Data. 50 things you can do: https://collectionsasdata.github.io/fiftythings/
Write Funding Proposals¶
When undertaking this task, what should you consider?
Positionality¶
- How does the social, political, economic, cultural, academic context you are embedded in impact what topics of research are receiving funding? In what way are you aligning your research to this?
- Does your institution have in-house expertise on writing successful funding proposals? How does this impact your position?
Recruitment¶
- Will your recruitment practices be inclusive?
Collaboration¶
- In what way could you work with affected communities?
- In what way could you collaborate with specialist academics/researchers to mitigate potential biases in your research?
Representation¶
- Who is and who is not represented in your research funding outline?
Impact¶
- Who would your research impact positively and/or negatively, and in what way? In terms of agenda, accessibility (digital divide), representation, use of language?
What are good practices in relation to this task?
-
Include ethical considerations into your proposal, whether or not this is asked for. Show the funding agency that alongside their own mission, ethical considerations are necessary and urgent to think about.
-
Speak about how will you allocate money to the combatting of bias (e.g. extra personnel, part of research is on ethics issues, pot of money for collaboration, widen theme of research or source base as an attempt to combat bias)
-
Actively pursue the allocation of money to collaborate with affected communities. Ensure enough money is set aside for all aspects of community engagement (monetary remuneration for expertise; cultivating sense of community between collaborators/citizen scientists4). Also consider whether you can create sustainable networks that will contribute to the setting up of future projects.
-
Focus on the creation of a diverse and inclusive project and team.
Resources¶
- Loughborough University, Embedding EDI in Grant Proposals. Version: Feb 2023.
- Maastricht University, How to Address Diversity in Research Proposals - A Guide.
Write a Data Management Plan (DMP)¶
Description: A DMP is a roadmap for good data management at every stage of research. It is a living document that will reflect changes and updates made to your research.
When undertaking this task, what should you consider?
FAIR¶
- How do you plan to publish FAIR data?
- Do you foresee any issues with aligning your project with the FAIR principles? Why?
Impact¶
- In what way would your data harm other persons and the environment? 3
Ownership¶
- Who (legally) owns the data you will produce?
- How will you acknowledge data contributors and sources?
Durability¶
- How will you ensure your data remains FAIR in the future, after the project ends?
- What preservation strategies will you implement for long-term accessibility?
- Who will maintain the dataset once the project ends?
Privacy¶
- How will your research data be stored safely? Is there need for anonymisation?
What are good practices in relation to this task?
-
Follow online guidelines/templates for writing DMPs. Your institute may already have a template/guidelines that you can use.
-
Use your envisioned dataset documentation templates to guide writing your DMP. This ensures dataset documentation is considered throughout the dataset creation process and allows for more efficient documentation.
- This ensures your DMP includes details on version control practices and proposed usages of the created dataset.
- Examples of dataset documentation are data-envelopes and datasheets (for cultural heritage).
-
Use the FAIR Implementation Profile questionnaire to guide writing your DMP, specifically with regards to what specifications to consider to make your dataset FAIR compliant.
-
Include how harm to other beings and the environment can be identified and mitigated in a timely manner.5
-
Include versioning into your DMP to encourage regular re-evaluation during the project.
Resources¶
Take a look at your institutional (faculty, department) guidelines and/or Code of Conducts regarding DMPs. Below are external resources that may be helpful:
- DMP tool (for filling out templates of various funding institutions): https://dmponline.dcc.ac.uk/
- Examples of DMP templates: https://www.lib4ri.ch/research-data-management-resources
- Resource for best practices for Data Management: https://campus.dariah.eu/resources/pathfinders/dariah-pathfinder-to-data-management-best-practices-in-the-humanities
- FAIR: https://www.go-fair.org/
- FAIR Implementation Profiles (FIPs): https://www.go-fair.org/how-to-go-fair/fair-implementation-profile/
- Alignment FIPs to DMPs: Singh et al. (2025). Aligning Data Management Plans with Community Standards Using FAIR Implementation Profiles. In: Sfakakis, M., Garoufallou, E., Damigos, M., Salaba, A., Papatheodorou, C. (eds) Metadata and Semantic Research. MTSR 2024. Communications in Computer and Information Science, vol 2331. Springer, Cham. https://doi.org/10.1007/978-3-031-81974-2_14 (co-authored by Angelica Maineri and Shuai Wang).
- Data documentation:
- Data-envelopes for Cultural Heritage: https://aclanthology.org/2024.legal-1.9/
- Datasheets for Cultural Heritage: https://openhumanitiesdata.metajnl.com/articles/10.5334/johd.124
- AI Model Research Documentation Sheet (AIRDocS): https://zenodo.org/records/14550113
- The checklist for Collections as Data from the GlamLABS Community: https://www.emerald.com/insight/content/doi/10.1108/gkmc-06-2023-0195/full/html
- Datasheets at DRI: https://repository.dri.ie/catalog/8c980k93k#dri_download_modal_id
- Datasheets for Web Archive: https://blogs.bl.uk/webarchive/2024/11/datasheets-for-web-archives-toolkit-is-now-live.html
- Quality Measures for Datasets: https://datanutrition.org/labels/
Write Ethical Commitments¶
When undertaking this task, what should you consider?
Positionality¶
- How does the social, political, economic, cultural, academic context you are embedded in relate to advancing social justice?
- Are you conducting research from a position of historical privilege? (For example, are you based in a former colonial power studying colonized regions?) How can you then use the benefits of your context to advance the deconstruction of existing power structures?
Transparency¶
- Do you have any ethical concerns about the research? Think of:
- What accessibility barriers might your research create or reinforce?
- How might your research methodologies perpetuate existing inequalities?
What are good practices in relation to this task?
-
Use the format: We acknowledge [injustice]. We are committed to [action to lessen the burden of injustice]. This provides a succinct overview for readers to know what you stand for and how your actions will reflect this.6
-
Include your gut-feelings about the project, including concerns.7
-
Add positionality statements to your Ethical Commitments, clearly communicating to your audience your awareness of the context you work in and what you benefit from.8
-
Include versioning into your Ethical Commitments to encourage regular re-evaluation during the project.
Resources¶
Examples of Ethical Commitments:
- Globalise: https://docs.globalise.huygens.knaw.nl/ethics/policy/
- Enslaved.org: https://enslaved.org/statementofEthics/
- On these Grounds: https://onthesegrounds.org/s/OTG/page/ethical-commitments
Write a Mission Statement¶
Description: A mission statement articulates your project’s purpose, values, and approach. It serves as both an internal compass and a public declaration of your research intentions, ethical stance, and expected outcomes.
When undertaking this task, what should you consider?
Transparency¶
- How active will your intervention be?
- Who will your research impact positively/negatively?
- What is the proposed outcome of the research? Could there be unintended uses and/or consequences?
- What values drive your work and how are they reflected in your work?
- Are there any accessibility restrictions?
What are good practices in relation to this task?
-
Reflect on all questions above and incorporate answers, ideas and concerns into your mission statement. This will ensure expectation management for collaborators and audiences.
-
Outline values/research philosophy that drive your research
- For example: Community, Trust and Time, Knowledge (Co-)Production, Redistribution, Accountability, Interrogating Power, Imagination.10
-
Discuss how active your ethical intervention will be.
- For example:
Contentious language will not be included as preflabels in published datasets, but will feature in public user interface (Globalise).
Contentious language will feature in public user interface, but with a caveat box providing context (Slave Voyages).
Contentious language will be retained for the historicity of sources, but alternative terms will be used to write about the sources (Historical Database of Suriname and Caribbean).
- For example:
-
Include any parts of your research that should be contextualised up front: use of contentious categories, naming, terms, and/or keywords; errors that may be found; choices for use of specific language over others.9
-
Outline predicted outputs.
-
Outline imagined use of your research (data).
- Also include imagines accessibility restrictions (e.g. data can only be accessed onsite, or with an account).
- Also include restrictions, in terms of licensing.11
-
Include ways for audiences (both academic and broader) to contact, engage with and/or provide feedback to the project.
-
Include versioning into your Mission Statement to encourage regular re-evaluation during the project.
-
Acknowledge that responsible research will always evolve, and is therefore never-ending (even after project end). Manage expectations regarding this in your mission statement.
Resources¶
Take a look at your institutional (faculty, department) guidelines and/or Code of Conducts regarding DMPs. Below are external resources that may be helpful:
Examples of Mission Statements:
- Globalise: https://docs.globalise.huygens.knaw.nl/mission/mission_researchthemes/
- One More Voice: https://onemorevoice.org/html/documents/mission_statement.html
- DAIR Institute: https://www.dair-institute.org/research-philosophy/
- Jaap Kunst Collection - Hearing the Indonesian Archipelago: https://jaapkunst.org/about/
-
Adapted from Dataschool Utrecht, DEDA Dataschool (2022). ↩
-
Adapted from Educational Innovation Erasmus University Rotterdam, Positionality (accessed 21 August 2025). ↩
-
Adapted from RDMkit, Ethical aspects (accessed 21 August 2025). ↩
-
For example, as Historical Database of Suriname en the Caribbean (HDSC) have done. ↩
-
Adapted from RDMkit, Ethical aspects (accessed 21 August 2025). ↩
-
See for reference: On These Grounds, Ethical Commitments (accessed 21 August 2025). ↩
-
Adapted from Dataschool Utrecht, DEDA Dataschool (2022). ↩
-
From conversations with Globalise; Kelly and Vicky Breemen, forthcoming publication. ↩
-
Adapted from Jaap Kunst Sound Collection, Methodology (accessed 21 August 2025). ↩
-
Adapted from DAIR Institute, Our Reseach Philosophy (2022). ↩
-
Adapted from Jaap Kunst Sound Collection, Uses (accessed 21 August 2025). ↩