Research Data Management (RDM) and Open Science
Aalto University offers comprehensive services, guidance, and support to help you manage your data efficiently. Explore our collection of resources and external links to boost your research.
Research data is any material that informs your research and validates its findings. This might include results of experiments, measurements, observations from fieldwork, survey results, interview recordings, images or code. This data might be collected as a part of the research (e.g. measurements), produced within it (e.g. code) or it might be existing data is reused for the purposes of your research project (e.g. census data). To understand how to best manage your data, it is useful to take a moment to make a list of what exactly you are working with. Make a list for yourself of:
Gathering this information into a table gives a good overview of your data:
Data type | Origin of data | File format | File size |
Interviews | Collected in project, audio recordings | mp3 for audio recordings txt after transcribing |
2GB raw audio 150 text |
Census data | Existing data reused, Statistics Finland, open under CC-BY license | csv | 500KB |
Measurement spectra | Collected in project, XPS | vms | 3-5MB |
Most often research data will be in digital form and stored on servers. The storage should be big enough fit your material and it should create automatic backups of your work to prevent data loss. The storage should also be secure and protected against unauthorised access to your work. Exactly how secure depends on what type of data you are working with.
Data can be divided into four groups based on who is allowed to have access to it: public, internal, confidential and secret. Health data for example is secret and will require strict safety measures and access control. In addition to choosing a storage option that is safe enough, consider also where your data is stored during the gathering and analysis stages. Gathering or analysing data might require uploading it onto a software e.g. collecting interviews on Zoom and uploading them into Atlas.ti for decoding. The software solutions used should be approved by Aalto IT for your data type. Details of what the four grops of data classification mean in practice and what software is safe for which data type.
It is also worth considering whether you might want a storage option that allows secure sharing of your data with your supervisor or collaborators. Aalto University's IT services offers many storage options to choose from:
Your research group or department might be using Aalto's shared storage servers. Most often, these are already set up and you are granted access by the manager of the storage space, e.g. your supervisor.
Be sure to use your Aalto institutional account when logging into these services.
When you are gathering your data it is useful to keep notes of how exactly the data was gathered. This might be noting down the calibration of equipment used for the measurements, so that you can replicate the experiment later. If you are working in a group, other team members need to understand how the data was created to interpret it correctly. Consider what information is integral to understanding your data and consider where this information might be stored.
Tips for keeping track of your data:
There are rules that define how certain data types can be used and handled, and it is integral that you know if these apply to your data. These rules might be set in agreements with corporate partners or in legislation for example regarding the handling of personal data. also set some rules regarding what kinds of research setups with human subjects require ethical pre-review. Similarly there are ethics pre-reviews required for medical research and research with animals. Find out whether your research will need an ethical pre-review.
In general, research at Aalto University follows the . These outline good research practices like agreeing on authorship early on and reporting research findings in an honest and transparent way. In addition to the national guidelines, EU projects follow the which are closely aligned.
Have a look through the examples below and consider if you are working with this type of data. If so, set aside some time to familiarise yourself with the rules and what exactly they mean for your work, e.g. preparing a privacy notice or checking that your storage solutions are secure enough. These may be a bit daunting, but Aalto has an entire team of experts on hand to help you meet the legal and ethical demands of your data. You can reach them at researchdata@aalto.fi.
Personal data is any data that can be used to identify a person. In straight forward forms, it might be a person's name, address or a photograph of them. It could also be a handful of characteristics that when put together make a person identifiable, e.g. three data points: Aalto employee, nationality Swedish, field of research acoustics. Patient records and location data can all be used to identify a person. As a rule of thumb, if you are gathering data from human participants, it is best to assume you will be gathering personal data.
If handling personal data of any kind, follow Aalto University's advice on handling of personal data.
When conducting research with human participants, some research setups require ethical pre-review. These include research with vulnerable groups or medical research. Find out whether your research will need an ethical pre-review. Beware that an ethical pre-review, as the name suggests, can only be done before data collection has started.
Consider whether there is risk that your experimental results could be used for military purposes such as weapons and their delivery systems, or for terrorism or abuse of human rights. Could there be a risk to national security, defense or public security? If you suspect your data could be used to develop such technology or that these risks might apply, please refer to the Aalto instructions for dual use and request support from Aalto's specialist team.
Consider whether your data collection method or publishing your data could cause harm to the environment or biodiversity. This might for example include a risk of releasing toxic substances or revealing the location of endangered species.
If so, consider keeping problematic data confidential. Ethical review focused on environmental issues is currently not a standard procedure in Finland. For self-evaluation, you can work with the Do No Significant Harm principle.
Consider whether your research project will involve the development and/or the use of systems or techniques based on artificial intelligence (AI)?
If so, especially if working with data from human subjects, there may be ethical issues regarding for example privacy and data protection, avoiding bias, or issues of transparency and accountability for AI systems and their outcomes. More guidance of working with this type of data.
Consider whether there are any contracts in place that determine how your data should be handled or whom it can be shared with. These might be confidentiality agreements with company collaborators or restrictions to use if reusing existing datasets.
The ownership of data should be agreed upon as early as possible if working with external partners.
Whilst the end of your project might feel far away, it is worth considering what will happen to your data early on as it affects e.g. the information in your privacy notice. Looking at your list of data, consider where your various data types fall in the categories below:
Note that once your time at Aalto ends, so does access to cloud services and other storage solutions.
Once these have been considered, you have in essence created a data management plan or in short DMP. A DMP is a document that speficies how data will be managed in a research project, often with a focus on publishing datasets. Funders are increasingly requiring researchers to prepare a DMP as assurance of good data management practices and to encourage opening of datasets with an aim to increase the openness of science and benefit the wider research community.
Funders have their own DMP templates and requirements. More on information on funder requirements. Funder requirement often mention FAIR data, which is an acronym for data that is findable, accessible, interoperable and reusable. The FAIR principles provide good guidance for data management in general, but are most useful when creating datasets that will be shared with others.
Aalto University has a that includes questions on data management as well as guidance for answering these.
Aalto University offers comprehensive services, guidance, and support to help you manage your data efficiently. Explore our collection of resources and external links to boost your research.
Create a Data Management Plan (DMP) to ensure your research data is high-quality and FAIR: findable, accessible, interoperable, and reusable.
Meet your Data Agents — researchers offering hands-on support on data management.