Introduction to data management at SLU

Page reviewed: 08/06/2026

If you work in research or environmental monitoring and assessment, you probably handle data on a daily basis without considering it as 'data management'. This page explains what data management is and how you can develop your skills in this area.

What is data management?

Data management refers to the handling of data from research and environmental monitoring and assessment (EMA) during the planning and preparation of a project, during its implementation, and after it has been completed. Good data management makes your work easier, adds value to research and environmental monitoring and assessment, and helps you to comply with rules and regulations.

Good data management — meaning data management that complies with laws, principles, and best practice — is not an end in itself. Rather, it is a strategy for making research, as well as environmental monitoring and assessment, more efficient. Good data management also is also a way of preserving and disseminating results so that they can be put to better use. It is also an important part of good research practice.

Good data management:

enables researchers and environmental assessment staff to use their time more efficiently;
makes research as well as environmental monitoring and assessment more transparent;
facilitates the validation of research;
ensures that files and data are stored in a way that makes them easy to find;
reduces the risks of data loss and data breaches;
enables data reuse;
can lead to new collaborations;
demonstrates that public funds are being used responsibly.

Research and EMA (environmental monitoring and assessment) data are data that are collected or generated in accordance with scientific principles for use in various types of scientific analysis. Both types of data come in various forms, such as:

numerical data, such as measurement results
text, such as interview and survey responses
image and audio material
source code
maps and other forms of geographical data
observations, such as species occurrences.

The data lifecycle – from planning to long-term preservation and reuse of data

Data management is important at every phase of the so-called data life cycle. The data lifecycle (see the image below) is a conceptual model that illustrates the various phases of data management, from preparing and planning to publication and long-term preservation. Read more about the different stages of data management below.

Schematic diagram illustrating the research data life cycle. Illustration.

Prepare and plan data management

Planning data management helps you to identify the technical, legal and ethical requirements of a project relating to data. Before a project begins or at the start of the project, it is important to consider data management issues, such as:

What data will the project use?
Will personal or other sensitive data be collected?
How do I share data with my project partners?
How will the data be stored, and who will have access to it during the project?
How will the data be made available and preserved?

A data management plan (DMP) is a key component of data management planning. This document describes how data will be managed during and after a research project, environmental assessment programme or within a research infrastructure. Having a DMP in place makes it easier to anticipate potential problems before they arise, providing the best possible conditions for finding a solution that suits you and your project. A DMP can also serve as an introduction for staff members who join the project once it is already underway.

A DMP is a living document that should be updated continuously.

According to SLU’s data management policy, data management plans are mandatory for all new research and environmental assessment projects started at the university in September 2022 or later.

An increasing number of funding agencies (e.g. the Swedish Research Council, the Swedish Environmental Protection Agency, Forte, Riksbankens Jubileumsfond and the EU) now require projects to develop, maintain and adhere to a data management plan. Even if a funding agency does not make this a requirement, it is a requirement under SLU’s policy.

Check with your funding agency to determine what data management planning requirements apply, or contact SLU’s data management support team (dms@slu.se).

SLU offers an online tool for creating data management plans, making it easier to include all the necessary information.

Read our guide: Set up a data management plan in DMPonline

If you want to write directly in a document, you can use this checklist:

SND's Checklist for data management plans

Reuse data

Reusing data can save time and resources, as well as enabling studies to be validated. It also makes it possible to integrate datasets from different studies and disciplines, opening up opportunities for new collaborations.

There are many ways to discover and search for data, including through colleagues in the research field, scientific articles and literature databases, general search engines and various data repositories and portals.

Sources for searching for and finding data can be found on the SLU University Library's web: Find research data and environmental data.

Consider whether the data you have found is actually useful for your purposes.
Evaluate the data based on its quality and reliability. Ask yourself: is the source reliable? Are established standards used? Is the data sufficiently documented in terms of when and how it was collected and processed?
Check the terms of use and distribution, and ensure that you have obtained all the necessary permissions and consents.

Give credit to the people who collected and made the data available by citing secondary data in the same way as academic articles. The SLU University Library provides guidance on citing data in accordance with SLU’s Harvard style.

Reference list using the SLU Harvard style

You can find more detailed information on Researchdata.se: Reuse and cite data

Document data

Metadata and documentation are required to ensure that data can be validated, understood, found and reused. Documentation refers to descriptions of data intended primarily for human reading (and therefore consisting of running text). Metadata is also a form of documentation, but it must be structured so that it can be read by both humans and computers.

Read more at Researchdata.se: Document data.

Data should be documented at project level, file level and variable level. The relevant documentation varies depending on the research subject and methods. The important thing is to document data in a way that makes it understandable and reusable.

Things that are always important to document include:

How the data was collected (including everything from the sampling equipment and measuring instruments used, to the wording of questionnaires and interview questions).
Where the data was collected.
When the data was collected.
What codes and abbreviations mean.
Any possible ethical or legal restrictions that could limit how the data can be reused.

There are various ways to document data. The most suitable method depends on the scientific discipline and the format in which the data is stored. Different tools and approaches may also be more appropriate at different stages of a project. The following are some examples of approaches and tools:

One option is to use a separate supplementary document, such as a text file named 'readme.txt' and saved in the same location as the data files.
Some file formats support embedded metadata.
electronic laboratory notebooks.
Markdown (e.g. R Markdown in RStudio).
Jupyter Notebook.

It is important to investigate the requirements and options regarding documentation and metadata when publishing data in a research data repository. Please choose a data repository that offers extensive options for describing data.

Documentation should not only cover the final data, but also collection methods, processing, analyses and data handling during the project. This will facilitate both writing articles and describing and making the final datasets available.

Documentation may be published alongside data when the data is published in a data repository, through SLU’s data hosting mission, or when it is made available by other public authorities.

If the documentation is published separately, it should be given a persistent identifier to help users find and refer to it. The most appropriate way to make documentation and metadata available depends on their format. Options include SLU's research and environmental monitoring and assessment database and SLU’s e-archive.

SLU's research and environmental monitoring and assessment database

SLU's research and environmental monitoring and assessment database is intended for publications issued by SLU. It is suitable for manuals, instructions and reports.

SLU's e-archive

SLU’s e-archive can be used to archive documents and assign persistent identifiers to them. The documents may be available either publicly or on request. It is suitable for documents that cannot be published in SLU's research and environmental monitoring and assessment database. To archive a document, please contact Air – The Unit for Archives, Information Governance and Records (arkiv@slu.se).

Please note that documentation of data produced at SLU is considered a public document and must be archived by law. SLU’s e-archive is suitable for this purpose. The same applies to documentation published in repositories and publications registered in SLU's research and environmental monitoring and assessment database. Procedures for the automatic archiving of publications from the publication database are currently under development.

Store data

Data must be stored securely! To prevent data loss and unauthorised access, you should use a secure storage solution that includes backup. You should also keep your raw data and working files separate. This ensures that you always have an intact, unedited master file to fall back on.

It is important to choose a solution that meets your requirements based on the nature of the data to be stored, such as whether it contains personal or other sensitive information.

When choosing a storage solution, there are a few key considerations:

How much storage capacity does the project require?
Is the data active, or will it be stored long term?
Who needs access to the data?
Does the data contain any sensitive information, such as personal data or details about the occurrence of protected species?

Organise data

Data can easily become disorganised, whether you’re working alone or with others. That’s why it’s a good idea to have an organised approach to naming and storing files. Having a system for managing different versions of the same file is also beneficial, as this allows you to track changes and correct errors more easily.

Decide on a system for organising and naming files and folders to save time and avoid mistakes. A logical and consistent system will help you to find the right files quickly and easily.

It is a good idea to document your file management system, especially when collaborating with others. You could place a description of the folder structure and the naming conventions for files and folders in a supporting README file in the project’s root folder, where everyone involved can easily find it.

Read more at Researchdata.se: Organize data.

Files often change, and it is important to keep track of these changes. Saving multiple versions of files and being able to easily access them makes it easier to track changes and correct errors. Versioning can be achieved through the use of file names, tables, or software. Another way to create a system for accessing different versions of a dataset is to make changes in a programme that uses code or scripts.

Please ensure that the original data is not altered. If you need to edit it, create a copy first.

You can read more about documenting data further down this page.

Read more on Researchdata.se: Folder structure, file names, and versioning
Publishing research software openly: Advice and best practices

Publishing and making data available

Making data openly available is an effective way to disseminate the results of research and environmental monitroing and assessment. It allows other researchers, public authorities and businesses to build on existing studies, rather than starting from scratch.

Open data also increases the visibility of related publications. Studies show that scientific articles that link to openly available datasets are often cited more frequently than other articles.

Sharing data openly enhances the transparency of research, making it easier to review, reproduce and further develop the results.

A research data repository is a platform that stores and makes data available in a structured, searchable, reliable and long-term sustainable manner. These repositories may be discipline-, domain- or institution-specific, or more general in nature.

Wherever possible, data should be made available in repositories that provide persistent identifiers for data (e.g. a Digital Object Identifier, or DOI). Ideally, a certified repository should be used.

Swedish National Data Service's repository

SLU is involved in running the Swedish National Data Service (SND) research infrastructure, which offers a data repository that meets legal requirements and those of research funders, for example. The repository can handle sensitive data, including personal data, as SLU has a data processing agreement with SND, and the data published here is also archived at SLU.

The SND repository can be used for many types of data and research subjects, but it also has subject-specific features that support various subject profiles.

Data published in the SND repository is made visible through the national research data portal, Researchdata.se (and other data catalogues and search services including Google and Web of science).

SLU's data management support service can help you publish via SND.

Read our guide to publishing with SND: Publish data in the SND repository and Researchdata.se

Choosing a data repository

The most suitable data repository depends on the type of data to be published. For data from certain research disciplines (such as bioinformatics), a subject-specific repository may be recommended.

The re3data.org website lists repositories where you can share and find research data.

The researchdata.se website provides a guide to choosing one of the repositories that make data visible on researchdata.se: Sharing data: A quick guide

When selecting a repository:

Check your funder's and journal's recommendations and requirements regarding data sharing
Where possible, use a repository that provides data with a persistent identifier, as this makes the data easier to reference, find and reuse.

Data papers are articles published in peer-reviewed scientific journals that describe datasets and the methods used to collect the data. However, the actual data is often stored in a repository (see above) and linked to from the data paper. Publishing a data paper can increase the likelihood that your data will reach a relevant audience and gain recognition.

Data cannot be published openly if it contains any of the following:

Confidential information, such as sensitive personal data
Material protected by copyright, unless permission has been granted
Trade secrets

However, it is still possible to publish documentation and metadata, and this is a requirement under SLU’s policy. In many cases, it may also be possible to publish parts of the data openly.

Remember that even data which cannot be published openly must be archived, and that any request for a dataset will be reviewed.

Personal data

You may publish data containing non-sensitive personal information, provided that you have informed the participants of the research/environmental assessment in advance that this will take place, and provided that a data processing agreement is in place between SLU and the relevant repository.

SLU has a data processing agreement with the Swedish National Data Service. This means that you can deposit data with them even if it contains personal data. If necessary, the dataset can then be subject to restricted access, whilst documentation and metadata remain openly available.

Access to a dataset containing sensitive personal data may be restricted, while the documentation and metadata remain freely available. In many cases, however, it is possible to publish parts of the data openly or to aggregated the data before publication.

Clear terms and conditions that govern how data can be used are important for facilitating its reuse and are a key part of the FAIR principles (see Guidelines and principles).

The FAIR principles recommend applying a licence, but licences require the publisher of the dataset to be the copyright holder. Data itself is not protected by copyright, and licences can be difficult to apply to datasets because they presuppose the existence of such rights. Therefore, it may be better to use a marker instead.

You can find guidance and recommendations on licensing when publishing data in our FAQ:

What license should I choose when publishing data via SND (Swedish National Data Service)?

Please remember to include your affiliation in the metadata when publishing data. Follow SLU’s guidelines, which can be found here:

Many research funding bodies now require the results and underlying data from funded projects to be made openly available. Scientific journals are also increasingly requiring the underlying data to be made available and linked to from the article. In the case of SLU’s environmental monitoring and assessment, there may also be legal obligations to share data.

Sweden and other EU countries have decided that data from publicly funded research should be published as openly as possible, subject to restrictions where necessary for legal, ethical, security or commercial reasons.

Read more about SLU’s data management policy and the guidelines and principles we follow in the ‘Guidelines and principles’ section further down the page.

Archiving and preserving data

As a Swedish public authority, SLU is required by law to archive public documents, including research data. The purpose of archiving is twofold: to maintain order in public documents and to provide the public with access to them in accordance with the principle of public access (unless confidentiality applies; see 'The principle of public access' under Data processing in accordance with the law further down this page).

Each department is responsible for ensuring that public documents are properly managed (registered, described, and stored securely). The Head of Department has overall responsibility. Day-to-day work is carried out by the person in the department responsible for registration and archiving (the RA role). The head of department can inform you who this person is.

Every member of staff is responsible for ensuring archiving is possible. Anyone who handles applications, contracts, agreements, research data, reports or articles in the course of their duties must ensure that these can be registered and archived.

If you publish data in the Swedish National Data Service’s repository, archiving at SLU is included in the publication process.

SLU has local archives at some departments and faculties, but research data must be archived in the university’s central e-archive. While the RA at your department can help you register your data, the transfer to the central e-archive also requires the involvement of SLU’s Unit for Archives, Information Governance and Records. They can also advise you on preserving research material, including how to describe it and which file formats to use.

If you publish data in the Swedish National Data Service’s repository, archiving at SLU is included in the publication process.

Research data is not the only thing that should be archived within a research project. Applications, contracts, project budgets, data management plans, reports and articles should also be archived.

Contact details for the Unit for Archives, Information Governance and Records.
Read more in the manual for managing research material: Guide for data archival and appraisal (in Swedish).

In order to ensure that files can still be read in the future, it is important to choose a suitable file format. Ideally, this should be based on an open standard, be independent of any specific software and be openly documented.

Read more about file formats for long-term preservation at Researchdata.se: File Formats.

Data processing in accordance with the law

A wide range of laws, regulations and other rules apply to data management at SLU. These include the Freedom of the Press Act, the Public Access to Information and Secrecy Act, the General Data Protection Regulation, the Archives Act and the Open Data Directive. Familiarising yourself with the basic provisions is important for managing scientific data responsibly.

As a public authority, SLU is subject to the principle of public access to official records. This means that data from research as well as from environmental monitoring and assessment usually are public documents and are disclosed upon request. The only exception is where grounds for confidentiality exist under the Public Access to Information and Secrecy Act. SLU’s legal team determines whether confidentiality grounds apply at the time of the request. The person requesting public documents always has the right to have any refusal reviewed.

One reason for maintaining confidentiality may be that the data contains sensitive personal information.

Read more on the staff web: Legal affairs

Scientific data is one of SLU’s most valuable assets, and it is crucial that we protect it. Therefore, when handling data, it is essential to consider information security. Information from SLU must be accurate and accessible to authorised users, while remaining hidden from unauthorised individuals. While scientific data does not always need to be kept confidential, we must protect it to prevent data loss and breaches.

The level of protection required depends on various factors, including what is needed to restore or reconstruct a dataset. As research data may be sensitive for various reasons and to varying degrees, the need for protection may change during the course of the work, when the data is made available and subsequently preserved and archived. For example, legal or ethical reasons may prevent data from being made openly available. This could be because the data is subject to copyright (if it contains photographs, for instance), or because it requires confidentiality (for example, to protect endangered species, sensitive personal data, or trade secrets). While it is in active use, such information may need to be kept more secure, but it should be archived in the same way as other research data, with a clear description.

The first step in protecting scientific data is to carry out an information classification. Read more on the staff web (login required): Information security

Find out more about sensitive data at Researchdata.se: Protected data

Personal data refers to information relating to an identified or identifiable natural person. Research and environmental monitoring and assessment data often contains personal data.

However, this is not a problem, as SLU is permitted to collect, use and archive personal data, provided that we process it correctly and have a legal basis for doing so. Special requirements apply to the processing of sensitive personal data, such as information about a person’s political views. For example, an ethical review is required before sensitive personal data can be used in research.

Please note that a dataset is still considered to contain personal data, even if names have been replaced with IDs and retained internally. To ensure that the data no longer contains personal information, it must be anonymised. This involves irrevocably severing any link between the data and an individual. If it is possible to identify a person through supplementary information, such as a code key or register, the data is considered pseudonymised personal data.

Find out more about personal data in research data:

Guidelines and principles

SLU’s data management policy aims to improve the quality, dissemination, impact and innovative potential of the university’s research and environmental monitoring and assessment. It sets out principles for data management at SLU, including those relating to storage, publication and access. The policy also emphasises the importance of good data management in research and environmental monitroing and assessment.

The data management policy is based, among other things, on legal requirements and Sweden’s national guidelines for open science.

For example, the policy states that data from research and environmental monitoring and assessment should be made as open as possible, that the FAIR principles should be adhered to, and that a data management plan should be drawn up for new projects.

This policy applies to all types of digital data produced or processed during research or environmental monitoring and assessment at SLU.

Read a summary or download the full policy here: SLU's data management policy.

SLU is an advocate of open science, as reflected in its data and publication policies, among other things.

SLU is also a member of the Swedish National Data Service (SND), a national infrastructure which helps researchers make all types of digital research data available, including via the portal Researchdata.se.

Data management support at SLU

SLU's support team assists staff with data management in research and environmental monitoring and assessment. Our team has expertise in various aspects of data management, as well as in academic publishing, information management, archiving, IT, law, information security, research, environmental assessment and research funding.

Read more on the staff web: Data management support for research and environmental assessment

Data management guides on the SLU University Library website

Data protection and information security on the staff web

Frequently asked questions about data management

Data Management FAQ

SLU's data management support for research and environmental assessment

dms@slu.se

Introduction to data management at SLU

Content on page

What is data management?

The benefits of good data management

What are research data and EMA data?

The data lifecycle – from planning to long-term preservation and reuse of data

Prepare and plan data management

Data management plans

Requirements from research funders

Writing a data management plan

Reuse data

Considerations for reusing data

Document data

What needs to be documented?

How should the data be documented?

Where should documentation and metadata be made available?

SLU's research and environmental monitoring and assessment database

SLU's e-archive

Store data

Choosing a storage solution

Organise data

Files and folders

Versioning

Publishing and making data available

Publish in a data repository

Swedish National Data Service's repository

Choosing a data repository

Publish a data paper

Can all data be published openly?

Personal data

Licences and markings

State your affiliation with SLU

Requirements from funders, journals and legislation

Archiving and preserving data

Who is responsible for archiving?

How and where can I archive scientific data?

File formats for long-term preservation

Data processing in accordance with the law

The principle of public access to and disclosure of official documents

Information security and protected data

Handle personal data correctly

Guidelines and principles

SLU's data management policy

Open science

The FAIR principles provide guidance on optimising the reuse of data

Data management support at SLU

Guides and other resources

Data management guides on the SLU University Library website

Data protection and information security on the staff web

Frequently asked questions about data management

Contact