91青青草

Services

Artificial intelligence and copyright

For AI system users, the main questions concern the right to use works as input data in an AI system and how to cite the output generated by an AI system.

For developers of AI models and AI systems, copyright issues concern the right to use works as training data for AI models and the transparency obligations regarding the works used as training data.

Relevant AI Act and copyright legislation is further explained in the last section.

1. Using AI systems

Aalto AI Assistant, the improved UI in Fall 2024

Aalto AI Assistant

Your personal AI assistant.

Services
Aalto logo

Guidance for the use of artificial intelligence in teaching and learning at Aalto University

Guidance for the use of artificial intelligence in teaching and learning at Aalto University.

Services

2. For AI model and system developers

The use of text and data in AI systems and models is regulated within the EU by the text and data mining exception. The EU Directive on Copyright in the Digital Single Market (Directive 2019/790), known as the DSM Directive, includes provisions on text and data mining. The DSM Directive defines text and data mining as an automatic analytical technique aimed at analysing digital text and data to produce information such as patterns, trends, or correlations.

Text and Data Mining for Scientific Research at Universities 

The text and data mining exception in the DSM Directive and the Finnish Copyright Act allow for the use of works as training, validation, and testing data for AI models, as well as the use of works as input data for AI systems. Text and data mining can be conducted unless the authors have explicitly and appropriately reserved this right.

 If text and data mining is conducted for scientific research at a university or in other research or cultural heritage institutions, rights holders within the EU/EEA cannot contractually prohibit the university's right to text and data mining. An AI system can be used for text and data mining. The AI system used for text and data analysis can also be a service provided by a commercial provider. However, the mining exception cannot be used to grant a third party company providing the AI system the right to use mined works to train the AI model.

The text and data mining exception does not contain the right to publish the mined dataset. However, the dataset may be stored for scientific research and verification purposes. The mandatory exception for text and data mining applies to employees of Aalto University and researchers affiliated with the university. Therefore, the data mining exception benefits not only university employees but also students, emeritus/emerita researchers, and grant researchers. The research must be conducted in collaboration with Aalto University and meet the criteria for scientific research.

General Data Mining Can Be Prohibited  In addition to the exception for text and data mining for scientific research, the DSM Directive and Finnish Copyright Act also include a general exception that allows the use of material for training data and analysis using an AI system. Rightsholders can prohibit this  data mining. If it concerns a website, the prohibition to use the site's material for data mining must be expressed using machine-readable means. The European Commission has published a draft code of practice . This code of practice explains on a practical level  on how data miners should comply with prohibitions for text and data mining. Websites conducting data mining must comply with the prohibitions expressed using the  . The rightsholder can  ensure compliance for the reservation of rights when using robots.txt on their website.

Transparency requirements for general-purpose AI models in the AI Act

If a general-purpose AI model, developed in a research project, is placed on the  EU market  in the course of commercial activities, the provider must create and make publicly available a sufficiently detailed summary of the content used for training the general-purpose AI model in accordance with the model provided by the AI office. This obligation is outlined in Article 53 of the EU AI Act. The EU Commission has published a draft for the template   . The template will be ready before the Article 53 obligations for general-purpose AI models will apply 2.8.2025.

General-purpose AI model is defined in the AI Act as  an AI model, including where such an AI model is trained with a large amount of data using self-supervision at scale, that displays significant generality and is capable of competently performing a wide range of distinct tasks regardless of the way the model is placed on the market and that can be integrated into a variety of downstream systems or applications, except AI models that are used for research, development or prototyping activities before they are placed on the market;

The AI Act does not apply to AI systems or models developed and deployed exclusively for scientific research. The AI Act does not apply to research, testing, and development activities concerning AI systems or models  before their placing on the EU Market or before AI systems are put into service in the EU. However, the AI Act applies to testing  AI systems in real-life conditions.

3. AI Act, copyright legislation and AI

This service is provided by:

Legal Services

Did you find what you were looking for? If not, please contact us.
  • Updated:
  • Published:
Share
URL copied!