Amsterdam-based BREIN removes Dutch language dataset used in AI model training without permission, investigates involved parties.

Amsterdam-based copyright group BREIN removed a large Dutch language dataset containing info from books, news sites, and subtitles, without explicit permissions. The dataset was used in AI model training, and the group is investigating which AI models used it, with plans to hold the involved parties responsible. This incident raises concerns about the use of copyrighted works in large language models.

August 13, 2024
6 Articles

Further Reading