Summary
- Danish media outlets are demanding that Common Crawl remove their articles from past data sets
- Requests to redact data from Common Crawl have increased, with major news outlets blocking their web crawler
- Compliance with removal requests is driven by the need to keep the nonprofit afloat, despite disagreement
- The clash over copyright and the open web is intensifying, with implications for AI development