
Adobe, a leading player in the tech industry, has fully embraced artificial intelligence (AI) in recent years, launching various AI-driven services, including its media-generation suite, Firefly. However, the company is now embroiled in controversy following a proposed class-action lawsuit that accuses it of using unauthorized books for training one of its AI models. The lawsuit, filed by Oregon author Elizabeth Lyon, alleges that Adobe incorporated pirated versions of multiple books, including some of her own, into the training dataset for its SlimLM program. Adobe describes SlimLM as a small language model designed to enhance document-related tasks on mobile devices. According to the company, this model was initially trained on a dataset known as SlimPajama-627B, which was released by Cerebras in June 2023 and is characterized as a deduplicated, multi-corpora, open-source dataset. Lyon contends that her works were included in the pretraining dataset utilized by Adobe. The lawsuit, first reported by Reuters, asserts that the SlimPajama dataset was derived from manipulating the RedPajama dataset, which also includes Books3. This latter dataset is significant, as it contains around 191,000 books that have been utilized in training generative AI systems and has been at the center of various legal disputes within the tech sector. Legal challenges surrounding these datasets are becoming increasingly frequent. For instance, in September, a lawsuit against Apple claimed that the company used copyrighted materials to develop its Apple Intelligence model without permission. Similarly, in October, Salesforce faced a lawsuit for allegedly using RedPajama in its AI training. The trend of such lawsuits highlights a growing concern in the tech industry regarding the use of copyrighted material without proper authorization. In a notable case earlier this year, AI firm Anthropic agreed to a settlement of $1.5 billion with authors who accused it of incorporating pirated works into the training of its chatbot, Claude. This case could signal a pivotal moment in the ongoing legal landscape concerning the use of copyrighted content in AI training data.
Sonder, a fresh entrant in the dating app market, is shaking things up with a sign-up process that some might call inten...
TechCrunch | Apr 01, 2026, 21:30
SpaceX has taken a significant step with its confidential filing for what could be the largest initial public offering (...
Business Today | Apr 02, 2026, 05:45
The future of legacy software might be brighter than anticipated, according to OpenAI's Chief Operating Officer, Brad Li...
Business Insider | Apr 02, 2026, 07:15As technology companies expand their operations, their energy needs have grown significantly, leading to power demands c...
TechCrunch | Apr 01, 2026, 19:05
At the Kennedy Space Center in Florida, an exciting new chapter in space exploration has begun as three American astrona...
Ars Technica | Apr 02, 2026, 24:05