Data Scraping
-
New York Times Sues Perplexity Over Copyright Infringement
The New York Times sued AI startup Perplexity in the Southern District of New York, alleging it scraped the newspaper’s articles, videos and podcasts to produce answers that are “identical or substantially similar” to its copyrighted content. Perplexity, known for its AI‑powered search engine, denies wrongdoing, likening the case to historic tech disputes. The suit follows other high‑profile copyright actions against AI firms, raising questions about licensing, data‑sourcing practices and the future cost structure for generative‑AI development.
-
Reddit Accuses Perplexity of Plagiarism, Escalating AI Data Rights Dispute
Reddit has sued AI search engine Perplexity, data scraping service Oxylabs, AWMProxy, and SerpApi, alleging unauthorized scraping of user-generated content to train AI models. Reddit claims Perplexity illegally used its data, while Perplexity denies training models on Reddit content and accuses Reddit of “extortion.” This lawsuit, following a similar one against Anthropic, underscores the escalating battle over data rights in the AI era and the growing tension between content platforms and AI developers. The case could set precedents for data licensing and AI training practices.