Story Details

  • OpenAI says it has evidence DeepSeek used its model to train competitor

    Posted: 2025-01-29 04:21:20

    OpenAI alleges that DeepSeek AI, a Chinese AI company, improperly used its large language model, likely GPT-3 or a related model, to train DeepSeek's own competing large language model called "DeepSeek Coder." OpenAI claims to have found substantial code overlap and distinctive formatting patterns suggesting DeepSeek scraped outputs from OpenAI's model and used them as training data. This suspected unauthorized use violates OpenAI's terms of service, and OpenAI is reportedly considering legal action. The incident highlights growing concerns around intellectual property protection in the rapidly evolving AI field.

    Summary of Comments ( 894 )
    https://news.ycombinator.com/item?id=42861475

    Several Hacker News commenters express skepticism of OpenAI's claims against DeepSeek, questioning the strength of their evidence and suggesting the move is anti-competitive. Some argue that reproducing the output of a model doesn't necessarily imply direct copying of the model weights, and point to the possibility of convergent evolution in training large language models. Others discuss the difficulty of proving copyright infringement in machine learning models and the broader implications for open-source development. A few commenters also raise concerns about the legal precedent this might set and the chilling effect it could have on future AI research. Several commenters call for OpenAI to release more details about their investigation and evidence.