Anthropic : We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax. These labs created over 24,000 fraudulent accounts and generated over 16 million exchanges with Claude, extracting its capabilities to train and improve their own models. - Kotaku In Action 2

Anthropic : We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax. These labs created over 24,000 fraudulent accounts and generated over 16 million exchanges with Claude, extracting its capabilities to train and improve their own models. (twitter.com)

posted 142 days ago by SophiesBoyfriend 142 days ago by SophiesBoyfriend +28 / -0

19 comments

19 comments share save hide report block hide replies

You're viewing a single comment thread. View all comments, or full comment thread.

Comments (19)

sorted by:

▲ 5 ▼

– 8BitArchitect 5 points 141 days ago +5 / -0

Do you know how LLMs work? There's no 'logic', it's all statistical modeling. What DeepSeek et. al. are doing is taking inputs from specific benchmarks, throwing them into Claude, then training their (smaller, cheaper, faster) models to match the output, bypassing the need to scrape the broader internet (or pay) for training data. Which is (IMO) why Anthropic is actually salty; they paid for publicly available data (to avoid being sued) and someone outside the legal system bypassed that process.

I guess you could call that 'backing out' or 'reverse engineering', except none of these companies actually understands the inner workings of their models (they have billions of parameters and are just too complicated), just the processes used to create them. It's a black box full of linear algebra.

permalink parent save report block reply

▲ 3 ▼

– undecidedmask2 3 points 141 days ago +3 / -0

There’s got to be some pattern they can acquire though? Some way Claude works a problem through a gazillion matrices that makes it superior and worth stealing from?