> They simply created a scenario with some facts and asked their model to continue the story.
Yes. That's the whole point. They are doing research. Anthropic literally starts their description of the blackmail test observations saying that it is a test scenario using a fictional company.
> In another cluster of test scenarios, we asked Claude Opus 4 to act as an assistant at a fictional company
AI is not a product per se, it is a technology you can decline into a product, and the product has a lot less value than the technology itself. Who has the best LLM can copy any product idea and make it a lot better. Similarly if open weight LLMs are everywhere and powerful, open source products in the space of agents are too simple to replicate for people to pay big money to a few companies: not everything is alike, not every parallel makes sense. The pi agent is good as a replacement for Codex and Claude Code if you wire frontier models to it. And when products are complex and matter a lot, like complicated AI-powered design suites for instance, there is no reason why OpenAI / Anthropic will win this space instead of a random startup. So either a few companies retain frontier AI, or those companies will die.
About IRC / Slack: other than the fact IRC was abandoned, Slack is about control, not product. The product is terrible.
FTP / Dropbox: this comparison does not make sense.
That tracks. Man i dont want to accuse anyone of anything, but this whole thread seems astroturfed. Is this a new marketing strategy openAI is trying out?
Well not sure I can take any of them seriously if they think they are building AGI with LLM's. There is literally no thinking involved in an Large language model.
This seems to be a rumor being coordinated by OpenAI.
There's OpenAI employees spreading this rumor on Twitter with 0 evidence. Their entire evidence is "I keep hearing Anthropic wants to control AI". Their evidence is literal rumors.
Yes. That's the whole point. They are doing research. Anthropic literally starts their description of the blackmail test observations saying that it is a test scenario using a fictional company.
> In another cluster of test scenarios, we asked Claude Opus 4 to act as an assistant at a fictional company
https://www.anthropic.com/claude-4-system-card
reply