This just made any closed LLM a huge supply chain risk. Everybody was aware of this possibility, but now it actually happened. It's like having nuclear weapons vs. firing a nuclear weapon.
Especially outside the US customers are going to be very hesitant to keep adopting LLMs from US companies.
> Especially outside the US customers are going to be very hesitant to keep adopting LLMs from US companies.
Not really. There aren't any other choices, and the PRC also heavily utilizes export controls [0].
This is why sovereign AI has become important, as can be seen with EU NatSec uses cases tending to use Mistral [1] and Indian governments starting to use Sarvam [2].
That said, for most commercial usecases, older generations of Opus as well as enterprise grade GPT and Gemini are fairly good.
The distilled OSS models are alright for hobbyists but if you have actually used unrestricted and enterprise grade versions of Claude, Mythos, GPT, and Gemini (most hobbyists don't get access to these) you see how far behind the open weight models are.
Even in China, traditionally open minded models teams like Alibaba's Qwen are looking to become more restricted given the org changes [3].
Also, Corporate RFCs now demand final say on model used and depending on the geo, this can be a dealbreaker (eg. An American financial institution will absolutely blacklist a vendor if they use a Chinese model and same in reverse and European defense vendors mandate sovereign EU models depending on the opportunity).
> if you have actually used unrestricted and enterprise grade versions of Claude, Mythos, GPT, and Gemini you see how far behind the open weight models are.
I really do feel like DeepSeek V4 Pro is often better than current Sonnet is, in the general case.
Opus 4.7 is a solid step above Sonnet, and Fable was a solid step above Opus 4.7. I've only had Fable for a few days, obviously, but I was decently impressed after Opus 4.8 being a downright disappointment for me (it's just too buggy; I had it go out of control 3 separate times on things Opus 4.7 never had any trouble with.) I still ran into limitations. It's not world-endingly great.
So, based on that, I think DeepSeek V4 Pro is, ignoring multi-modal capabilities, about a couple solid steps behind. Assuming model iteration will continue to decelerate, especially as Anthropic heads into IPO, I'm guessing that DeepSeek will probably be able to strike back with something further along. Of course we'll see how able and willing they are to stay open weight, but they've done well so far so, no reason to doubt them at the moment.
(There are some models that claim to be ahead of DeepSeek V4 Pro. I've tried some of them and really not been that impressed. Maybe it's a me issue.)
Now I reckon that most people just simply don't really need Mythos/Fable for most of what they do and using Mythos/Fable tokens in place of Sonnet-tier models would not make any sense. At my job we already mostly just use Sonnet as it is. I'm sure there is some cutting-edge research where you want the absolute best model available and sure, in that case, you're stuck with Anthropic for the moment.
But is that really everyone? After all, while Mythos was dominating the hype cycles, quite a lot of impressive LLM-assisted CVEs dropped that were not linked to Mythos.
Compute was constrained. There is a lot happening, especially with chinese chips which currently points to a massive upcoming increase in non-US capacity.
Also, the EU, Japan, SK, ASEAN, and India are not supportive of using Chinese tech after China export controlled rare earth exports last years [0].
Software supply chain regulations also make utilizing Chinese software risky for ExChina players and make using ExChina tech risky for Chinese players.
Expect to see RFCs now demanding visibility into what models are used and right of refusal - this is already the norm in F1000s. Similar ones are likely to arise in the EU as well with some of the upcoming industrial policy changes being proposed.
you think its that hard to get trade secrets from some openai or anthropic engineer if you promise them anonymity and a new better paid position? hell they might even give it away for free if they think what their company is doing is morally wrong. know how is not source code, you cant catch it with dlp or online leak scanners. you would need 24/7 combined human and electronic surveillance and thats something even the cia reserves for top level targets, it takes too much manpower to use it on everyone.
SMIC hired hundreds of TSMC employees and now its a couple years away from 3nm equivalent chips in full production. export controls only work against poor countries with less advanced industry like russia. china has the resources and export controls give the motivation. and if the eu/us relations get even worse i wouldnt be surprised if the dutch government let asml start selling euv machines to everyone just to get back at trump.
If you’re talking about TSMC Arizona they aren’t fabricating at N3 until end of next year at the earliest, N2 isn’t slated until “end of decade”. I think they’re manufacturing Blackwell there which is N4 / 4nm
They've already been labeled a "supply chain risk". Probably not a good idea to upset the regulators more. Maybe tomorrow Opus will be declared too dangerous for the public.
You're mistaken, this is a cratering of the userbase inside and outside of the US. The ban is on any foreigner whether abroad or living in the USA, so Anthropic has no choice but to completely shut down access to the model for the whole world including the US.
Their IPO is well and truly fucked now. This also means no other frontier lab in the US is allowed to exceed Opus 4.8 capabilities.
If you're a luddite or a decel you should literally be dancing in the streets right now. And, if you're a tankie you'll be dancing right next to them. And, if you were hoping for a Star Trek-like future, you just adjusted your timeline for the worse.
>this is a cratering of the userbase inside and outside of the US.
Is it really? It was limited release anyway (like hypebeast merch!). Everything people are gonna talk about for a week is gonna be about how Fable was so cool that it got banned by the feds. If it's just the Trump admin being the Trump admin, Amodei is just gonna have to pay up as a racket / marketing expense. Or it is like I'm suspecting and this was pre-bribed and the ban is kabuki theater.
>And, if you were hoping for a Star Trek-like future, you just adjusted your timeline for the worse.
The funny thing is that solar and batteries advancements are actually this, not LLMs, but your framing kinda fits anyways.
The main error of the AI bubble is expressed in the The Jetsons cartoon from the early 60s.
In the future, everyone obviously would be running nuclear powered cars. It was just an engineering problem to be solved. Ford made the Ford Nucleon prototype in 1958.
The nuclear optimism completely blinded people to the ridiculous idea of an individual handling nuclear material for personal use.
The AI bubble error is this idea that everyone is going to have "AGI" in their pocket. It is just a completely absurd idea that is not going to happen.
Fable was interesting from what I tried but nothing close to AGI yet here we are. The models don't get smarter and LESS restricted from here.
To me, right away it seemed that the "Mythos moment" was extraordinarily bearish for the assumptions the AI bubble is built on.
Is it now? From a company's point of view, does it really matter that some expensive tool is allegedly good or not if it's reliability/availability is poor and subject to completely arbitrary and unpredictable change?
And they don't have to actually serve expensive model compute and this all goes away once they contribute to the right charitable organizations and patriotic causes funneling money to the right people.
This is quite clearly corporate capture of the white house by a competitor influencing policy, but it's hard to imagine something that plays more into anthropic's hand. They now own the model that was so good the US government made them shut it off.
it may be really good pr, but it's really bad for their IPO. If their market for future models is usa only, their potential revenue is cut by 50% at least. (and it's even worse because it means Europe, India, and China will all have companies making their own models that anthropic needs to stay ahead of)
Another sibling thread already called this out, but mentioning here: it's not "USA only", it's "US citizens only" (and I'm not entirely sure how dual-citizenship interacts with this, but I assume you can't sell to them, either, since they are by definition also foreign nationals). A private company only being able to do business with folks they can verify are solely US citizens (who themselves are also willing to submit verification of said citizenship to a private company), has a relatively small pool of potential users.
And so if this policy holds, Anthropic has functionally had Fable killed by government intervention, and in a logically consistent world, this would imply all other US-based AI labs may also never exceed existing (read: Opus) capabilities.
Regarding the dual-citizenship, you are wrong to assume that. To US government you are US citizen and that is all that matters, even if you have 5 different citizenships government and justice system don't care, you need to follow the US laws and can't cherry pick what you want. Regarding users, for any of this big 3 (Alphabet, Anthropic, OpenAI) only important customers are enterprises, not individual users.
I don't think so, most of enterprise customers are US based companies. They will basically give Mythos to US citizens in R&D while others will use Opus. I hope this is not the actual intent.
How many entreprise customers that aren't in the defense sector currently have R&D departments entirely composed of US citizens?
And what does it mean for indirect access to the models, through say agents working off ticket systems.
The problem here is that the valuations of these AI companies was based on the fact that they'd keep improving models. A company that just serves the latest Opus isn't worth trillions.
You think Anthropic will ask all their enterprise customers to provide passports for all their employees and then setup individual Claude accounts for each and every employee to gatekeep access to Mythos? Because a plain ole api key no longer cuts it
People talking about "flow" and using AI as a "tutor" are missing the point entirely. The beauty of AI is that you don't need a tutor anymore, and you don't need flow. It's like asking how best to use a combustion engine to improve your horse riding ability.
HN rewards engagement in the form of upvotes. If you make a post or comment with the intention of getting upvotes, you're engagement farming. I hate to break it to you, but there are many people doing that on this website.
Why do you think all engagement farming would result in downvotes? You think HN posters are so keen at sniffing out bad actors that they would never reward them by accident?
It feels like you're being willfully dense about this. I can see why the other person accused you of being AI.
>It feels like you're being willfully dense about this. I can see why the other person accused you of being AI.
No, you fucking idiot. Not everybody who disagrees with you is AI.
"Engagement farming" is a term of art with a specific meaning, basically just the adult way of saying "ragebait". People engagement farming on IG or TikTok are typically doing so by posting deliberately controversial content designed to troll upset viewers into writing upset comments. It is inherent to the technique that it only works on platforms without downvotes.
You don't see anyone "engagement farming" on Reddit either, it's always specifically "karma farming" because farming karma requires positive engagement. Engagement farmers do not care if the engagement is positive or not, the required approaches are completely different.
You're conflating engagement farming and rage-baiting. Engagement can be both positive and negative. I can't find any online sources that agree with your definition.
Here's one that I think sums it up pretty well:
"Engagement farming refers to a range of deceptive practices on social media designed to artificially inflate engagement metrics such as likes, shares, comments, and followers."
"Engagement farming employs various tactics to exploit social media algorithms, with the intent to appear more popular than actual user interest would warrant. Examples include posting controversial content to provoke emotional responses, repurposing successful posts without originality, and using automated systems for mass liking or following." [1]
If you don't think this is happening on HN (especially to mass downvote posts) you're naive.
Maybe look at how people actually use it? Not sure why the Indian blogspam is worth looking at.
"Engagement" has a specific meaning. It's different from "like farming" or "karma farming". Some platforms specifically reward engagement, making it a reasonable thing to farm on those platforms.
> using automated systems for mass liking or following
Clearly the author was clueless, this has nothing whatsoever to do with engagement farming.
HN has been very pro-AI over the last several years. It's only swung back slightly the other way recently. I suspect this is due to tensions in the gulf causing some institutions to reallocate their investments, which results in reduced bot activity.
It's the opposite. There's documented evidence of China recently funding anti-AI bots in the west in an attempt to weaken the US's AI lead, I imagine that such bots also inhabit HN.
Think about it. Anthropic just reported that their codebase is now improving itself. We're moments away from every open source repo being able to do the same. Think of it like torrenting — you'll be able to open your repo to the public, and have a stream of code flow in from millions of contributors. More code than you could ever write in ten lifetimes, uploaded to your repository in a matter of days.
Ladybird doesn't know it yet, but they just left themselves in the dust.
> More code than you could ever write in ten lifetimes, uploaded to your repository in a matter of days
I wonder if any of you AI boosters have any actual knowledge of how software is written, why bugs exist
and how to mitigate tech debt. I wonder if you guys even know what tech debt is.
Somehow it’s always very young accounts which made me wonder if I’m talking to exuberant teenagers or people that have done a one week Python code camp 10 years ago and now think they’re John Carmack with their Claude subscription.
Is your point that Anthropic is now accepting code submissions to Opus 5.0 from “millions of contributors”? No?
Ladybird uses AI to code. This is not them banning AI. This is them not wanting to take responsibility for the code WE write with AI. They think outside code contributions are raising their risk and slowing them down.
I dislike this change but it does not track to what you are saying at all.
Like always with Open Source, if you really believe what you are saying, fork the project. Outrun them if you can.
> open your repo to the public, and have a stream of code flow in from millions of contributors. More code than you could ever write in ten lifetimes, uploaded to your repository in a matter of days.
It would be free labour! Truly crowd-funded development.
I'm picturing something like folding@home, but where people donate their spare tokens to a service, and those tokens get distributed amongst all open source projects on GitHub. You don't think that would be cool? Like, someone might initialise a repo with only a readme and a to do list before they go to sleep, and then wake up to a complete software ecosystem that looks as if it's been in development since before they were born. Like, so much code that no one person could possibly understand it, and it all happened overnight while they were sleeping!
Just donate your tokens to the project. The actual team that’s actually leading the project can direct the prompts better and evaluate the LLM-generated code better for their project than random drive-by contributors can. That’s the whole point of their announcement.
It's pretty crazy that a company like Anthropic no longer needs to hire Software Engineers, because their software engineers itself. If that's not a break through I don't know what is!
edit: it looks like I was wrong and they're still hiring many software engineers. Not completely sure why that is just yet.
> So there's a huge number of HN posters claiming that the price of tokens will go UP over time rather than down (that's how Moore's Law works, right???)
I mean, Github Copilot's pricing just went up considerably, so I guess they were right?
reply