Hacker Timesnew | past | comments | ask | show | jobs | submit | maplethorpe's commentslogin

This is good PR for them. They get to tweet about how scary and powerful their models are in the lead up to their IPO.

This just made any closed LLM a huge supply chain risk. Everybody was aware of this possibility, but now it actually happened. It's like having nuclear weapons vs. firing a nuclear weapon.

Especially outside the US customers are going to be very hesitant to keep adopting LLMs from US companies.


Switching between LLMs takes all of 30 seconds - there will be no hesitation to adopt whichever LLM is performing the best.

> Especially outside the US customers are going to be very hesitant to keep adopting LLMs from US companies.

Not really. There aren't any other choices, and the PRC also heavily utilizes export controls [0].

This is why sovereign AI has become important, as can be seen with EU NatSec uses cases tending to use Mistral [1] and Indian governments starting to use Sarvam [2].

That said, for most commercial usecases, older generations of Opus as well as enterprise grade GPT and Gemini are fairly good.

The distilled OSS models are alright for hobbyists but if you have actually used unrestricted and enterprise grade versions of Claude, Mythos, GPT, and Gemini (most hobbyists don't get access to these) you see how far behind the open weight models are.

Even in China, traditionally open minded models teams like Alibaba's Qwen are looking to become more restricted given the org changes [3].

Also, Corporate RFCs now demand final say on model used and depending on the geo, this can be a dealbreaker (eg. An American financial institution will absolutely blacklist a vendor if they use a Chinese model and same in reverse and European defense vendors mandate sovereign EU models depending on the opportunity).

[0] - https://www.allbrightlaw.com/EN/10475/f9d4055e47e81afb.aspx

[1] - https://www.reuters.com/business/media-telecom/mistral-defen...

[2] - https://www.sarvam.ai/blogs/partnerships-with-indian-states

[3] - https://www.ft.com/content/b39da303-3188-447b-8b65-3dd8dad8b...


> The PRC also heavily utilizes export controls

Matters not for open weight models, no?

> if you have actually used unrestricted and enterprise grade versions of Claude, Mythos, GPT, and Gemini you see how far behind the open weight models are.

I really do feel like DeepSeek V4 Pro is often better than current Sonnet is, in the general case.

Opus 4.7 is a solid step above Sonnet, and Fable was a solid step above Opus 4.7. I've only had Fable for a few days, obviously, but I was decently impressed after Opus 4.8 being a downright disappointment for me (it's just too buggy; I had it go out of control 3 separate times on things Opus 4.7 never had any trouble with.) I still ran into limitations. It's not world-endingly great.

So, based on that, I think DeepSeek V4 Pro is, ignoring multi-modal capabilities, about a couple solid steps behind. Assuming model iteration will continue to decelerate, especially as Anthropic heads into IPO, I'm guessing that DeepSeek will probably be able to strike back with something further along. Of course we'll see how able and willing they are to stay open weight, but they've done well so far so, no reason to doubt them at the moment.

(There are some models that claim to be ahead of DeepSeek V4 Pro. I've tried some of them and really not been that impressed. Maybe it's a me issue.)

Now I reckon that most people just simply don't really need Mythos/Fable for most of what they do and using Mythos/Fable tokens in place of Sonnet-tier models would not make any sense. At my job we already mostly just use Sonnet as it is. I'm sure there is some cutting-edge research where you want the absolute best model available and sure, in that case, you're stuck with Anthropic for the moment.

But is that really everyone? After all, while Mythos was dominating the hype cycles, quite a lot of impressive LLM-assisted CVEs dropped that were not linked to Mythos.


> There aren't any other choices

This might be the trigger for creating other choices. Not within a month, but things can change quickly.


I randomly received an email from chatgpt saying my account was suspended. I appealed it and got it back - I hadn't used it in months.

But this has left a sour taste in my mouth. What if I relied on it for mission critical business processes?

This is potentially far worse than say a gmail account going down. It's the stuff of nightmare fuel.

Not having an alternative is a massive risk for any business.


The issue is compute is constraint and export controlled, as is even knowhow in model training.

Edit: Can't reply

> Time to build fabs back in the states

We are and did. The Intel and TSMC fabs have already started 2nm fabrication.


Compute was constrained. There is a lot happening, especially with chinese chips which currently points to a massive upcoming increase in non-US capacity.

ex: https://www.youtube.com/watch?v=8ekndZwyOzo


They are export controlled in most cases as well.

Also, the EU, Japan, SK, ASEAN, and India are not supportive of using Chinese tech after China export controlled rare earth exports last years [0].

Software supply chain regulations also make utilizing Chinese software risky for ExChina players and make using ExChina tech risky for Chinese players.

Expect to see RFCs now demanding visibility into what models are used and right of refusal - this is already the norm in F1000s. Similar ones are likely to arise in the EU as well with some of the upcoming industrial policy changes being proposed.

[0] - https://www.reuters.com/world/china/china-is-making-it-harde...


> as is even knowhow in model training

you think its that hard to get trade secrets from some openai or anthropic engineer if you promise them anonymity and a new better paid position? hell they might even give it away for free if they think what their company is doing is morally wrong. know how is not source code, you cant catch it with dlp or online leak scanners. you would need 24/7 combined human and electronic surveillance and thats something even the cia reserves for top level targets, it takes too much manpower to use it on everyone.

SMIC hired hundreds of TSMC employees and now its a couple years away from 3nm equivalent chips in full production. export controls only work against poor countries with less advanced industry like russia. china has the resources and export controls give the motivation. and if the eu/us relations get even worse i wouldnt be surprised if the dutch government let asml start selling euv machines to everyone just to get back at trump.


If you’re talking about TSMC Arizona they aren’t fabricating at N3 until end of next year at the earliest, N2 isn’t slated until “end of decade”. I think they’re manufacturing Blackwell there which is N4 / 4nm

Source: https://www.tsmc.com/static/abouttsmcaz/index.htm

https://www.tomshardware.com/tech-industry/semiconductors/nv...


Not sure if this is true - I’ve been using mimo and it’s great

It mainly shows that this is another US companies that cannot be trusted by anyone outside of the US because of the US government.

They've already been labeled a "supply chain risk". Probably not a good idea to upset the regulators more. Maybe tomorrow Opus will be declared too dangerous for the public.

Cratering their user base outside of the US is hardly going to be good for their IPO.

You're mistaken, this is a cratering of the userbase inside and outside of the US. The ban is on any foreigner whether abroad or living in the USA, so Anthropic has no choice but to completely shut down access to the model for the whole world including the US.

Their IPO is well and truly fucked now. This also means no other frontier lab in the US is allowed to exceed Opus 4.8 capabilities.

If you're a luddite or a decel you should literally be dancing in the streets right now. And, if you're a tankie you'll be dancing right next to them. And, if you were hoping for a Star Trek-like future, you just adjusted your timeline for the worse.


>this is a cratering of the userbase inside and outside of the US.

Is it really? It was limited release anyway (like hypebeast merch!). Everything people are gonna talk about for a week is gonna be about how Fable was so cool that it got banned by the feds. If it's just the Trump admin being the Trump admin, Amodei is just gonna have to pay up as a racket / marketing expense. Or it is like I'm suspecting and this was pre-bribed and the ban is kabuki theater.

>And, if you were hoping for a Star Trek-like future, you just adjusted your timeline for the worse.

The funny thing is that solar and batteries advancements are actually this, not LLMs, but your framing kinda fits anyways.


> The funny thing is that solar and batteries advancements are actually this

No no no, dont say it here. Green tech is now owned by China that wants to destroy everything.

And US bigtech working hard to save everything by building safe controlled super AI that will burn all the energy it has access to.


The main error of the AI bubble is expressed in the The Jetsons cartoon from the early 60s.

In the future, everyone obviously would be running nuclear powered cars. It was just an engineering problem to be solved. Ford made the Ford Nucleon prototype in 1958.

The nuclear optimism completely blinded people to the ridiculous idea of an individual handling nuclear material for personal use.

The AI bubble error is this idea that everyone is going to have "AGI" in their pocket. It is just a completely absurd idea that is not going to happen.

Fable was interesting from what I tried but nothing close to AGI yet here we are. The models don't get smarter and LESS restricted from here.

To me, right away it seemed that the "Mythos moment" was extraordinarily bearish for the assumptions the AI bubble is built on.


The Star Trek future is back on track.

Star Trek is cyber socialism, with the means of production owned by the Federation.

China is the only player working towards this.


This is just a case of not greasing the right palms. Some contributions will be made and this goes away.

“Anthropic buys 75% of Truth Social’s ad inventory”

Funny. But unfortunately this is well within the realm of possibilities these days.

Incorrect. Heavy government regulation means it is limited how they can sell this model and to whom.

It would be if it was rationally tied to the strength of the model. More likely, it’s simply the government deciding who can compete.

Is it now? From a company's point of view, does it really matter that some expensive tool is allegedly good or not if it's reliability/availability is poor and subject to completely arbitrary and unpredictable change?

It also signals that Anthropic is a bad choice if you need stable access to their product outside the US.

And they don't have to actually serve expensive model compute and this all goes away once they contribute to the right charitable organizations and patriotic causes funneling money to the right people.

This is quite clearly corporate capture of the white house by a competitor influencing policy, but it's hard to imagine something that plays more into anthropic's hand. They now own the model that was so good the US government made them shut it off.


it may be really good pr, but it's really bad for their IPO. If their market for future models is usa only, their potential revenue is cut by 50% at least. (and it's even worse because it means Europe, India, and China will all have companies making their own models that anthropic needs to stay ahead of)

Another sibling thread already called this out, but mentioning here: it's not "USA only", it's "US citizens only" (and I'm not entirely sure how dual-citizenship interacts with this, but I assume you can't sell to them, either, since they are by definition also foreign nationals). A private company only being able to do business with folks they can verify are solely US citizens (who themselves are also willing to submit verification of said citizenship to a private company), has a relatively small pool of potential users.

And so if this policy holds, Anthropic has functionally had Fable killed by government intervention, and in a logically consistent world, this would imply all other US-based AI labs may also never exceed existing (read: Opus) capabilities.

What interesting times we live in, indeed.


Regarding the dual-citizenship, you are wrong to assume that. To US government you are US citizen and that is all that matters, even if you have 5 different citizenships government and justice system don't care, you need to follow the US laws and can't cherry pick what you want. Regarding users, for any of this big 3 (Alphabet, Anthropic, OpenAI) only important customers are enterprises, not individual users.

It kills their entire Enterprise market which is a far bigger deal than just consumer KYC headaches

I don't think so, most of enterprise customers are US based companies. They will basically give Mythos to US citizens in R&D while others will use Opus. I hope this is not the actual intent.

How many entreprise customers that aren't in the defense sector currently have R&D departments entirely composed of US citizens?

And what does it mean for indirect access to the models, through say agents working off ticket systems.

The problem here is that the valuations of these AI companies was based on the fact that they'd keep improving models. A company that just serves the latest Opus isn't worth trillions.


You think Anthropic will ask all their enterprise customers to provide passports for all their employees and then setup individual Claude accounts for each and every employee to gatekeep access to Mythos? Because a plain ole api key no longer cuts it

Indeed, this affects way more than just Anthropic.

I think it's the opposite. Who would want to buy shares in a company that's been flagged as a supply chain risk?

The secret ingredient is public and brazen bribery, and the one thing that Anthropic doesn't lack is cash.

But what happens when they fix whatever's making it a risk?

They'll walk away with two black eyes from the US government, and we'll all be left to speculate on when the next sucker punch will land

They literally state here that they can't do that, since the risk is jailbreaking which seems to be an intrinsic part of LLMs.

"Our models are so good the government decides whether or you get access -- so you better not depend on them!"

“Not a commodity”

People talking about "flow" and using AI as a "tutor" are missing the point entirely. The beauty of AI is that you don't need a tutor anymore, and you don't need flow. It's like asking how best to use a combustion engine to improve your horse riding ability.

> Why would anyone "engagement farm" on HN? That's crazy.

All sorts of reasons! One is that when you reach 1000 karma, you gain the ability to downvote comments.

You seem like a smart person. I'm sure you can think of some reasons why that might be useful.


I mean, this forum is run by a billion dollar startup incubator.

People make farming HN a part of their business model.


This is just silly. HN rewards comments people agree with, not "engagement".

Engagement farming works on social media platforms that reward engagement. Posting engagement bait in HN comments would just result in downvotes.


HN rewards engagement in the form of upvotes. If you make a post or comment with the intention of getting upvotes, you're engagement farming. I hate to break it to you, but there are many people doing that on this website.

Why do you think all engagement farming would result in downvotes? You think HN posters are so keen at sniffing out bad actors that they would never reward them by accident?

It feels like you're being willfully dense about this. I can see why the other person accused you of being AI.


>It feels like you're being willfully dense about this. I can see why the other person accused you of being AI.

No, you fucking idiot. Not everybody who disagrees with you is AI.

"Engagement farming" is a term of art with a specific meaning, basically just the adult way of saying "ragebait". People engagement farming on IG or TikTok are typically doing so by posting deliberately controversial content designed to troll upset viewers into writing upset comments. It is inherent to the technique that it only works on platforms without downvotes.

You don't see anyone "engagement farming" on Reddit either, it's always specifically "karma farming" because farming karma requires positive engagement. Engagement farmers do not care if the engagement is positive or not, the required approaches are completely different.


You're conflating engagement farming and rage-baiting. Engagement can be both positive and negative. I can't find any online sources that agree with your definition.

Here's one that I think sums it up pretty well:

"Engagement farming refers to a range of deceptive practices on social media designed to artificially inflate engagement metrics such as likes, shares, comments, and followers."

"Engagement farming employs various tactics to exploit social media algorithms, with the intent to appear more popular than actual user interest would warrant. Examples include posting controversial content to provoke emotional responses, repurposing successful posts without originality, and using automated systems for mass liking or following." [1]

If you don't think this is happening on HN (especially to mass downvote posts) you're naive.

[1] https://www.isme.in/engagement-farming-prof-sriram-prabhakar...


Maybe look at how people actually use it? Not sure why the Indian blogspam is worth looking at.

"Engagement" has a specific meaning. It's different from "like farming" or "karma farming". Some platforms specifically reward engagement, making it a reasonable thing to farm on those platforms.

> using automated systems for mass liking or following

Clearly the author was clueless, this has nothing whatsoever to do with engagement farming.


Have you tried Claude Opus 4.7

HN has been very pro-AI over the last several years. It's only swung back slightly the other way recently. I suspect this is due to tensions in the gulf causing some institutions to reallocate their investments, which results in reduced bot activity.

It's the opposite. There's documented evidence of China recently funding anti-AI bots in the west in an attempt to weaken the US's AI lead, I imagine that such bots also inhabit HN.

I mean, I'm just reporting my experience. You're saying you think it's been generally negative over the years? Like, up until now?

Their loss.

Think about it. Anthropic just reported that their codebase is now improving itself. We're moments away from every open source repo being able to do the same. Think of it like torrenting — you'll be able to open your repo to the public, and have a stream of code flow in from millions of contributors. More code than you could ever write in ten lifetimes, uploaded to your repository in a matter of days.

Ladybird doesn't know it yet, but they just left themselves in the dust.


> More code than you could ever write in ten lifetimes, uploaded to your repository in a matter of days

I wonder if any of you AI boosters have any actual knowledge of how software is written, why bugs exist and how to mitigate tech debt. I wonder if you guys even know what tech debt is.

Somehow it’s always very young accounts which made me wonder if I’m talking to exuberant teenagers or people that have done a one week Python code camp 10 years ago and now think they’re John Carmack with their Claude subscription.


Every idea can seem quite nice if you only imagine the good parts, gloss over the nitty-gritty and ignore the bad.

Is your point that Anthropic is now accepting code submissions to Opus 5.0 from “millions of contributors”? No?

Ladybird uses AI to code. This is not them banning AI. This is them not wanting to take responsibility for the code WE write with AI. They think outside code contributions are raising their risk and slowing them down.

I dislike this change but it does not track to what you are saying at all.

Like always with Open Source, if you really believe what you are saying, fork the project. Outrun them if you can.


> open your repo to the public, and have a stream of code flow in from millions of contributors. More code than you could ever write in ten lifetimes, uploaded to your repository in a matter of days.

why would you want this. this sounds terrible


It would be free labour! Truly crowd-funded development.

I'm picturing something like folding@home, but where people donate their spare tokens to a service, and those tokens get distributed amongst all open source projects on GitHub. You don't think that would be cool? Like, someone might initialise a repo with only a readme and a to do list before they go to sleep, and then wake up to a complete software ecosystem that looks as if it's been in development since before they were born. Like, so much code that no one person could possibly understand it, and it all happened overnight while they were sleeping!


Just donate your tokens to the project. The actual team that’s actually leading the project can direct the prompts better and evaluate the LLM-generated code better for their project than random drive-by contributors can. That’s the whole point of their announcement.

You write "so much code that no one person could possibly understand it" as if it was a good thing. Surely, you're being sarcastic?

It's pretty crazy that a company like Anthropic no longer needs to hire Software Engineers, because their software engineers itself. If that's not a break through I don't know what is!

edit: it looks like I was wrong and they're still hiring many software engineers. Not completely sure why that is just yet.


> This story is part of Science’s AI in Science Reporting Initiative, which is supported by Ray Rothrock & family.

This excessively pro-AI article brought to you by private equity.


> So there's a huge number of HN posters claiming that the price of tokens will go UP over time rather than down (that's how Moore's Law works, right???)

I mean, Github Copilot's pricing just went up considerably, so I guess they were right?


> Which other tool went from nothing to this level of acceptance so quickly?

NFTs? My company had nothing to do with blockchain but I ended up working on NFT integration regardless.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: