Hacker Timesnew | past | comments | ask | show | jobs | submit | impulser_'s commentslogin

Every model release is just proof that AGI will most likely only be for the rich. We are a few years into LLMs and majority of people are already getting priced out of intelligence from LLMs and these are no where near AGI.

This is like looking at mainframe pricing in 1990 and concluding that PCs will only be for the rich. The price of each new level of capability is going to drop like crazy very quickly. It won't be that long before practically any consumer use case will be possible on models that are dirt cheap.

This premise is based around the assumption that Moore's law is still working, which it very much isn't [0]

[0] https://cap.csail.mit.edu/death-moores-law-what-it-means-and...


Improvements in model performance aren't always strictly compute-constrained in a way that makes them reliant on Moore's Law. Open weight models-- in particular, from Chinese labs-- are optimizing model intelligence with less compute. They're "behind" frontier models by months, but as others have noted, it's possible to get Sonnet 4.5+ level performance at reduced cost, today, from open weight labs.

No, I'm not assuming Moore's law. The efficiency of AI datacenters will continue to improve even without Moore's law, but more importantly the efficiency of packing intelligence into gigabytes and FLOPS will improve by leaps and bounds over the coming years, just as it has for the past few years if not faster.

You are only priced out if you only care for SOTA right now and can't wait for the inevitable cheap model coming in 6 months. DeepSeek, Xiaomi and Moonshot are already really cheap and match frontier performance from 6 months ago.

But they’re artificially cheap. When will they be cheap while the company makes a profit.

They are not artificially cheap, they are still cheap even when hosted by independent inference providers. Are all providers subsidizing their open-weight models?

Nobody's making profits right now, not because they're selling tokens for less than their cost but because they're always investing in the next bigger model.

Hardware manufacturing hasn’t caught up yet. Once it does, especially in China these token prices are going to drop hard.

They are using Google Cloud.

https://security.apple.com/blog/expanding-pcc/?linkId=100000...

"Now, we are collaborating with Google and NVIDIA to run new Apple Intelligence workloads on Google Cloud, extending our industry-leading PCC privacy commitments to third-party data centers for the first time."


Per that link: I think there's an interesting question about whether a nefarious actor who's infiltrated a cloud provider with physical access to machines that are running signed operating systems, with signed binaries, with TDX remote attestation, and with hardware supply chain verification, has the ability to break the privacy guarantees of a tenant with Apple's sophistication.

Certainly, one could tamper with the hardware, but could one do it in a way that wouldn't get that machine immediately flagged, removed from the routing pool, and told to wipe its memory immediately, by a watchtower (perhaps even the routing layer itself) that runs in a separate secure Apple datacenter?


Those datacentres would be in the same position of trust as a VPN provider in that the data must be unencrypted at points in the process.

They could be making it very safe, and the things apple says they are doing would make it as safe as possible, but as a user there is no way of verifying the claims.


> as a user there is no way of verifying the claims

I think this sums up what it's like to be an Apple user pretty well. With their heavy proprietary and closed approach, all users can do is "trust" them.


Have you read the PCC whitepapers? Are you saying the user-facing verification methods in them are insufficient, or vulnerable, or just false?

Apple could simply be ordered to include a hardware backdoor, and legally be prevented from talking about it. Everything else in the architecture could work exactly the way they claim in the PCC paper.

>nefarious actor who's infiltrated a cloud provider

Google is buying that compute from xAI aka Musk


Spoiler alert; Google is the nefarious actor.

I think the last thing Google wants to do is get on the bad side of their largest partners.

their largest partner is probably the US government.

Which is...

Wrong answer. Or at least, obvious and not particularly useful.

Truth is, none of those parties are "nefarious" - they're all just not on your side. And "security" is never an unqualified good thing to have (it's not an unqualified bad thing either). It's just a framework of coercion.

The most important questions to answer about any security system is, what is being protected, for who, and from who. People don't ask that much, not even in the industry - it's an implicit assumption that everyone themselves is a "good person" and is on the protected side of security systems. And then they're confused because it turns out end-users are more often seen as threat actors. All the players mention, but perhaps especially Apple, in its own special way, is protecting the computer from the user just as much as they're protecting the user/user's data from third parties.


It's not.

Why bother with all that cloak and dagger stuff when they can just buy the data? You believe Apple and/or Google isn't selling it? I have some land in Florida I'd like to talk about.

Having worked at Apple, I will say I firmly believe they do not sell data. I worked in data science and we had the shittiest inference because we had essentially no access, even internally, to longitudinal or cross-app user data. Best we had was 15 minute rotating sessions for a single app. There are internal teams dedicated to deanonymizing data to try to narrow down users - if they can successfully do so, and relevant fields that lead to deanonymization get permanently purged from internal logging.

I can’t speak to the current architecture but Apple has shown a consistent willingness to sacrifice access to user data in the name of selling privacy instead at a premium price (you could argue precisely because no one of their competition have any meaningful posture on this). I do believe they are quite serious in their commitment to that, as they have found this strategy to be more valuable than the data itself.


But sending sensitive private audio recordings to the lowest bidder is par for the course?

https://www.bbc.com/news/technology-49502292


This comment makes it sound like they sold private recordings to whomever was willing to pay for them, but they paid third parties to evaluate Siri recordings.

Don't really agree with that, that would have been highest bidder if anything.

And it wouldn't have been much worse compared to be as careless as they have been.


> Having worked at Apple, I will say I firmly believe they do not sell data.

Selling data is so shabby! Why sell when you can just give it away to letter-soup friends?


Because that's not legal, so they sell it to third party data brokers and it gets resold to someone the TLAs can buy it legally from.

Illegal to share data with entities that are themselves law enforcement, and which they are known to be demanding, not just asking to share out of good will?

Apple's incentives don't align to sell private data as their whole thing is privacy. They do that they tank their business. If you have proof that they are doing it -- I'd love to see it. (*3rd party actors from an app re-selling data doesn't count)

Google is 100% doing that because thats their entire incentive for the business. They sell low cost software / subsidized hardware on the grounds that you pay with your sharing data. That's the implied cost.

Show me the incentives - I will show you the outcomes.


Apple/Google make less money if they sell the data because their ad product would no longer have an advantage. So no, I don't think they do that.

That’s not so special, though? There’s a difference between Google infra running Google services.

Versus any F500 company running their services on GCP.

It’s a bit whacky to think about because Apple will operate Google owned software on GCP. But it should be sandboxed just the same.

I’m not making a normative privacy argument here. Just pointing out that this is cloud business as usual. Perhaps it’s interesting Apple is doing it, but basically everything else is already using either AWS or GCP at this point.


I think the difference is scale. This is Apple, so it's an enormous amount of devices. And it's a seamless experience, to the user, going from local model to cloud models.

So the question about which model Apple was going to use and where has been highly anticipated, especially by the likes of OpenAI and Anthropic. Imagine if either one could say they have Apple as their customer?

Apple certainly has the cash to burn if they wanted to train their own model, but it also always seemed out of their core competency. This is a major win for Google.

So "business as usual" but with huge implications for the AI ecosystem in general.


Google Cloud, but, the way I read it, not Google’s AI offerings. They, basically, hire Google servers to run their software on it.

They also (claim to) ensure those servers run only software they have approved to run on it.

(Part of their software are models derived from Google Gemini, but that’s orthogonal to this)


>(Part of their software are models derived from Google Gemini, but that’s orthogonal to this)

You're right that it is orthogonal to the privacy promises Apple makes to its own users.

The moralistic and righteous undertone in their marketing material is questionable though given that these Apple services might not exist if Google didn't exploit Gemini app user data on Android the way it does.

That's fine with me. Users have a choice here. In fact, it's a big improvement over the search deal with Google where Apple sends its own users directly to Google.


They are not _only_ using Google Cloud. They continue to build and invest in their own datacenters. It's not a binary choice.

Yeah, but the models are running in Google Cloud which makes sense they are based on Gemini.

They appear to be running them on both GCP and in their datacenter.

That is news — I guess not very surprising that they'd need more data centres than before.

But again there is no Apple-to-Google transfer in the inference in the sense of the comment I was originally replying to (I am not suggesting you're implying otherwise, obviously)

But I stand happily corrected where I said they aren't in the picture at all.

That is an interesting press release because it outlines what they would have had to do with any data centre they were outsourcing to.


This is probably why Google had to rent compute from SpaceX. They needed to free up NVIDIA GPUs for Apple so they probably moved internal workloads to SpaceX compute.

Google likely won't rent compute from SpaceX, they have a substantial share of SpaceX (they own 5% of it) and need the IPO to be valued highly, so to prop up the IPO stock, they made this announcement, but if you read the fine print, both SpaceX and Google are allowed to cancel it at any time, as-in, after they cash out from the IPO.

iCloud already uses Google Cloud, so that still doesn't change the operational boundaries of where data goes

I hope they are still using PCC hardware rather than running private data through third-party servers.

I don't think you understand the size of the US capital market. We are talking probably ~150 trillion.

It's easy as fuck for Google to raise this money because they are a money printing business. They are the most profitable company in the world, so for anyone this is basically the same as buying US debt.


> We are talking probably ~150 trillion.

Yes, but we are talking about liquidity not valuations...


Believe it or not there is actually a shortage of assets and an excess of money. That's in part why valuations are so bonkers.

This is equities + bonds which are pretty liquid assets.

Yeah, but Google has the money for this. They are quite literally the most profitable company in the world. They are only raising because they don't want to harm there other businesses buy eating up their capital for this.

Why do you think there will only be one winner?


> Yeah, but Google has the money for this. They are quite literally the most profitable company in the world.

"Alphabet announced that its 2026 capital expenditures are expected to be $180-$190 billion, and that it expects 2027 capital expenditures to significantly increase [...] over the 12 months ended March 31, 2026, Alphabet generated $174 billion of operating cash flow"


"If all of this was done to better humanity, AI development would be done in public, data would be legally obtained, models would be released for free, access wouldn't be gatekept behind ever increasing subscription costs."

The vast majority of AI development is public. There are papers literally every single day to read. In fact everything you need to build Claude and GPT models is public. Thanks to Google, DeepSeek, and all the other research labs. There are more research labs than there are closed shops. In fact there really is only one Anthropic, and lately maybe OpenAI. Google still releases papers all the time on AI.

There are more open source models than closed source models and all of them are accessible without a subscription. Yeah you still need to pay for them, but hey as we build out infrastructure and more time is put into efficient models today will easily run on person compute of the future.


There are more ants than humans in the world, too.

But which one is driving major changes to the world?

Just because you can point to an absolute number of open source models doesn't mean much when the models that 99.9% of the world cares about aren't.


"models that 99.9% of the world cares about aren't."

Software engineers aren't 99.9% of the world.


Is there any mainstream model that is actually open-source (not just open-weight)?

What do you mean with "open-source"? Of course, the inference code for all the open weight models is publically available - see llama.cpp or hf transformers.

There are, however, very few models where also the full training pipeline is available. Olmo by AI2 comes to mind.


Harnesses aren't really going to change much of the performance on models like Opus, and GPT.

You literally can just give the model a bash tool and it will do just fine in fact it will most likely do better than majority of harnesses due to how well models are at bash.

The model do all the lifting. It really doesn't matter which harness you use.


People need to stop thinking that LLMs actually know what they are. They don't. They don't know their Qwen, They don't know they are Opus. They don't even know they are an LLM.

This is why literally every single model's system prompt starts with something like:

"You're Claude Opus a large language model from Anthropic"


They can easily add this "individuality" in RLHF. The base model won't know, correct, but the final user-facing model very much can if that's what they so desire.

Crazy they bring up honest, when Claude models are literally known for straight up lying about things it has done and tries to act like it did what you asked.

Which is why they brought it up as something they are trying to improve.

Less than other frontier models. Which is scary honestly.

No. GPT models follow instructions significantly better than Claude models.

You tell it too research a repo to find a piece of code it will. Claude will just read the README and guess.


I have a codex session I am using to vibe code a db thats being going for like 3 month. Still doing OK. Try that in CC.

What's the token usage at?

I have tested almost every language and Go is pretty good but for some reason LLMs get paranoid over races and just start spamming locks next thing you know every fucking struct has mutexes and every function has locks lol.

The best language I have seen an LLM use was Kotlin. It actually surprised me how well it wrote the language. I wrote a project in it and I think I didn't have to correct it once. Like I was seriously impressed. I just wish Kotlin had better tooling so I didn't have to use gradle or maven lol.


Jarred said this had nothing to do with Mythos or Anthropic.


I have a very, very hard time believing that. Surely the acquisition left his wealth largely in the form of Anthropic stock, so his personal definition of success is "rep Anthropic so my stock goes up" and at that point he has succeeded.

Me, I still have to be competent to succeed. I don't just get to declare that because I used AI the effort was a success, and I have 0 desire to work with those kinds of people.


The concept of a "useful fool" is apt here.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: