Hacker Timesnew | past | comments | ask | show | jobs | submit | hereme888's commentslogin

Any bets on how long now until GPT-5.6 announced on HN?

I say 1-2 weeks.


Early ArtificialAnalysis.ai results show GPT 5.5 is still the better bang-for-your-buck.

OpenAI solves tasks with about 50% less output tokens.

https://artificialanalysis.ai/?intelligence=coding-index&int...


I give Codex a try with every new version, and we don't match, so this isn't true for everyone.

Claude would need to be much more expensive for me to switch.


People be saying these things with certainity. 99% of the time one has just inspired more confidence through sycophancy, or just good varience in outputs for a session/prompt.

Slop heads be swearing by one slot machine one week and swearing it off the next like an addicted gambler describing their favorite slot machines from week to week.

This isn't a coincidence, these companies hire UX designers from mobile gaming and online gambling to help engineer their addictiveness.

Its all in your head, and the output is no matter what always going to be worse than learning how to do something yourself and putting care into it.

Handmade watches > mass manufactured watches. There's nothing special about the skills needed for the guy who runs a conveyer belt at a watch manufacturer in China. The watch made by the guy who makes one watch a month in Switzerland is prized and beloved.


So you're trying to convince a community of mostly engineers — using the example of a terribly outdated technology that stays overpriced purely through symbolization because the luxury industry's bubble keeps holding — that flashy looks, advertising, and fancy concepts aren't really beloved and worthy? Fascinating.

> the guy who makes one watch a month

That's the thing, though. Most people alive today will never be able to possess such an object, no matter how prized and beloved it is. Still, if people want to be able to tell the time from their wrist in a reliable fashion, there are _plenty_ of far cheaper options available to them. The craftsmanship does have inherent value, yes. That does not mean the practical solution is worthless.

There can be practices incorporated in the production of software, involving AI use in a responsible fashion (difficult, of course), that produces practical solutions to real world problems far faster than a group of industry-hardened veterans painstakingly polishing their codebase in pursuit of craftsmanship. Those who appreciate how it is made will pay for the crafstmanship. Those who cannot afford to do so, and only care about a solution working well enough for the tasks they want to accomplish, the production line is good enough.


Codex with 5.4/5.4 is great Idk havent seen anything more crazy with claude + more expensive

GPT 5.5 and 5.4 are such great models. I just tried opus 4.8 and took 30 minutes to be confronted with a bit laziness that makes me go crazy. 5.5 just doesn’t have this issue.

How do you compare them to 5.3 Codex? I am using 5.3 Codex for a while, I subjectively think it does better job than Opus 4.6/4.7, with a fraction of the cost, and I did give 5.5 a try and it seems a bit better but magnitudes more expensive.

5.3 is good but talks like a robot, it’s too hard to understand what exactly it’s talking about. When using droid I use it to act like worker model and does a great job.

All 5.x models suffer from weirdness in the way it writes but 5.5 and 5.4 are much better and now offer a good balance, direct but without being like Claude.


My 20$ OpenAI sub gets me the same as my 100$ Anthropic sub. It really is the better deal.

I have the Max $100 plan and have never hit a wall; so the number of tokens I consume has never been a factor in how I use the models or which ones I use.

It's not magic, but for the value I get, I have no problem paying $100/month.

If I was forced to use API-usage pricing, then all of a sudden, switching models, limiting token usage, using "lower effort thinking" modes, etc., would become a thing.


This is true but OpenAI has been slowly boiling the frog here, too. $20 and $100 on their plan doesn't get you nearly as far as it did two, three months ago.

I use their (newish) 5x $100 plan and I routinely run out of weekly limits about a two days before the end of the week.

This has also goaded me into upgrading to $200 once before... and then had them hand out limits resets to everyone. Argh.


I feel you. I also switched to $200 plan because I have agents running half of most days.

I was previously coasting on that 2x both Claude and OpenAI were offering.


same with $200 plan

they silently raised the costs

also feels degraded


I agree

lol, the start of my game, where I used an obvious name and company name:

"""

Company Bio

Sam Altman, a serial founder known for previously leading a Y Combinator that invested in just about every flavor of tech, naturally pivoted to an "AI-first" approach for his latest venture, OpenAI. Now, the company plans to secure a $99 trillion valuation by offering a chatbot that can generate a haiku about a paperclip.

"""

And the news: "Cerebras CS-4 unveiled. 4 trillion transistors, one chip". lol

My favorite: a protest from Greta Thunberg:

AI Environmental Coalition: '1000 protest outside OpenAI DC'

Local power and water use protests. Media cameras everywhere. The nerve of them, disrupting our quest for AGI with their 'sustainability'!

“Greta camped outside our data center last night. Again.”


Haha, that sounds about like it.

NVIDIA’s published specs imply much larger gains in NVFP4 inference compute and GPU memory bandwidth than in BOM cost.

That said, more intelligence and automation = higher costs.


Was just going to say.... I looked inside by accident, and it gives a better impression of intelligence and effort than the outside.

I stopped paying for a blue checkmark when I noticed so many paid accounts were managed by AI bots anyways, and my account is shadow-banned when I post unpopular opinions.

Free speech, for those that pay.

Interesting. I can see the utility if you're going to see a nurse practitioner. But if your physician doesn't pull the actual charts for your device and visually inspect them.... try finding someone else.


Only a 10% budget cut? Should have been way more. I hear victimization throughout the article. It was the school's choice to focus on politicizing and prioritizing foreigners, and looking as "accepting" as possible, rather than educating and funding our citizens which is what matters.


How do you mean politicizing and prioritizing foreigners?


Normalizing compute rental + no privacy vs. actual ownership.

Typical of Big Tech spirit.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: