More

girvo · 2026-04-03T00:44:28 1775177068

Yeah these people have no idea what they are talking about, you’re correct.

girvo · 2026-04-02T23:11:35 1775171495

GLM-5 is surprisingly good to be fair. Punches well above its weight IMO

girvo · 2026-04-02T23:10:43 1775171443

All the time now… it’s wild how little usage you get with Opus on the Pro sub now haha

girvo · 2026-04-02T22:59:16 1775170756

Exactly. 3.6 plus in the exact same coding agent harness is notably worse in all of my testing compared to 3.5 plus.

The former gets stuck in ridiculous thought loops on the exact same tasks I’m testing. Fascinating really, I expected more for some reason.

girvo · 2026-04-02T22:52:15 1775170335

> I also expect that there's a lot of feeling busy while not actually moving much faster.

Hey don’t say that too loudly, you’ll spook people.

With less snark, this is absolutely true for a lot of the use I’m seeing. It’s notably faster if you’re doing greenfield from scratch work though.

girvo · 2026-04-02T22:42:15 1775169735

> I still want to _code_ not just vibe my way through tickets.

You and I want this. My EMs and HoEs and execs do not. I weep for the future of our industry.

girvo · 2026-04-02T22:30:41 1775169041

> I'm not sure why more people aren't jumping on it

Simple: most of the people you’re talking to aren’t setting these things up. They’re running off the shelf software and setups and calling it a day. They’re not working with custom harnesses or even tweaking temperature or templates, most of them.

girvo · 2026-04-02T22:26:45 1775168805

The value prop for the Nvidia one is simple: playing with CUDA with wide enough RAM at okay enough speeds, then running your actual workload on a server someone running the same (not really, lol Blackwell does not mean Blackwell…) architecture.

They’re fine tuning and teaching boxes, not inference boxes. IMO anyway, that’s what mine is for.

girvo · 2026-04-02T22:15:16 1775168116

They really are. Benchmaxxing is real… but also the Qwen 3.5 series of models are still very impressive. I’m looking forward to trying out Gemma

girvo · 2026-03-31T22:53:36 1774997616

> then code quality just doesn’t really matter so much in the age of AI

Except at scale it really does, because garbage in garbage out. The crappier the code you feed the current models, the worse and more confusing the broken leaky abstractions, the more bugs the AI will generate.