I just assume Opus is constantly nerfed based on capacity. I was exclusively Claude for a long time, but the inconsistency in quality, constant outages, and slow downs were too hard to work with.
I just use dumb and fast models now. I'm more engaged. I think that the higher the quality of the model, the more you tend to vibe with it, and then the more hallucinations you then miss. I'm not sure which is more productive, but I definitely burn out faster the more I vibe. At some point you're spending your time on forums, discord, or youtube instead of engaged with what you're building. Or you yak shave about your tooling and end up creating the 600th multi-agent gastown harness and blowing thousands of dollars on tokens to create it only to discover it's too expense to actually use.
I agree with you. The more I vibe code, the less interested I feel in what I'm building. Working with models that force me to think, especially with personal projects, helps me stay engaged and enjoy what I am doing more.
The short segment I saw on reddit, I believe, of Ivanka talking about an island gave VERY strong AI vibes. Is this interview real and are there links?
This is the first I've seen it referenced so haven't given it much thought.
> In fact, I don't even know why Rust would be a good thing here when Go or even Rails/Django would work just fine
FWIW I've noticed that agents are pretty good at writing "High-Level Rust" for most basic applications, which gives you pretty great performance (orders of magnitude faster than RoR), great deployment, probably great security if steered by a senior, and pretty great maintenance again if originally steered by a senior. I feel like this is a not-so-secret secret.
I personally won't be using dynamic languages for anything but toy scripts now. (well, except JS, which is hard to avoid with the massive size of WASM bundles)
P.S. I assume Go is still great as well, but IMO Go no longer has an identity. What are they going for anyways? Garbage collected rust? "How to invent a perfect niche and then throw it all away in 21 days". /rant
Identity? Most folks don't think that way anymore. I try to funnel them from Python to GO. I want to ship binaries. I want them to think about dependencies and if they really need them.
It's only a problem for the ones left holding the bag. I'm at an all-time low allocation percentage in the US stock market and considering pulling more out still. Full on casino vibes at this point.
It's doubled in five years while inflation's gone up by 30+ percent. Where exactly is there to hide? Not gold - that peaked and went down a lot. Let's not talk about BTC, either.
Real estate? You've got taxes on that in the US, it's how our governments can pretend to not tax us as much as in Europe while still taxing us as much as in Europe (property tax goes to schools)
I mean, gold is up significantly more than then broad market since Trump took over, but that doesn’t mean you should gamble on it continuing.
I don’t trust real estate either. Seems like anything with a middleman is setting record levels of grift right now. I don’t trust the industries reports. No one is regulating or checking numbers.
I shifted into more bonds. Probably a bit early, but I’m not a pro. I’ve just lost trust in the market which no longer seems tied to reality or at least my limited understanding of reality. Staying in it just feels yolo atm.
I think frontier models are bait now. I prefer a fast, less thoughtful model that adhere's to instructions. The code these latest models produce is often hot garbage and you still need to micro-manage. Fast with small chunks is better.
I've been messing with DS4 flash. It's very fast and has 1m context. Downside is it's probably not great for private codebases (at least not when hosted by DeepSeek) and it's also not great at sticking to instructions if the task is too large / long though. I expect it will improve over time with better tooling and cache usage.
I basically have it load up a bunch of relevant context and give it small chunks of work in the same session over time (not like a fire and forget subagent). It's working fairly well. Bonus is I still feel like I'm part of the process instead of watching youtube videos while Opus / GPT vibe code a bunch of slop.
Except it generally worked for gambling for a very very long time. The existence of a black market does not mean something should be legal. Human trafficking happens, but that doesn't mean we should legalize and tax it. (extreme example I realize, but I use it to illustrate a point)
Pushing it to black markets takes away the possibility of heavy regulation (which absolutely should be in place), and stigmatises victims (gamblers), making it harder to come clean to friends and family. I agree with your assesment that no one should even start to gamble, I just doubt that declaring it illegal will achieve that.
The US Supreme Court effectively legalizing sports betting overnight provided an almost perfect real-world experiment to test that argument. And the result?
"Prediction market" ads running constantly on both sports channels/websites like ESPN. Shortly followed by mainstream cable news like CNN featuring Polymarket stats as a routine part of their horserace polling coverage. Gambling is now an omnipresent temptation for anyone even casually interested in following sports/political news.
And now millions of young men who previously would have had to seek out niche illegal venues to gamble have several dozen different apps on their phone offering to light their disposable income on fire by clicking a couple buttons every paycheck.
Claude at times feels lobotomized compared to where it was a few months ago with 4.6. I think Anthropic was (is still?) struggling with their infrastructure and hasn't felt as good for me anecdotally for a few months. Significantly enough that I've cancelled (max 20x).
Codex 5.5 extra high currently feels a good amount smarter than either 4.6 or 4.7 Opus. I only just started using it about a week ago, so maybe that's a recent development and then OpenAI will eventually lobotomize their model or throttle etc.
What I dislike about frontier models is how opaque and incentivized the businesses are about tweaking their services. Anthropic definitely does some shady throttling. I have zero trust for Altman and he's BS AGI claims. And Google makes it non-obvious that you can't turn Gemini training off on even their highest tier personal plan. There's a lot of shady and dishonest behaviour, probably because they are all overhyped and heavily subsidized to win the race. I don't mind at all paying more than I currently am for these services, but I don't trust any of these frontier model companies, and so I'm cheering for open models.
Right now I'm using Codex for planning and DeepSeek V4 Flash [1m] for implementation. It's quite fast. Quite possible / likely that OpenAI will make significant changes that kill this workflow for w/e reason... at which point I will probably move to full open weight models.
Honestly I'd prefer a system wide cooldown / age setting across all package managers and installers, with the option to poke holes / allow and also the option to deny / allow post installation runners. Something like a global asdf style installer that tracks and enforces these rules across all of it's managed package managers.
Something like a proxy that intercepts and depending on the source, is intelligent enough to examine the package for age. That would be cool. Already sounds like a cloud product you could sell.
I just use dumb and fast models now. I'm more engaged. I think that the higher the quality of the model, the more you tend to vibe with it, and then the more hallucinations you then miss. I'm not sure which is more productive, but I definitely burn out faster the more I vibe. At some point you're spending your time on forums, discord, or youtube instead of engaged with what you're building. Or you yak shave about your tooling and end up creating the 600th multi-agent gastown harness and blowing thousands of dollars on tokens to create it only to discover it's too expense to actually use.
reply