More

mgdev · 2026-01-26T13:10:08 1769433008

This thing is cool except:

1) It chews through tokens. If you're on a metered API plan I would avoid it. I've spent $300+ on this just in the last 2 days, doing what I perceived to be fairly basic tasks.

2) It's terrifying. No directory sandboxing, etc. On one hand, it's cool that this thing can modify anything on my machine that I can. On the other, it's terrifying that it can modify anything on my machine that I can.

That said, some really nice things that make this "click":

1) Dynamic skill creation is awesome.

2) Having the ability to schedule recurring and one-time tasks makes it terribly convenient.

3) Persistent agents with remote messaging makes it really feel like an assistant.

bronco21016 · 2026-01-26T22:39:15 1769467155

> It chews through tokens. If you're on a metered API plan I would avoid it. I've spent $300+ on this just in the last 2 days, doing what I perceived to be fairly basic tasks.

Didn’t Anthropic make it so you can’t use your Claude Code Pro/Max with other tools? Has anyone experienced a block because of that policy while using this tool?

Also really curious what kind of tasks ran up $300 in 2 days? Definitely believe it’s possible. Just curious.

esskay · 2026-01-27T12:03:11 1769515391

Seen a couple of people on X have posted about their Claude accounts being suspended after using this. All of them seem to have used it with Claude Code so yes looks like it violates their policy (not surprising really, it breaks their TOS).

I've tried it on Codex (ChatGPT Pro) and within an hour of just getting stuff set up and tested used half my weekly limit so I can see using $300 in a couple of days being very easy.

Until thats figured out this is basically a non starter, you can't use it if its going to cost $1k+ per week to use, and I'm not sure theres any local models that'd handle it without $10k+ in hardware costs.

bronco21016 · 2026-01-27T15:34:36 1769528076

I’ve been working on adapting Claude Code to do some repetitive “personal assistant” type tasks so I was really excited to try this tool.

One of my tasks is a skill that fetches my calendar via MCP and slots events into a JSON to be used for an OR-Tools constraint optimizer that finds a workable schedule for something. It then uploads those events to the calendar using MCP when I choose my favorite candidate solution.

I checked token usage for this task last time I ran it. It would’ve cost $29 in API usage with Opus 4.5.

So yea, you’re absolutely right that this stuff isn’t going to go mainstream at these rates.

mgdev · 2026-01-28T15:08:07 1769612887

One thing you can try is powering Clawdbot with a local model. My company recently wrote[0] about it.

Unclear what kind of quality you'll get out of it, but since the tokens are all local, kinda doesn't matter if it burns through 10x more for the same outcome.

[0]:https://www.docker.com/blog/clawdbot-docker-model-runner-pri...

mgdev · 2026-01-27T11:44:12 1769514252

I offhandedly set it up to do a weather alert every 4 hours during the big winter storm. Absent a well-specified API, I can only assume it was repeatedly doing a bunch of work to access some open API it discovered.

Very much the LLM equivalent of “to bake an apple pie you must first invent the universe”.

To its credit, it did a great job.

bronco21016 · 2026-01-27T13:44:03 1769521443

Wow, so it must have been spending a ton of reasoning tokens then writing code to go fetch the weather. Or maybe using a browser?

Hopefully one day we get self-hostable LLMs good enough for this.

mgdev · 2026-01-05T19:40:20 1767642020

It's the perfect honeypot.

mgdev · 2025-11-05T03:46:45 1762314405

After 20+ years with Apple, I'm 90% on Linux at this point.

Two desktops, two AI workstations, two laptops, and a handheld. Even my wife is running Linux.

My personal phone and work laptop are the last holdouts.

mgdev · 2025-11-05T03:30:41 1762313441

Very simple. Undermines their ad business - which is their fastest-growing profitable business.

mgdev · 2025-10-31T14:53:24 1761922404

Hear hear. Elixir is a dream for this kind of stuff. But it requires very different decisions "all the way down" to make it work outside of BEAM. And BEAM itself feels heavy to most systems devs.

(IMO it's not for many use cases, and to the extent it is I'm happy to see things like AtomVM start to address it.)

I'm just happy I can use Elixir + Zig for NIFs.

travisgriggs · 2025-10-31T21:50:37 1761947437

Indeed. Zigler is tres cool.

mgdev · 2025-10-27T01:47:20 1761529640

Yes. Obvious to anyone who writes AI garbage all day.

mgdev · 2025-10-27T01:47:00 1761529620

That makes zero sense.

mgdev · 2025-10-08T16:23:50 1759940630

This is, as they say, "The beginning of the end."

tyleo · 2025-10-08T16:25:09 1759940709

Beginning of the end of what? If I could have take a bet, “Will GitHub move to Azure?” a few years ago, I would have thrown money down.

This seems inevitable since the acquisition and not necessarily a bad thing. I see it as neutral.

tacker2000 · 2025-10-08T16:31:23 1759941083

The point is that they are prioritizing this over new features.

But since “new features” consists primarily of shoving the bloody copilot agent down everyones throat, it might not be such a bad thing.

dmix · 2025-10-08T16:43:53 1759941833

That plus the new React diff viewer in beta. The old one seemed to be a simpler Web Component inside a Rails turbo frame.

I've tested the beta one and like most SPAs it doesn't scale well to large amounts of data (large numbers of files / line counts). You can feel the DOM slowing down even on a high end macbook. It even blanked out the page a couple times, another common issue when browsers are overloaded. So I switched back to the old one.

dmart · 2025-10-08T17:01:09 1759942869

The new one also doesn’t consistently snap to a specific line in the URL fragment if the diff is too large, which makes sharing links problematic.

torgoguys · 2025-10-08T17:08:57 1759943337

>The point is that they are prioritizing this over new features.

Good! Shoring up infrastructure vs. delivering the latest hotness is something that is rarely prioritized. I'll take boring and reliable every day of the week.

tacker2000 · 2025-10-08T19:15:51 1759950951

Fair point, but I believe they are just migrating for the sake of pleasing their MS overlords.

Does anyone know what infra they are running on now? AWS?

dbbk · 2025-10-08T17:22:36 1759944156

You would be a fool to think the Copilot Coding Agent is not their most important feature at the moment. It's not particularly great, but it must become so.

walkabout · 2025-10-08T16:47:42 1759942062

The infrastructure behind serving git repos the way they do is pretty fiddly—I'd not be a bit surprised if this move reduces stability and/or performance.

stackskipton · 2025-10-08T17:04:03 1759943043

Sure but it also might make them fix some of that.

walkabout · 2025-10-08T19:44:51 1759952691

No, I mean inherently so. It's basically a whole stack of caching problems.

driverdan · 2025-10-08T17:29:49 1759944589

That started with MS and accelerated with Copilot. Word is that GH leadership doesn't care about anything other than Copilot/AI. All other features are receiving far less focus and fewer resources. I've heard this repeatedly from current and former employees.

aaronbrethorst · 2025-10-08T16:26:42 1759940802

nah, I'd say we're well past that. The beginning might have been Microsoft's acquisition of GitHub. Or the elimination of GitHub's independence.

rufo · 2025-10-08T17:09:59 1759943399

IMHO: the acceleration curve into point-of-no-return was when Microsoft decided to go hard on AI, and saw GitHub's Copilot as one of the key inflection points they were going to use to do so - even going so far to adopt the Copilot brand across the entire company.

Before that, it still felt like there _some_ degree of autonomy and ability to think about the developer experience on the platform as a whole. Once ChatGPT took off and MSFT decided that they were going to go hard on AI, though, Copilot (and therefore GitHub) became too important to Microsoft to leave alone.

I kinda suspect the slide was inevitable anyway, given how acquisitions tend to go. But IMO, Copilot was the tsunami that washed the octocat out to sea.

bediger4000 · 2025-10-08T20:16:57 1759954617

It does remind the oldsters of Hotmail.com

mgdev · 2025-08-09T01:44:34 1754703874

Alyssa Henry is former AWS, and an absolute monster of a leader.

nealabq · 2025-08-09T03:15:11 1754709311

Is being a monster a good or a bad thing?

adastra22 · 2025-08-09T18:12:52 1754763172

In this slang usage it is a good thing.

thejazzman · 2025-08-09T04:43:54 1754714634

i was really hoping apple was gonna drop a new chip called the M4 Monster to one up the Ultra

i share rhat to say, i think it's got positive connotations atm

mgdev · 2025-07-14T16:05:34 1752509134

it's pretentiousness thinly disguised as modesty.

trust me.

vinceguidry · 2025-07-14T18:07:23 1752516443

What I find pretentious is the legion of commenters who can't find anything better to comment on and instead pretend they're smart by nitpicking some stylistic choice in the most low-effort way possible.

happytoexplain · 2025-07-14T18:33:11 1752517991

Classic case of "you're pretentious", "no, you're pretentious". It's exhausting how often we reach for the word "pretentious" when we have bitter feelings about one person's opinion of another person or their work.

vinceguidry · 2025-07-15T13:05:56 1752584756

I just used it because he did. My real feeling was exhaustion. More of those comments than comments about the subject matter of the post. Like going to a swimming meet as a pro and finding it full of kids instead.

mgdev · 2025-07-16T00:03:41 1752624221

I was trying to be ironical.