More

gertjandewilde · 2026-03-16T15:25:36 1773674736

We built a unified API with a large surface area and ran into a problem when building our MCP server: tool definitions alone burned 50,000+ tokens before the agent touched a single user message.

The fix that worked for us was giving agents a CLI instead. ~80 tokens in the system prompt, progressive discovery through --help, and permission enforcement baked into the binary rather than prompts.

The post covers the benchmarks (Scalekit's 75-run comparison showed 4-32x token overhead for MCP vs CLI), the architecture, and an honest section on where CLIs fall short (streaming, delegated auth, distribution).

OsrsNeedsf2P · 2026-03-16T15:37:06 1773675426

How is progressive discovery not more expensive due to the increased number of steps?

BeefySwain · 2026-03-16T15:44:00 1773675840

I assume because the discovery is branching. If the an agent using the CLI for for GitHub needs to make an issue, it can check the help message for the issue sub-command and go from there, doesn't need to know anything about pull requests, or pipelines, or account configuration, etc, so it doesn't query those subcommands.

Compare this to an MCP, where my understanding is that the entire API usage is injected into the context.

zamalek · 2026-03-16T15:41:36 1773675696

In short: JSON. Plan prose or markdown is way more token efficient than JSON. I think that responding in JSON was always a mistake in the spec; it should have been free-form text (which could then be JSON if required).

iamjackg · 2026-03-16T15:42:16 1773675736

It depends on what your "currency" is: inference cost vs. models getting dumber/slower with a fuller context.

gertjandewilde · 2026-02-25T20:50:40 1772052640

Most APIs were designed for human developers, not autonomous agents. As LLMs start selecting endpoints and generating arguments directly from your schema, ambiguity and weak error semantics become production issues. This post outlines practical API design patterns that make APIs more reliable for agent-driven workflows.

gertjandewilde · 2025-03-16T10:54:45 1742122485

Analyze codebases using AI - generate architectural overviews, documentation, explanations, bug reports and more

Would love to hear your thoughts, feedback, and ideas for improvement!

gertjandewilde · on Jan 16, 2025

We’ve overhauled our SDKs with Speakeasy, leaving the limitations of our old OpenAPI generator behind. The new versions deliver major upgrades in usability, error handling, and performance.

gertjandewilde · on Dec 26, 2021

Great use case. Is the demo broken?

gertjandewilde · on Dec 26, 2021

Yes, Portman can do this https://github.com/apideck-libraries/portman

gertjandewilde · on Aug 3, 2021

Hehe, fair point! Most of those products have startup programs not to break the bank

In case you want to go the open-source route you have this handy overview https://www.btw.so/open-source-alternatives

gertjandewilde · on Aug 3, 2021

Thanks for the feedback!

I agree some categories are indeed thin, which is one of the reasons why we made the submission so potentially valuable tools can get listed.

gertjandewilde · on July 16, 2021

Thanks for sharing. RAFT looks great

gertjandewilde · on July 16, 2021

Good question! To transform the OpenAPI spec to a Postman collection, we're using the handy openapi-to-postman package from Postman.

The test automation is where the real magic happens.