More

est · 2026-05-31T15:43:36 1780242216

> just plain HTML and some basic CSS

Or even better. XML + XLST.

True separation of representation and data.

Is thousands of nested <div> really a good idea?

est · 2026-05-30T14:28:48 1780151328

MCP is based on a lie: Machines are good at read/generate machine-parsable procotols.

Turns LLMs are shit with JSON. Especially those JSON str embeded inside another JSON key-value pairs.

Why do smart ppl design a schema like escape JSON into str embeded into another?

It's based on another lie: AIs favor static typed languages.

est · 2026-05-29T05:03:37 1780031017

> I'm going to feed all of my business's data to it

Your business data is probably worthless, even considered harmful for the pretrain corpus.

Your interactions and decision making process are most valuable parts of the whole business.

bandrami · 2026-05-29T05:52:16 1780033936

I assure you my business's data is not remotely worthless which is why there are pretty strict laws and regulations about what we can do with it

TZubiri · 2026-05-29T06:52:17 1780037537

>Your business data is probably worthless

please tell me you are not in charge of the data of any business I'm a client of

elpocko · 2026-05-29T08:46:10 1780044370

Could be! Let's check. I just need your name and address, your SSN, a list of businesses you are a client of, and a DNA sample.

est · 2026-05-29T07:26:50 1780039610

to clarify, probably worthless to AI vendors, but might be useful for third-parties.

TZubiri · 2026-05-29T07:35:22 1780040122

Third parties that can be clients of the AI vendor...

selcuka · 2026-05-29T09:40:49 1780047649

If it's worthless to AI vendors, they won't include it in the training corpus, so third parties won't have access to it.

TZubiri · 2026-05-29T18:06:37 1780077997

But it isn't worthless because the user is paying for that, and third parties are paying for that as well. Unless the input output is completely different, which it's not because you are human, and I bet you have a profession which other humans have, and many other qualities which you share with other humans.

In any case, relying on the chance that the LLM inference won't train on your data because of it's presumably low value is as good a strategy as crossing your fingers or venerating the god of rain. You should be relying on contractual clauses at least when including professional and client data.

estearum · 2026-05-29T10:55:15 1780052115

They're alluding to something more like espionage of just selling the interesting stuff you put in the text box.

TZubiri · 2026-05-29T18:00:27 1780077627

Wow I thought this was quite obvious, apparently not, so I'll explain.

Llm provider sells usage of their model. You use it to write code. Other clients use it to write code as well. If the llm provider trains with user data, then the usage benefits other users. If you pay the company to generate code,then by definition it is useful, and highly likely that other customers care about it.

Replace writing code with anything, a lawyer, a psychologist, a confessional. The IO is inherently useful to users of the same category.

That is to say nothing of adversarial use, that is, being useful because a counterparty might find it useful, so an attacker might find common code patterns, a lawyer might see what the opposition might be advised, a boy might see what a girl asks or gets advised, etc..

If this sounds too complex to you, just think of training on data as exfiltration with added steps, because that's what it is

estearum · 2026-05-29T18:26:52 1780079212

Oh well this is a bad argument. I made a mistake by assuming you made a good argument instead.

bandrami · 2026-05-29T12:39:28 1780058368

The worry is direct exfiltration, not training

est · 2026-05-27T16:13:03 1779898383

There are basically two kinds of people in the world, ones that create stuff, and ones that destroys stuff.

Defense is a toally different game, and requires a complete new mindset than creativity. Security is something that you miss one then you lose all.

AIs are good at choosing a good candidate based on a reward model, but it sucks hard at enumerating mundane attack surfaces and make combinations to exploit through.

beardedwizard · 2026-05-27T16:25:56 1779899156

Good engineering is good engineering. Belief that someone else uniquely possesses the skill to engineer some critical part of a system you built is, for me, just abdicating responsibility. It's a learned helplessness.

Someone else blindly operating an llm on a corpus you created with an llm is comical.

drfloyd51 · 2026-05-27T17:05:22 1779901522

Are you the best choice to engineer everything your system does? There is no one in your company that might do a better job than you for a specific part of the system?

There is nothing wrong with asking for help or bouncing ideas of people with stronger skills.

I still have the responsibility to code XYZ well. But I don’t have to do it in a clean room.

est · 2026-05-27T14:14:21 1779891261

sounds like a Monty Python sketch ...

est · 2026-05-27T07:21:23 1779866483

> The founding tenet of AI, “saving us” from all sorts of difficult things: climate, disease, poverty, conflict is falling, fast

No those are just marketing slogans. The founding tenet of AI is to best match next token according to a reward model.

est · 2026-05-26T02:40:29 1779763229

I am almost certain this layout is generated by AI, because I vibe coded the exact same newspaper-like style weeks ago.

oliviergg · 2026-05-26T05:50:37 1779774637

Yes ! Building a news website with Claude design give me the same design, background color, text size …

t3r · 2026-05-26T09:40:11 1779788411

Somehow, Claude seems to have developed a default nostalgic newspaper aesthetic despite being so young.

celltalk · 2026-05-26T04:54:33 1779771273

me too… this felt awkward: https://duobook.co/explore-stories

est · 2026-05-19T09:11:22 1779181882

Your DNS config 5-7 rows are the culprit.

Don't point a wildcard domain to Github. It's a wildcard and dangerous.

rmeertens · 2026-05-19T09:23:01 1779182581

Yep! Fixed it already!

est · 2026-05-19T05:05:54 1779167154

clickbait.

rohansood15 · 2026-05-19T05:09:50 1779167390

Nope, HN changed the title.

https://imgur.com/a/UgJqWEh

est · 2026-05-15T13:44:16 1778852656

Why can't you use a web page instead ?

ryandrake · 2026-05-15T14:35:49 1778855749

Came here to ask the same question. This could be a static HTML web page with a table.