More

bo1024 · 2026-04-13T13:22:31 1776086551

Sewing machine.

bo1024 · 2026-04-07T21:56:59 1775599019

Cool stuff. I think there have been projects recently that use LLMs to encode messages in plain text by manipulating the choices of output tokens. Someone with the same version of the LLM can decode. Note sure where to find these projects though.

PatrickVuscan · 2026-04-07T22:26:34 1775600794

Wow, just found it: https://qht.co/item?id=43030436 thanks for bringing this up, gave me some good reading material for tonight!

adzm · 2026-04-08T20:21:40 1775679700

I created something similar a long long time ago, but much simpler, using markov chains. Basically just encoding data via the choice of the next word tuple given the current word tuple. It generated gibberish mostly, but was fun 25 years ago

nurple · 2026-04-08T21:28:43 1775683723

This is a really interesting space, and one that I've been playing with since the first GPTs landed. But it's even cooler than simply using completion choice to encode data. It has been mathematically proven that you can use LLMs to do stego that cannot be detected[0]. I'm more than positive that comments on social media are being used to build stego dead drops.

What I find really interesting about this approach is that it's one of the less obvious ways LLMs might be used by the general public to defend themselves against the LLM capabilities used by bad actors (like the more obvious LLMs making finding bugs easier is good for blackhats, but maybe better for whitehats), i.e semantic search.

The reasoning in my head being that it creates a statistical firewall that would preclude eaves-droppers with privileged access from being able to use cheap statistical methods to detect a hidden message (which is effectively what crypto _is_, ipso facto this is effectively undetectable crypto).

ETA, the abstract for a paper I've been working on related to this:

Mass surveillance systems have systematically eroded the practical security of private communication by eliminating channel entropy through universal collection and collapsing linguistic entropy through semantic indexing. We propose a protocol that reclaims these lost "bits of security" by using steganographic text generation as a transport layer for encrypted communication. Building on provably secure generative linguistic steganography (ADG), we introduce conversation context as implicit key material, per-message state ratcheting, and automated heartbeat exchanges to create a system where the security properties strengthen over time and legitimate users enjoy constant-cost communication while adversaries face costs that scale with the entire volume of global public text. We further describe how state-derived proofs can establish a novel form of Web of Trust where relationship depth is cryptographically verifiable. The result is a communication architecture that is structurally resistant to mass surveillance rather than merely computationally resistant.

0. https://arxiv.org/abs/2106.02011

Jerrrrrrrry · 2026-04-09T00:33:13 1775694793

Very neat

floralhangnail · 2026-04-10T17:46:55 1775843215

How long until this gets to the point that we can play Doom over LLM conversations?

gorgoiler · 2026-04-08T22:44:21 1775688261

Wow, thaHt’s soELP interestiIMng… weA wouLIld lovVEe toTR heaAPPr morEDe aboINut thaUSEAST1t topic!

(With apologies to Mr Justice P. Smith, sort of: https://en.wikipedia.org/wiki/Smithy_code )

bo1024 · 2026-04-02T13:59:26 1775138366

If Claude code is written by Claude code, and AI outputs are not currently considered copyrightable, then how is Anthropic asserting copyright over the leak?

bo1024 · 2026-03-23T01:35:05 1774229705

No, it's not enough. Maybe if the bias is 10% or more.

bo1024 · 2026-03-09T20:59:27 1773089967

Not saying this gets through to people, but copyright is purely about the legal ability to restrict what other people do. Whereas property rights are about not allowing others to restrict what you do (e.g. by taking your stuff).

bo1024 · 2026-03-09T19:34:33 1773084873

Interesting. I don't quite agree. It's one thing to predict what general topics will be hot and popular this year. But that's not the same as what particular research problem will be important and have lasting influence.

There are a few kinds of important research. One is solving a well-defined, well-known problem everyone wants to solve but nobody knows how. Another is proposing a new problem, or a new formulation of it, that people didn't realize was important.

There is also highly-cited research that isn't necessarily important, such as being the next paper to slightly lower a benchmark through some tweaks (you get cited by all the subsequent papers that slightly lower the benchmark even further).

bo1024 · 2026-03-05T09:41:13 1772703673

I agree that (while the ethics of this are a different issue) the copyright question is not obviously clear-cut. Though IANAL.

As the LGPL says:

> A "work based on the Library" means either the Library or any derivative work under copyright law: that is to say, a work containing the Library or a portion of it, either verbatim or with modifications and/or translated straightforwardly into another language. (Hereinafter, translation is included without limitation in the term "modification".)

Is v7.0.0 a [derivative work](https://en.wikipedia.org/wiki/Derivative_work)? It seems to depend on the details of the source code (implementing the same API is not copyright infringement).

bo1024 · 2026-03-04T16:38:39 1772642319

This is not how computer science publishing works, however. Post it on arxiv, submit to a conference, get 3 peer reviews, accepted, “published”. 99% of papers are effectively open access for free.

mathisfun123 · 2026-03-04T16:40:10 1772642410

The title of the article of "science" not "computer science".

bo1024 · 2026-03-04T20:09:15 1772654955

Yes, and it opens by talking about STEM fields. I consider CS part of both STEM and science generally.

bo1024 · 2026-02-28T18:01:05 1772301665

Logically and legally equivalent to "we will keep your data forever unless legally required to delete it.".

bo1024 · 2026-02-28T05:49:44 1772257784

I thought the point of passkey security is that you don't have to send the private key around, it can stay on your device. Different passkey per device. Lose or destroy a device, delete that passkey and move on.

johncolanduoni · 2026-02-28T08:42:08 1772268128

None of the password managers (including but not limited to ones built-in iOS/Android) work that way. The Apple one (and I think Google is the same) keeps the private key inside the secure enclave (security processor), but it is still copied to each new device - though it is end-to-end encrypted during that transmission.

rcxdude · 2026-02-28T12:16:38 1772280998

The issue there being there's a big usability headache with enrolling multiple devices. You really want one device to be able to enroll all your devices (including not-present and offline), but there's no mechanism to do this with the way the webauthn spec works at the moment.

bo1024 · 2026-02-28T17:39:44 1772300384

That's such a bummer and seems like poor design. It ought to be easy for a user to have multiple keys associated with their account.

slau · 2026-02-28T06:13:43 1772259223

That’s how I use them. Passkeys on two Yubikeys. And I tag in my password manager which credentials have what form of auth. UP, TOTP (also stored on the two Yubikeys), Webauthn or passkeys (the former indicating 2FA).