More

scrollop · 2026-06-08T05:48:27 1780897707

https://artificialanalysis.ai/evaluations/omniscience

scrollop · 2026-06-08T05:47:49 1780897669

I'd recommend carefully looking at a few benchmarks (even though generally relying on benchmarks is problematic)

https://artificialanalysis.ai/evaluations/omniscience

Esp check the Hallucination rate for Deepseek - it's not good.

overfeed · 2026-06-08T07:06:48 1780902408

> Esp check the Hallucination rate for Deepseek - it's not good.

For strongly-typed coding tasks - and I imagine other tasks that have cheap validity checks: agentic harnesses and thinking tokens are an effective foil against hallucinations, at the expense of time. If a model hallucinates an API, compilation will fail and the error fed back into the machine so it can try again, in a two-steps-forward-one-step-back dance that is unreasonably effective. Given the price delta, it is often more cost effective to let the weaker model spiral towards a solution with many "Oh, wait..." turns

scrollop · 2026-06-07T17:17:09 1780852629

Imagine if cutting edge AI companies could decide to using their world best AI to

1) Develop software for linux

2) Provide decent support

zombot · 2026-06-07T17:39:21 1780853961

If there were any truth to the marketing stories that say they have such a thing, then they could indeed.

gessha · 2026-06-08T01:54:50 1780883690

Still not enough to establish the year of Linux on the desktop.

kskdkwjdkwkkds · 2026-06-07T18:16:52 1780856212

It’s almost as if this whole productivity increase promised by AI is just marketing spiel, huh? Crazy.

scrollop · 2026-06-02T20:42:38 1780432958

Out of interest - do you trust google reading all your emails? What do you think about privacy?

awkwardpotato · 2026-06-02T21:07:47 1780434467

95% of the people I interact with over email are on Gmail (or Outlook). Google/Microsoft still have those emails either way, even if I switch off.

glerk · 2026-06-02T23:02:34 1780441354

I used to care, but I don't anymore. They can read my emails, my code, track what websites I visit and what music I listen to, be my guest. I'd let them read my thoughts directly if we can build technology to do that lol. I realized that ultimately, these corporations are too stupid to do anything of value with all that data, so I don't feel threatened.

wyclif · 2026-06-03T03:45:46 1780458346

The danger isn't that they'll do anything of value; the danger is that they'll do something stupid with your data.

glerk · 2026-06-03T06:13:35 1780467215

You're probably right, I'm an idiot. I just think there's not much we can do about it, so might as well not take it too seriously. At least for the innocuous type of surveillance like reading my emails to learn how to sell me product. Things that you really want to keep for yourself shouldn't really touch the internet at all.

Meekro · 2026-06-02T23:14:57 1780442097

This doesn't strike me as "reading" your emails any more than a router is "reading" your packets when it forwards them. As far as I know, Google employees (even high-ranking ones) can't randomly start going through people's messages-- that's the privacy that matters.

conductr · 2026-06-03T07:12:25 1780470745

No but they can train a model to know everything about you and sell it.

They actually have precedence in that as it’s their legacy ad business.

I could absolutely see them getting more proactive with their ad business. Something like mortgage brokers want to know you executed an offer on a new home (high indication you will be shopping for a lender). Then that turn into, your employer wants to know you’re talking to other employers. Then of course there’s many more nefarious examples people would consider more invasive but may not even realize it leaked from their email provider.

scrollop · 2026-06-02T20:42:03 1780432923

I find it odd how so many tech involved people here use gmail - are privacy concerns not a concern for them?

I moved to mailbox.org years ago. Pay a few pounds a year for private email with webtools and drive and don't have google snooping my emails and sending me targeted ads.

lukan · 2026-06-02T20:47:47 1780433267

Convenience. Also I don't really communicate private stuff over gmail, I have signal for that.

danielhep · 2026-06-02T20:49:40 1780433380

I did the same except switched to fastmail. I love it, it’s such a great service.

AgentME · 2026-06-02T21:58:21 1780437501

Gmail stopped using email contents for ad targeting in 2017.

scrollop · 2026-05-31T11:32:30 1780227150

Agreed - 5% fees is quite high considering the volumes of tokens involved.

scrollop · 2026-05-31T11:31:22 1780227082

Though you pay 5% fees? Not worth it for me with the volume of tokens used.

scrollop · 2026-05-26T17:59:42 1779818382

Also increased, or we're now aware of, higher rates in long distance runners.

adaml_623 · 2026-05-26T18:05:09 1779818709

I'd love to know the causation for that correlation

elric · 2026-05-26T18:43:00 1779820980

There was some discussion about this on HN recently. Supposedly something to do with less blood going to the bowels during prolonged exercise. Apparently the risk was largest in people who ran 5+ marathons.

saladdays · 2026-05-26T18:07:50 1779818870

Perhaps higher sugar consumption from fueling techniques?

nradov · 2026-05-26T18:48:02 1779821282

No one knows for sure. One hypothesis is chronic inflammation, perhaps linked to diet or mechanical stress.

array_key_first · 2026-05-27T16:14:34 1779898474

My understanding is that exercise lowers chronic inflammation. Basically, you trade off acute inflammation during the exercise itself for less inflammation when you're not exercising. But, maybe long distance running is too long or something.

scrollop · 2026-05-21T16:09:13 1779379753

You should try hosting it yourself in docker. Absurdly easy to do if you get an llm to do it and it works very, very well.

Hope they don't alter self hosting it.

BrandoElFollito · 2026-05-21T16:54:43 1779382483

It is absurdly easy to fire off the docker container you mean.

Because you need to back up, verify backups, monitor availability, manage updates, manage MFA, and a zillion things.

Don't get me wrong, I work in hardcore, high tech IT for 30 years and I selfhost two dozen or so of services. It is far, very far from "absurdly easy" when you start .

Sure you can run a container on your pc, and hope for the best

Esophagus4 · 2026-05-21T17:55:06 1779386106

Exactly.

I’ve seen this idea so many times on HN. “Just stand up a docker container and self-host”. Or even worse: “why does anyone need GitHub - just host Bitbucket yourself”

Ok, then what?

mvdtnz · 2026-05-21T19:07:57 1779390477

This seems crazy to me. I have a home server and host lots of my own stuff. But a password manager is tier-0, it cannot fail me.

I need to access my accounts while I'm overseas - in fact I'm prompted for passwords far more often when I cross borders. I need my passwords at urgent moments like when I need to make a large bank transfer. I need passwords unexpectedly at all times when sessions expire or I need a new session for a device I've never logged in with.

If my home server went down for any reason at these critical moments it could be extremely bad. There are some kinds of outages I can't recover from without physically attending my server. And if I'm not very very careful there are some kinds of failures I cannot recover from at all - I have a working backup solution but so did every company that lost customer data before.

And this doesn't even touch on the security risk of hosting a database of credentials on a publicly available endpoint.

I need a trust hosted solution.

arikrahman · 2026-05-21T17:05:20 1779383120

You can get rid of the element of hope by using KeepassXC and syncthing. Bonus is you can use this FOSS stack completely offline.

omnimus · 2026-05-21T17:22:00 1779384120

And not be able to use it on your phone or share it with people you work with.

Vaultwarden is the way. Easy to host docker. Solid. And if bitwarden blocks the clients there will be a fork.

It's leading to it anyway.

rirze · 2026-05-21T17:38:03 1779385083

I really hope the community gets together and creates a better browser extension. Vaultwarden + that would be perfect.

xstr305 · 2026-05-22T10:20:53 1779445253

Syncthing works on Android just fine, though I'm not familiar with iOS. There also several keepass compatible clients, some support sync via cloud storage. Don't need to host anything. But I admit, for corporate shared secrets storage it is not a right tool.

ndsipa_pomu · 2026-05-22T23:17:51 1779491871

I self-host Vaultwarden and it's great, but I'm not so sure that we can rely on trustworthy forks of the phone app and browser extensions.

arikrahman · 2026-05-22T01:59:31 1779415171

KeepassDX works great on my phone. I use LocalSend to move around keyfiles fully offline as well.

yinksta · 2026-05-21T18:29:40 1779388180

You can use it on your phone what are you talking about?

arikrahman · 2026-05-22T02:10:21 1779415821

That's what I'm saying, a lot of people are coping with a product they admit will need a fork.

Not only is it incurring the cost of project fragmentation, but also incurring an always online cost with overly-complicated docker solutions, when a fully offline and airgapped solution already exists.

Furthermore, staying with the same ecosystem invokes the sunken cost fallacy. But the migration from Bitwarden couldn't be simpler (just export Bitwarden json file). It's almost a form of battered woman syndrome people are inflicting on themselves when quite simply they can hop onto an already proven ecosystem that doesn't bait and switch.

omnimus · 2026-05-22T16:06:24 1779465984

I was on keepass before bitwarden. Bitwarden just solves more things for me. I am sure the keepass ecosystem improved a lot over the years but fundamentally i find vaultwarden docker to be far easier. Especially for my work and family members that i convinced to use bitwarden. If they were also in charge of the sync it wouldn't be possible.

Afaik vaultwarden and bitwarden clients are as proven as keepass.

arikrahman · 2026-05-23T08:39:01 1779525541

Proven to bait and switch as it turns out much unlike keepass.

horsawlarway · 2026-05-21T16:29:25 1779380965

If you're going to the trouble of self-hosting, I'd suggest just running vaultwarden.

https://github.com/dani-garcia/vaultwarden

It's entirely compatible with the clients. It also removes a lot of "rug-pull" potential, and gives you the ability to access all the nice features (ex - multi-org, multi-user, shared vaults, totp, etc...)

Honestly - part of the reason I like Bitwarden is that if they ever go full "enshittification", it's going to be relatively easy and straight-forward to just move entirely off their projects and onto open-source forks.

prism56 · 2026-05-21T18:08:58 1779386938

Cant tell if this is satire. But I'm not self hosting my passwords unless I fully understand exactly what's happening. Trusting that to an LLM without really understanding what's happening seems very risky to me.

scrollop · 2026-05-21T13:32:04 1779370324

Since it appears that LLMs can't achieve AGI and lose hallucinations, I presume a new company will appear with a new architecture that can - what happens to the current behemoths and their stock prices? Will they jump architectures?

Splendidly interesting times.