More

robotswantdata · 2026-03-31T07:48:36 1774943316

Why are people still using Ollama? Serious.

Lemonade or even llama.cpp are much better optimised and arguably just as easy to use.

eddieroger · 2026-03-31T14:13:51 1774966431

`ollama serve` and `ollama run`

The devex is great and familiar to folks who have used Docker. Reading through the Lemonade documentation, it seems like a natural migration, but we're talking about two steps for getting started versus just one. So I'd need a reason to make that much change when I'm happy enough with Ollama.

hamdingers · 2026-03-31T14:44:01 1774968241

Why not? Also serious.

It seems to just work every time I try to use it, the API is easy to work with, the model library is convenient. I've never hit any kind of snag that makes me look elsewhere.

niek_pas · 2026-03-31T10:56:29 1774954589

Serious answer: I don't use it that much, it's what I happened to download like 1.5 years ago, and it works fine. Happy to see what may be a speed boost, and have little interest in switching to something else (unless my situation changes, of course).

vorticalbox · 2026-03-31T10:29:49 1774952989

i like ollama, mostly because the cli is pretty nice. its desktop app has stupid choices like if a model can support tools then the ui should give me the "search" option but it only shows for cloud models.

i have ran lmstudio for a while but i don't really use local models that much other than to mess about.

zozbot234 · 2026-03-31T10:32:16 1774953136

You can also use OpenWebUI locally which should give you a nice friendly UX once you set it up.

robotswantdata · 2026-03-29T12:05:35 1774785935

Don’t really get the purpose for this apart from throw away projects.

For vibe coders is it really “hours” setting up a database these days? GCP cloud sql + drizzle ORM is minutes and actually scales unlike a spreadsheet, heck Claude can even write you a deployment script and run it over GCP CLI.

misiek08 · 2026-03-29T12:06:40 1774786000

Cloud SQL costs gazillions, sheet is free (other than selling your data)

gruez · 2026-03-29T12:46:33 1774788393

>sheet is free (other than selling your data)

Except the sheets-to-api SaaS charges $9/month if you want more than 250 requests.

robotswantdata · 2026-03-29T12:18:13 1774786693

Cloud sql lowest tier is pennies a day, this ninja platform is also not free.

A spreadsheet is a misclick away from corruption, why not spend another prompt on getting Claude to configure a db?

kevcampb · 2026-03-29T12:23:15 1774786995

Which works out at $100 USD / year. You might think that's trivial, but when you start provisioning multiple environments over multiple projects it starts to add up.

It's a shame that Google haven't managed to come up with a scale to zero option or serverless alternative that's compatible.

Yokohiii · 2026-03-29T12:57:39 1774789059

Sheet Ninja is 108 USD / year and has tiny capacities for every metric. SQLite is free and would stomp this in every aspect on low budget hosting. Even a tiny API that stores CSV would be magnitudes more efficient.

But what would scare me the most, is that google can easily shut this thing down.

rvz · 2026-03-29T12:29:23 1774787363

It is trivial to set up a database on GCP given that you know what you are doing and I would pay Google for that stability and support for setting up multi-tenancy and region.

Using Google spreadsheets as a backend will just cause them to charge everyone later.

Sheet Ninja isn't free. Even on their side, "free" does not mean what you think it means.

robotswantdata · 2026-03-29T12:46:18 1774788378

setup a DB project , use same cloud sql instance for all DBs. Did that for years on non prod or experimental projects. $100 is a bargain for what you get in terms of resiliency

n_e · 2026-03-29T12:43:49 1774788229

> Cloud sql lowest tier is pennies a day

Unless things have improved it's also hideously slow, like trivial queries on a small table taking tens of milliseconds. Though I guess that if the alternative is google sheets that's not really a concern.

bercini · 2026-03-29T13:35:30 1774791330

You can fire up a burstable postgres for about $20/mo

codybontecou · 2026-03-29T12:16:25 1774786585

Most are lucky to get a few sign ups.

lelanthran · 2026-03-29T19:04:44 1774811084

> Cloud SQL costs gazillions,

WTF is "Cloud SQL"?

I have a postgresql server running on a $5/m VPS that I add DBs to as and when I explore some new idea.

robotswantdata · 2026-03-29T20:36:58 1774816618

Google have you covered: https://www.youtube.com/watch?v=Kl8ig2BtLAY

Zetaphor · 2026-03-29T15:45:37 1774799137

SQLite is enough for 98% of all of these use cases, and 100% of the ones this would appeal to

faangguyindia · 2026-03-29T18:07:19 1774807639

Don't do this guy, cloudsql costs a lot.

robotswantdata · 2026-03-30T08:07:24 1774858044

Costs a lot? It’s a bargain for globally resilient infrastructure.

db-f1-micro is about $10pm inc storage for something that just works and can scale, be shifted on prem etc. you can run all your vibe coded slop on one instance.

patate007 · 2026-03-29T21:05:50 1774818350

I think it can be useful if you want to use an existing Google Sheet, or if your users want to modify the database directly in Google Sheets, even though it seems pretty risky.

robotswantdata · 2026-03-27T07:24:42 1774596282

DGX workstations, expensive but allow PCI cards as well.

https://marketplace.nvidia.com/en-us/enterprise/personal-ai-...

fooker · 2026-03-27T09:42:12 1774604532

It's hilarious that not a single one of these has pricing listed anywhere public.

I don't think they expect anyone to actually buy these.

Most companies looking to buy these for developers would ideally have multiple people share one machine and that sort of an arrangement works much more naturally with a managed cloud machine instead of the tower format presented here.

Confirming my hypothesis, this category of devices more or less absent in the used market. The only DGX workstation on ebay has a GPU from 2017, several generations ago.

chatmasta · 2026-03-27T11:23:10 1774610590

Nvidia doesn’t list prices because they don’t sell the machines themselves. If you click through each of those links, the prices are listed on the distributor’s website. For example the Dell Pro Max with GB10 is $4,194.34 and you can even click “Add to Cart.”

fooker · 2026-03-27T13:32:12 1774618332

I don't mean the small GB10s.

If you try to find the pricing of the GB300 towers even on the manufacturer sites, you'll see that it's not listed for any of the six or so models.

tecleandor · 2026-03-27T14:09:14 1774620554

Because that's a different price point, that's getting near 100K, and the availability is very limited. I don't think they're even selling it openly, just to a bunch of partners...

The MSI workstation is the one that is showing some pricing around. Seems like some distributors are quoting USD96K, and have a wait time of 4 to 6 weeks [0]. Other say 90K and also out of stock [1]

--

  0: https://www.cdw.com/product/msi-nvidia-gb300-wkstn-72c-grace-cpu/9087313?pfm=srh
  1: https://www.centralcomputer.com/msi-ct60-s8060-nvidia-dgx-station-cpu-memory-up-to-496gb-lpddr5x-nvidia-blackwell-ultra-gpu-1x-10-gbe-2x-400-gbe.html

fooker · 2026-03-28T02:53:04 1774666384

> I don't think they're even selling it openly, just to a bunch of partners...

Yes, that's my point.

Melatonic · 2026-03-27T16:35:01 1774629301

Isnt that because nobody has released one yet? They are brand new

numpad0 · 2026-03-27T10:46:57 1774608417

I don't think it's so odd, very few products above ~$50k have final prices listed for anyone to buy 1-click.

fooker · 2026-03-27T13:33:34 1774618414

Workstations above 50k are not that uncommon.

Older xeon based workstations easily reach that number.

tecleandor · 2026-03-27T14:12:50 1774620770

If you put a 50 or 80K workstation in the HP store, it will say:

"Purchasing limit reached. To complete your order and provide you with the best customer experience, please call 1-877-888-8235"

bluedino · 2026-03-27T13:42:03 1774618923

'Important' people in organizations get them. They either ask for them, or the team that manages the shared GPU resources gets tired of their shit and they just give them one.

fooker · 2026-03-27T14:19:36 1774621176

Yes, I agree this is the use case.

Since the user here is not paying for it directly, the manufacturer does not have any incentive to list prices anywhere.

deelowe · 2026-03-27T12:31:37 1774614697

There were plenty of them around when I worked at Nvidia. They definitely exist.

fooker · 2026-03-27T13:39:31 1774618771

You have seen plenty of third party GB300 DGX workstations?

QuantumNomad_ · 2026-03-27T08:07:19 1774598839

How much do those workstations cost? All of the different manufacturers links on that page lack pricing info and you have to contact them for pricing.

fotcorn · 2026-03-27T09:05:06 1774602306

Cheapest i know if is around $96k

cudima · 2026-03-27T08:46:17 1774601177

$4000

eitally · 2026-03-27T15:41:13 1774626073

$4k is for GB10 (DGX Spark reference design). $90-100k is for GB300 (DGX Station reference design).

robotswantdata · 2026-03-23T11:03:21 1774263801

Ignore the expected negativity, many here have not used the latest gen of voice agents in development. Even if used as a router , prefer that to waiting to get through

netsharc · 2026-03-23T11:20:27 1774264827

I was agreeing with all the nay-saying comments, but yours made me see the idea as good. I guess the word "luxury" ruined it for OP.

But a speech-to-text and text-to-speech system that I know is "understanding" me would be great rather than waiting music. The shop could even sell it as "As a small shop, most of our employees are busy fixing cars, so we are using AI to help with calls" (Although then people who are anxious about AI stealing jobs might hang up). The robot can ask me what I need, and then say "So for [this service], the price would be..." (to tell the caller what it has understood).

If the AI can even look at gaps in the shop's schedule and set an appointment time, the customer might even be happy that they just spent a minute on the phone instead of 10+...

Eddy_Viscosity2 · 2026-03-23T12:08:25 1774267705

I would rather just be sent to a regular old answering machine. Dealing with an AI is dehumanizing. In almost every single case where I actually need to call a place, its because I need to talk to them about something an automated system like booking an appointment, can't handle.

netsharc · 2026-03-23T14:43:36 1774277016

Congrats..?

A friend of mine worked for a call center that did car rentals, old people would call them and ask to rent a car.

Maybe the AI system should have "Press 1 to talk to AI, press 2 to leave a message" so experts like you can press 2.

recursive · 2026-03-23T16:52:30 1774284750

I know it's intended to be dismissive, but I would appreciate the choice.

Even if the new model that came out last week totally fixed all the problems this time for real, most people's experience with chatbots is that they are prone to misunderstanding or making false statements. "Hallucinations"

I have yet to experience any degree of confidence in any output from an LLM, so I'd rather leave the message. I don't know how common this point of view is.

QuadmasterXLII · 2026-03-23T11:16:38 1774264598

brutal market for lemons: the last 100 times they heard robovoice on the phone they had a terrible experience, and any money you spend fixing this is wasted because the customer cant tell your robovoice is actually honest and capable of making commitments because they all sound perfectly confident and correct even the ones who know nothing and will promise anything

robotswantdata · 2026-03-23T11:32:39 1774265559

Sounds like the typical dealer experience minus the ai

robotswantdata · 2026-03-23T09:11:25 1774257085

The current deployments of chatbots are not the bar to compare with. There’s an incoming wave of extremely capable agents and process reimagining that is going to be highly disruptive.

Been in this space over a decade and this time really is different. It’s hard for humans to perceive the exponential, it will be slow then sudden.

chrz · 2026-03-23T09:23:52 1774257832

Lets go back to waterfall even harder and write the super correct and detailed design doc.

CodeCompost · 2026-03-23T09:35:48 1774258548

You jest but this is precisely what we have done. Our customers have downright rejected SCRUM. It is considered a waste of money.

n4r9 · 2026-03-23T09:44:58 1774259098

At a recent AI workshop management made clear that they see AI as rendering sprints and scrums obsolete, that Kanban makes a lot more sense, and that estimating effort/story-points is also becoming meaningless. Which is a strong silver lining if you ask me.

odyssey7 · 2026-03-23T09:57:07 1774259827

I want to understand how AI leads to this outcome.

n4r9 · 2026-03-23T10:22:32 1774261352

I think it's to do with the bottleneck shifting away from code generation and towards specifying and reviewing and integrating code. The process of working with AI agents to produce specs, tech specs, code, and reviews lends itself more to a flow-based structure (like kanban).

Bear in mind this is a B2B enterprise company with a mix of legacy and greenfield. And management has invested heavily into designing a robust spec/context-based workflow for using agents. Might be different elsewhere.

Personally I don't think scrums, planning, retros etc were better than kanban even before AI, at least if you have switched-on, motivated and smart people on your team. They actually made things less agile, and story-points give a false sense of predictability. Imo the crucial factor may be that AI agents are smart and switched-on (with the right context).

franktankbank · 2026-03-23T12:39:29 1774269569

Its a good excuse to move away from a shitty process, I'll take it! Fuck SCRUM, fuck Agile. No one was doing it anyway. I had to quit an Agile job because I was shipping shit without ever getting a lick of feedback, and this was not some webdev low stakes work, it was for planning expensive real world installations.

badgersnake · 2026-03-23T09:22:26 1774257746

The next AI I’m working on is going be amazing and change the world. Please back my series A you won’t regret it.

(Let’s not talk about my blockchain startup and my VR startup and my NFT startup). My house is nice though.

Madmallard · 2026-03-23T09:15:32 1774257332

Are you sure about that one?

What exactly will these agents be able to do with enough consistency, accuracy, and reliability that people will want to hire them over humans?

In my experience with even the most basic implementation of agents, i.e. customer service chat bots, I literally cannot stand interacting with them even once. They are extremely unhelpful and I will hang up or immediately ask to speak to a human.

edu · 2026-03-23T09:22:30 1774257750

Obviously your support chatbot with talk to your flavor of clawd that will call Claude Code that will code a solution that will be reviewed by Codex that will merge and release it and then will ping clawd that will send an email to the user announcing that their issue has been fixed. /s just in case

Stromgren · 2026-03-23T09:46:19 1774259179

I’ve been involved in building a system that reads structured data from a special form of contracts from a specific industry. Prices, clauses, pick up, delivery, etc. A couple hundred datapoints per contract. We had many discussions around how to present and sell an imperfect system. The thing is, the potential customers are today transcribing the contracts manually and we quickly realized that people make a ton of mistakes doing that. It became obvious when we were working on assertion datasets ourself. It’s not a perfect system and you have to consider how you use the data (aggregating for price indexing for instance), but we’re actually doing better than what people are achieving when they have to transcribe data for hours a day.

robotswantdata · 2026-03-23T09:46:33 1774259193

The voice agents in development right now feel 100x the current chatbots deployed by companies.

I had same opinion till a few months ago, now would prefer the [redacted company so as to not give free marketing] AI agent. You’ll start seeing this wave in around 3-6 months as most are in trial

Madmallard · 2026-03-23T12:20:56 1774268456

Just sounds like gassing because you are invested in it yourself.

exitb · 2026-03-23T09:34:14 1774258454

Most support agents lack... well, agency. If you connect a chatbot to an FAQ, that's exactly what you get. Just another instance of enterprise software being badly designed, badly written etc. It doesn't mean that it's actually an impossible problem.

Madmallard · 2026-03-23T12:22:32 1774268552

They won't ever give agents the ability to actually do things for customers that can impact the company in some kind of negative fashion. At least not willingly.

That's sort of the whole point of talking to customer service though. Getting something done that you want that involves them having to do work for you. AKA you taking value from the company.

So yeah they're basically always going to be useless garbage if put together according to business requirements.

Other services should just be automated already.

exitb · 2026-03-23T12:47:38 1774270058

They'll do the same thing we do in software development - proper sandboxing, context curation, reviews on high impact actions. I presume real customer service is really expensive, as I've seen many companies prefer to just quickly refund, or drop you as a customer entirely, rather than fix your problem. It can't get much worse than that.

askl · 2026-03-23T09:28:24 1774258104

It's always different this time. It always will only take a couple more months or years. And then people move on to the next hype topic.

tripledry · 2026-03-23T10:18:30 1774261110

> It’s hard for humans to perceive the exponential, it will be slow then sudden.

True, but also there are perception biases that lead us to believe progress is exponential, even though it might as well be an S-curve.

I'm having a hard time finding the right terms, but I'm sure there is some bias to think that "the line goes up".

ares623 · 2026-03-23T09:26:20 1774257980

Well ain't this a chronological oddity. Always 6 months away!

I don't want Codex dammit! I'm a Claude Code man.

robotswantdata · 2026-03-22T19:56:43 1774209403

Wasn’t the point of openclaw to YOLO your credentials to the internet?

Only ever a creative prompt injection away from a leak.

Saw some smarter people using credential proxies but no one acknowledges the very real risk that their “claws” commit cyber crime on their behalf once breached.

robotswantdata · 2026-03-21T10:57:57 1774090677

Are we sure Claude Scale™ won’t appear next month? A specialist agent that turns your vibe coded mess into a production grade scaled solution on their infrastructure.

Expect anthropic to want to capture more of the supply chain over time

pron · 2026-03-21T18:35:17 1774118117

If they could they would, and if they can they will. Maybe it will appear next month, and maybe 5 years from now, and we don't know which of these is more likely. But I think that if agents could actually produce good, reliable software than can evolve over time, there's little they couldn't do even beyond software. So it won't be (just) the software developers being replaced, but also software users.

edgyquant · 2026-03-21T14:31:47 1774103507

Yeah which is why the solution has to be legislative. These companies are trying to take over the entire industry and even if they won’t have as good a solution as someone who only focuses on one thing they have the capital, distribution and name recognition to kill any upstarts

robotswantdata · 2026-03-19T12:05:12 1773921912

12 channel ddr5 5600 ECC is around 500gbs which in real world works very well for large MoE

adrian_b · 2026-03-19T12:41:18 1773924078

You mean 500 GB/s, not Gb/s (actually 537 GB/s).

Unfortunately that does not matter. Even in a cheap desktop motherboard the memory bandwidth is higher than of 16-lane PCIe 5.0.

Therefore the memory bandwidth available to a discrete GPU is determined by its PCIe slot, not by the system memory.

If you install multiple GPUs, in many MBs that will halve the bandwidth of the PCIe slots, for an even lower memory throughput.

zargon · 2026-03-19T16:17:48 1773937068

> in many MBs that will halve the bandwidth of the PCIe slots

Not on boards that have 12 channels of DDR5.

But yeah, squeezing an LLM from RAM through the PCIe bus is silly. I would expect it would be faster to just run a portion of the model on the CPU in llama.cop fashion.

Gracana · 2026-03-19T16:40:44 1773938444

It is much faster, yeah. llama.cpp supports swapping between system memory and GPU, but it’s recommended that you don’t use that feature because it’s rarely the right call vs using the CPU to do inference on the model parts in system CPU memory.

Edit: the settings is "GGML_CUDA_ENABLE_UNIFIED_MEMORY=1"... useful if you have unified memory, very slow if you do not.

lostmsu · 2026-03-24T17:01:05 1774371665

llama.cpp afaik does not run portion of the model on the CPU. --cpu-moe just offloads weights to RAM, but they are still loaded to GPU for compute.

robotswantdata · 2026-03-19T16:50:48 1773939048

Talking about dual socket SP5 EPYC with 24 DIMM slots, 128 PCIe 5.0 lanes

It’s fast for hybrid inference, if you get the KV and MoE layers tuned between the Blackwell card(s) and offloading.

We have a prototype unit and it’s very fast with large MoEs

robotswantdata · 2026-03-18T19:35:57 1773862557

Where’s the opt out ?

john_strinlai · 2026-03-18T19:38:05 1773862685

hackernews is very upfront that they do not really care about deletion requests or anything of that sort, so, the opt out is to not use hackernews.

lofaszvanitt · 2026-03-18T22:22:07 1773872527

Time to sue them to oblivion :D.

BowBun · 2026-03-18T20:23:20 1773865400

By posting comments on this site, you are relinquishing your right to that content. It belongs to YC and it is theirs to enforce, not yours. https://www.ycombinator.com/legal/

lofaszvanitt · 2026-03-18T22:21:48 1773872508

There is no such thing under https://qht.co/ when you create your user.

robotswantdata · 2026-03-18T20:31:30 1773865890

Max Schrems would like a word

pkilgore · 2026-03-18T22:52:21 1773874341

Is this legal advice?

BowBun · 2026-03-19T19:59:07 1773950347

Do you want it to be? I think it's safe to assume that most comments are _not_ legal advice.

ratg13 · 2026-03-18T19:52:45 1773863565

Create a new account every so often, don’t leave any identifying information, occasionally switch up the way you spell words (British/US English), and alternate using different slang words and shorthand.

fdghrtbrt · 2026-03-18T20:00:35 1773864035

And do what I do - paste everything into ChatGPT and have it rephrase it. Not because I need help writing, but because I’d rather not have my writing style used against me.

socksy · 2026-03-18T20:10:39 1773864639

I can't stand this and will actively discriminate against comments I notice in that voice. Even this one has "Not because [..], but because [..]"

Diederich · 2026-03-18T22:02:26 1773871346

I get your sentiment, though I think it's likely that people, on average, are going to organically start writing more and more like LLMs.

adi_kurian · 2026-03-18T22:39:40 1773873580

It's already begun.

selfhoster11 · 2026-03-19T12:48:45 1773924525

The good rephrasing will not include that voice.

coppsilgold · 2026-03-18T20:36:18 1773866178

This just gives OpenAI that data.

Perhaps you could use a local translation model to rephrase (such as TranslateGemma). If translating English to English doesn't achieve this effect then use an intermediate language, one the model is good at to not mangle meaning too much.

fdghrtbrt · 2026-03-18T20:40:58 1773866458

I run Qwen 3 locally, but I mention OpenAI on HN so people understand what I’m referring to.

GeoAtreides · 2026-03-18T20:51:35 1773867095

do the following:

sample content from users on this page: https://qht.co/leaders

and ask the LLM to rephrase it in their voice

culi · 2026-03-18T21:34:51 1773869691

I'm actually working on a browser extension to do just this with adversarial stylometry techniques

culi · 2026-03-18T21:34:21 1773869661

Look up "adversarial stylometry"

GeoAtreides · 2026-03-18T20:50:01 1773867001

funnily enough, if everyone did this (at least make a new account often), it would prove more destructive to what HN (purposefully) wants to do than deleting the occasional account data

tantalor · 2026-03-18T19:46:12 1773863172

The back button

robotswantdata · 2026-03-18T19:29:13 1773862153

Then one day forgetting to close the door of the crate…..

linhns · 2026-03-18T19:58:09 1773863889

But the dog is so used to the crate…