Hacker Timesnew | past | comments | ask | show | jobs | submit | kristjansson's commentslogin

I think MTP Gemma4 support is still WIP https://github.com/ggml-org/llama.cpp/pull/23398 ?

This has been my impression.

The underlying LiteRT-LM framework used in the edge gallery does support the MTP drafters for the smaller models, but according to:

https://developers.google.com/edge/litert-lm/models/gemma-4

> Note: LiteRT-LM supports E2B and E4B models today, with support for larger models coming soon.

So even Google aren't shipping MTP support for the 26B and 31B models yet.


> quantization

12b means 12G @ 8 bits/param (basically lossless) and 6G at 4 b/p (generally accepted 'pretty close' level). Not too bad?

But TBD how well the base model performs before thinking too much about quantization


Smaller models are less forgiving to quantization. For a 12B model I wouldn't expect Q4 to be "pretty close", unless it underwent quantization aware training (QAT). Of course it's not set in stone, there's a huge variance between models, so this might surprise.

yes yes the solution to some debatably valid social science research is to burn the entire epistemological method and practice to the ground. We'll surely find something better to replace it in its absence.

> 137B-A5B

Yeah, not a 5B param model as the earlier title implied!


GOOG was a $20B company against a $10T S&P 500. SpaceX is going to be a $2T company against a $60T S&P 500.

Sure, it'll only be ~2% of the index if it opens where they want to. But in the downside case where it meanders long enough for significant amounts of its stock to make it in to public hands and then goes to 10x revenue (i.e. down 90%) , you've allowed a company to engineer dramatic changes in index rules resulting in a transfer ~1% of S&P 500 market cap from index funder holders to its bagholders^W privileged insiders^W^W investors.

Yes a -1% day should be nothing to a long term holder. Yes they're buying the market; if the market is wrong they shouldn't really have any recourse. But one can also understand that a -1% day that accrues ~entirely to the benefit a small group, who appear to have engineered that outcome has much more emotional valence than a typical down down. It doesn't feel like a bad day on the market, it feels like a heist.


Noting for reference that Gemma4 MTP work is in progress[0] on llama.cpp; similar work for Qwen3.6 landed recently and has been great thus far.

[0]: https://github.com/ggml-org/llama.cpp/pull/23398


I mean catch and release is better than being dinner, and the prevalence of fly fishing as a hobby goes a long way toward garnering public and financial support for restoration activities.

Internship etc. proposals miss that status quo job seeking is a parallel process for applicants. An internship model means each candidate can only “consider” one employer at a time. I think this is what Steve’s campfire proposal is trying to counter by making samples/internship outputs public?

I think the real solution is something like the Bar Exam. Apply the hard filter once, publicly, administered by a trusted neutral party. Anyone that gets through is assumed to be good enough, and interviews can assess only firm/team specifics


This is what Triplebyte tried to do, but unfortunately they failed in the market. Maybe they were ahead of their time and a similar concept would work now.

Yeah I wrote “by a third party” then thought of them and revised to “trusted neutral party”. Has to be someone with ~no financial interest in the candidates success

The real solution is stop proposing solutions like a bar exam. Software engineering evolves too quickly for anything but a work sample to work.

The Bar is also a not-great signal of actual ability. It is a great signal for 1) base knowledge and reasoning capacity and 2) sticktoitiveness / willingness to meet ~arbitrary expectations to work in a given industry ... which is all these 'technical' interviews are really measuring. By revealed preferences that's what management wants at those companies, so better to save people time and stress and make it a one time thing?

Per TFA Steve got hired by Amazon and then kicked off the acquisition of his former company.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: