Smaller models are less forgiving to quantization. For a 12B model I wouldn't expect Q4 to be "pretty close", unless it underwent quantization aware training (QAT). Of course it's not set in stone, there's a huge variance between models, so this might surprise.
yes yes the solution to some debatably valid social science research is to burn the entire epistemological method and practice to the ground. We'll surely find something better to replace it in its absence.
Sure, it'll only be ~2% of the index if it opens where they want to. But in the downside case where it meanders long enough for significant amounts of its stock to make it in to public hands and then goes to 10x revenue (i.e. down 90%) , you've allowed a company to engineer dramatic changes in index rules resulting in a transfer ~1% of S&P 500 market cap from index funder holders to its bagholders^W privileged insiders^W^W investors.
Yes a -1% day should be nothing to a long term holder. Yes they're buying the market; if the market is wrong they shouldn't really have any recourse. But one can also understand that a -1% day that accrues ~entirely to the benefit a small group, who appear to have engineered that outcome has much more emotional valence than a typical down down. It doesn't feel like a bad day on the market, it feels like a heist.
I mean catch and release is better than being dinner, and the prevalence of fly fishing as a hobby goes a long way toward garnering public and financial support for restoration activities.
Internship etc. proposals miss that status quo job seeking is a parallel process for applicants. An internship model means each candidate can only “consider” one employer at a time. I think this is what Steve’s campfire proposal is trying to counter by making samples/internship outputs public?
I think the real solution is something like the Bar Exam. Apply the hard filter once, publicly, administered by a trusted neutral party. Anyone that gets through is assumed to be good enough, and interviews can assess only firm/team specifics
This is what Triplebyte tried to do, but unfortunately they failed in the market. Maybe they were ahead of their time and a similar concept would work now.
Yeah I wrote “by a third party” then thought of them and revised to “trusted neutral party”. Has to be someone with ~no financial interest in the candidates success
The Bar is also a not-great signal of actual ability. It is a great signal for 1) base knowledge and reasoning capacity and 2) sticktoitiveness / willingness to meet ~arbitrary expectations to work in a given industry ... which is all these 'technical' interviews are really measuring. By revealed preferences that's what management wants at those companies, so better to save people time and stress and make it a one time thing?
reply