LIMA: Less Is More for Alignment https://arxiv.org/abs/2305.11206 AlpaGasus: Tra... | Hacker News

Hacker Timesnew | past | comments | ask | show | jobs | submit

		isaacfung on Sept 22, 2023 \| parent \| context \| favorite \| on: Outperforming larger language models with less tra... LIMA: Less Is More for Alignment https://arxiv.org/abs/2305.11206 AlpaGasus: Training A Better Alpaca with Fewer Data https://arxiv.org/abs/2307.08701 Textbooks Are All You Need II: phi-1.5 technical report https://arxiv.org/abs/2309.05463 Skill-it! A Data-Driven Skills Framework for Understanding and Training Language Models https://arxiv.org/abs/2307.14430

eru on Sept 23, 2023 [–]

Thanks for the link!

I can very well believe that empirical this can work. (I haven't checked the literature.) My point was merely that given my priors, this isn't intuitive.

Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact