Hacker Timesnew | past | comments | ask | show | jobs | submitlogin

Are there any "small" LM can be deployed locally that understands formats and specs and generate outputs?


Alpaca [1], perhaps. It's based on facebook's model (LLaMA) and its been trained on a conversational style, same as chat gpt. I don't know if it can produce code, though.

[1] https://github.com/antimatter15/alpaca.cpp


I mean anybody had success fine-tuning it? How hard could it be and how long does it take?


Yeah, anybody can clone Alpaca and fine tune it.

From what I can tell setting things up to train the model is easy. Then you have to generate the training data (can be done using GPT3's API), fine tune Alpaca, and then evaluate it.

Haven't done it myself but I believe you can find more info here [1]

[1] https://crfm.stanford.edu/2023/03/13/alpaca.html




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: