TBH people are kinda ignoring the LLaMA license now, as Meta seems to be doing. I see some pseudo commercial (encouraging donations and such) and a few straight up commercial services using a LLaMA backend.
llama.cpp specifically has Falcon on their roadmap, and some other quantized implementations already work with it. But the transition will be slow.