Hacker Timesnew | past | comments | ask | show | jobs | submitlogin

Its graph execution is still full of busyloops, e.g.:

https://github.com/ggerganov/llama.cpp/blob/44f906e8537fcec9...

I wonder how much more efficient it would be when Taskflow lib was used instead, or even inteltbb.



Someone ought to be along with a PR eventually.


It's not a very good library IMO.


ggml or Intel TBB?


is graph execution used for training only or inference also?


Inference. It's a big bottleneck for RWKV.cpp, second only to the matrix multiplies.


does tbb work with apple Silicon?





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: