Open
Description
Motivation
Kubernetes is widely used in the industry to deploy product and application at scale.
It can be useful for the community to have a llama.cpp
helm chart for the server.
I have started several weeks ago, I will continue when I have more time, meanwhile any help is welcomed:
https://github.com/phymbert/llama.cpp/tree/example/kubernetes/examples/kubernetes