Skip to content

Issues: ggml-org/llama.cpp

examples : add configuration presets
#10932 opened Dec 21, 2024 by ggerganov
Open 3
changelog : libllama API
#9289 opened Sep 3, 2024 by ggerganov
Open 9
changelog : llama-server REST API
#9291 opened Sep 3, 2024 by ggerganov
Open 14
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Support start strings, the opposite of stop tokens. examples python python script changes server
#13214 opened Apr 30, 2025 by matteoserva Loading…
server : crash when -b > -ub with embeddings bug Something isn't working embeddings embedding related topics good first issue Good for newcomers server
#12836 opened Apr 8, 2025 by ggerganov
common: add partial regex support examples server testing Everything test related tool calling
#12808 opened Apr 7, 2025 by ochafik Loading…
WIP: Add support for CogAgent examples python python script changes server
#12679 opened Mar 31, 2025 by Tianyue-Zhao Draft
server: streaming of tool calls and thoughts when --jinja is on documentation Improvements or additions to documentation examples python python script changes script Script related server testing Everything test related tool calling
#12379 opened Mar 14, 2025 by ochafik Draft
4 of 10 tasks
tool-call: Phi-4 support android Issues specific to Android Apple Metal https://en.wikipedia.org/wiki/Metal_(API) devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#12288 opened Mar 9, 2025 by jpohhhh Loading…
Server: openai-style lookup decoding examples python python script changes server
#12127 opened Mar 1, 2025 by eeroel Draft
Cache based tokenization for the server input prompts demo Demonstrate some concept or idea, not intended to be merged examples server
#12067 opened Feb 25, 2025 by vnicolici Loading…
server webui easy config selection demo Demonstrate some concept or idea, not intended to be merged examples server
#12031 opened Feb 22, 2025 by poulphunter Loading…
llama : add llama_batch_ext android Issues specific to Android examples python python script changes server
#11875 opened Feb 14, 2025 by ngxson Loading…
Update CMakeLists.txt examples server
#11558 opened Jan 31, 2025 by magicse Loading…
tool-call: add support for tool-calls using Model Context Protocol build Compilation issues examples server testing Everything test related
#11556 opened Jan 31, 2025 by bandoti Loading…
8 of 12 tasks
llama : second attempt to refactor vision API examples python python script changes server
#11292 opened Jan 18, 2025 by ngxson Draft
1 of 5 tasks
ProTip! What’s not been updated in a month: updated:<2025-04-02.