Skip to content

Issues: ggml-org/llama.cpp

changelog : libllama API
#9289 opened Sep 3, 2024 by ggerganov
Open 9
changelog : llama-server REST API
#9291 opened Sep 3, 2024 by ggerganov
Open 15
tutorials : list for llama.cpp
#13523 opened May 14, 2025 by ggerganov
Open 3
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Issues list

Bug: Failed to run qwen2-57b-a14b-instruct-fp16. bug Something isn't working good first issue Good for newcomers high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#9628 opened Sep 24, 2024 by tang-t21
Bug: non-chat completions not respecting the max_tokens parameter using the OpenAI api bug Something isn't working bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#8634 opened Jul 22, 2024 by cloud11665
Bug: abort on Android (pixel 8 pro) android Issues specific to Android bug Something isn't working high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#8109 opened Jun 25, 2024 by nivibilla
Bug: server crashes on startup is ckt ctv specified. bug Something isn't working high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#7639 opened May 30, 2024 by 0wwafa
ProTip! What’s not been updated in a month: updated:<2025-05-02.