Skip to content

Issues: ggml-org/llama.cpp

changelog : libllama API
#9289 opened Sep 3, 2024 by ggerganov
Open 9
changelog : llama-server REST API
#9291 opened Sep 3, 2024 by ggerganov
Open 15
tutorials : list for llama.cpp
#13523 opened May 14, 2025 by ggerganov
Open 3
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Issues list

finetune.cpp command-line arg build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#13873 opened May 28, 2025 by graehl Loading…
cmake : set RPATH to $ORIGIN on Linux (#13740) build Compilation issues
#13741 opened May 24, 2025 by sunhaitao Loading…
Move page cache via mbind to prevent cross-NUMA access build Compilation issues
#13731 opened May 23, 2025 by vishalc-ibm Loading…
PR: Refine ggml-hexagon backend(Qualcomm Hexagon NPU backend) for latest ggml,whisper.cpp,llama.cpp build Compilation issues ggml changes relating to the ggml tensor library for machine learning Qualcomm NPU script Script related
#12326 opened Mar 11, 2025 by zhouwg Loading…
1 task done
fix: AVX2 intrinsics, const correctness, and SIMD headers build Compilation issues ggml changes relating to the ggml tensor library for machine learning
#12186 opened Mar 4, 2025 by sandboxyer Loading…
[WIP]backend: Integrating QNN (Qualcomm AI Engine Direct) as a dedicated backend for Qualcomm NPUs build Compilation issues ggml changes relating to the ggml tensor library for machine learning
#12063 opened Feb 25, 2025 by chraac Draft
Compile bug: Emulated Linux ARM64 CPU build fails bug Something isn't working build Compilation issues
#10933 opened Dec 21, 2024 by SamuelTallet
Refactor/tinyblas build Compilation issues demo Demonstrate some concept or idea, not intended to be merged documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning
#10343 opened Nov 16, 2024 by Djip007 Draft
2 of 4 tasks
docs: add doxygen documentation build Compilation issues
#10209 opened Nov 8, 2024 by sparkleholic Loading…
2 of 4 tasks
add FP8 support to gguf/llama: build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning script Script related Tensor Encoding Scheme https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes testing Everything test related
#10055 opened Oct 26, 2024 by Djip007 Draft
1 of 3 tasks
Fix for Debian CMake package creation build Compilation issues Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#9206 opened Aug 27, 2024 by Eugeniusz-Gienek Loading…
2 of 4 tasks
Revert "ggml : remove OpenCL (#7735) + (#8235)" Apple Metal https://en.wikipedia.org/wiki/Metal_(API) build Compilation issues devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment python python script changes script Script related SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#8986 opened Aug 11, 2024 by okias Draft
2 of 4 tasks
server: Windows 7 compatibility build Compilation issues examples Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix server
#8208 opened Jun 29, 2024 by Zor-X-L Loading…
2 of 4 tasks
WIP: Use DirectStorage with CUDA interop to more efficient load tensors build Compilation issues ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#7796 opened Jun 6, 2024 by mtavenrath Draft
ggml-threading.cpp build Compilation issues ggml changes relating to the ggml tensor library for machine learning Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#7576 opened May 27, 2024 by kunnis Draft
fix performance regression on woa build Compilation issues Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7555 opened May 27, 2024 by ReinForce-II Loading…
Portability: use the ccache path detected in cmake when setting the compiler launch rule build Compilation issues Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix
#7535 opened May 25, 2024 by s-daveb Loading…
ggml : unified CMake build build Compilation issues enhancement New feature or request refactoring Refactoring roadmap Part of a roadmap project
#6913 opened Apr 25, 2024 by ggerganov
ci : fix Docker workflow build Compilation issues help wanted Extra attention is needed
#3628 opened Oct 15, 2023 by ggerganov
[MPI] Add support for per-node options, thread counts, and layer allocations build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning server
#3334 opened Sep 26, 2023 by AutonomicPerfectionist Draft
2 of 5 tasks
ProTip! Follow long discussions with comments:>50.