Skip to content

Issues: ggml-org/llama.cpp

examples : add configuration presets
#10932 opened Dec 21, 2024 by ggerganov
Open 3
changelog : libllama API
#9289 opened Sep 3, 2024 by ggerganov
Open 9
changelog : llama-server REST API
#9291 opened Sep 3, 2024 by ggerganov
Open 14
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Feature Request: Allow -hf to be used offline enhancement New feature or request
#13128 opened Apr 26, 2025 by ngxson
4 tasks done
Feature Request: Add C api for mtmd enhancement New feature or request
#13124 opened Apr 26, 2025 by chinshou
4 tasks done
Feature Request: Kimi-Audio-7B enhancement New feature or request
#13114 opened Apr 25, 2025 by wrapss
4 tasks done
Feature Request: Tensor paralellism (--split-mode row) over rpc enhancement New feature or request
#13083 opened Apr 23, 2025 by tobi97h
4 tasks done
Feature Proposal: Server Model Switching at Runtime enhancement New feature or request
#13027 opened Apr 19, 2025 by christopherthompson81
4 tasks done
Feature Request: Add kv-quant fa kernel variants for head sizes other than 128 enhancement New feature or request
#12989 opened Apr 17, 2025 by pl752
4 tasks done
Feature Request: Improve model load time when using the RPC backend enhancement New feature or request
#12954 opened Apr 15, 2025 by rgerganov
4 tasks done
Feature Request: support for image input in llama-server (and web ui) enhancement New feature or request
#12792 opened Apr 7, 2025 by khimaros
4 tasks done
Feature Request: Method that counts the number of image tokens in LLAVA_API enhancement New feature or request
#12689 opened Apr 1, 2025 by clarinevong
4 tasks done
Feature Request: Qwen2.5-Omni enhancement New feature or request
#12673 opened Mar 31, 2025 by Kreijstal
4 tasks done
Feature Request: Add support for StarVector-8b/1b enhancement New feature or request
#12666 opened Mar 31, 2025 by roflsunriz
4 tasks done
Feature Request: Add support of convert.py for model Qwen2.5-Omni-7B enhancement New feature or request
#12641 opened Mar 29, 2025 by nickhuang99
4 tasks done
tts : add support for SparkTTS enhancement New feature or request stale
#12495 opened Mar 21, 2025 by ecyht2
4 tasks done
Feature Request: Qwen2.5 0.5b OpenCL backend support enhancement New feature or request stale
#12463 opened Mar 19, 2025 by Francis235
4 tasks done
Feature Request: Cache nix builds on a public cache server? enhancement New feature or request stale
#12453 opened Mar 18, 2025 by Azeirah
4 tasks done
ProTip! Type g i on any issue or pull request to go back to the issue listing page.