-
Notifications
You must be signed in to change notification settings - Fork 11.5k
Issues: ggml-org/llama.cpp
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Feature Request: Allow New feature or request
-hf
to be used offline
enhancement
#13128
opened Apr 26, 2025 by
ngxson
4 tasks done
Feature Request: Add C api for mtmd
enhancement
New feature or request
#13124
opened Apr 26, 2025 by
chinshou
4 tasks done
Feature Request: Kimi-Audio-7B
enhancement
New feature or request
#13114
opened Apr 25, 2025 by
wrapss
4 tasks done
Feature Request: define key bindings for quick deletion of the previous conversation.
enhancement
New feature or request
#13111
opened Apr 25, 2025 by
gnusupport
4 tasks done
Feature Request: Tensor paralellism (--split-mode row) over rpc
enhancement
New feature or request
#13083
opened Apr 23, 2025 by
tobi97h
4 tasks done
Feature Request: Ability to pack multiple GGUFs into single one
enhancement
New feature or request
#13028
opened Apr 19, 2025 by
ngxson
Feature Proposal: Server Model Switching at Runtime
enhancement
New feature or request
#13027
opened Apr 19, 2025 by
christopherthompson81
4 tasks done
Feature Request: Add kv-quant fa kernel variants for head sizes other than 128
enhancement
New feature or request
#12989
opened Apr 17, 2025 by
pl752
4 tasks done
Feature Request: Improve model load time when using the RPC backend
enhancement
New feature or request
#12954
opened Apr 15, 2025 by
rgerganov
4 tasks done
Feature Request: support for image input in llama-server (and web ui)
enhancement
New feature or request
#12792
opened Apr 7, 2025 by
khimaros
4 tasks done
Feature Request: Method that counts the number of image tokens in LLAVA_API
enhancement
New feature or request
#12689
opened Apr 1, 2025 by
clarinevong
4 tasks done
Feature Request: Qwen2.5-Omni
enhancement
New feature or request
#12673
opened Mar 31, 2025 by
Kreijstal
4 tasks done
Feature Request: Add support for StarVector-8b/1b
enhancement
New feature or request
#12666
opened Mar 31, 2025 by
roflsunriz
4 tasks done
Feature Request: Splitting layers according to VRAM usage on multi GPUs setups
enhancement
New feature or request
#12654
opened Mar 30, 2025 by
goodglitch
4 tasks done
Feature Request: convert_hf_to_gguf.py to support model type Qwen2_5_VLForConditionalGeneration
enhancement
New feature or request
stale
#12642
opened Mar 29, 2025 by
nickhuang99
4 tasks done
Feature Request: Add support of convert.py for model Qwen2.5-Omni-7B
enhancement
New feature or request
#12641
opened Mar 29, 2025 by
nickhuang99
4 tasks done
Feature Request: Interleaved sliding window attention support for gemma 2 and 3
enhancement
New feature or request
#12637
opened Mar 29, 2025 by
ymcki
4 tasks done
Feature Request: Add "trust_remote_code support" to 'convert_hf_to_gguf.py' for compatibility with modern HF models
enhancement
New feature or request
stale
#12610
opened Mar 27, 2025 by
joeyama
4 tasks done
[New Bitnet Model Support Request] Deepgrove model Bonsai 0.5B - Add Channel Scales
enhancement
New feature or request
stale
#12598
opened Mar 27, 2025 by
zephrus9
tts : add support for SparkTTS
enhancement
New feature or request
stale
#12495
opened Mar 21, 2025 by
ecyht2
4 tasks done
Feature Request: deep/ recurrent processing like "thinking", but script based.
enhancement
New feature or request
stale
#12486
opened Mar 21, 2025 by
David-AU-github
4 tasks done
Feature Request: New sampling method that boosts reasoning performance - looks too good?
enhancement
New feature or request
stale
#12479
opened Mar 20, 2025 by
mirek190
4 tasks done
Feature Request: Qwen2.5 0.5b OpenCL backend support
enhancement
New feature or request
stale
#12463
opened Mar 19, 2025 by
Francis235
4 tasks done
Feature Request: Cache nix builds on a public cache server?
enhancement
New feature or request
stale
#12453
opened Mar 18, 2025 by
Azeirah
4 tasks done
Feature Request: allow mmap to take advantage of hugepage feature which has 10x speedup
enhancement
New feature or request
stale
#12444
opened Mar 18, 2025 by
nickhuang99
4 tasks done
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.