ggml-org / llama.cpp Public

Notifications You must be signed in to change notification settings
Fork 11.5k
Star 78.9k

Code
Issues 334
Pull requests 421
Discussions
Actions
Projects 9
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Issues: ggml-org/llama.cpp

examples : add configuration presets

#10932 opened Dec 21, 2024 by ggerganov

Open 3

changelog : libllama API

#9289 opened Sep 3, 2024 by ggerganov

Open 9

changelog : llama-server REST API

#9291 opened Sep 3, 2024 by ggerganov

Open 14

Labels 75 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clear current search query, filters, and sorts

112 Open 887 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Feature Request: Allow -hf to be used offline enhancement

New feature or request

#13128 opened Apr 26, 2025 by ngxson

4 tasks done

Feature Request: Add C api for mtmd enhancement

New feature or request

#13124 opened Apr 26, 2025 by chinshou

4 tasks done

Feature Request: Kimi-Audio-7B enhancement

New feature or request

#13114 opened Apr 25, 2025 by wrapss

4 tasks done

Feature Request: define key bindings for quick deletion of the previous conversation. enhancement

New feature or request

#13111 opened Apr 25, 2025 by gnusupport

4 tasks done

Feature Request: Tensor paralellism (--split-mode row) over rpc enhancement

New feature or request

#13083 opened Apr 23, 2025 by tobi97h

4 tasks done

Feature Request: Ability to pack multiple GGUFs into single one enhancement

New feature or request

#13028 opened Apr 19, 2025 by ngxson

Feature Proposal: Server Model Switching at Runtime enhancement

New feature or request

#13027 opened Apr 19, 2025 by christopherthompson81

4 tasks done

Feature Request: Add kv-quant fa kernel variants for head sizes other than 128 enhancement

New feature or request

#12989 opened Apr 17, 2025 by pl752

4 tasks done

Feature Request: Improve model load time when using the RPC backend enhancement

New feature or request

#12954 opened Apr 15, 2025 by rgerganov

4 tasks done

Feature Request: support for image input in llama-server (and web ui) enhancement

New feature or request

#12792 opened Apr 7, 2025 by khimaros

4 tasks done

Feature Request: Method that counts the number of image tokens in LLAVA_API enhancement

New feature or request

#12689 opened Apr 1, 2025 by clarinevong

4 tasks done

Feature Request: Qwen2.5-Omni enhancement

New feature or request

#12673 opened Mar 31, 2025 by Kreijstal

4 tasks done

Feature Request: Add support for StarVector-8b/1b enhancement

New feature or request

#12666 opened Mar 31, 2025 by roflsunriz

4 tasks done

Feature Request: Splitting layers according to VRAM usage on multi GPUs setups enhancement

New feature or request

#12654 opened Mar 30, 2025 by goodglitch

4 tasks done

Feature Request: convert_hf_to_gguf.py to support model type Qwen2_5_VLForConditionalGeneration enhancement

New feature or request

stale

#12642 opened Mar 29, 2025 by nickhuang99

4 tasks done

Feature Request: Add support of convert.py for model Qwen2.5-Omni-7B enhancement

New feature or request

#12641 opened Mar 29, 2025 by nickhuang99

4 tasks done

Feature Request: Interleaved sliding window attention support for gemma 2 and 3 enhancement

New feature or request

#12637 opened Mar 29, 2025 by ymcki

4 tasks done

Feature Request: Add "trust_remote_code support" to 'convert_hf_to_gguf.py' for compatibility with modern HF models enhancement

New feature or request

stale

#12610 opened Mar 27, 2025 by joeyama

4 tasks done

[New Bitnet Model Support Request] Deepgrove model Bonsai 0.5B - Add Channel Scales enhancement

New feature or request

stale

#12598 opened Mar 27, 2025 by zephrus9

tts : add support for SparkTTS enhancement

New feature or request

stale

#12495 opened Mar 21, 2025 by ecyht2

4 tasks done

Feature Request: deep/ recurrent processing like "thinking", but script based. enhancement

New feature or request

stale

#12486 opened Mar 21, 2025 by David-AU-github

4 tasks done

Feature Request: New sampling method that boosts reasoning performance - looks too good? enhancement

New feature or request

stale

#12479 opened Mar 20, 2025 by mirek190

4 tasks done

Feature Request: Qwen2.5 0.5b OpenCL backend support enhancement

New feature or request

stale

#12463 opened Mar 19, 2025 by Francis235

4 tasks done

Feature Request: Cache nix builds on a public cache server? enhancement

New feature or request

stale

#12453 opened Mar 18, 2025 by Azeirah

4 tasks done

Feature Request: allow mmap to take advantage of hugepage feature which has 10x speedup enhancement

New feature or request

stale

#12444 opened Mar 18, 2025 by nickhuang99

4 tasks done

Previous 1 2 3 4 5 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly