cturan/llama.cpp

نویسنده	SHA1 پیام	تاریخ
Georgi Gerganov	08ea539df2 unicode : improve naming style (#10838)	1 سال پیش
Georgi Gerganov	644fd71b44 sampling : refactor + optimize penalties sampler (#10803)	1 سال پیش
Bartowski	4ddd199f6f llava : Allow locally downloaded models for QwenVL (#10833)	1 سال پیش
Valentin Mamedov	a0974156f3 llama : add Deepseek MoE v1 & GigaChat models (#10827)	1 سال پیش
Georgi Gerganov	87cf323cef scripts : change build path to "build-bench" for compare-commits.sh (#10836)	1 سال پیش
Vinesh Janarthanan	5478bbcd17 server: (UI) add syntax highlighting and latex math rendering (#10808)	1 سال پیش
Georgi Gerganov	b5ae1ddff9 gguf-py : bump to v0.13.0	1 سال پیش
Michelle Tan	89d604f2c8 server: Fix `has_next_line` in JSON response (#10818)	1 سال پیش
Evgeny Kurnevsky	e52aba537a nix: allow to override rocm gpu targets (#10794)	1 سال پیش
HimariO	ba1cb19cdd llama : add Qwen2VL support + multimodal RoPE (#10361)	1 سال پیش
cduk	56eea0781c Removes spurious \r in output that causes logging in journalctl to treat lines as binary and therefore hidden by default (#10771)	1 سال پیش
lhez	a76c56fa1a Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (#10693)	1 سال پیش
Eric Curtin	c27ac678dd Opt class for positional argument handling (#10508)	1 سال پیش
Corentin REGAL	11e07fd63b fix: graceful shutdown for Docker images (#10815)	1 سال پیش
Jett Janiak	4601a8bb67 gguf-py : numpy 2 newbyteorder fix (#9772)	1 سال پیش
谢乃闻	9f35e44592 Fix crash caused by ggml_backend_load_all when launching on Android Activity (#10812)	1 سال پیش
Eve	64ae065511 vulkan: small mul_mat_vec optimizations (#10665)	1 سال پیش
Akarshan Biswas	83ed24a97b SYCL: Reduce most of the compiler warnings (#10748)	1 سال پیش
Karol Kontny	d583cd03f6 ggml : Fix compilation issues on ARM platform when building without fp16 (#10811)	1 سال پیش
Xuan Son Nguyen	adffa6ffd5 common : improve -ctv -ctk CLI arguments (#10806)	1 سال پیش
Xuan Son Nguyen	274ec65af6 contrib : add ngxson as codeowner (#10804)	1 سال پیش
a3sh	8faa1d4dd4 CUDA: faster non-contiguous concat (#10760)	1 سال پیش
Diego Devesa	cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797)	1 سال پیش
0cc4m	4064c0e3b6 Vulkan: Use improved q4_k and q5_k dequant code in dequant shaders (#10798)	1 سال پیش
0cc4m	dc5301d565 Vulkan: Add VK_EXT_subgroup_size_control support to ensure full subgroups for coopmats (#10721)	1 سال پیش
Xuan Son Nguyen	9fdb124304 common : add missing env var for speculative (#10801)	1 سال پیش
CentricStorm	5555c0c1f6 docs: update server streaming mode documentation (#9519)	1 سال پیش
Georgi Gerganov	973f328b1e Merge pull request #10788 from ggerganov/gg/gguf-py-0.11.0	1 سال پیش
Georgi Gerganov	fb18934a97 gguf-py : bump version to 0.11.0	1 سال پیش
Xuan Son Nguyen	235f6e14bf server : (UI) add tok/s, get rid of completion.js (#10786)	1 سال پیش

جدیدتر قدیمی‌تر

تاریخچه Commit ها یافتن

تاریخچه Commit ها