cturan/llama.cpp

Автор	SHA1 Сообщение	Дата
standby24x7	fe163d5bf3 common : Fix a typo in help (#11899)	11 месяцев назад
Xuan-Son Nguyen	818a340ea8 ci : fix (again) arm64 build fails (#11895)	11 месяцев назад
Jeff Bolz	bf42a23d0a vulkan: support multi/vision rope, and noncontiguous rope (#11902)	11 месяцев назад
Hale Chan	c2ea16f260 metal : fix the crash caused by the lack of residency set support on Intel Macs. (#11904)	11 месяцев назад
Johannes Gäßler	6dde178248 scripts: fix compare-llama-bench commit hash logic (#11891)	11 месяцев назад
708-145	fc10c38ded examples: fix typo in imatrix/README.md (#11884)	11 месяцев назад
Adrian Kretz	22885105a6 metal : optimize dequant q6_K kernel (#11892)	11 месяцев назад
Georgi Gerganov	c2cd24fbfd readme : add notice about new package registry (#11890)	11 месяцев назад
Georgi Gerganov	68ff663a04 repo : update links to new url (#11886)	11 месяцев назад
Olivier Chafik	f355229692 server: fix type promotion typo causing crashes w/ --jinja w/o tools (#11880)	11 месяцев назад
Rémy O	fc1b0d0936 vulkan: initial support for IQ1_S and IQ1_M quantizations (#11528)	11 месяцев назад
Michał Moskal	89daa2564f llguidance build fixes for Windows (#11664)	11 месяцев назад
lhez	300907b211 opencl: Fix rope and softmax (#11833)	11 месяцев назад
Diego Devesa	94b87f87b5 cuda : add ampere to the list of default architectures (#11870)	11 месяцев назад
Georgi Gerganov	dbc2ec59b5 docker : drop to CUDA 12.4 (#11869)	11 месяцев назад
Daniel Bevenius	3d68f034da llama : add completion for --chat-template-file (#11860)	11 месяцев назад
Jinyang He	38e32eb6a0 ggml: optimize some vec dot functions for LoongArch ASX (#11842)	11 месяцев назад
Eve	a4f011e8d0 vulkan: linux builds + small subgroup size fixes (#11767)	11 месяцев назад
theraininsky	a7b8ce2260 llama-bench : fix unexpected global variable initialize sequence issue (#11832)	11 месяцев назад
Georgi Gerganov	04045bb842 readme : minor	11 месяцев назад
Jeffrey Morgan	8a8c4ceb60 llamafile: use member variable instead of constant for iq4nlt (#11780)	11 месяцев назад
Reza Rahemtola	c1f958c038 server : (docs) Update wrong tool calling example (#11809)	11 месяцев назад
Daniel Bevenius	c48f630d1c llama : add --completion-bash option (#11846)	11 месяцев назад
R0CKSTAR	bd6e55bfd3 musa: bump MUSA SDK version to rc3.1.1 (#11822)	11 месяцев назад
Olivier Chafik	c7f460ab88 `server`: fix tool-call of DeepSeek R1 Qwen, return reasoning_content (Command 7RB & DeepSeek R1) unless `--reasoning-format none` (#11607)	11 месяцев назад
Vinesh Janarthanan	27e8a23300 sampling: add Top-nσ sampler (#11223)	11 месяцев назад
Oleksandr Kuvshynov	e4376270d9 llama.cpp: fix warning message (#11839)	11 месяцев назад
Daniel Bevenius	3e69319772 llama : update llama_decode_internal ref [no ci] (#11840)	11 месяцев назад
Diego Devesa	a394039db0 ggml-cpu : add chunking support to mul_mat_id (#11666)	11 месяцев назад
Xuan-Son Nguyen	be3bbd6215 ggml : x2 speed for WASM by optimizing SIMD (#11453)	11 месяцев назад

Новее Раньше

История коммитов Найти

История коммитов