cturan/llama.cpp

Yazar	SHA1 Mesaj	Tarih
Georgi Gerganov	4d0a7cbc61 ci : adjust params for less runtime (#16167)	4 ay önce
Ruben Ortlam	9073a73d82 vulkan: vec dot matrix multiplication fix (#16151)	4 ay önce
lhez	51f5a45fbe opencl: fix concat crash on win arm64 with Adreno (#15944)	4 ay önce
lhez	c4510dc937 opencl: initial `q8_0` mv support (#15732)	4 ay önce
Georgi Gerganov	da30ab5f86 ci : add label for the RISC-V runner (#16150)	4 ay önce
Georgi Gerganov	28baac9c9f ci : migrate ggml ci to self-hosted runners (#16116)	4 ay önce
Giuseppe Scrivano	1eeb523c3e vulkan: optimize UMA buffer operations and fix driver hangs (#16059)	4 ay önce
Jeff Bolz	5bb4a3edec vulkan: fix validation error about VK_PIPELINE_CREATE_CAPTURE_STATISTICS_BIT_KHR (#16086)	4 ay önce
Georgi Gerganov	7f766929ca sync : ggml	4 ay önce
Daniel Bevenius	405921dcef ggml : introduce semantic versioning (ggml/1336)	4 ay önce
Gregor Jasny	fa6383ca7e CUDA : conditionally add cuda architectures (ggml/1341)	4 ay önce
Ruben Ortlam	803dac2e48 vulkan: use vec dot for matrix matrix multiplications (#16056)	4 ay önce
Benni	459c0c2c1a server: fix SSE and OpenAI compatibility for error messages when streaming (#16109)	4 ay önce
ssweens	be79d9fdd9 llama-bench: add --devices and --list-devices support (#16039)	4 ay önce
shun095	f432d8d83e chat: Fix streaming parser for granite models (#15682)	4 ay önce
Aleksander Grygier	4067f07fc5 feat: Improve mobile UI for Settings Dialog (#16084)	4 ay önce
Xuan-Son Nguyen	4b8560ab56 chat : fix build on arm64 (#16101)	4 ay önce
Xuan-Son Nguyen	0dd58b6877 ggml : refactor forward_dup for cpu backend (#16062)	4 ay önce
Adrien Gallouët	69ffd89163 ggml-amx : fix ggml_amx_init() on generic Linux (#16049)	4 ay önce
Adrien Gallouët	246c0d9c79 cmake : fix static linking for OpenMP on Unix-like systems (#16031)	4 ay önce
Shawn Gu	3edd87cd05 opencl: optimize mxfp4 kernels (#16037)	4 ay önce
Jeff Bolz	c0b45097c3 rename optimize_graph to graph_optimize (#16082)	4 ay önce
Bowen Han	38dbdf4c05 CUDA: Optimize PAD_REFLECT_1D (#15957)	4 ay önce
Johannes Gäßler	368560a1e3 CUDA: fix compilation on CC 6.0 (#16091)	4 ay önce
Eric Curtin	4ca088b036 Add resumable downloads for llama-server model loading (#15963)	4 ay önce
Georgi Gerganov	703f9e32c4 metal : use function constants for mul_mv_ext kernels (#16074)	4 ay önce
Sigbjørn Skjæret	ad6bd9083b cuda : add missing F32<->I32 entries in ggml_cuda_cpy_fn (#16060)	4 ay önce
Radoslav Gerganov	2b6b55a59f server : include usage statistics only when user request them (#16052)	4 ay önce
Georgi Gerganov	e58174cecb llama : bump max seq limit from 64 to 256 (#15916)	4 ay önce
Georgi Gerganov	b213fce89b metal : improve F32, F16 and BF16 mat-vec multiplication (#16057)	4 ay önce

Daha yeni Daha Eski

Geçmişin Kaydedilmesi Bul

Geçmişin Kaydedilmesi