cturan/llama.cpp

Autor	SHA1 Mensagem	Data
Xuan Son Nguyen	235f6e14bf server : (UI) add tok/s, get rid of completion.js (#10786)	há 1 ano atrás
qingy1337	1a31d0dc00 Update README.md (#10772)	há 1 ano atrás
Xuan Son Nguyen	92f77a640f ci : pin nodejs to 22.11.0 (#10779)	há 1 ano atrás
kallewoof	484d2f31ae bug-fix: snprintf prints NULL in place of the last character (#10419)	há 1 ano atrás
CentricStorm	4b4d92b098 docs: fix server documentation formatting (#10776)	há 1 ano atrás
Gilad S.	43041d2eb3 ggml: load all backends from a user-provided search path (#10699)	há 1 ano atrás
Jeff Bolz	b685daf386 vulkan: request round-to-even for fp16 in im2col/rope_head (#10767)	há 1 ano atrás
Eve	dafae66cc2 vulkan: dynamic subgroup size for the remaining k quants (#10745)	há 1 ano atrás
Bartowski	ae4b922614 imatrix : Add imatrix to --no-context-shift (#10766)	há 1 ano atrás
Andreas Kieslinger	750cb3e246 CUDA: rename macros to avoid conflicts with WinAPI (#10736)	há 1 ano atrás
Yüg	a86ad841f1 server : add flag to disable the web-ui (#10762) (#10751)	há 1 ano atrás
Jeff Bolz	a05e2afcc2 vulkan: disable spirv-opt for coopmat shaders (#10763)	há 1 ano atrás
Johannes Gäßler	26a8406ba9 CUDA: fix shared memory access condition for mmv (#10740)	há 1 ano atrás
Srihari-mcw	c37fb4cf62 Changes to CMakePresets.json to add ninja clang target on windows (#10668)	há 1 ano atrás
Jeff Bolz	3d98b4cb22 vulkan: fix compile warnings (#10731)	há 1 ano atrás
Borislav Stanimirov	1a05004743 cmake : simplify msvc charsets (#10672)	há 1 ano atrás
Xuan Son Nguyen	ce8784bdb1 server : fix format_infill (#10724)	há 1 ano atrás
Xuan Son Nguyen	e52522b869 server : bring back info of final chunk in stream mode (#10722)	há 1 ano atrás
stduhpf	06d70147e6 Vulkan: fix NaN in tanh.comp with AMD proprietary driver on Windows (#10723)	há 1 ano atrás
Diego Devesa	43ed389a3f llama : use cmake for swift build (#10525)	há 1 ano atrás
Jeff Bolz	ecc93d0558 vulkan: compile a test shader in cmake to check for coopmat2 support (#10713)	há 1 ano atrás
Robert Collins	62e84d9848 llama : add 128k yarn context for Qwen (#10698)	há 1 ano atrás
Xuan Son Nguyen	3573fa8e7b server : (refactor) no more json in server_task input (#10691)	há 1 ano atrás
Georgi Gerganov	d9c3ba2b77 ggml : disable iq4_nl interleave size 8 (#10709)	há 1 ano atrás
Georgi Gerganov	ce4a7b8493 server : various fixes (#10704)	há 1 ano atrás
Djip007	19d8762ab6 ggml : refactor online repacking (#10446)	há 1 ano atrás
Georgi Gerganov	c2a16c0bdb server : fix free of spec context and batch (#10651)	há 1 ano atrás
0cc4m	3df784b305 Vulkan: VK_KHR_cooperative_matrix support to speed up prompt processing (#10597)	há 1 ano atrás
Robert Ormandi	86a1934978 metal : Extend how Llama.cpp locates metal resources (#10676)	há 1 ano atrás
Sukriti Sharma	784a14aa49 convert : add support for Roberta embeddings (#10695)	há 1 ano atrás

Recente Antigo

Histórico de Commits Pesquisar

Histórico de Commits