cturan/llama.cpp

Autor	SHA1 Mensaje	Fecha
Gaurav Garg	c262beddf2 CUDA: Prefer vector flash decoding kernel for Gemma models (#12738)	hace 9 meses
yumeyao	5dd5d1ab00 vocab : use string_view::find() to avoid unnecessary looking up beyond the fragment range (#12706)	hace 9 meses
Jeff Bolz	1c059995e0 vulkan: Fix missing cmake logic for dot product extension (#12721)	hace 9 meses
Atharva Dubey	2004644b7a ci : add env variable in ggml-ci and document the same in SYCL.md (#12736)	hace 9 meses
R0CKSTAR	5f696e88e0 sync : minja (inclusionAI/Ling) and update tests (#12699)	hace 9 meses
a3sh	193c3e03a6 fix MUSA compiler warning (#12704)	hace 9 meses
Chenguang Li	65cfe136a0 CANN: Support operator SIN COS ARGMAX (#12709)	hace 9 meses
Alan Gray	3f9da22c2b Simplify and improve CUDA graphs through use of indirect copy pointers (#9017)	hace 9 meses
hipudding	2a0dc97e56 CANN: Fix failed test cases (#12708)	hace 9 meses
lhez	97a20c012b opencl: use `max_alloc_size` in backend ctx instead of querying again (#12705)	hace 9 meses
Jeff Bolz	f01bd02376 vulkan: Implement split_k for coopmat2 flash attention. (#12627)	hace 9 meses
bandoti	6f3bd38640 cmake: remove caching from vulkan coopmat checks (#12719)	hace 9 meses
Jeff Bolz	be0a0f8cae vulkan: Implement grouped query attention in the coopmat2 FA shader (#12559)	hace 9 meses
0cc4m	92e3006bb6 Vulkan: Fix mmq int dot float cache size (#12722)	hace 9 meses
Georgi Gerganov	833e2b7409 model : print tensor size during load (#12711)	hace 9 meses
Diego Devesa	e0e912f49b llama : add option to override model tensor buffers (#11397)	hace 9 meses
Georgi Gerganov	a10b36c91a llama : refactor kv cache guard (#12695)	hace 9 meses
Sigbjørn Skjæret	83a88bd6af vocab : BailingMoE : change possessive quantifiers to greedy (#12677)	hace 9 meses
Xuan-Son Nguyen	42eb248f46 common : remove json.hpp from common.cpp (#12697)	hace 9 meses
Chenguang Li	9bacd6b374 [CANN] get_rows and dup optimization (#12671)	hace 9 meses
Xuan-Son Nguyen	267c1399f1 common : refactor downloading system, handle mmproj with -hf option (#12694)	hace 9 meses
Junil Kim	f423981ac8 opencl : fix memory allocation size (#12649)	hace 9 meses
jklincn	e39e727e9a llama : use LLM_KV_GENERAL_FILE_TYPE instead of gguf_find_key (#12672)	hace 9 meses
Sigbjørn Skjæret	5936a616e4 convert : BailingMoE : fix qkv split when head_dim is 0 (#12687)	hace 9 meses
Georgi Gerganov	3fd072a540 metal : use F32 prec in FA kernels (#12688)	hace 9 meses
R0CKSTAR	a6f32f0b34 Fix clang warning in gguf_check_reserved_keys (#12686)	hace 9 meses
Wagner Bruna	2bb3597e42 vulkan: fix build when glslc doesn't support coopmat (#12683)	hace 9 meses
Romain Biessy	8293970542 SYCL: Rename oneMKL to oneMath (#12192)	hace 9 meses
Akarshan Biswas	8bbf26083d SYCL: switch to SYCL namespace (#12674)	hace 9 meses
Sigbjørn Skjæret	35782aeedb convert : BailingMoE : avoid setting rope_dim to 0 (#12678)	hace 9 meses

Posterior Anterior

Historial de Commits Buscar

Historial de Commits