cturan/llama.cpp

Autor	SHA1 Mensaje	Fecha
Johannes Gäßler	10d2af0eaa llama/ggml: add LLM training support (#10544)	hace 8 meses
David Huang	7f323a589f Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386)	hace 8 meses
Johannes Gäßler	9070365020 CUDA: fix logic for clearing padding with -ngl 0 (#13320)	hace 8 meses
mgroeber9110	5bbe6a9fe9 ggml : portability fixes for VS 2017 (#12150)	hace 10 meses
William Tambellini	70680c48e5 ggml : upgrade init_tensor API to return a ggml_status (#11854)	hace 10 meses
Diego Devesa	017cc5f446 ggml-backend : only offload from host buffers (fix) (#11124)	hace 1 año
Diego Devesa	a3d50bc022 ggml-backend : only offload from host buffers (#11120)	hace 1 año
Daniel Bevenius	db68c93b57 ggml : improve inputs log sched_print_assignments (ggml/1053)	hace 1 año
Diego Devesa	7cc2d2c889 ggml : move AMX to the CPU backend (#10570)	hace 1 año
slaren	59b9172822 ggml/sched : do not skip views in pre-assignments	hace 1 año
Johannes Gäßler	02e4eaf22f ggml-opt: fix data corruption (ggml/1022)	hace 1 año
Diego Devesa	be5caccef9 llama : only use default buffer types for the KV cache (#10358)	hace 1 año
Diego Devesa	eda7e1d4f5 ggml : fix possible buffer use after free in sched reserve (#9930)	hace 1 año
Johannes Gäßler	8a43e940ab ggml: new optimization interface (ggml/988)	hace 1 año
Diego Devesa	ae8de6d50a ggml : build backends as libraries (#10256)	hace 1 año
Diego Devesa	9f40989351 ggml : move CPU backend to a separate file (#10144)	hace 1 año
Diego Devesa	c02e5ab2a6 llama : fix buffer checks for mamba and rwk (#10111)	hace 1 año
Sergio López	61408e7fad kompute: add backend registry / device interfaces (#10045)	hace 1 año
Diego Devesa	c5b0f4b5d9 llama : refactor model loader with backend registry (#10026)	hace 1 año
leo-pony	6b8447352d [CANN] Adapt to dynamically loadable backends mechanism (#9970)	hace 1 año
Ouadie EL FAROUKI	87421a23e8 [SYCL] Add SYCL Backend registry, device and Event Interfaces (#9705)	hace 1 año
Ma Mingfei	60ce97c9d8 add amx kernel for gemm (#8998)	hace 1 año
Diego Devesa	f010b77a37 vulkan : add backend registry / device interfaces (#9721)	hace 1 año
Gilad S.	2194200278 fix: allocating CPU buffer with size `0` (#9917)	hace 1 año
Gilad S.	73afe681aa fix: use `vm_allocate` to allocate CPU backend buffer on macOS (#9875)	hace 1 año
Diego Devesa	96776405a1 ggml : move more prints to the ggml log system (#9839)	hace 1 año
Diego Devesa	0e9f760eb1 rpc : add backend registry / device interfaces (#9812)	hace 1 año
Diego Devesa	dca1d4b58a ggml : fix BLAS with unsupported types (#9775)	hace 1 año
Diego Devesa	6374743747 ggml : add backend registry / device interfaces to BLAS backend (#9752)	hace 1 año
Georgi Gerganov	d5ac8cf2f2 ggml : add metal backend registry / device (#9713)	hace 1 año

Posterior Anterior

Historial de Commits Buscar

Historial de Commits