cturan/llama.cpp

Autor	SHA1 Mensaje	Fecha
compilade	5d46babdc2 llama : initial Mamba-2 support (#9126)	hace 6 meses
Georgi Gerganov	ec68e84c32 ggml : support bcast ggml_soft_max_ext, ggml_flash_attn_ext (#14435)	hace 7 meses
Jeff Bolz	6a746cf9c4 vulkan: Split large mul_mat_id to fit in shared memory (#14451)	hace 6 meses
Acly	431b2c24f3 ggml-cpu : "align corners" for bilinear upscale/downscale (ggml/1285)	hace 6 meses
Diego Devesa	eb3fa2913e test-backend-ops : disable llama test (#14461)	hace 6 meses
Sigbjørn Skjæret	a0535ffa0d ggml : implement REGLU/GEGLU/SWIGLU ops (#14158)	hace 6 meses
Jeff Bolz	bd9c981d72 vulkan: Add fusion support for RMS_NORM+MUL (#14366)	hace 6 meses
Aman Gupta	27208bf657 CUDA: add bf16 and f32 support to cublas_mul_mat_batched (#14361)	hace 7 meses
Radoslav Gerganov	8d94219a4a ggml : add ggml_set_rows (#14274)	hace 7 meses
Georgi Gerganov	e8215dbb96 metal : add special-case mat-vec mul for ne00 == 4 (#14385)	hace 7 meses
Aman Gupta	aa064b2eb7 CUDA: add mean operation (#14313)	hace 7 meses
Aman Gupta	c959f462a0 CUDA: add conv_2d_transpose (#14287)	hace 7 meses
Ervin Áron Tasnádi	0d3984424f ggml-vulkan: adds support for op CONV_TRANSPOSE_1D (#13813)	hace 7 meses
Johannes Gäßler	10d2af0eaa llama/ggml: add LLM training support (#10544)	hace 8 meses
Georgi Gerganov	b34443923c sync : ggml (#13268)	hace 8 meses
Johannes Gäßler	b0ecbd434b test: non-cont. b in test-backend-ops -o MUL_MAT (#13187)	hace 8 meses
Johannes Gäßler	e1e8e0991f CUDA: batched+noncont MMQ, refactor bs>1 MoE code (#13199)	hace 8 meses
Xuan-Son Nguyen	edb18b6e8f clip : fix pixtral on some GPU backends (#13097)	hace 9 meses
Johannes Gäßler	658987cfc9 CUDA: noncont MMVQ + batched bs1 MUL_MAT_ID (#13014)	hace 9 meses
Georgi Gerganov	2f74c354c0 graph : make FA compatible with MLA + add initial Metal kernels (#12953)	hace 9 meses
Jeff Bolz	015022bb53 vulkan: enable coopmat2 FA gqa and split_k optimizations more often (#12931)	hace 9 meses
Georgi Gerganov	1d2b613445 tests : fix init order (#0)	hace 9 meses
Diego Devesa	fe92821ea9 ggml : add bilinear upscale support (ggml/1185)	hace 9 meses
Jeff Bolz	f01bd02376 vulkan: Implement split_k for coopmat2 flash attention. (#12627)	hace 9 meses
Georgi Gerganov	b4ae50810e metal : improve FA + improve MoE (#12612)	hace 10 meses
Jeff Bolz	9b169a4d4e vulkan: fix mul_mat_vec failure in backend tests (#12529)	hace 10 meses
Georgi Gerganov	ba932dfb50 ggml : fix quantized cpy op (#12310)	hace 10 meses
Jeff Bolz	eddfb43850 vulkan: Optimize mul_mat_vec p021 and nc shaders (#12505)	hace 10 meses
Gaurav Garg	517b5ddbf0 CUDA: Improve flash decoding kernel GPU occupancy for BS=1 case (#12183)	hace 10 meses
Molly Sophia	7dfad387e3 llama: Add support for RWKV v7 architecture (#12412)	hace 10 meses

Posterior Anterior

Historial de Commits Buscar

Historial de Commits