cturan/llama.cpp

Author	SHA1 Message	Date
Henry Linjamäki	f79243992c opencl : fix `ulong` kernel args were set from `int` variables (#12174)	10 months ago
simon886212	ed4ce0dda2 opencl : fix profile-related errors (#12095)	10 months ago
Rémy O	07d1572347 ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions (#12154)	10 months ago
Akarshan Biswas	5e43f104cc SYCL: Disable f16 Unary OPs as not supported by the kernels (#12201)	10 months ago
Plamen Minev	16e4b22c5e ggml : fix GGMLMetalClass ODR (#12200)	10 months ago
Daniel Bevenius	074c4fd39d ci : add fetch-depth to xcframework upload (#12195)	10 months ago
Olivier Chafik	669912d9a5 `tool-call`: fix Qwen 2.5 Coder support, add micro benchmarks, support trigger patterns for lazy grammars (#12034)	10 months ago
Daniel Bevenius	fa31c438e0 ci : fix xcframework artifact tag (#12191)	10 months ago
Daniel Bevenius	3ccbfe5a71 ci : remove xframework upload (#12190)	10 months ago
Clauszy	06a92a193a server : fix cache reuse logic (#12161)	10 months ago
Daniel Bevenius	a057897ad4 llama : add xcframework build script (#11996)	10 months ago
mgroeber9110	5bbe6a9fe9 ggml : portability fixes for VS 2017 (#12150)	10 months ago
Georgi Gerganov	20a9b8f5e1 readme : fix roadmap link (#12185)	10 months ago
Sigbjørn Skjæret	56d7a9f812 main: allow preloading conversation with -p and add -st / --single-turn (#12145)	10 months ago
Olivier Chafik	1a24c4621f `server`: fix deadly typo in response_format.json_schema.schema handling (#12168)	10 months ago
David Huang	becade5de7 HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (#12032)	10 months ago
Georgi Gerganov	dfd6b2c0be sync : ggml	10 months ago
cmdr2	b64d7cc272 cuda: unary ops as float + de-duplicate (ggml/1130)	10 months ago
Georgi Gerganov	3d1cf3cf33 sync : ggml	10 months ago
cmdr2	0cbee131ad cuda/vulkan: specify fp32-only support for some operations in supports_op (ggml/1129)	10 months ago
Georgi Gerganov	8371d44595 sync : ggml	10 months ago
cmdr2	87abb7e903 cuda/cpu: Increase support for fp16 unary operations (ggml/1125)	10 months ago
Diego Devesa	6d4c23b81b whisper : support GGML_BACKEND_DL (whisper/2843)	10 months ago
midnight	6512a90037 cmake : fix compile assumptions for power9/etc (whisper/2777)	11 months ago
petterreinholdtsen	4512055792 Told cmake to install ggml-cpp.h as a public header file. (ggml/1126)	10 months ago
cmdr2	f54a4ba11e Support pure float16 add/sub/mul/div operations in the CUDA (and CPU) backend (ggml/1121)	10 months ago
Georgi Gerganov	aede2074f6 scripts : sync-ggml-am.sh fix	10 months ago
Daniel Bevenius	2679c3b55d ci : set GITHUB_ACTION env var for server tests (#12162)	10 months ago
dm4	c43af9276b tts: add speaker file support (#12048)	10 months ago
Diego Devesa	d5c63cd7f9 test-backend-ops : add option -p to filter by op params (#12155)	10 months ago

Newer Older

Commit History Find

Commit History