cturan/llama.cpp

Эзэн	SHA1 Мессеж	Огноо
Djip007	0bb2919335 llama : change cpu_buft_list order: ACCEL -> GPU host -> CPU extra -> CPU (#12632)	9 сар өмнө
Jay	a69f846351 cmake : fix ccache conflict (#12522)	9 сар өмнө
hipudding	d07a0d7a79 CANN : remove clang-format in ggml-cann (#12607)	9 сар өмнө
Sigbjørn Skjæret	3714c3ee1a llama : fix incorrect Qwen2Moe ffn_moe_out graph callback (#12631)	9 сар өмнө
Georgi Gerganov	b4ae50810e metal : improve FA + improve MoE (#12612)	9 сар өмнө
Icenowy Zheng	b86f600723 vulkan: fix coopmat shader generation when cross-compiling (#12272)	9 сар өмнө
Johannes Gäßler	dd373dd3bf llama: fix error on bad grammar (#12628)	9 сар өмнө
Benson Wong	5d01670266 server : include speculative decoding stats when timings_per_token is enabled (#12603)	9 сар өмнө
Radoslav Gerganov	ef03229ff4 rpc : update README for cache usage (#12620)	9 сар өмнө
amritahs-ibm	13731766db llamafile : ppc64le GEMV forwarding for FP32. (#12594)	9 сар өмнө
Radoslav Gerganov	ab6ab8f809 rpc : send hash when tensor data is above some fixed threshold (#12496)	9 сар өмнө
Piotr	2099a9d5db server : Support listening on a unix socket (#12613)	9 сар өмнө
Georgi Gerganov	2969019837 media : add SVG logo [no ci] (#12616)	9 сар өмнө
lhez	5dec47dcd4 opencl: add multi and vision rope, `gelu_quick` and `im2col` (#12600)	9 сар өмнө
Si1w	f125b8dccf llama : add PLM GGUF Conversion & Inference Support (#12457)	9 сар өмнө
HighDoping	953c2a62cf model : restore support for T5Encoder (#12590)	9 сар өмнө
Csaba Kecskemeti	d5c6309d91 convert : Support Qwen2_5_VLForConditionalGeneration (#12595)	9 сар өмнө
Georgi Gerganov	029c693fdc sync : ggml	9 сар өмнө
Georgi Gerganov	771d84371c scripts : update sync + fix cmake merge	9 сар өмнө
Georgi Gerganov	df0665a483 sync : ggml	9 сар өмнө
Georgi Gerganov	0306aad1ca cmake : sync/merge PowerPC build commands (#0)	9 сар өмнө
amritahs-ibm	c7b43ab608 llamafile : ppc64le MMA implementation for Q4_0. (#12489)	9 сар өмнө
xctan	24feaec057 ggml : riscv: add 128-bit RVV support (#12530)	9 сар өмнө
Georgi Gerganov	f28bc4c286 llama : make loras compatible with repacking (#12593)	9 сар өмнө
Akarshan Biswas	f17a3bb4e8 SYCL: implement memset ggml backend buffer interface (#12580)	9 сар өмнө
Slobodan Josic	bd40678df7 HIP: Add support for RDNA4 targets (#12372)	9 сар өмнө
Georgi Gerganov	b3298fa47a metal : refactor mat-vec code (#12569)	10 сар өмнө
Michał Moskal	2447ad8a98 upgrade to llguidance 0.7.10 (#12576)	10 сар өмнө
Ivy233	02082f1519 clip: Fix llama-llava-clip-quantize-cli quantization error under CUDA backend (#12566)	10 сар өмнө
Georgi Gerganov	df4d20cd53 convert : fix squeeze for ssm_conv tensors (#12573)	10 сар өмнө

Шинэ Хуучин

Коммит түүх Хайх

Коммит түүх