cturan/llama.cpp

Autor	SHA1 Mensaxe	Data
Georgi Gerganov	36319dec5d tts : small QoL for easy model fetch (#10903)	hai 1 ano
Xuan Son Nguyen	57bb2c40cd server : fix logprobs, make it OAI-compatible (#10783)	hai 1 ano
Adrien Gallouët	a3c33b1dce ggml: fix arm build with gcc (#10895)	hai 1 ano
Sukriti Sharma	2fffc52b50 llama : fix Roberta embeddings (#10856)	hai 1 ano
fairydreaming	7585edbdeb convert : Add support for Microsoft Phi-4 model (#10817)	hai 1 ano
Johannes Gäßler	cd920d0ac3 tests: disable GGUF test for bad value size (#10886)	hai 1 ano
Eric Curtin	7909e8588d llama-run : improve progress bar (#10821)	hai 1 ano
Diego Devesa	9177484f58 ggml : fix arm build (#10890)	hai 1 ano
Georgi Gerganov	0bf2d10c55 tts : add OuteTTS support (#10784)	hai 1 ano
Gaetan Bisson	7bbb5acf12 server: avoid overwriting Authorization header (#10878)	hai 1 ano
Georgi Gerganov	152610eda9 server : output embeddings for all tokens when pooling = none (#10861)	hai 1 ano
Georgi Gerganov	0e70ba686e server : add "tokens" output (#10853)	hai 1 ano
Xuan Son Nguyen	46828872c3 server : (embeddings) using same format for "input" and "content" (#10872)	hai 1 ano
redbeard	6b064c92b4 docs: Fix HIP (née hipBLAS) in README (#10880)	hai 1 ano
Diego Devesa	4da69d1abd Revert "llama : add Falcon3 support (#10864)" (#10876)	hai 1 ano
DAN™	d62b532c52 Use model->gguf_kv for loading the template instead of using the C API. (#10868)	hai 1 ano
Johannes Gäßler	081b29bd2a tests: add tests for GGUF (#10830)	hai 1 ano
Georgi Gerganov	5437d4aaf5 sync : ggml	hai 1 ano
Georgi Gerganov	78f766768d cmake : fix "amd64" processor string (whisper/2638)	hai 1 ano
gn64	8dd19a4812 vulkan : fix soft_max.comp division by zero (whisper/2633)	hai 1 ano
Daniel Bevenius	130d0c90bd ggml : remove return from ggml_gallocr_allocate_node (ggml/1048)	hai 1 ano
Daniel Bevenius	3919da8e33 ggml : add check for grad_accs (ggml/1046)	hai 1 ano
Georgi Gerganov	0006f5a74a ggml : update ggml_backend_cpu_device_supports_op (#10867)	hai 1 ano
krystiancha	05c3a444b8 server : fill usage info in embeddings and rerank responses (#10852)	hai 1 ano
Billel Mokeddem	382bc7f2e8 llama : add Falcon3 support (#10864)	hai 1 ano
Ruan	4f51968aca readme : update typos (#10863)	hai 1 ano
Xuan Son Nguyen	227d7c5a7f server : (UI) fix missing async generator on safari (#10857)	hai 1 ano
Eve	7b1ec53f56 vulkan: bugfixes for small subgroup size systems + llvmpipe test (#10809)	hai 1 ano
Zhiyuan Li	160bc039c8 rwkv6: add wkv6 support for Vulkan backend (#10829)	hai 1 ano
Georgi Gerganov	08ea539df2 unicode : improve naming style (#10838)	hai 1 ano

Posterior Anterior

Commit History Buscar

Commit History