cturan/llama.cpp

Autor	SHA1 Mensaje	Fecha
Sukriti Sharma	2fffc52b50 llama : fix Roberta embeddings (#10856)	hace 1 año
fairydreaming	7585edbdeb convert : Add support for Microsoft Phi-4 model (#10817)	hace 1 año
Johannes Gäßler	cd920d0ac3 tests: disable GGUF test for bad value size (#10886)	hace 1 año
Eric Curtin	7909e8588d llama-run : improve progress bar (#10821)	hace 1 año
Diego Devesa	9177484f58 ggml : fix arm build (#10890)	hace 1 año
Georgi Gerganov	0bf2d10c55 tts : add OuteTTS support (#10784)	hace 1 año
Gaetan Bisson	7bbb5acf12 server: avoid overwriting Authorization header (#10878)	hace 1 año
Georgi Gerganov	152610eda9 server : output embeddings for all tokens when pooling = none (#10861)	hace 1 año
Georgi Gerganov	0e70ba686e server : add "tokens" output (#10853)	hace 1 año
Xuan Son Nguyen	46828872c3 server : (embeddings) using same format for "input" and "content" (#10872)	hace 1 año
redbeard	6b064c92b4 docs: Fix HIP (née hipBLAS) in README (#10880)	hace 1 año
Diego Devesa	4da69d1abd Revert "llama : add Falcon3 support (#10864)" (#10876)	hace 1 año
DAN™	d62b532c52 Use model->gguf_kv for loading the template instead of using the C API. (#10868)	hace 1 año
Johannes Gäßler	081b29bd2a tests: add tests for GGUF (#10830)	hace 1 año
Georgi Gerganov	5437d4aaf5 sync : ggml	hace 1 año
Georgi Gerganov	78f766768d cmake : fix "amd64" processor string (whisper/2638)	hace 1 año
gn64	8dd19a4812 vulkan : fix soft_max.comp division by zero (whisper/2633)	hace 1 año
Daniel Bevenius	130d0c90bd ggml : remove return from ggml_gallocr_allocate_node (ggml/1048)	hace 1 año
Daniel Bevenius	3919da8e33 ggml : add check for grad_accs (ggml/1046)	hace 1 año
Georgi Gerganov	0006f5a74a ggml : update ggml_backend_cpu_device_supports_op (#10867)	hace 1 año
krystiancha	05c3a444b8 server : fill usage info in embeddings and rerank responses (#10852)	hace 1 año
Billel Mokeddem	382bc7f2e8 llama : add Falcon3 support (#10864)	hace 1 año
Ruan	4f51968aca readme : update typos (#10863)	hace 1 año
Xuan Son Nguyen	227d7c5a7f server : (UI) fix missing async generator on safari (#10857)	hace 1 año
Eve	7b1ec53f56 vulkan: bugfixes for small subgroup size systems + llvmpipe test (#10809)	hace 1 año
Zhiyuan Li	160bc039c8 rwkv6: add wkv6 support for Vulkan backend (#10829)	hace 1 año
Georgi Gerganov	08ea539df2 unicode : improve naming style (#10838)	hace 1 año
Georgi Gerganov	644fd71b44 sampling : refactor + optimize penalties sampler (#10803)	hace 1 año
Bartowski	4ddd199f6f llava : Allow locally downloaded models for QwenVL (#10833)	hace 1 año
Valentin Mamedov	a0974156f3 llama : add Deepseek MoE v1 & GigaChat models (#10827)	hace 1 año

Posterior Anterior

Historial de Commits Buscar

Historial de Commits