cturan/llama.cpp

Autor	SHA1 Mensagem	Data
Çetin	45ada635f0 Corrected Implementation of Qwen3Next Support	há 3 semanas atrás
Sigbjørn Skjæret	74e05131e9 ci : remove non-windows zip artifacts (#18201)	há 4 semanas atrás
Sigbjørn Skjæret	f74747d886 ci : only save ccache on master (#18207)	há 4 semanas atrás
Alfred	ce734a8a2f ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations (#17977)	há 4 semanas atrás
Pascal	14931a826e arg: fix order to use short form before long form (#18196)	há 4 semanas atrás
Julius Tischbein	f99ef53d2a llama : Changing off_t to size_t for Windows (#18204)	há 4 semanas atrás
Aman Gupta	cc0a04343e server: friendlier error msg when ctx < input (#18174)	há 4 semanas atrás
Xuan-Son Nguyen	98c1c7a7bf presets: refactor, allow cascade presets from different sources, add global section (#18169)	há 4 semanas atrás
Aleksander Grygier	acb73d8340 webui: Add editing attachments in user messages (#18147)	há 4 semanas atrás
Daniel Bevenius	0a271d82b4 model-conversion : add verbose flag in run-org-model.py (#18194)	há 4 semanas atrás
Naco Siren	52fc7fee8a android: fix missing screenshots for Android.md (#18156)	há 4 semanas atrás
Jeff Bolz	cdbada8d10 vulkan: Add perf logger mode with concurrency (#17944)	há 4 semanas atrás
Xuan-Son Nguyen	8ea958d4d9 model : add ASR support for LFM2-Audio-1.5B (conformer) (#18106)	há 4 semanas atrás
Pascal	f9ec8858ed webui: display prompt processing stats (#18146)	há 4 semanas atrás
Taimur Ahmad	f716588e63 ggml-cpu: extend support for RVV floating-point kernels (#17318)	há 4 semanas atrás
Xuan-Son Nguyen	4d1316c440 arg: fix ASAN error on sampler_type_names empty (#18167)	há 4 semanas atrás
Sigbjørn Skjæret	ec7b9329ae gguf-py : use copy-on-write mode for localtensor (#18162)	há 4 semanas atrás
yulo	54189c0d39 remove i_major_dual (#18157)	há 4 semanas atrás
Aleksander Grygier	9ce64aed7d webui: Fix selecting generated output issues during active streaming (#18091)	há 1 mês atrás
Kim S.	900316da4e webui: fix chat screen shadow width (#18010)	há 1 mês atrás
Johannes Gäßler	57c1e05643 llama: offload output layer to GPU first (#18148)	há 1 mês atrás
Sigbjørn Skjæret	9cff4cc554 convert : sort and use file parts from model index if present (#18043)	há 1 mês atrás
Julius Tischbein	4d4f4cacd1 llama : Async DirectIO model loading on Linux (#18012)	há 1 mês atrás
Shouyu	0a0bba05e8 ggml-hexagon: swiglu_oai operation (#18114)	há 1 mês atrás
Sigbjørn Skjæret	5166aaf868 convert : force patch_merger tensors to f16/f32 (#18124)	há 1 mês atrás
Pascal	6ce3d85796 server: (webui) add --webui-config (#18028)	há 1 mês atrás
Xuan-Son Nguyen	e85e9d7637 server: (router) disable SSL on child process (#18141)	há 1 mês atrás
Johannes Gäßler	8dcc3662a2 llama-fit-params: fix memory print (#18136)	há 1 mês atrás
Kim S.	d37fc93505 webui: fix chat header width when sidebar is closed (#17981)	há 1 mês atrás
Shouyu	4470a0764a ggml-hexagon: gelu operation (#17921)	há 1 mês atrás

Recente Antigo

Histórico de Commits Pesquisar

Histórico de Commits