cturan/llama.cpp

Author	SHA1 Message	Date
Xuan-Son Nguyen	9e39a1e6a9 server: support load model on startup, support preset-only options (#18206)	4 weeks ago
Pascal	14931a826e arg: fix order to use short form before long form (#18196)	4 weeks ago
Xuan-Son Nguyen	98c1c7a7bf presets: refactor, allow cascade presets from different sources, add global section (#18169)	4 weeks ago
Xuan-Son Nguyen	8ea958d4d9 model : add ASR support for LFM2-Audio-1.5B (conformer) (#18106)	4 weeks ago
Xuan-Son Nguyen	4d1316c440 arg: fix ASAN error on sampler_type_names empty (#18167)	4 weeks ago
Pascal	6ce3d85796 server: (webui) add --webui-config (#18028)	1 month ago
Pascal	487674fbb3 common: fix --override-kv to support comma-separated values (#18056)	1 month ago
TrevorS	4b2a4778f8 arg: allow -kvu flag for llama-perplexity (#18117)	1 month ago
Xuan-Son Nguyen	7b1db3d3b7 arg: clarify auto kvu/np being set on server (#17997)	1 month ago
Johannes Gäßler	b1f3a6e5db llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (#16653)	1 month ago
Georgi Gerganov	254098a279 common : refactor common_sampler + grammar logic changes (#17937)	1 month ago
Xuan-Son Nguyen	4d5ae24c0a arg: fix common_params_parse not accepting negated arg (#17991)	1 month ago
Sigbjørn Skjæret	8e4d678528 common : skip model validation when --completion-bash is requested (#17975)	1 month ago
Sigbjørn Skjæret	2bc94e7928 add llama-completion to completion-bash executables (#17976)	1 month ago
Xuan-Son Nguyen	380b4c984e common: support negated args (#17919)	1 month ago
Xuan-Son Nguyen	54a0fee4b7 arg: add -mm and -mmu as short form of --mmproj and --mmproj-url (#17958)	1 month ago
Xuan-Son Nguyen	34a6d86982 cli: enable jinja by default (#17911)	1 month ago
Pascal	f32ca51bfe server: add presets (config) when using multiple models (#17859)	1 month ago
Xuan-Son Nguyen	6c2131773c cli: new CLI experience (#17824)	1 month ago
Sigbjørn Skjæret	22577583a3 common : change --color to accept on/off/auto, default to auto (#17827)	1 month ago
Daniel Bevenius	bd4ef13476 common : skip model validation when --help is requested (#17755)	1 month ago
Reese Levine	7ca5991d2b ggml webgpu: add support for emscripten builds (#17184)	1 month ago
Xuan-Son Nguyen	13628d8bdb server: add --media-path for local media files (#17697)	1 month ago
Xuan-Son Nguyen	a96283adc4 mtmd: fix --no-warmup (#17695)	1 month ago
Xuan-Son Nguyen	ec18edfcba server: introduce API for serving / loading / unloading multiple models (#17470)	1 month ago
Xuan-Son Nguyen	7733409734 common: improve verbosity level definitions (#17630)	1 month ago
Aaron Teo	def5404f26 common: add LLAMA_LOG_FILE env var (#17609)	1 month ago
ddh0	5a6241feb0 common: update env var name (#17588)	1 month ago
Xuan-Son Nguyen	e509411cf1 server: enable jinja by default, update docs (#17524)	1 month ago
Aaron Teo	877566d512 llama: introduce support for model-embedded sampling parameters (#17120)	1 month ago

Newer Older

Commit History Find

Commit History