Xuan-Son Nguyen
|
9e39a1e6a9
server: support load model on startup, support preset-only options (#18206)
|
4 weeks ago |
Pascal
|
14931a826e
arg: fix order to use short form before long form (#18196)
|
4 weeks ago |
Xuan-Son Nguyen
|
98c1c7a7bf
presets: refactor, allow cascade presets from different sources, add global section (#18169)
|
4 weeks ago |
Xuan-Son Nguyen
|
8ea958d4d9
model : add ASR support for LFM2-Audio-1.5B (conformer) (#18106)
|
4 weeks ago |
Xuan-Son Nguyen
|
4d1316c440
arg: fix ASAN error on sampler_type_names empty (#18167)
|
4 weeks ago |
Pascal
|
6ce3d85796
server: (webui) add --webui-config (#18028)
|
1 month ago |
Pascal
|
487674fbb3
common: fix --override-kv to support comma-separated values (#18056)
|
1 month ago |
TrevorS
|
4b2a4778f8
arg: allow -kvu flag for llama-perplexity (#18117)
|
1 month ago |
Xuan-Son Nguyen
|
7b1db3d3b7
arg: clarify auto kvu/np being set on server (#17997)
|
1 month ago |
Johannes Gäßler
|
b1f3a6e5db
llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (#16653)
|
1 month ago |
Georgi Gerganov
|
254098a279
common : refactor common_sampler + grammar logic changes (#17937)
|
1 month ago |
Xuan-Son Nguyen
|
4d5ae24c0a
arg: fix common_params_parse not accepting negated arg (#17991)
|
1 month ago |
Sigbjørn Skjæret
|
8e4d678528
common : skip model validation when --completion-bash is requested (#17975)
|
1 month ago |
Sigbjørn Skjæret
|
2bc94e7928
add llama-completion to completion-bash executables (#17976)
|
1 month ago |
Xuan-Son Nguyen
|
380b4c984e
common: support negated args (#17919)
|
1 month ago |
Xuan-Son Nguyen
|
54a0fee4b7
arg: add -mm and -mmu as short form of --mmproj and --mmproj-url (#17958)
|
1 month ago |
Xuan-Son Nguyen
|
34a6d86982
cli: enable jinja by default (#17911)
|
1 month ago |
Pascal
|
f32ca51bfe
server: add presets (config) when using multiple models (#17859)
|
1 month ago |
Xuan-Son Nguyen
|
6c2131773c
cli: new CLI experience (#17824)
|
1 month ago |
Sigbjørn Skjæret
|
22577583a3
common : change --color to accept on/off/auto, default to auto (#17827)
|
1 month ago |
Daniel Bevenius
|
bd4ef13476
common : skip model validation when --help is requested (#17755)
|
1 month ago |
Reese Levine
|
7ca5991d2b
ggml webgpu: add support for emscripten builds (#17184)
|
1 month ago |
Xuan-Son Nguyen
|
13628d8bdb
server: add --media-path for local media files (#17697)
|
1 month ago |
Xuan-Son Nguyen
|
a96283adc4
mtmd: fix --no-warmup (#17695)
|
1 month ago |
Xuan-Son Nguyen
|
ec18edfcba
server: introduce API for serving / loading / unloading multiple models (#17470)
|
1 month ago |
Xuan-Son Nguyen
|
7733409734
common: improve verbosity level definitions (#17630)
|
1 month ago |
Aaron Teo
|
def5404f26
common: add LLAMA_LOG_FILE env var (#17609)
|
1 month ago |
ddh0
|
5a6241feb0
common: update env var name (#17588)
|
1 month ago |
Xuan-Son Nguyen
|
e509411cf1
server: enable jinja by default, update docs (#17524)
|
1 month ago |
Aaron Teo
|
877566d512
llama: introduce support for model-embedded sampling parameters (#17120)
|
1 month ago |