David Huang
|
7f323a589f
Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386)
|
8 месяцев назад |
Xuan-Son Nguyen
|
7fef11766c
arg : add env var to control mmproj (#13416)
|
8 месяцев назад |
Xuan-Son Nguyen
|
33eff40240
server : vision support via libmtmd (#12898)
|
8 месяцев назад |
Bartowski
|
efb8b47eda
imatrix : Add --parse-special for enabling parsing of special tokens in imatrix calculation (#13389)
|
8 месяцев назад |
Georgi Gerganov
|
51fb96b1ff
context : remove logits_all flag (#13284)
|
8 месяцев назад |
Georgi Gerganov
|
4773d7a02f
examples : remove infill (#13283)
|
8 месяцев назад |
Xuan-Son Nguyen
|
9b61acf060
mtmd : rename llava directory to mtmd (#13311)
|
8 месяцев назад |
Diego Devesa
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
8 месяцев назад |
Georgi Gerganov
|
fab647e884
server : add cache reuse card link to help (#13230)
|
8 месяцев назад |
Xuan-Son Nguyen
|
13c9a3319b
arg : remove CURLINFO_EFFECTIVE_METHOD (#13228)
|
8 месяцев назад |
Xuan-Son Nguyen
|
6f67cf1f48
arg : -hf do not fail if url mismatch (#13219)
|
8 месяцев назад |
Olivier Chafik
|
3b127c7385
common : add -jf / --json-schema-file flag (#12011)
|
8 месяцев назад |
Xuan-Son Nguyen
|
5933e6fdc9
arg : allow using -hf offline (#13202)
|
8 месяцев назад |
Georgi Gerganov
|
43f2b07193
common : fix noreturn compile warning (#13151)
|
8 месяцев назад |
Xuan-Son Nguyen
|
85f36e5e71
arg : fix unused variable (#13142)
|
8 месяцев назад |
Xuan-Son Nguyen
|
2d451c8059
common : add common_remote_get_content (#13123)
|
8 месяцев назад |
Georgi Gerganov
|
13b4548877
cmake : do not include ./src as public for libllama (#13062)
|
8 месяцев назад |
Xuan-Son Nguyen
|
7c727fbe39
arg : add --no-mmproj-offload (#13093)
|
8 месяцев назад |
Xuan-Son Nguyen
|
80982e815e
arg : clean up handling --mmproj with -hf (#13082)
|
8 месяцев назад |
Xuan-Son Nguyen
|
243453533e
llava : update documentations (#13055)
|
9 месяцев назад |
Xuan-Son Nguyen
|
84a9bf2fc2
mtmd : merge llava, gemma3 and minicpmv CLI into single `llama-mtmd-cli` (#13012)
|
9 месяцев назад |
tastelikefeet
|
b2034c2b55
contrib: support modelscope community (#12664)
|
9 месяцев назад |
Prajwal B Mehendarkar
|
1d343b4069
arg : Including limits file on AIX (#12822)
|
9 месяцев назад |
Sergey Fedorov
|
f1e3eb4249
common : fix includes in arg.cpp and gemma3-cli.cpp (#12766)
|
9 месяцев назад |
エシュナヴァリシア
|
c6ff5d2a8d
common: custom hf endpoint support (#12769)
|
9 месяцев назад |
Diego Devesa
|
e0e912f49b
llama : add option to override model tensor buffers (#11397)
|
9 месяцев назад |
Xuan-Son Nguyen
|
267c1399f1
common : refactor downloading system, handle mmproj with -hf option (#12694)
|
9 месяцев назад |
Piotr
|
2099a9d5db
server : Support listening on a unix socket (#12613)
|
9 месяцев назад |
marcoStocchi
|
f4c3dd5daa
llama-tts : add '-o' option (#12398)
|
10 месяцев назад |
Sigbjørn Skjæret
|
774973b8f3
main : add -sysf / --system-prompt-file (#12249) (#12250)
|
10 месяцев назад |