Georgi Gerganov
|
13b4548877
cmake : do not include ./src as public for libllama (#13062)
|
8 mesiacov pred |
Xuan-Son Nguyen
|
7c727fbe39
arg : add --no-mmproj-offload (#13093)
|
8 mesiacov pred |
Xuan-Son Nguyen
|
80982e815e
arg : clean up handling --mmproj with -hf (#13082)
|
8 mesiacov pred |
Xuan-Son Nguyen
|
243453533e
llava : update documentations (#13055)
|
9 mesiacov pred |
Xuan-Son Nguyen
|
84a9bf2fc2
mtmd : merge llava, gemma3 and minicpmv CLI into single `llama-mtmd-cli` (#13012)
|
9 mesiacov pred |
tastelikefeet
|
b2034c2b55
contrib: support modelscope community (#12664)
|
9 mesiacov pred |
Prajwal B Mehendarkar
|
1d343b4069
arg : Including limits file on AIX (#12822)
|
9 mesiacov pred |
Sergey Fedorov
|
f1e3eb4249
common : fix includes in arg.cpp and gemma3-cli.cpp (#12766)
|
9 mesiacov pred |
エシュナヴァリシア
|
c6ff5d2a8d
common: custom hf endpoint support (#12769)
|
9 mesiacov pred |
Diego Devesa
|
e0e912f49b
llama : add option to override model tensor buffers (#11397)
|
9 mesiacov pred |
Xuan-Son Nguyen
|
267c1399f1
common : refactor downloading system, handle mmproj with -hf option (#12694)
|
9 mesiacov pred |
Piotr
|
2099a9d5db
server : Support listening on a unix socket (#12613)
|
9 mesiacov pred |
marcoStocchi
|
f4c3dd5daa
llama-tts : add '-o' option (#12398)
|
10 mesiacov pred |
Sigbjørn Skjæret
|
774973b8f3
main : add -sysf / --system-prompt-file (#12249) (#12250)
|
10 mesiacov pred |
Xuan-Son Nguyen
|
be7c303410
arg : no n_predict = -2 for examples except for main and infill (#12364)
|
10 mesiacov pred |
marcoStocchi
|
6ef79a67ca
common : refactor '-o' option (#12278)
|
10 mesiacov pred |
Georgi Gerganov
|
1e2f78a004
server : add speculative decoding presets for FIM (#12287)
|
10 mesiacov pred |
Sigbjørn Skjæret
|
56d7a9f812
main: allow preloading conversation with -p and add -st / --single-turn (#12145)
|
10 mesiacov pred |
dm4
|
c43af9276b
tts: add speaker file support (#12048)
|
10 mesiacov pred |
Sigbjørn Skjæret
|
45a8e76745
common : add --system-prompt parameter, replace behavior of -p in conversation mode (#12131)
|
10 mesiacov pred |
Daniel Bevenius
|
d07c621393
common : add llama.vim preset for Qwen2.5 Coder (#11945)
|
11 mesiacov pred |
Olivier Chafik
|
63e489c025
tool-call: refactor common chat / tool-call api (+ tests / fixes) (#11900)
|
11 mesiacov pred |
standby24x7
|
fe163d5bf3
common : Fix a typo in help (#11899)
|
11 mesiacov pred |
Georgi Gerganov
|
68ff663a04
repo : update links to new url (#11886)
|
11 mesiacov pred |
Daniel Bevenius
|
3d68f034da
llama : add completion for --chat-template-file (#11860)
|
11 mesiacov pred |
Daniel Bevenius
|
c48f630d1c
llama : add --completion-bash option (#11846)
|
11 mesiacov pred |
Olivier Chafik
|
c7f460ab88
`server`: fix tool-call of DeepSeek R1 Qwen, return reasoning_content (Command 7RB & DeepSeek R1) unless `--reasoning-format none` (#11607)
|
11 mesiacov pred |
Vinesh Janarthanan
|
27e8a23300
sampling: add Top-nσ sampler (#11223)
|
11 mesiacov pred |
Maxim Evtush
|
7b891bdc86
fix: typos in documentation files (#11791)
|
11 mesiacov pred |
Daniel Bevenius
|
b7552cfcbc
common : add default embeddings presets (#11677)
|
11 mesiacov pred |