Michał Moskal
|
ff227703d6
sampling : support for llguidance grammars (#10224)
|
11 сар өмнө |
piDack
|
0cec062a63
llama : add support for GLM-Edge and GLM-Edge-V series models (#10573)
|
11 сар өмнө |
Olivier Chafik
|
53debe6f3c
ci: use sccache on windows HIP jobs (#11553)
|
11 сар өмнө |
Olivier Chafik
|
cfd74c86db
`sync`: minja (https://github.com/google/minja/commit/418a2364b56dc9be4ed9a1a2b0fb16fb53a7a22e) (#11574)
|
11 сар өмнө |
Eric Curtin
|
ecef206ccb
Implement s3:// protocol (#11511)
|
11 сар өмнө |
Olivier Chafik
|
5bbc7362cb
ci: simplify cmake build commands (#11548)
|
11 сар өмнө |
Olivier Chafik
|
aa6fb13213
`ci`: use sccache on windows instead of ccache (#11545)
|
11 сар өмнө |
Olivier Chafik
|
a83f528688
`tool-call`: fix llama 3.x and functionary 3.2, play nice w/ pydantic_ai package, update readme (#11539)
|
11 сар өмнө |
Olivier Chafik
|
b1bcd309fc
fix stop regression (#11543)
|
11 сар өмнө |
Olivier Chafik
|
5783575c9d
Fix chatml fallback for unsupported builtin templates (when --jinja not enabled) (#11533)
|
11 сар өмнө |
Olivier Chafik
|
4a2b196d03
server : fix --jinja when there's no tools or schema (typo was forcing JSON) (#11531)
|
11 сар өмнө |
Steve Grubb
|
1bd3047a93
common: Add missing va_end (#11529)
|
11 сар өмнө |
Daniel Bevenius
|
a2df2787b3
server : update help metrics processing/deferred (#11512)
|
11 сар өмнө |
Olivier Chafik
|
553f1e46e9
`ci`: ccache for all github worfklows (#11516)
|
11 сар өмнө |
Olivier Chafik
|
8b576b6c55
Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639)
|
11 сар өмнө |
uvos
|
27d135c970
HIP: require at least HIP 5.5
|
11 сар өмнө |
uvos
|
6af1ca48cb
HIP: Prepare reduction operators for wave 64
|
11 сар өмнө |
uvos
|
c300e68ef4
CUDA/HIP: add warp_size to cuda_device_info
|
11 сар өмнө |
Olivier Chafik
|
3d804dec76
sync: minja (#11499)
|
11 сар өмнө |
mgroeber9110
|
ffd0821c57
vocab : correctly identify LF token for GPT-2 style BPE tokenizer (#11496)
|
11 сар өмнө |
Daniel Bevenius
|
4314e56c4f
server : use lambda instead of std::bind (#11507)
|
11 сар өмнө |
Isaac McFadyen
|
496e5bf46b
server : (docs) added response format for /apply-template [no ci] (#11503)
|
11 сар өмнө |
Guspan Tanadi
|
7919256c57
readme : reference examples relative links (#11505)
|
11 сар өмнө |
Daniel Bevenius
|
e0449763a4
server : update json snippets in README.md [no ci] (#11492)
|
11 сар өмнө |
Nigel Bosch
|
eb7cf15a80
server : add /apply-template endpoint for additional use cases of Minja functionality (#11489)
|
11 сар өмнө |
Rémy Oudompheng
|
66ee4f297c
vulkan: implement initial support for IQ2 and IQ3 quantizations (#11360)
|
11 сар өмнө |
Daniel Bevenius
|
e51c47b401
server : update auto gen files comments [no ci] (#11484)
|
11 сар өмнө |
Jeff Bolz
|
2711d0215f
vulkan: Catch pipeline creation failure and print an error message (#11436)
|
11 сар өмнө |
Eric Curtin
|
f0d4b29edf
Parse https://ollama.com/library/ syntax (#11480)
|
11 сар өмнө |
Georgi Gerganov
|
815857791d
sync : ggml
|
11 сар өмнө |