Olivier Chafik
|
669912d9a5
`tool-call`: fix Qwen 2.5 Coder support, add micro benchmarks, support trigger patterns for lazy grammars (#12034)
|
vor 10 Monaten |
Daniel Bevenius
|
fa31c438e0
ci : fix xcframework artifact tag (#12191)
|
vor 10 Monaten |
Daniel Bevenius
|
3ccbfe5a71
ci : remove xframework upload (#12190)
|
vor 10 Monaten |
Clauszy
|
06a92a193a
server : fix cache reuse logic (#12161)
|
vor 10 Monaten |
Daniel Bevenius
|
a057897ad4
llama : add xcframework build script (#11996)
|
vor 10 Monaten |
mgroeber9110
|
5bbe6a9fe9
ggml : portability fixes for VS 2017 (#12150)
|
vor 10 Monaten |
Georgi Gerganov
|
20a9b8f5e1
readme : fix roadmap link (#12185)
|
vor 10 Monaten |
Sigbjørn Skjæret
|
56d7a9f812
main: allow preloading conversation with -p and add -st / --single-turn (#12145)
|
vor 10 Monaten |
Olivier Chafik
|
1a24c4621f
`server`: fix deadly typo in response_format.json_schema.schema handling (#12168)
|
vor 10 Monaten |
David Huang
|
becade5de7
HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (#12032)
|
vor 10 Monaten |
Georgi Gerganov
|
dfd6b2c0be
sync : ggml
|
vor 10 Monaten |
cmdr2
|
b64d7cc272
cuda: unary ops as float + de-duplicate (ggml/1130)
|
vor 10 Monaten |
Georgi Gerganov
|
3d1cf3cf33
sync : ggml
|
vor 10 Monaten |
cmdr2
|
0cbee131ad
cuda/vulkan: specify fp32-only support for some operations in supports_op (ggml/1129)
|
vor 10 Monaten |
Georgi Gerganov
|
8371d44595
sync : ggml
|
vor 10 Monaten |
cmdr2
|
87abb7e903
cuda/cpu: Increase support for fp16 unary operations (ggml/1125)
|
vor 10 Monaten |
Diego Devesa
|
6d4c23b81b
whisper : support GGML_BACKEND_DL (whisper/2843)
|
vor 10 Monaten |
midnight
|
6512a90037
cmake : fix compile assumptions for power9/etc (whisper/2777)
|
vor 11 Monaten |
petterreinholdtsen
|
4512055792
Told cmake to install ggml-cpp.h as a public header file. (ggml/1126)
|
vor 10 Monaten |
cmdr2
|
f54a4ba11e
Support pure float16 add/sub/mul/div operations in the CUDA (and CPU) backend (ggml/1121)
|
vor 10 Monaten |
Georgi Gerganov
|
aede2074f6
scripts : sync-ggml-am.sh fix
|
vor 10 Monaten |
Daniel Bevenius
|
2679c3b55d
ci : set GITHUB_ACTION env var for server tests (#12162)
|
vor 10 Monaten |
dm4
|
c43af9276b
tts: add speaker file support (#12048)
|
vor 10 Monaten |
Diego Devesa
|
d5c63cd7f9
test-backend-ops : add option -p to filter by op params (#12155)
|
vor 10 Monaten |
ag2s20150909
|
9660ffef58
ggml : fix kleidiai build (#12159)
|
vor 10 Monaten |
Eric Curtin
|
c950a1f692
Adding UTF-8 support to llama.cpp (#12111)
|
vor 10 Monaten |
Xuan-Son Nguyen
|
7b69003af7
webui : add ?m=... and ?q=... params (#12148)
|
vor 10 Monaten |
Akarshan Biswas
|
ece9745bb8
SYCL: Move CPY kernels to a separate file and add few missing kernels (#12133)
|
vor 10 Monaten |
Diego Devesa
|
cc473cac7c
ggml-backend : keep paths in native string type when possible (#12144)
|
vor 10 Monaten |
Sigbjørn Skjæret
|
14dec0c2f2
main: use jinja chat template system prompt by default (#12118)
|
vor 10 Monaten |