Sigbjørn Skjæret
|
b2ba81dbe0
ci : fix ccache key for ubuntu-cpu-cmake (#16355)
|
3 months ago |
Adrien Gallouët
|
bf6f3b3a19
common : disable progress bar without a tty (#16352)
|
3 months ago |
lhez
|
7c156df414
opencl: support pad_ext (#15888)
|
3 months ago |
Pascal
|
16b0ca0d2e
Chatapi ignore empty sampling (#16330)
|
3 months ago |
Reese Levine
|
8d78cd2613
ggml webgpu: support for rope,div,sub,glu,scale,cont operators (#16187)
|
3 months ago |
lhez
|
d1c84a662d
opencl: support ne3 in get_rows (#15866)
|
3 months ago |
Adrien Gallouët
|
364a7a6d4a
common : remove common_has_curl() (#16351)
|
3 months ago |
Sigbjørn Skjæret
|
2df5bcf357
ci : disable ccache for android (#16348)
|
3 months ago |
Georgi Gerganov
|
075c01567b
ggml : bump version to 0.9.4 (ggml/1363)
|
3 months ago |
anavp-nvidia
|
a014310374
cuda : Enable CUDA Graph usage for Nemotron Nano v2 (NemotronH) (#16328)
|
3 months ago |
Georgi Gerganov
|
35fb82497e
metal : dynamic simdgroups for MV kernels (#16340)
|
3 months ago |
Adrien Gallouët
|
3c62aed89f
common : simplify etag tracking by removing json (#16342)
|
3 months ago |
Charles Xu
|
f1eb1cb1eb
kleidiai : fix work size and threads sync for fp16 (#16246)
|
3 months ago |
lhez
|
de41f2b7bf
codeowners: add codeowners for opencl backend (#16344)
|
3 months ago |
Jeff Bolz
|
a74a0d69f3
tests: override test_set_rows::max_nmse_err to allow for occasional rounding differences (#16295)
|
3 months ago |
Pascal
|
5f7e166cbf
Fix thinking blocks with quotes + add handling `[THINK]...[/THINK]` blocks (#16326)
|
3 months ago |
Georgi Gerganov
|
d72f5f7ba2
ci : add AMD runners and workflows (#16249)
|
3 months ago |
alex-spacemit
|
b77e6c18e1
ggml: riscv: add riscv spacemit backend (#15288)
|
3 months ago |
Georgi Gerganov
|
2ddd3f2356
sync : ggml
|
3 months ago |
Georgi Gerganov
|
4d3d455d3c
sync : whisper.cpp (ggml/1359)
|
3 months ago |
Daniel Bevenius
|
c9b1c06467
ggml : remove -dev suffix from release version (ggml/1355)
|
3 months ago |
Daniel Bevenius
|
b6ae75afb4
ggml : bump version to 0.9.3 (ggml/1353)
|
3 months ago |
Georgi Gerganov
|
b6dff20e2f
ggml : prepare for development of 0.9.2-dev
|
4 months ago |
Georgi Gerganov
|
2db78c75e4
ggml : bump version to 0.9.1
|
4 months ago |
Rafal Lewczuk
|
02463ab27b
ggml-backend : add root cause in error message if loading backend library fails (#16172)
|
3 months ago |
Sigbjørn Skjæret
|
adc76347d7
ggml : check cuda and metal argsort limits and add test (#16323)
|
3 months ago |
Aleksander Grygier
|
3a2bdcda0b
Improve Mobile UI for dialogs and action dropdowns (#16222)
|
3 months ago |
Pascal
|
66bb7985c3
fix: preserved zero values in chat settings inputs and textareas by switching to nullish coalescing for field values and default placeholders (#16312)
|
3 months ago |
Vinkal
|
2f61c0f5bf
llama-cli: prevent spurious assistant token (#16202)
|
3 months ago |
ddh0
|
3ffd0fae47
perplexity : show more kl-divergence data (#16321)
|
3 months ago |