Eve
|
f09aefaa84
ci: update vulkan ci (#16294)
|
hai 3 meses |
Georgi Gerganov
|
bbd32bc038
ci : fix clean-up of old logs (#16381)
|
hai 3 meses |
Neo Zhang Jianyu
|
2be72c2b12
SYCL: Update to oneAPI 2025.2 (#16371)
|
hai 3 meses |
uvos
|
95ce098544
HIP: add IMbackK to codeowner (#16375)
|
hai 3 meses |
uvos
|
c8dedc9999
CI: reenable cdna in rocm docker builds (#16376)
|
hai 3 meses |
uvos
|
e95fec640f
HIP: Disable ROCWMMA fattn on CDNA when compiled against ROCWMMA 2.0.0 (#16221)
|
hai 3 meses |
Shunta Saito
|
ded67b9444
llama : parameter conversion and loading fixes for PLaMo2 variants (#16075)
|
hai 3 meses |
uvos
|
1fe4e38cc2
ci: Properly install rocwmma for hip builds (#16305)
|
hai 3 meses |
Adrien Gallouët
|
4201deae9c
common: introduce http.h for httplib-based client (#16373)
|
hai 3 meses |
Aleksander Grygier
|
764799279f
Conversation action dialogs as singletons from Chat Sidebar + apply conditional rendering for Actions Dropdown for Chat Conversation Items (#16369)
|
hai 3 meses |
Aleksander Grygier
|
2a9b63383a
Improve code block color theming (#16325)
|
hai 3 meses |
Sigbjørn Skjæret
|
1104ca1a1c
ci : use registry cache for docker builds (#16366)
|
hai 3 meses |
Aleksander Grygier
|
4f1575921c
Add optional setting for showing "Model used:" information (#16337)
|
hai 3 meses |
Eve
|
132d673554
vulkan: make ggml_vk_default_dispatcher support older vulkan headers (#16345)
|
hai 3 meses |
Aleksander Grygier
|
aa9538a63a
webui: Remove running `llama-server` within WebUI `dev.sh` script (#16363)
|
hai 3 meses |
Bartowski
|
e74c92e842
model : support GLM 4.6 (make a few NextN/MTP tensors not required) (#16359)
|
hai 3 meses |
Sigbjørn Skjæret
|
b2ba81dbe0
ci : fix ccache key for ubuntu-cpu-cmake (#16355)
|
hai 3 meses |
Adrien Gallouët
|
bf6f3b3a19
common : disable progress bar without a tty (#16352)
|
hai 3 meses |
lhez
|
7c156df414
opencl: support pad_ext (#15888)
|
hai 3 meses |
Pascal
|
16b0ca0d2e
Chatapi ignore empty sampling (#16330)
|
hai 3 meses |
Reese Levine
|
8d78cd2613
ggml webgpu: support for rope,div,sub,glu,scale,cont operators (#16187)
|
hai 3 meses |
lhez
|
d1c84a662d
opencl: support ne3 in get_rows (#15866)
|
hai 3 meses |
Adrien Gallouët
|
364a7a6d4a
common : remove common_has_curl() (#16351)
|
hai 3 meses |
Sigbjørn Skjæret
|
2df5bcf357
ci : disable ccache for android (#16348)
|
hai 3 meses |
Georgi Gerganov
|
075c01567b
ggml : bump version to 0.9.4 (ggml/1363)
|
hai 3 meses |
anavp-nvidia
|
a014310374
cuda : Enable CUDA Graph usage for Nemotron Nano v2 (NemotronH) (#16328)
|
hai 3 meses |
Georgi Gerganov
|
35fb82497e
metal : dynamic simdgroups for MV kernels (#16340)
|
hai 3 meses |
Adrien Gallouët
|
3c62aed89f
common : simplify etag tracking by removing json (#16342)
|
hai 3 meses |
Charles Xu
|
f1eb1cb1eb
kleidiai : fix work size and threads sync for fp16 (#16246)
|
hai 3 meses |
lhez
|
de41f2b7bf
codeowners: add codeowners for opencl backend (#16344)
|
hai 3 meses |