Dmytro Minochkin
|
0499b29c6f
vulkan: throw system error instead of SIGABRT during init on older devices (#16156)
|
3 months ago |
Adrien Gallouët
|
234e2ff8ed
server : remove old LLAMA_SERVER_SSL (#16290)
|
3 months ago |
Jeff Bolz
|
3f81b4e91c
vulkan: support GET_ROWS for k-quants (#16235)
|
3 months ago |
Adrien Gallouët
|
ace6a54565
build : add LLAMA_OPENSSL option (#16287)
|
3 months ago |
Vinkal
|
72b24d96c6
model : make minicpm embedding_scale, residual_scale and logit_scale optional with legacy defaults (#16273)
|
3 months ago |
Aaron Teo
|
624207e676
devops: add s390x & ppc64le CI (#15925)
|
3 months ago |
Aleksander Grygier
|
807e8c6d31
Enhance text file detection logic for file attachments (#16199)
|
3 months ago |
Aleksander Grygier
|
1a18927894
Allow viewing conversations even when llama server is down (#16255)
|
3 months ago |
Isaac McFadyen
|
e0539eb6ae
webui: switch to hash-based routing (alternative of #16079) (#16157)
|
3 months ago |
Aleksander Grygier
|
5d0a40f390
Always show message actions for mobile UI + improvements for user message sizing (#16076)
|
3 months ago |
Radoslav Gerganov
|
d12a983659
codeowners : add rgerganov as owner of RPC [no ci] (#16279)
|
3 months ago |
Aleksei Nikiforov
|
cc1cfa277b
mtmd : fix uninitialized variable in bicubic_resize (#16275)
|
3 months ago |
Georgi Gerganov
|
54dbc37053
metal : report OOM errors (#16274)
|
3 months ago |
Adrien Gallouët
|
b995a10760
common : use cpp-httplib as a cURL alternative for downloads (#16185)
|
3 months ago |
Adrien Gallouët
|
4710dd31bb
build : fix build-ios-device (#16257)
|
3 months ago |
Aaron Teo
|
9b26511857
ggml-cpu: implement MXFP4 SIMD for s390x (#16193)
|
3 months ago |
Radoslav Gerganov
|
00217cd413
ci : create git tags for released docker images (#16008)
|
3 months ago |
Daniel Bevenius
|
3b337b01a1
codeowners : add danbev as owner of build-xcframework.sh [no ci] (#16268)
|
3 months ago |
R0CKSTAR
|
a86a580a66
musa: upgrade musa sdk to 4.3.0 (#16240)
|
3 months ago |
R0CKSTAR
|
0f7c69689f
musa: fix build warnings (#15611)
|
3 months ago |
Sigbjørn Skjæret
|
835b2b915c
model : add GroveMoE support (#15510)
|
3 months ago |
Aaron Teo
|
b05a9d650f
vendors: update miniaudio version (#16212)
|
3 months ago |
rtaluyev
|
27052978e4
readme : update bindings (#16144)
|
3 months ago |
Aman Gupta
|
077c94d0ca
CUDA: add a fused top-K MoE kernel (#16130)
|
3 months ago |
Daniel Bevenius
|
aa3ee0eb0b
model-conversion : add embedding prompt file support (#15871)
|
3 months ago |
Daniel Bevenius
|
d0991da39d
server : add support for external server for tests (#16243)
|
3 months ago |
junchao-zhao
|
aa719c2f88
ggml : fix loongarch lsx compilation error (#15864)
|
3 months ago |
Johannes Gäßler
|
4cdd0bb453
docs: fix typo [no ci] (#16244)
|
3 months ago |
Douglas Hanley
|
b5bd037832
llama : add support for qwen3 reranker (#15824)
|
3 months ago |
Georgi Gerganov
|
dfcd53f7ec
metal : fuse NORM + MUL + ADD, support non-multiples of 4 (#16220)
|
3 months ago |