Djip007
|
2cd43f4900
ggml : more perfo with llamafile tinyblas on x86_64 (#10714)
|
hai 1 ano |
NeverLucky
|
09fe2e7613
server: allow filtering llama server response fields (#10940)
|
hai 1 ano |
Georgi Gerganov
|
30caac3a68
llama : the WPM vocabs use the CLS token as BOS (#10930)
|
hai 1 ano |
Diego Devesa
|
60cfa728e2
ggml : use wstring for backend search paths (#10960)
|
hai 1 ano |
Diego Devesa
|
3327bb0f8d
ggml : fix arm enabled features check (#10961)
|
hai 1 ano |
Diego Devesa
|
32d6ee6385
ggml : fix const usage in SSE path (#10962)
|
hai 1 ano |
Xuan Son Nguyen
|
14b699ecde
server : fix missing model id in /model endpoint (#10957)
|
hai 1 ano |
Xuan Son Nguyen
|
485dc01214
server : add system_fingerprint to chat/completion (#10917)
|
hai 1 ano |
Radoslav Gerganov
|
86bf31cfe6
rpc-server : add support for the SYCL backend (#10934)
|
hai 1 ano |
Yun Dou
|
b92a14a841
llama : support InfiniAI Megrez 3b (#10893)
|
hai 1 ano |
ymcki
|
6f0c9e034b
llama : support for Llama-3_1-Nemotron-51B (#10669)
|
hai 1 ano |
Eric Curtin
|
dab76c92cc
llama-run : include temperature option (#10899)
|
hai 1 ano |
yuri@FreeBSD
|
7024d59e6a
ggml : fix run-time on FreeBSD in get_executable_path() (#10948)
|
hai 1 ano |
Rudi Servo
|
7c0e285858
devops : add docker-multi-stage builds (#10832)
|
hai 1 ano |
Billel Mokeddem
|
7ae33a616f
llama : add Falcon3 support (#10883)
|
hai 1 ano |
Jeff Bolz
|
ebdee9478c
vulkan: build fixes for 32b (#10927)
|
hai 1 ano |
Georgi Gerganov
|
5cd85b5e00
convert : add BertForMaskedLM (#10919)
|
hai 1 ano |
Jeff Bolz
|
a91a41364b
vulkan: optimize coopmat2 dequant functions (#10855)
|
hai 1 ano |
Adrien Gallouët
|
e34c5af43f
ggml-cpu: replace NEON asm with intrinsics in ggml_gemv_q4_0_4x8_q8_0() (#10874)
|
hai 1 ano |
Akarshan Biswas
|
eb5c3dc64b
SYCL: Migrate away from deprecated ggml_tensor->backend (#10840)
|
hai 1 ano |
Xuan Son Nguyen
|
0ca416c91a
server : (UI) fix copy to clipboard function (#10916)
|
hai 1 ano |
Diego Devesa
|
21ae3b9be8
ggml : add test for SVE and disable when it fails (#10906)
|
hai 1 ano |
Molly Sophia
|
0a11f8b7b5
convert : fix RWKV v6 model conversion (#10913)
|
hai 1 ano |
Georgi Gerganov
|
d408bb9268
clip : disable GPU support (#10896)
|
hai 1 ano |
Georgi Gerganov
|
5cab3e4aaa
llama : minor grammar refactor (#10897)
|
hai 1 ano |
Georgi Gerganov
|
36319dec5d
tts : small QoL for easy model fetch (#10903)
|
hai 1 ano |
Xuan Son Nguyen
|
57bb2c40cd
server : fix logprobs, make it OAI-compatible (#10783)
|
hai 1 ano |
Adrien Gallouët
|
a3c33b1dce
ggml: fix arm build with gcc (#10895)
|
hai 1 ano |
Sukriti Sharma
|
2fffc52b50
llama : fix Roberta embeddings (#10856)
|
hai 1 ano |
fairydreaming
|
7585edbdeb
convert : Add support for Microsoft Phi-4 model (#10817)
|
hai 1 ano |