Isaac McFadyen
|
f865ea149d
server: added more docs for response_fields field (#10995)
|
пре 1 година |
Alexey Parfenov
|
16cdce7b68
server : fix token duplication when streaming with stop strings (#10997)
|
пре 1 година |
Eve
|
d79d8f39b4
vulkan: multi-row k quants (#10846)
|
пре 1 година |
Peter
|
d283d02bf2
examples, ggml : fix GCC compiler warnings (#10983)
|
пре 1 година |
Reza Kakhki
|
9ba399dfa7
server : add support for "encoding_format": "base64" to the */embeddings endpoints (#10967)
|
пре 1 година |
Djip007
|
2cd43f4900
ggml : more perfo with llamafile tinyblas on x86_64 (#10714)
|
пре 1 година |
NeverLucky
|
09fe2e7613
server: allow filtering llama server response fields (#10940)
|
пре 1 година |
Georgi Gerganov
|
30caac3a68
llama : the WPM vocabs use the CLS token as BOS (#10930)
|
пре 1 година |
Diego Devesa
|
60cfa728e2
ggml : use wstring for backend search paths (#10960)
|
пре 1 година |
Diego Devesa
|
3327bb0f8d
ggml : fix arm enabled features check (#10961)
|
пре 1 година |
Diego Devesa
|
32d6ee6385
ggml : fix const usage in SSE path (#10962)
|
пре 1 година |
Xuan Son Nguyen
|
14b699ecde
server : fix missing model id in /model endpoint (#10957)
|
пре 1 година |
Xuan Son Nguyen
|
485dc01214
server : add system_fingerprint to chat/completion (#10917)
|
пре 1 година |
Radoslav Gerganov
|
86bf31cfe6
rpc-server : add support for the SYCL backend (#10934)
|
пре 1 година |
Yun Dou
|
b92a14a841
llama : support InfiniAI Megrez 3b (#10893)
|
пре 1 година |
ymcki
|
6f0c9e034b
llama : support for Llama-3_1-Nemotron-51B (#10669)
|
пре 1 година |
Eric Curtin
|
dab76c92cc
llama-run : include temperature option (#10899)
|
пре 1 година |
yuri@FreeBSD
|
7024d59e6a
ggml : fix run-time on FreeBSD in get_executable_path() (#10948)
|
пре 1 година |
Rudi Servo
|
7c0e285858
devops : add docker-multi-stage builds (#10832)
|
пре 1 година |
Billel Mokeddem
|
7ae33a616f
llama : add Falcon3 support (#10883)
|
пре 1 година |
Jeff Bolz
|
ebdee9478c
vulkan: build fixes for 32b (#10927)
|
пре 1 година |
Georgi Gerganov
|
5cd85b5e00
convert : add BertForMaskedLM (#10919)
|
пре 1 година |
Jeff Bolz
|
a91a41364b
vulkan: optimize coopmat2 dequant functions (#10855)
|
пре 1 година |
Adrien Gallouët
|
e34c5af43f
ggml-cpu: replace NEON asm with intrinsics in ggml_gemv_q4_0_4x8_q8_0() (#10874)
|
пре 1 година |
Akarshan Biswas
|
eb5c3dc64b
SYCL: Migrate away from deprecated ggml_tensor->backend (#10840)
|
пре 1 година |
Xuan Son Nguyen
|
0ca416c91a
server : (UI) fix copy to clipboard function (#10916)
|
пре 1 година |
Diego Devesa
|
21ae3b9be8
ggml : add test for SVE and disable when it fails (#10906)
|
пре 1 година |
Molly Sophia
|
0a11f8b7b5
convert : fix RWKV v6 model conversion (#10913)
|
пре 1 година |
Georgi Gerganov
|
d408bb9268
clip : disable GPU support (#10896)
|
пре 1 година |
Georgi Gerganov
|
5cab3e4aaa
llama : minor grammar refactor (#10897)
|
пре 1 година |