Eric Curtin
|
dab76c92cc
llama-run : include temperature option (#10899)
|
пре 1 година |
yuri@FreeBSD
|
7024d59e6a
ggml : fix run-time on FreeBSD in get_executable_path() (#10948)
|
пре 1 година |
Rudi Servo
|
7c0e285858
devops : add docker-multi-stage builds (#10832)
|
пре 1 година |
Billel Mokeddem
|
7ae33a616f
llama : add Falcon3 support (#10883)
|
пре 1 година |
Jeff Bolz
|
ebdee9478c
vulkan: build fixes for 32b (#10927)
|
пре 1 година |
Georgi Gerganov
|
5cd85b5e00
convert : add BertForMaskedLM (#10919)
|
пре 1 година |
Jeff Bolz
|
a91a41364b
vulkan: optimize coopmat2 dequant functions (#10855)
|
пре 1 година |
Adrien Gallouët
|
e34c5af43f
ggml-cpu: replace NEON asm with intrinsics in ggml_gemv_q4_0_4x8_q8_0() (#10874)
|
пре 1 година |
Akarshan Biswas
|
eb5c3dc64b
SYCL: Migrate away from deprecated ggml_tensor->backend (#10840)
|
пре 1 година |
Xuan Son Nguyen
|
0ca416c91a
server : (UI) fix copy to clipboard function (#10916)
|
пре 1 година |
Diego Devesa
|
21ae3b9be8
ggml : add test for SVE and disable when it fails (#10906)
|
пре 1 година |
Molly Sophia
|
0a11f8b7b5
convert : fix RWKV v6 model conversion (#10913)
|
пре 1 година |
Georgi Gerganov
|
d408bb9268
clip : disable GPU support (#10896)
|
пре 1 година |
Georgi Gerganov
|
5cab3e4aaa
llama : minor grammar refactor (#10897)
|
пре 1 година |
Georgi Gerganov
|
36319dec5d
tts : small QoL for easy model fetch (#10903)
|
пре 1 година |
Xuan Son Nguyen
|
57bb2c40cd
server : fix logprobs, make it OAI-compatible (#10783)
|
пре 1 година |
Adrien Gallouët
|
a3c33b1dce
ggml: fix arm build with gcc (#10895)
|
пре 1 година |
Sukriti Sharma
|
2fffc52b50
llama : fix Roberta embeddings (#10856)
|
пре 1 година |
fairydreaming
|
7585edbdeb
convert : Add support for Microsoft Phi-4 model (#10817)
|
пре 1 година |
Johannes Gäßler
|
cd920d0ac3
tests: disable GGUF test for bad value size (#10886)
|
пре 1 година |
Eric Curtin
|
7909e8588d
llama-run : improve progress bar (#10821)
|
пре 1 година |
Diego Devesa
|
9177484f58
ggml : fix arm build (#10890)
|
пре 1 година |
Georgi Gerganov
|
0bf2d10c55
tts : add OuteTTS support (#10784)
|
пре 1 година |
Gaetan Bisson
|
7bbb5acf12
server: avoid overwriting Authorization header (#10878)
|
пре 1 година |
Georgi Gerganov
|
152610eda9
server : output embeddings for all tokens when pooling = none (#10861)
|
пре 1 година |
Georgi Gerganov
|
0e70ba686e
server : add "tokens" output (#10853)
|
пре 1 година |
Xuan Son Nguyen
|
46828872c3
server : (embeddings) using same format for "input" and "content" (#10872)
|
пре 1 година |
redbeard
|
6b064c92b4
docs: Fix HIP (née hipBLAS) in README (#10880)
|
пре 1 година |
Diego Devesa
|
4da69d1abd
Revert "llama : add Falcon3 support (#10864)" (#10876)
|
пре 1 година |
DAN™
|
d62b532c52
Use model->gguf_kv for loading the template instead of using the C API. (#10868)
|
пре 1 година |