Georgi Gerganov
|
6d1616944d
ggml : prevent builds with -ffinite-math-only (#7726)
|
1 gadu atpakaļ |
Radoslav Gerganov
|
bde7cd3cd9
llama : offload to RPC in addition to other backends (#7640)
|
1 gadu atpakaļ |
Masaya, Kato
|
a5735e4426
ggml : use OpenMP as a thread pool (#7606)
|
1 gadu atpakaļ |
Johannes Gäßler
|
0b832d53ba
make: fix debug options not being applied to NVCC (#7714)
|
1 gadu atpakaļ |
0cc4m
|
3d7ebf6312
Vulkan Mixture of Experts (MoE) support (#7628)
|
1 gadu atpakaļ |
Andy Tai
|
a10cda58d3
cmake : add pkg-config spec file for llama.cpp (#7702)
|
1 gadu atpakaļ |
zhangkaihuo
|
6f28a333c1
llama : MiniCPM support tied embeddings (#7664)
|
1 gadu atpakaļ |
Georgi Gerganov
|
549279d804
llama : avoid double token-to-piece cache (#7654)
|
1 gadu atpakaļ |
woachk
|
9e405b6e2e
kompute : implement op_getrows_f32 (#6403)
|
1 gadu atpakaļ |
Dave Airlie
|
3413ae2193
fix bug introduced in using calloc (#7701)
|
1 gadu atpakaļ |
Georgi Gerganov
|
1669810d7c
flake.lock: Update (#7686)
|
1 gadu atpakaļ |
Austin
|
7c4e5b7eae
chore : add ignore rule for generated server themes (#7689)
|
1 gadu atpakaļ |
nickp27
|
9422c5e34b
[SYCL] Update rpc-server.cpp to include SYCL backend (#7682)
|
1 gadu atpakaļ |
Johannes Gäßler
|
e141ce624a
Fix FlashAttention debug test, FP32 assert (#7684)
|
1 gadu atpakaļ |
Yazan Agha-Schrader
|
2e666832e6
server : new UI (#7633)
|
1 gadu atpakaļ |
HanishKVC
|
2ac95c9d56
SimpleChat: Simple histogram/repeatMatching driven garbageTrimming, Settings UI, Streaming mode, OpenAi Compat (Model, Authorization Bearer), Save/Restore session, Auto Settings UI (#7548)
|
1 gadu atpakaļ |
Johannes Gäßler
|
750f60c03e
CUDA: fix Pascal FA, deq. KV to FP16 for batch > 8 (#7681)
|
1 gadu atpakaļ |
Johannes Gäßler
|
9b596417af
CUDA: quantized KV support for FA vec (#7527)
|
1 gadu atpakaļ |
Georgi Gerganov
|
a323ec60af
server : update js (#7670)
|
1 gadu atpakaļ |
Galunid
|
0515ad93f4
convert-hf : Handle NotImplementedError in convert-hf-to-gguf (#7660)
|
1 gadu atpakaļ |
Johannes Gäßler
|
c8047d538f
scripts: update compare_llama_bench.py [no ci] (#7673)
|
1 gadu atpakaļ |
Daniele
|
30e238b246
Improve HIP compatibility (#7672)
|
1 gadu atpakaļ |
Georgi Gerganov
|
16926dff92
readme : link homebrew discussion
|
1 gadu atpakaļ |
Georgi Gerganov
|
0c27e6f62e
ggml : fix loongson compile warnings (#7537)
|
1 gadu atpakaļ |
Galunid
|
2e32f874e6
Somehow '**' got lost (#7663)
|
1 gadu atpakaļ |
Galunid
|
1af511fc22
Add convert.py removal to hot topics (#7662)
|
1 gadu atpakaļ |
Sertaç Özercan
|
0541f06296
[no ci] docs: add aikit to readme (#7650)
|
1 gadu atpakaļ |
JohnnyB
|
9022c33646
Fixed painfully slow single process builds. (#7326)
|
1 gadu atpakaļ |
Georgi Gerganov
|
5921b8f089
llama : cache llama_token_to_piece (#7587)
|
1 gadu atpakaļ |
Martin Delille
|
5dcdf94676
Fix conan badge display [no ci] (#7645)
|
1 gadu atpakaļ |