Georgi Gerganov
|
5437d4aaf5
sync : ggml
|
1 سال پیش |
Georgi Gerganov
|
78f766768d
cmake : fix "amd64" processor string (whisper/2638)
|
1 سال پیش |
gn64
|
8dd19a4812
vulkan : fix soft_max.comp division by zero (whisper/2633)
|
1 سال پیش |
Daniel Bevenius
|
130d0c90bd
ggml : remove return from ggml_gallocr_allocate_node (ggml/1048)
|
1 سال پیش |
Daniel Bevenius
|
3919da8e33
ggml : add check for grad_accs (ggml/1046)
|
1 سال پیش |
Georgi Gerganov
|
0006f5a74a
ggml : update ggml_backend_cpu_device_supports_op (#10867)
|
1 سال پیش |
krystiancha
|
05c3a444b8
server : fill usage info in embeddings and rerank responses (#10852)
|
1 سال پیش |
Billel Mokeddem
|
382bc7f2e8
llama : add Falcon3 support (#10864)
|
1 سال پیش |
Ruan
|
4f51968aca
readme : update typos (#10863)
|
1 سال پیش |
Xuan Son Nguyen
|
227d7c5a7f
server : (UI) fix missing async generator on safari (#10857)
|
1 سال پیش |
Eve
|
7b1ec53f56
vulkan: bugfixes for small subgroup size systems + llvmpipe test (#10809)
|
1 سال پیش |
Zhiyuan Li
|
160bc039c8
rwkv6: add wkv6 support for Vulkan backend (#10829)
|
1 سال پیش |
Georgi Gerganov
|
08ea539df2
unicode : improve naming style (#10838)
|
1 سال پیش |
Georgi Gerganov
|
644fd71b44
sampling : refactor + optimize penalties sampler (#10803)
|
1 سال پیش |
Bartowski
|
4ddd199f6f
llava : Allow locally downloaded models for QwenVL (#10833)
|
1 سال پیش |
Valentin Mamedov
|
a0974156f3
llama : add Deepseek MoE v1 & GigaChat models (#10827)
|
1 سال پیش |
Georgi Gerganov
|
87cf323cef
scripts : change build path to "build-bench" for compare-commits.sh (#10836)
|
1 سال پیش |
Vinesh Janarthanan
|
5478bbcd17
server: (UI) add syntax highlighting and latex math rendering (#10808)
|
1 سال پیش |
Georgi Gerganov
|
b5ae1ddff9
gguf-py : bump to v0.13.0
|
1 سال پیش |
Michelle Tan
|
89d604f2c8
server: Fix `has_next_line` in JSON response (#10818)
|
1 سال پیش |
Evgeny Kurnevsky
|
e52aba537a
nix: allow to override rocm gpu targets (#10794)
|
1 سال پیش |
HimariO
|
ba1cb19cdd
llama : add Qwen2VL support + multimodal RoPE (#10361)
|
1 سال پیش |
cduk
|
56eea0781c
Removes spurious \r in output that causes logging in journalctl to treat lines as binary and therefore hidden by default (#10771)
|
1 سال پیش |
lhez
|
a76c56fa1a
Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (#10693)
|
1 سال پیش |
Eric Curtin
|
c27ac678dd
Opt class for positional argument handling (#10508)
|
1 سال پیش |
Corentin REGAL
|
11e07fd63b
fix: graceful shutdown for Docker images (#10815)
|
1 سال پیش |
Jett Janiak
|
4601a8bb67
gguf-py : numpy 2 newbyteorder fix (#9772)
|
1 سال پیش |
谢乃闻
|
9f35e44592
Fix crash caused by ggml_backend_load_all when launching on Android Activity (#10812)
|
1 سال پیش |
Eve
|
64ae065511
vulkan: small mul_mat_vec optimizations (#10665)
|
1 سال پیش |
Akarshan Biswas
|
83ed24a97b
SYCL: Reduce most of the compiler warnings (#10748)
|
1 سال پیش |