Georgi Gerganov
|
eb420e1148
sync : ggml
|
преди 9 месеца |
cmdr2
|
cb79c2e7fa
ggml: don't include arm_neon.h when using CUDA 12 with ARM Neon (ggml/1187)
|
преди 9 месеца |
Diego Devesa
|
fe92821ea9
ggml : add bilinear upscale support (ggml/1185)
|
преди 9 месеца |
Diego Devesa
|
459895c326
ggml : add more generic custom op, remove deprecated custom ops (ggml/1183)
|
преди 9 месеца |
Georgi Gerganov
|
e4bf72d631
scripts : fix sync-ggml-am.sh
|
преди 9 месеца |
Xuan-Son Nguyen
|
8b9cc7cdd8
llava : introduce libmtmd (#12849)
|
преди 9 месеца |
Xuan-Son Nguyen
|
64eda5deb9
convert : ability to lazy-load safetensors remotely without downloading to disk (#12820)
|
преди 9 месеца |
Chenguang Li
|
fe5b78c896
CANN: Support more ops (#12841)
|
преди 9 месеца |
Prajwal B Mehendarkar
|
11d07e1e69
Fixes #12823 (#12830)
|
преди 9 месеца |
Rudi Servo
|
b0091ecc1e
docker : added all CPU to GPU images (#12749)
|
преди 9 месеца |
Piotr Kubaj
|
31f7803bc4
ggml-cpu-impl.h: do not redefine bool on POWER9 (#12856)
|
преди 9 месеца |
Piotr Kubaj
|
2391506ace
ggml-impl.h: fix build on POWER9 (#12855)
|
преди 9 месеца |
Bo Zheng
|
d3bd7193ba
llama : Support Qwen3 and Qwen3MoE (#12828)
|
преди 9 месеца |
R0CKSTAR
|
d9a63b2f2e
musa: enable freediskspace for docker image build (#12839)
|
преди 9 месеца |
Romain Biessy
|
8ed71242f4
sycl: update documentation to use -no-cnv (#12845)
|
преди 9 месеца |
Plamen Minev
|
381603a775
ci: detach common from the library (#12827)
|
преди 9 месеца |
Xuan-Son Nguyen
|
65a69e6e1b
clip : do not print ftype (#12832)
|
преди 9 месеца |
Georgi Gerganov
|
47277d6d1d
readme : add rpc backend (#12842)
|
преди 9 месеца |
Chenguang Li
|
6e1c4cebdb
CANN: Support Opt CONV_TRANSPOSE_1D and ELU (#12786)
|
преди 9 месеца |
Jeff Bolz
|
0090950f67
vulkan: In coopmat2 mmq, load q4_k/q5_k scales through shared memory (#12833)
|
преди 9 месеца |
Jeff Bolz
|
7ecd780b1a
vulkan: Use fp16 for the flash attention P*V multiplication (#12783)
|
преди 9 месеца |
Sigbjørn Skjæret
|
7538246e7c
cuda : add f32 to bf16 copy op (#12806)
|
преди 9 месеца |
Matt Clayton
|
b32efad2bc
llava: improve clip_ctx destructor to not memleak load_image_size (#12834)
|
преди 9 месеца |
Georgi Gerganov
|
a19b5cef16
llama : fix FA when KV cache is not used (i.e. embeddings) (#12825)
|
преди 9 месеца |
Xuan-Son Nguyen
|
78a1ba0a4f
server : fix thread.join() on exit (#12831)
|
преди 9 месеца |
dm4
|
2dabf759e7
llava: add more helper functions to check projector types in clip context (#12824)
|
преди 9 месеца |
Prajwal B Mehendarkar
|
1d343b4069
arg : Including limits file on AIX (#12822)
|
преди 9 месеца |
characharm
|
8ca6e1c3a4
server : webui : Improve Chat Input with Auto-Sizing Textarea (#12785)
|
преди 9 месеца |
Neo Zhang Jianyu
|
656babd6c2
Revert "sycl:remove redundant memcopy in function ggml_backend_sycl_buffer_set_tensor" (#12812)
|
преди 9 месеца |
compilade
|
a226bc7a9a
gguf-py : support lazy tensor splitting (#12809)
|
преди 9 месеца |