Jeff Bolz
|
bbbf5ecccb
vulkan: handle large sizes for get_rows (#15686)
|
преди 4 месеца |
Jeff Bolz
|
c37052ab4d
vulkan: mul_mat_id coopmat2 optimizations (#15546)
|
преди 4 месеца |
Daniel Bevenius
|
5c16b9c87d
vulkan : remove unused portability_enumeration_ext variable (#15679)
|
преди 4 месеца |
Jeff Bolz
|
b97c9edc59
vulkan: Allow fallback to sysmem memory when vidmem is full (#15649)
|
преди 4 месеца |
Jeff Bolz
|
94e82c7ead
vulkan: clamp matmul and FA results to the max finite value (#15652)
|
преди 4 месеца |
Charles Xu
|
4d74393bcc
ggml: update kleidiai to v1.13.0 (#15663)
|
преди 4 месеца |
Diego Devesa
|
dd892555b0
Update build.md to remove MSVC arm64 notes (#15684)
|
преди 4 месеца |
Johannes Gäßler
|
e81b8e4b7f
llama: use FA + max. GPU layers by default (#15434)
|
преди 4 месеца |
Johannes Gäßler
|
38ad381f9f
CUDA: use FP32 arithmetic for conv2d (#15683)
|
преди 4 месеца |
Jeff Bolz
|
696fccf354
vulkan: Skip syncing for prealloc_y when it is reused (#15544)
|
преди 4 месеца |
Chenguang Li
|
ef476916bb
CANN: FIx compiler warnings (#15661)
|
преди 4 месеца |
Sergey Alirzaev
|
d82f6aa34a
server : removed obsolete doc (#15670)
|
преди 4 месеца |
Johannes Gäßler
|
3d16b29c3b
scripts: strip "AMD Instinct" from GPU name (#15668)
|
преди 4 месеца |
ExtReMLapin
|
792b44f2ed
server : add documentation for `parallel_tool_calls` param (#15647)
|
преди 4 месеца |
Aman Gupta
|
81017865ee
CUDA: fix bug in rms_norm fusion (#15660)
|
преди 4 месеца |
Piotr Wilkin (ilintar)
|
60e5eee31f
chat : Seed OSS thinking + tool call support (#15552)
|
преди 4 месеца |
Aman Gupta
|
009b709d6e
CUDA: fuse adds, fuse add with rms norm (#15631)
|
преди 4 месеца |
Gabe Goodhart
|
e8d99dd0b6
nvidia nemotron nano v2 (nemotronh) (#15507)
|
преди 4 месеца |
Gabe Goodhart
|
a8bca68f72
fix: Compute the full sum in llama-eval-callback, not just the sum of printed values (#15637)
|
преди 4 месеца |
mnehete32
|
c97dc09391
CUDA: add conv2d (#15635)
|
преди 4 месеца |
Aaron Teo
|
6c442f42ff
ggml-cpu: fix invalid hsum build in debug s390x (#15634)
|
преди 4 месеца |
compilade
|
73804145ab
ggml : fix SSM_SCAN for n_groups > 1 (#15625)
|
преди 4 месеца |
Georgi Gerganov
|
c8d0d14e77
kv-cache : fix find_slot to not search for continuous slot (#15638)
|
преди 4 месеца |
Sigbjørn Skjæret
|
84ab83cc0b
model : jina-embeddings-v3 support (#13693)
|
преди 4 месеца |
Aman Gupta
|
55042b3692
scripts: add sqlite3 check for compare-commits.sh (#15633)
|
преди 4 месеца |
Georgi Gerganov
|
8a4280ce43
kv-cache : remove LLAMA_SET_ROWS checks (#15505)
|
преди 4 месеца |
Aleksei Nikiforov
|
64387f6e95
gguf-py: byteswapping improvements (#12851)
|
преди 4 месеца |
Joshua Cogliati
|
d35a1e8c41
cli : change log to warning to explain reason for stopping (#15604)
|
преди 4 месеца |
Daniel Bevenius
|
46d9caa27a
model-conversion : add mmproj conversion target (#15628)
|
преди 4 месеца |
matiaslin
|
5a0e3ef6f0
cuda: Add cublasLt_static linking when GGML_STATIC is enabled (#15622)
|
преди 4 месеца |