Xuan-Son Nguyen
|
27aa259532
mtmd : add C public API (#13184)
|
пре 8 месеци |
Diego Devesa
|
9fdfcdaedd
rpc : use backend registry, support dl backends (#13304)
|
пре 8 месеци |
Aaron Teo
|
6eb7d25c70
ggml : activate s390x simd for Q3_K (#13301)
|
пре 8 месеци |
Diego Devesa
|
86bd60d3fe
llava/mtmd : fixes to fully support dl backends (#13303)
|
пре 8 месеци |
Diego Devesa
|
9f2da5871f
llama : build windows releases with dl backends (#13220)
|
пре 8 месеци |
Johannes Gäßler
|
93c4e23905
CUDA: fix race condition in MMQ stream-k fixup (#13299)
|
пре 8 месеци |
Johannes Gäßler
|
8afbd96818
CUDA: fix race condition in MMQ ids_dst (#13294)
|
пре 8 месеци |
Jeff Bolz
|
8ae5ebcf85
vulkan: Additional type support for unary, binary, and copy (#13266)
|
пре 8 месеци |
Johannes Gäßler
|
3e959f0976
imatrix: fix oob writes if src1 is not contiguous (#13286)
|
пре 8 месеци |
Xuan-Son Nguyen
|
36667c8edc
clip : revert the change of BOI/EOI token for GLM-edge (⚠️ breaking change) (#13259)
|
пре 8 месеци |
ymcki
|
3bf785f3ef
llama : Llama-3_1-Nemotron-Ultra-253B-v1 support (#12843)
|
пре 8 месеци |
Diego Devesa
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
пре 8 месеци |
Georgi Gerganov
|
b34443923c
sync : ggml (#13268)
|
пре 8 месеци |
Georgi Gerganov
|
a75cb30dc9
context : fix reorder logic (#13267)
|
пре 8 месеци |
shalinib-ibm
|
3f3769ba76
ggml : Enable MMA for BF16 in llamafile_sgemm (#13148)
|
пре 8 месеци |
Jared Van Bortel
|
2f567611c0
llama-model : support Qwen2 embedding models and pooling_mode_lasttoken (#13245)
|
пре 8 месеци |
Jared Van Bortel
|
7d2123484e
convert : use correct context length for nomic-embed-text-v2 (#13216)
|
пре 8 месеци |
Xuan-Son Nguyen
|
074e42ab31
convert : converting mmproj for Qwen2/2.5VL from convert_hf_to_gguf (#13209)
|
пре 8 месеци |
Georgi Gerganov
|
c642bc014c
kv-cache : separate recurrent vs non-recurrent impl (#12799)
|
пре 8 месеци |
Sigbjørn Skjæret
|
cb06a3c363
llama : orion rope type is neox (#13261)
|
пре 8 месеци |
Sigbjørn Skjæret
|
626083faf7
llama : plamo rope type is neox (#13260)
|
пре 8 месеци |
piDack
|
2af6880178
llama-chat : reset glmedge chat template (#13253)
|
пре 8 месеци |
Shakil Ahmed
|
e84773ab60
mtmd-cli : fix out_of_range when input image path is empty (#13244)
|
пре 8 месеци |
Georgi Gerganov
|
fab647e884
server : add cache reuse card link to help (#13230)
|
пре 8 месеци |
Xuan-Son Nguyen
|
dcf886007d
convert : explicitly disable trust_remote_code for AutoConfig (#13246)
|
пре 8 месеци |
bandoti
|
d24d592808
ci: fix cross-compile sync issues (#12804)
|
пре 8 месеци |
Justin Santa Barbara
|
8efbdadc61
rpc : avoid uninitialized memory in serialize_tensor (#13210)
|
пре 8 месеци |
Jesse Gross
|
f057808ffa
ggml: Don't assert fail when tensor data changes (#13222)
|
пре 8 месеци |
Diego Devesa
|
d7a14c42a1
build : fix build info on windows (#13239)
|
пре 8 месеци |
Loïc Carrère
|
b6e4ff69b8
clip : (minicpmv) Re-enable upscaling of images smaller than the CLIP image size (#13237)
|
пре 8 месеци |