zhangkaihuo
|
2c90da4c7e
llama : use llm_build_granite for minicpm (#13911)
|
7 месяцев назад |
Christian Kastner
|
ec9e0301fe
cmake: Guard GGML_CPU_ALL_VARIANTS by architecture (#13890)
|
7 месяцев назад |
Sigbjørn Skjæret
|
e83ba3e460
llama : add support for jina-reranker-v2 (#13900)
|
7 месяцев назад |
Sigbjørn Skjæret
|
2b131621e6
gguf-py : add support for sub_type (in arrays) in GGUFWriter add_key_value method (#13561)
|
7 месяцев назад |
Yibo Cai
|
54a2c7a8cd
arm64: optimize q4_k_q8_k kernel with i8mm (#13886)
|
7 месяцев назад |
Christian Kastner
|
21fcc21ad5
cmake: Factor out CPU architecture detection (#13883)
|
7 месяцев назад |
Vineel Abhinav
|
dd8ba93416
ggml: aarch64: Implement SVE F32 kernels for Mamba Sequential Scan Algorithm (#13882)
|
7 месяцев назад |
Georgi Gerganov
|
66c92061f5
tests : remove json.hpp from a test (#13880)
|
7 месяцев назад |
Sigbjørn Skjæret
|
5ca82fc1d7
convert : workaround for AutoConfig dummy labels (#13881)
|
7 месяцев назад |
Sigbjørn Skjæret
|
6385b843a8
llama : add RobertaForSequenceClassification reranker support (#13875)
|
7 месяцев назад |
Vineel Abhinav
|
1b8fb8152d
ggml: aarch64: Implement SVE F32 kernels for vector functions (#13843)
|
7 месяцев назад |
Beinsezii
|
53ae30640e
gguf-py : fix SafetensorRemote return on undefined size (< 0) (#13841)
|
7 месяцев назад |
Xuan-Son Nguyen
|
763d06edb7
llama : fix KV shift for qwen2vl (#13870)
|
7 месяцев назад |
Xuan-Son Nguyen
|
10961339b2
mtmd : move helpers to dedicated library (⚠️ breaking change) (#13866)
|
7 месяцев назад |
bandoti
|
d98f2a35fc
ci: disable LLAMA_CURL for Linux cross-builds (#13871)
|
7 месяцев назад |
Đinh Trọng Huy
|
e0e3aa231d
llama : add support for BertForSequenceClassification reranker (#13858)
|
7 месяцев назад |
Đinh Trọng Huy
|
aa6dff05be
convert: small addition to support LlamaModel (#13838)
|
7 месяцев назад |
Sky
|
c962ae3382
server: fix remove 'image_url'/'input_audio' json-object effectlly for 'llama_params' in multimodal-model-mode (#13853)
|
7 месяцев назад |
Xuan-Son Nguyen
|
a3938fb53d
convert : fix qwen omni conversion (#13859)
|
7 месяцев назад |
Alex Fanthome
|
f7873fc698
tests : change umlaut test (#11600)
|
7 месяцев назад |
Johannes Gäßler
|
a68247439b
CUDA: fix FA tg at long context for CC >= 8.9 (#13852)
|
7 месяцев назад |
Xuan-Son Nguyen
|
26b79b6cb3
convert : fix tensor naming conflict for llama 4 vision (#13836)
|
7 месяцев назад |
leo-pony
|
1e8659e65a
CANN: Add SOC TYPE printing in cmake configuration (#13837)
|
7 месяцев назад |
lhez
|
a3c30846e4
opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` (#13787)
|
7 месяцев назад |
lhez
|
1701d4c54f
opencl: mark `mul_mat` `f32f32` as supporting non-contiguous tensors (#13790)
|
7 месяцев назад |
Jeff Bolz
|
bef8176387
vulkan: use timestamp queries for GGML_VULKAN_PERF (#13817)
|
7 месяцев назад |
Georgi Gerganov
|
34b7c0439e
cmake : add llama-cparams.cpp to build (#13832)
|
7 месяцев назад |
Akarshan Biswas
|
f3101a8cc6
SYCL: add gelu_erf kernel (#13749)
|
7 месяцев назад |
Georgi Gerganov
|
1c49c70d07
sync : ggml
|
7 месяцев назад |
Xuan-Son Nguyen
|
a8ea03d8ad
ggml : add ggml_repeat_4d (#13824)
|
7 месяцев назад |