Johannes Gäßler
|
e789095502
llama: print memory breakdown on exit (#15860)
|
3 months ago |
Acly
|
f2a789e334
ggml : split graph allocations according to backend max buffer size (#15815)
|
3 months ago |
Tarek Dakhran
|
3a59971967
model : add label for LiquidAI LFM2-2.6B model (#16204)
|
3 months ago |
Jie Fu (傅杰)
|
63b54c81a6
model-conversion : make causal-verify-logits fails with model names containing "." (#16215)
|
3 months ago |
Uilian Ries
|
152729f884
common : add missing chrono header for common.cpp (#16211)
|
3 months ago |
Sigbjørn Skjæret
|
c0c59c1157
codeowners : match all requirements files (#16214)
|
3 months ago |
Jie Fu (傅杰)
|
7735706b93
model-conversion : run-org-model.py fails to run on mac m1 (#16213)
|
3 months ago |
Daniel Bevenius
|
4d9ea03d17
codeowners : use slash prefix for root files [no ci] (#16210)
|
3 months ago |
Jie Fu (傅杰)
|
8ba548dae2
model-conversion : fix the make targets in the README.md (#16209)
|
3 months ago |
Georgi Gerganov
|
f505bd83ca
ci : disable AMD workflows + update NVIDIA workflows (#16200)
|
3 months ago |
Georgi Gerganov
|
0889589dbe
ci : enable Vulkan workflow on Mac (#16194)
|
3 months ago |
Xiangyan Sun
|
4e29084ba4
ggml-cpu: Respect cpumask settings (#16164)
|
3 months ago |
Sigbjørn Skjæret
|
f6b4af3d04
ggml : fix uninitialized is_on_grid in quantize_row_iq3_xxs_impl (#15928)
|
4 months ago |
Aaron Teo
|
264f1b5187
zdnn: refactor codebase + add docs (#16178)
|
4 months ago |
Daniel Bevenius
|
0bc7cc7154
codeowners : add @danbev to model-conversion example [no ci] (#16190)
|
4 months ago |
Aaron Teo
|
4b9f4cb0f8
devops: add s390x containers (#15915)
|
4 months ago |
Daniel Bevenius
|
85e72271ba
ggml-cpu : fix typo in gemm comments [no ci] (#16189)
|
4 months ago |
Gabe Goodhart
|
1d0125bcf1
feat: Add conversion support in GraniteHybrid for non-hybrid (all attn) (#16177)
|
4 months ago |
Haiyue Wang
|
351f3da39c
clang-tidy : disable warning about performance enum size (#16127)
|
4 months ago |
Sigbjørn Skjæret
|
3ecb2f671a
ggml : implement set_rows with i32 index (#16159)
|
4 months ago |
Georgi Gerganov
|
432cf4304c
codeowners : update + cleanup (#16174)
|
4 months ago |
Adrien Gallouët
|
37a23c17bd
common : enable `--offline` mode without curl support (#16137)
|
4 months ago |
Quentin Bramas
|
138c87ce8b
webui : fix handling incomplete chunks (#16107)
|
4 months ago |
GideonSerf
|
c6db9a1027
embedding : fix typos in README (#16171)
|
4 months ago |
Haiyue Wang
|
d05affbab7
common : remove unused local variables (#16140)
|
4 months ago |
Georgi Gerganov
|
4f324a556c
ggml : extend ggml_can_fuse to work with non-sequential nodes (#16123)
|
4 months ago |
Georgi Gerganov
|
a71ae3ba7a
ggml : add ggml_op_is_empty (#16122)
|
4 months ago |
Xuan-Son Nguyen
|
05a2458121
codeowners : update ownership for @ngxson and @allozuar (#16128)
|
4 months ago |
Shin-myoung-serp
|
96fdca043b
Vulkan: add conv_transpose_2d operation (#16022)
|
4 months ago |
Sigbjørn Skjæret
|
b2d980fce0
codeowners : claim responsibility for ci, models, gguf-py and convert (#16124)
|
4 months ago |