Diego Devesa
|
360d6533db
ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device type (#15797)
|
пре 4 месеци |
Johannes Gäßler
|
0e6ff0046f
CUDA: larger SRAM reads for tile FA, AMD FP16 dot (#15927)
|
пре 4 месеци |
ddh0
|
df082f5630
nitpick : correct MB to MiB (#15934)
|
пре 4 месеци |
Daniel Bevenius
|
24a6734daf
ggml-cpu : add check for ARM MATMUL_INT8/i8mm support (#15922)
|
пре 4 месеци |
Charles Xu
|
2b3efea9a4
kleidiai: fix GGML_ASSERT(*cur_backend_id != -1) failed (#15614)
|
пре 4 месеци |
hipudding
|
c0389dba43
CANN: Disable acl_graph for prefill stage (#15933)
|
пре 4 месеци |
Oliver Simons
|
00681dfc16
CUDA: Add `fastdiv` to `k_bin_bcast*`, giving 1-3% E2E performance (#15872)
|
пре 4 месеци |
Jie Fu (傅杰)
|
4f658855fa
llama : support T5 models with unequal number of encoder-decoder layers (#15909)
|
пре 4 месеци |
Sigbjørn Skjæret
|
6ab397e12b
graph : support non-contiguous Q in build_attn_mha (#15908)
|
пре 4 месеци |
Daniel Bevenius
|
9de447d94e
ggml-cpu : fix padding in ggml_timestep_embedding (#15917)
|
пре 4 месеци |
Georgi Gerganov
|
0f0a3c2851
metal : make the backend async (#15906)
|
пре 4 месеци |
Daniel Bevenius
|
33daece86b
ci : add caching for ROCm installation in release workflow (#15924)
|
пре 4 месеци |
Daniel Bevenius
|
e7b6d83b52
tests : filter out no-ops from coverage report (#15900)
|
пре 4 месеци |
j-k
|
2cfef4d117
media : add transparent icon svg and png [no ci] (#15891)
|
пре 4 месеци |
Jesse
|
09e72a037c
gitignore : Ignore vim swap files in tests (#15901)
|
пре 4 месеци |
Chenguang Li
|
10d8b2b6b0
CANN: Add ROPE sin/cos cache for reuse (#15912)
|
пре 4 месеци |
Chenguang Li
|
28b5f190ef
CANN: implement LRU cache for ACL graphs (#15814)
|
пре 4 месеци |
Daniel Bevenius
|
86587da03b
llama : check returned fn ptrs from ggml_backend_reg_get_proc_address (#15893)
|
пре 4 месеци |
Daniel Bevenius
|
ff02caf9ee
ci : cache ROCm installation in windows-latest-cmake-hip (#15887)
|
пре 4 месеци |
Ruben Ortlam
|
ae355f6f71
vulkan: throw the oom error instead of no memory type found (#15905)
|
пре 4 месеци |
Jeff Bolz
|
4f63cd705c
vulkan: Fix OOB accesses in soft_max_back (#15861)
|
пре 4 месеци |
Johannes Gäßler
|
17bc5a815f
HIP: use v_dot2_f32_f16 instruction for FA (#15884)
|
пре 4 месеци |
lksj92hs
|
ed54e32558
Workaround for subgroup arithmetic failing on MoltenVK with AMD GPUs (issue 15846) (#15886)
|
пре 4 месеци |
Aman Gupta
|
a972faebed
CUDA: Add mul_mat_id support for the mmf kernel (#15767)
|
пре 4 месеци |
Johannes Gäßler
|
550cf726e1
CUDA: fix GET_ROWS for large tensors (#15882)
|
пре 4 месеци |
Georgi Gerganov
|
c252ce67c4
contrib : add notes about merging PRs (#15881)
|
пре 4 месеци |
Daniel Bevenius
|
70cd37dbbe
requirements : update transformers/torch for Embedding Gemma (#15828)
|
пре 4 месеци |
Piotr Wilkin (ilintar)
|
acc1b008cf
model-conversion : add extra debugging support for model conversion (#15877)
|
пре 4 месеци |
Aldehir Rojas
|
7057faf64b
json : support `enum` values within `allOf` (#15830)
|
пре 4 месеци |
j-k
|
fe1c92cd7b
media : add llama1 icon (#15878)
|
пре 4 месеци |