Johannes Gäßler
|
8a43e940ab
ggml: new optimization interface (ggml/988)
|
1 year ago |
Daniel Bevenius
|
cd60b88bf7
ggml-alloc : remove buffer_id from leaf_alloc (ggml/987)
|
1 year ago |
Diego Devesa
|
96776405a1
ggml : move more prints to the ggml log system (#9839)
|
1 year ago |
slaren
|
d09770cae7
ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG (#9573)
|
1 year ago |
slaren
|
2b1f616b20
ggml : reduce hash table reset cost (#8698)
|
1 year ago |
Johannes Gäßler
|
a15ef8f8a0
CUDA: fix partial offloading for ne0 % 256 != 0 (#8572)
|
1 year ago |
Georgi Gerganov
|
f3f65429c4
llama : reorganize source code + improve CMake (#8006)
|
1 year ago |