Georgi Gerganov
|
f55538c3cc
metal : fix memory leak (#2762)
|
2 år sedan |
Georgi Gerganov
|
6381d4e110
gguf : new file format with flexible meta data (beta) (#2398)
|
2 år sedan |
Shouzheng Liu
|
fc8ef549e5
metal : enable ggml-alloc (#2627)
|
2 år sedan |
Shouzheng Liu
|
1aa18ef994
metal : concurrently dispatch commands (#2358)
|
2 år sedan |
Qingyou Meng
|
1d656d6360
ggml : change ggml_graph_compute() API to not require context (#1999)
|
2 år sedan |
Georgi Gerganov
|
ce2c7d72e2
metal : handle buffers larger than device's maxBufferLength (#1826)
|
2 år sedan |
Georgi Gerganov
|
4bfcc855ab
metal : parallel command buffer encoding (#1860)
|
2 år sedan |
Georgi Gerganov
|
ecb217db4f
llama : Metal inference (#1642)
|
2 år sedan |