cturan/llama.cpp

Autor	SHA1 Mensaxe	Data
slaren	344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290)	hai 1 ano
Borislav Stanimirov	ef0d5e3ec9 build: fix and ignore msvc warnings (ggml/805)	hai 1 ano
agray3	928e0b7013 Reset schedule earlier to allow overlap with ggml graph computation on device (#6933)	hai 1 ano
Dave Airlie	e931888d50 ggml : fix calloc argument ordering. (#6820)	hai 1 ano
Georgi Gerganov	b9cc76d87e ggml : fix ggml_backend_cpu_supports_op() for CPY (#0)	hai 1 ano
slaren	280345968d cuda : rename build flag to LLAMA_CUDA (#6299)	hai 1 ano
slaren	5e1b7f94a0 backend : set max split inputs to GGML_MAX_SRC (#6137)	hai 1 ano
slaren	2bf8d0f7c4 backend : offload large batches to GPU (#6083)	hai 1 ano
slaren	f30ea47a87 llama : add pipeline parallelism support (#6017)	hai 1 ano
Michael Podvitskiy	9fa2627347 ggml : introduce ggml_status (ggml/750)	hai 1 ano
UEXTM.com	5f70671856 Introduce backend GUIDs (ggml/743)	hai 1 ano
Kawrakow	bd2d4e393b 1.5 bit quantization (#5453)	hai 1 ano
Georgi Gerganov	8f1be0d42f ggml : add ALiBi support for ggml_soft_max_ext (#5488)	hai 1 ano
Ananta Bastola	6e4e973b26 ci : add an option to fail on compile warning (#3952)	hai 1 ano
AT	f5ca054855 Early return for zero size calls to get_tensor. (#5482)	hai 1 ano
Georgi Gerganov	3b169441df sync : ggml (#5452)	hai 1 ano
Michael Podvitskiy	4633d93af0 ggml : add abort_callback for cpu backend (ggml/725)	hai 1 ano
Jared Van Bortel	fbf1ddec69 Nomic Vulkan backend (#4456)	%!s(int64=2) %!d(string=hai) anos
0cc4m	2307523d32 ggml : add Vulkan backend (#2059)	%!s(int64=2) %!d(string=hai) anos
Abhilash Majumder	0f648573dd ggml : add unified SYCL backend for Intel GPUs (#2690)	%!s(int64=2) %!d(string=hai) anos
slaren	62fead3ea0 cuda : fix tensor size calculation for non-split buffer (#5145)	%!s(int64=2) %!d(string=hai) anos
slaren	6df465a91d llama : run all KQV ops on the CPU with no KV offload (#5049)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	38566680cd ggml : add IQ2 to test-backend-ops + refactoring (#4990)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	44a1a4a41a backend : add eval callback (#4935)	%!s(int64=2) %!d(string=hai) anos
Justine Tunney	a0b3ac8c48 ggml : introduce GGML_CALL function annotation (#4850)	%!s(int64=2) %!d(string=hai) anos
slaren	fa5c1fb44a backend_sched : fix assignments	%!s(int64=2) %!d(string=hai) anos
slaren	e7e4df031b llama : ggml-backend integration (#4766)	%!s(int64=2) %!d(string=hai) anos
Finn Voorhees	1bf681f90e ggml : add error handling to graph_compute (whisper/1714)	%!s(int64=2) %!d(string=hai) anos
bssrdf	afc8c19291 ggml : fix some mul mat cases + add tests for src1 F16 (ggml/669)	%!s(int64=2) %!d(string=hai) anos
slaren	5bf3953d7e cuda : improve cuda pool efficiency using virtual memory (#4606)	%!s(int64=2) %!d(string=hai) anos

Posterior Anterior

Commit History Buscar

Commit History