cturan/llama.cpp

작성자	SHA1 메시지	날짜
jaime-m-p	c90dbe026b Fix per token atrributes bits (#7749)	1 년 전
agray3	b90dc566c1 Allow number of nodes in CUDA graph to change (#7738)	1 년 전
Georgi Gerganov	1442677f92 common : refactor cli arg parsing (#7675)	1 년 전
Georgi Gerganov	554c247caf ggml : remove OpenCL (#7735)	1 년 전
Georgi Gerganov	0cd6bd3483 llama : remove beam search (#7736)	1 년 전
Georgi Gerganov	5ca0944a15 readme : remove obsolete Zig instructions (#7471)	1 년 전
slaren	adc9ff3841 llama-bench : allow using a different printer for stderr with -oe (#7722)	1 년 전
Daniele	987d743d6b Improve hipBLAS support in CMake (#7696)	1 년 전
zhouwg	b226c1227b refine .gitignore (#7688)	1 년 전
jaime-m-p	3b38d48609 Per token attributes (#7685)	1 년 전
Georgi Gerganov	6d1616944d ggml : prevent builds with -ffinite-math-only (#7726)	1 년 전
Radoslav Gerganov	bde7cd3cd9 llama : offload to RPC in addition to other backends (#7640)	1 년 전
Masaya, Kato	a5735e4426 ggml : use OpenMP as a thread pool (#7606)	1 년 전
Johannes Gäßler	0b832d53ba make: fix debug options not being applied to NVCC (#7714)	1 년 전
0cc4m	3d7ebf6312 Vulkan Mixture of Experts (MoE) support (#7628)	1 년 전
Andy Tai	a10cda58d3 cmake : add pkg-config spec file for llama.cpp (#7702)	1 년 전
zhangkaihuo	6f28a333c1 llama : MiniCPM support tied embeddings (#7664)	1 년 전
Georgi Gerganov	549279d804 llama : avoid double token-to-piece cache (#7654)	1 년 전
woachk	9e405b6e2e kompute : implement op_getrows_f32 (#6403)	1 년 전
Dave Airlie	3413ae2193 fix bug introduced in using calloc (#7701)	1 년 전
Georgi Gerganov	1669810d7c flake.lock: Update (#7686)	1 년 전
Austin	7c4e5b7eae chore : add ignore rule for generated server themes (#7689)	1 년 전
nickp27	9422c5e34b [SYCL] Update rpc-server.cpp to include SYCL backend (#7682)	1 년 전
Johannes Gäßler	e141ce624a Fix FlashAttention debug test, FP32 assert (#7684)	1 년 전
Yazan Agha-Schrader	2e666832e6 server : new UI (#7633)	1 년 전
HanishKVC	2ac95c9d56 SimpleChat: Simple histogram/repeatMatching driven garbageTrimming, Settings UI, Streaming mode, OpenAi Compat (Model, Authorization Bearer), Save/Restore session, Auto Settings UI (#7548)	1 년 전
Johannes Gäßler	750f60c03e CUDA: fix Pascal FA, deq. KV to FP16 for batch > 8 (#7681)	1 년 전
Johannes Gäßler	9b596417af CUDA: quantized KV support for FA vec (#7527)	1 년 전
Georgi Gerganov	a323ec60af server : update js (#7670)	1 년 전
Galunid	0515ad93f4 convert-hf : Handle NotImplementedError in convert-hf-to-gguf (#7660)	1 년 전

최신 이전

커밋 기록 찾기

커밋 기록