cturan/llama.cpp

Autor	SHA1 Zpráva	Datum
Molly Sophia	2a63caaa69 RWKV v6: RWKV_WKV op CUDA implementation (#9454)	před 1 rokem
slaren	d09770cae7 ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG (#9573)	před 1 rokem
agray3	41f477879f Update CUDA graph on scale change plus clear nodes/params (#9550)	před 1 rokem
Huang Qi	e948a7da7a CI: Provide prebuilt windows binary for hip (#9467)	před 1 rokem
slaren	63351143b2 quantize : improve type name parsing (#9570)	před 1 rokem
Georgi Gerganov	d13edb17ed ggml : fix builds (#0)	před 1 rokem
Georgi Gerganov	27609c49b9 ggml : fix trailing whitespace (#0)	před 1 rokem
Georgi Gerganov	4301535326 sync : ggml	před 1 rokem
Johannes Gäßler	424c5d00a9 ggml/examples: add backend support for numerical optimization (ggml/949)	před 1 rokem
Georgi Gerganov	a6809c6a2e examples : add null threadpool args where needed (ggml/0)	před 1 rokem
Johannes Gäßler	5cb12f6839 CUDA: fix sum.cu compilation for CUDA < 11.7 (#9562)	před 1 rokem
Georgi Gerganov	d39e26741f examples : flush log upon ctrl+c (#9559)	před 1 rokem
Sigbjørn Skjæret	722ec1eb51 perplexity : do not escape input data by default (#9548)	před 1 rokem
Georgi Gerganov	6026da52d6 server : clean-up completed tasks from waiting list (#9531)	před 1 rokem
Sigbjørn Skjæret	eca0fab44e imatrix : disable prompt escape by default (#9543)	před 1 rokem
slaren	64c6af3195 ggml : fix n_threads_cur initialization with one thread (#9538)	před 1 rokem
Georgi Gerganov	0d2f22e45c scripts : verify py deps at the start of compare (#9520)	před 1 rokem
Daniel Bevenius	6443ddd985 llama : use reserve/emplace_back in sampler_sample (#9534)	před 1 rokem
Vinesh Janarthanan	8a308354f6 server : match OAI structured output response (#9527)	před 1 rokem
Eric Zhang	f799155ab8 server : fix OpenSSL build (remove obsolete `LOG_INFO`) (#9529)	před 1 rokem
Neo Zhang Jianyu	faf67b3de4 [SYCL]set context default value to avoid memory issue, update guide (#9476)	před 1 rokem
Michael Podvitskiy	7be099fa81 llama-bench: correct argument parsing error message (#9524)	před 1 rokem
Bert Wagner	8b836ae731 arg : add env variable for parallel (#9513)	před 1 rokem
Michael Podvitskiy	8344ef58f8 llama : fix n_vocab init for 'no_vocab' case (#9511)	před 1 rokem
Max Krasnyansky	0226613853 threadpool : skip polling for unused threads (#9461)	před 1 rokem
Yuri Khrustalev	503147a9f9 unicode : add <algorithm> (#9508)	před 1 rokem
Gabe Goodhart	0d2ec43833 llama : support IBM Granite architecture (#9412)	před 1 rokem
Michael Podvitskiy	37f3a3810e llama : add llama_n_head() (#9512)	před 1 rokem
slaren	23e0d70bac ggml : move common CPU backend impl to new header (#9509)	před 1 rokem
Daniel Bevenius	acb2c32c33 llama : rename n_embed to n_embd in rwkv6_time_mix (#9504)	před 1 rokem

Novější Starší

Historie revizí Hledat

Historie revizí