cturan/llama.cpp

Autor	SHA1 Zpráva	Datum
Evan Jones	604b8bdfa6 Fix unicode in grammars (fixes #2501) (#2553)	před 2 roky
staviq	10151bee2e server : support for saving templates in browser LocalStorage (#2486)	před 2 roky
Johannes Gäßler	0992a7b8b1 README: fix LLAMA_CUDA_MMV_Y documentation (#2647)	před 2 roky
Henri Vasserman	6ddeefad9b [Zig] Fixing Zig build and improvements (#2554)	před 2 roky
Kerfuffle	8dae7ce684 Add --cfg-negative-prompt-file option for examples (#2591)	před 2 roky
Georgi Gerganov	a73ccf1aa3 llama : replace (permute + reshape + view_1d) with (view_3d) (#2538)	před 2 roky
drbh	7cf54e1f74 tests : adds simple llama grammar tests (#2618)	před 2 roky
Shouzheng Liu	a872a2b28e ggml-alloc : fix discrepency between measure&eval (#2639)	před 2 roky
Kolen Cheung	0919a0f73d cmake : install ggml-meta.metal if LLAMA_METAL (#2449)	před 2 roky
Jhen-Jie Hong	ed53db86c3 metal : print error of load pipeline state (#2564)	před 2 roky
Shouzheng Liu	fc8ef549e5 metal : enable ggml-alloc (#2627)	před 2 roky
Shouzheng Liu	bf83bff674 metal : matrix-matrix multiplication kernel (#2615)	před 2 roky
Georgi Gerganov	b5ffb2849d scripts : add helper script to get wikitext	před 2 roky
Jhen-Jie Hong	3ebb00935f server : add missing /json-schema-to-grammar.mjs (#2616)	před 2 roky
Jhen-Jie Hong	d783f7982e metal : return null instead of exit(1) (#2573)	před 2 roky
Cheng Shao	d75561df20 server : add --numa support (#2524)	před 2 roky
Kamil Tomšík	348acf188c llama : add missing enum keyword in function signatures (#2610)	před 2 roky
Johannes Gäßler	1cd06fa25e CUDA: launch_bounds, small q4_K, q5_K mmq refactor (#2596)	před 2 roky
Jhen-Jie Hong	2feb8934eb server : fix default grammar by use empty string in the UI (#2604)	před 2 roky
Jhen-Jie Hong	5517d6e692 server : implement json-schema-to-grammar.mjs & add grammar param in the UI (#2588)	před 2 roky
vxiiduu	f31b539714 Enhance Windows 7 and below compatibility. (#2592)	před 2 roky
drbh	ee77efea2a test : add simple grammar parsing tests (#2594)	před 2 roky
Johannes Gäßler	f64d44a9b9 CUDA: Fixed OpenLLaMA 3b mmq, reduced compile time (#2590)	před 2 roky
byte-6174	b19edd54d5 Adding support for llama2.c models (#2559)	před 2 roky
Equim	53dc399472 server: fixed wrong variable name in timing json (#2579)	před 2 roky
DannyDaemonic	9ca4abed89 Handle `ENABLE_VIRTUAL_TERMINAL_PROCESSING` more gracefully on earlier versions of Windows.	před 2 roky
Christian Demsar	e59fcb2bc1 Add --n-predict -2 for stopping generation on full context (#2565)	před 2 roky
Martin Krasser	1638757767 Fix grammar-based sampling issue in server (#2566)	před 2 roky
Sam Spilsbury	916a9acdd0 ggml-alloc: Don't try to re-use buffers of external tensors (#2562)	před 2 roky
grahameth	ea04a4ca19 add log_callback to llama_context_params for custom logging. (#2234)	před 2 roky

Novější Starší

Historie revizí Hledat

Historie revizí