cturan/llama.cpp

Autor	SHA1 Mensaxe	Data
Shouzheng Liu	fc8ef549e5 metal : enable ggml-alloc (#2627)	%!s(int64=2) %!d(string=hai) anos
Shouzheng Liu	bf83bff674 metal : matrix-matrix multiplication kernel (#2615)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	b5ffb2849d scripts : add helper script to get wikitext	%!s(int64=2) %!d(string=hai) anos
Jhen-Jie Hong	3ebb00935f server : add missing /json-schema-to-grammar.mjs (#2616)	%!s(int64=2) %!d(string=hai) anos
Jhen-Jie Hong	d783f7982e metal : return null instead of exit(1) (#2573)	%!s(int64=2) %!d(string=hai) anos
Cheng Shao	d75561df20 server : add --numa support (#2524)	%!s(int64=2) %!d(string=hai) anos
Kamil Tomšík	348acf188c llama : add missing enum keyword in function signatures (#2610)	%!s(int64=2) %!d(string=hai) anos
Johannes Gäßler	1cd06fa25e CUDA: launch_bounds, small q4_K, q5_K mmq refactor (#2596)	%!s(int64=2) %!d(string=hai) anos
Jhen-Jie Hong	2feb8934eb server : fix default grammar by use empty string in the UI (#2604)	%!s(int64=2) %!d(string=hai) anos
Jhen-Jie Hong	5517d6e692 server : implement json-schema-to-grammar.mjs & add grammar param in the UI (#2588)	%!s(int64=2) %!d(string=hai) anos
vxiiduu	f31b539714 Enhance Windows 7 and below compatibility. (#2592)	%!s(int64=2) %!d(string=hai) anos
drbh	ee77efea2a test : add simple grammar parsing tests (#2594)	%!s(int64=2) %!d(string=hai) anos
Johannes Gäßler	f64d44a9b9 CUDA: Fixed OpenLLaMA 3b mmq, reduced compile time (#2590)	%!s(int64=2) %!d(string=hai) anos
byte-6174	b19edd54d5 Adding support for llama2.c models (#2559)	%!s(int64=2) %!d(string=hai) anos
Equim	53dc399472 server: fixed wrong variable name in timing json (#2579)	%!s(int64=2) %!d(string=hai) anos
DannyDaemonic	9ca4abed89 Handle `ENABLE_VIRTUAL_TERMINAL_PROCESSING` more gracefully on earlier versions of Windows.	%!s(int64=2) %!d(string=hai) anos
Christian Demsar	e59fcb2bc1 Add --n-predict -2 for stopping generation on full context (#2565)	%!s(int64=2) %!d(string=hai) anos
Martin Krasser	1638757767 Fix grammar-based sampling issue in server (#2566)	%!s(int64=2) %!d(string=hai) anos
Sam Spilsbury	916a9acdd0 ggml-alloc: Don't try to re-use buffers of external tensors (#2562)	%!s(int64=2) %!d(string=hai) anos
grahameth	ea04a4ca19 add log_callback to llama_context_params for custom logging. (#2234)	%!s(int64=2) %!d(string=hai) anos
Johannes Gäßler	25d43e0eb5 CUDA: tuned mul_mat_q kernels (#2546)	%!s(int64=2) %!d(string=hai) anos
Martin Krasser	f5bfea0580 Allow passing grammar to completion endpoint (#2532)	%!s(int64=2) %!d(string=hai) anos
Johannes Gäßler	acfc5478ff CUDA: tighter VRAM scratch size for 65b/70b (#2551)	%!s(int64=2) %!d(string=hai) anos
chaihahaha	7ed8d1fe7f llm.vim : multiline autocompletion, get rid of "^@" (#2543)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	e7f94d6fdc vim : bring back simple llm.vim example	%!s(int64=2) %!d(string=hai) anos
AustinMroz	2d7baaf50f vim : streaming and more (#2495)	%!s(int64=2) %!d(string=hai) anos
klosax	f3c3b4b167 Add --rope-scale parameter (#2544)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	93356bdb7a ggml : mul mat tweaks (#2372)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	60baff7c85 ggml : pad result of ggml_nbytes()	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	9082b5dfbf ggml : change params pointer (style change) (#2539)	%!s(int64=2) %!d(string=hai) anos

Posterior Anterior

Commit History Buscar

Commit History