cturan/llama.cpp

mirror of https://github.com/cturan/llama.cpp

Tác giả	SHA1 Thông báo	Ngày
Georgi Gerganov	16bcc1259d kv-cache : pad the cache size to 256 for performance (#17046)	2 tháng trước cách đây
Johannes Gäßler	e81b8e4b7f llama: use FA + max. GPU layers by default (#15434)	4 tháng trước cách đây
Georgi Gerganov	d2fcd91cf9 server : disable context shift by default (#15416)	5 tháng trước cách đây
Diego Devesa	1d36b3670b llama : move end-user examples to tools directory (#13249)	8 tháng trước cách đây