cturan/llama.cpp

mirror of https://github.com/cturan/llama.cpp

Author	SHA1 Message	Date
Georgi Gerganov	d2fcd91cf9 server : disable context shift by default (#15416)	5 months ago
Lukas Straub	a9f77a8be3 server : add openai-style logit_bias support (#14946)	5 months ago
Olivier Chafik	f13847cfb5 server: fix regression on streamed non-chat completion w/ stops (#13785)	7 months ago
Xuan-Son Nguyen	360a9c98e1 server : fix cache_tokens bug with no cache_prompt (#13533)	8 months ago
Diego Devesa	1d36b3670b llama : move end-user examples to tools directory (#13249)	8 months ago