cturan/llama.cpp

Autor	SHA1 Mensagem	Data
Georgi Gerganov	8d8ff71536 llama : remove Tail-Free sampling (#10071)	há 1 ano atrás
Georgi Gerganov	8125e6cbfc server : don't overfill the batch during infill (#10018)	há 1 ano atrás
Xuan Son Nguyen	958367bf53 server : refactor slot input data, move tokenizer to HTTP thread (#10023)	há 1 ano atrás
VoidIsVoid	a89f75e1b7 server : handle "logprobs" field with false value (#9871)	há 1 ano atrás
Georgi Gerganov	c7181bd294 server : reuse cached context chunks (#9866)	há 1 ano atrás
Diego Devesa	7eee341bee common : use common_ prefix for common library functions (#9805)	há 1 ano atrás
Xuan Son Nguyen	458367a906 server : better security control for public deployments (#9776)	há 1 ano atrás
Georgi Gerganov	f4d2b8846a llama : add reranking support (#9510)	há 1 ano atrás
Vinesh Janarthanan	8a308354f6 server : match OAI structured output response (#9527)	há 1 ano atrás
Georgi Gerganov	6262d13e0b common : reimplement logging (#9418)	há 1 ano atrás
Mathijs Henquet	78203641fe server : Add option to return token pieces in /tokenize endpoint (#9108)	há 1 ano atrás
Xuan Son Nguyen	6e7d133a5f server : refactor multitask handling (#9274)	há 1 ano atrás
ardfork	978ba3d83d Server: Don't ignore llama.cpp params (#8754)	há 1 ano atrás
Georgi Gerganov	4e24cffd8c server : handle content array in chat API (#8449)	há 1 ano atrás
Xuan Son Nguyen	48e6b92cc3 Add chat template support for llama-cli (#8068)	há 1 ano atrás
sasha0552	7a16ce7db2 server : smart slot selection using Longest Common Prefix (#7728)	há 1 ano atrás
Georgi Gerganov	1442677f92 common : refactor cli arg parsing (#7675)	há 1 ano atrás
Benjamin Findley	e586ee4259 change default temperature of OAI compat API from 0 to 1 (#7226)	há 1 ano atrás
Johannes Gäßler	c12452c7ae JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)	há 1 ano atrás
Xuan Son Nguyen	1fd9c1741d clean up json_value & server_log (#7142)	há 1 ano atrás
Pedro Cuenca	b97bc3966e llama : support Llama 3 HF conversion (#6745)	há 1 ano atrás
Pierrick Hymbert	75cd4c7729 ci: bench: support sse and fix prompt processing time / server: add tokens usage in stream OAI response (#6495)	há 1 ano atrás
JH23X	60cdf40cc3 server : handle exception on wrong type in request (#6452)	há 1 ano atrás
Xuan Son Nguyen	ad3a0505e3 Server: clean up OAI params parsing function (#6284)	há 1 ano atrás
Pierrick Hymbert	1b26aebe4d server: flush stdout after logging in both text and json layout (#6253)	há 1 ano atrás
Olivier Chafik	72114edf06 json-schema-to-grammar : fix order of props + non-str const/enum (#6232)	há 1 ano atrás
Olivier Chafik	5b7b0ac8df json-schema-to-grammar improvements (+ added to server) (#5978)	há 1 ano atrás
Karthick	47cc7a7bf9 Server: Handle n_keep parameter in the request (#6174)	há 1 ano atrás
Xuan Son Nguyen	99b71c068f Server: Use multi-task for embeddings endpoint (#6001)	há 1 ano atrás
Xuan Son Nguyen	caa106d4e0 Server: format error to json (#5961)	há 1 ano atrás

Recente Antigo

Histórico de Commits Pesquisar

Histórico de Commits