cturan/llama.cpp

Autor	SHA1 Mensagem	Data
Alexey Parfenov	a2d60c9158 server : allow to get default generation settings for completion (#5307)	há 1 ano atrás
Michael Klimenko	52bb63c708 refactor : switch to emplace_back to avoid extra object (#5291)	há 1 ano atrás
Georgi Gerganov	5cb04dbc16 llama : remove LLAMA_MAX_DEVICES and LLAMA_SUPPORTS_GPU_OFFLOAD (#5240)	há 1 ano atrás
Georgi Gerganov	e6f291d158 server : fix context shift (#5195)	há 1 ano atrás
Wu Jian Ping	c82d18e863 server : embeddings compatibility for OpenAI (#5190)	há 1 ano atrás
Abhilash Majumder	0f648573dd ggml : add unified SYCL backend for Intel GPUs (#2690)	há 2 anos atrás
Michael Klimenko	35a2ee9143 Remove unused data and add fixes (#5154)	há 2 anos atrás
Maximilian Winter	ec903c0341 server : add self-extend support (#5104)	há 2 anos atrás
Xuan Son Nguyen	48c857aa10 server : refactored the task processing logic (#5065)	há 2 anos atrás
Xuan Son Nguyen	821f0a271e server : defer tasks when "slot unavailable" (#5018)	há 2 anos atrás
Georgi Gerganov	0ea069b87b server : fix prompt caching with system prompt (#4914)	há 2 anos atrás
Ziad Ben Hadj-Alouane	356327feb3 server : fix deadlock that occurs in multi-prompt scenarios (#4905)	há 2 anos atrás
makomk	ee8243adaa server : fix crash with multimodal models without BOS token (#4904)	há 2 anos atrás
slaren	e7e4df031b llama : ggml-backend integration (#4766)	há 2 anos atrás
Georgi Gerganov	1d118386fe server : fix infill when prompt is empty (#4833)	há 2 anos atrás
Laura	4330bd83fe server : implement credentialed CORS (#4514)	há 2 anos atrás
Michael Coppola	27379455c3 server : support for multiple api keys (#4864)	há 2 anos atrás
Behnam M	eab6795006 server : add `LOG_INFO` when model is successfully loaded (#4881)	há 2 anos atrás
Isaac McFadyen	2f043328e3 server : fix typo in model name (#4876)	há 2 anos atrás
Georgi Gerganov	5c1980d8d4 server : fix build + rename enums (#4870)	há 2 anos atrás
Behnam M	cd108e641d server : add a `/health` endpoint (#4860)	há 2 anos atrás
Georgi Gerganov	67984921a7 server : fix n_predict check (#4798)	há 2 anos atrás
Georgi Gerganov	012cf349ae server : send token probs for "stream == false" (#4714)	há 2 anos atrás
Georgi Gerganov	32866c5edd editorconfig : fix whitespace and indentation #4710	há 2 anos atrás
minarchist	5d7002d437 server : add --override-kv parameter (#4710)	há 2 anos atrás
Georgi Gerganov	9fbda719de clip : refactor + bug fixes (#4696)	há 2 anos atrás
Justine Tunney	db49ff8ed7 server : replace sleep with condition variables (#4673)	há 2 anos atrás
SakuraUmi	60f55e888c server : fix OpenAI server sampling w.r.t. penalty. (#4675)	há 2 anos atrás
Karthik Sethuraman	b93edd22f5 server : allow to generate multimodal embeddings (#4681)	há 2 anos atrás
Justine Tunney	65e5f6dadb Fix OpenAI server sampling w.r.t. temp and seed (#4668)	há 2 anos atrás

Recente Antigo

Histórico de Commits Pesquisar

Histórico de Commits