cturan/llama.cpp

نویسنده	SHA1 پیام	تاریخ
Georgi Gerganov	afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110)	1 سال پیش
Georgi Gerganov	727368c60f llama : use LLAMA_TOKEN_NULL (#11062)	1 سال پیش
Georgi Gerganov	f66f582927 llama : refactor `src/llama.cpp` (#10902)	1 سال پیش
Xuan Son Nguyen	0da5d86026 server : allow using LoRA adapters per-request (#10994)	1 سال پیش
Xuan Son Nguyen	45095a61bf server : clean up built-in template detection (#11026)	1 سال پیش
Xuan Son Nguyen	5896c65232 server : add OAI compat for /v1/completions (#10974)	1 سال پیش
Reza Kakhki	9ba399dfa7 server : add support for "encoding_format": "base64" to the */embeddings endpoints (#10967)	1 سال پیش
NeverLucky	09fe2e7613 server: allow filtering llama server response fields (#10940)	1 سال پیش
Xuan Son Nguyen	485dc01214 server : add system_fingerprint to chat/completion (#10917)	1 سال پیش
Xuan Son Nguyen	57bb2c40cd server : fix logprobs, make it OAI-compatible (#10783)	1 سال پیش
Xuan Son Nguyen	46828872c3 server : (embeddings) using same format for "input" and "content" (#10872)	1 سال پیش
krystiancha	05c3a444b8 server : fill usage info in embeddings and rerank responses (#10852)	1 سال پیش
Michelle Tan	89d604f2c8 server: Fix `has_next_line` in JSON response (#10818)	1 سال پیش
kallewoof	484d2f31ae bug-fix: snprintf prints NULL in place of the last character (#10419)	1 سال پیش
Xuan Son Nguyen	3573fa8e7b server : (refactor) no more json in server_task input (#10691)	1 سال پیش
Georgi Gerganov	ce4a7b8493 server : various fixes (#10704)	1 سال پیش
Xuan Son Nguyen	6c5bc0625f server : (refactoring) do not rely on JSON internally (#10643)	1 سال پیش
haopeng	64ed2091b2 server: Add "tokens per second" information in the backend (#10548)	1 سال پیش
Georgi Gerganov	d9d54e498d speculative : refactor and add a simpler example (#10362)	1 سال پیش
sasha0552	42cadc74bd server : fix slot selection by lru (#10126)	1 سال پیش
sasha0552	d865d1478c server : fix smart selection of available slot (#10120)	1 سال پیش
Georgi Gerganov	8d8ff71536 llama : remove Tail-Free sampling (#10071)	1 سال پیش
Georgi Gerganov	8125e6cbfc server : don't overfill the batch during infill (#10018)	1 سال پیش
Xuan Son Nguyen	958367bf53 server : refactor slot input data, move tokenizer to HTTP thread (#10023)	1 سال پیش
VoidIsVoid	a89f75e1b7 server : handle "logprobs" field with false value (#9871)	1 سال پیش
Georgi Gerganov	c7181bd294 server : reuse cached context chunks (#9866)	1 سال پیش
Diego Devesa	7eee341bee common : use common_ prefix for common library functions (#9805)	1 سال پیش
Xuan Son Nguyen	458367a906 server : better security control for public deployments (#9776)	1 سال پیش
Georgi Gerganov	f4d2b8846a llama : add reranking support (#9510)	1 سال پیش
Vinesh Janarthanan	8a308354f6 server : match OAI structured output response (#9527)	1 سال پیش

جدیدتر قدیمی‌تر

تاریخچه Commit ها یافتن

تاریخچه Commit ها