Robin Davidsson
|
0d5c742161
server : Add the endpoints /api/tags and /api/chat (#13659)
|
8 hónapja |
Dorin-Andrei Geman
|
42158ae2e8
server : fix first message identification (#13634)
|
8 hónapja |
Georgi Gerganov
|
797f2ac062
kv-cache : simplify the interface (#13660)
|
8 hónapja |
Georgi Gerganov
|
e298d2fbd0
kv-cache : add SWA support (#13194)
|
8 hónapja |
Isaac McFadyen
|
6a2bc8bfb7
server : added --no-prefill-assistant flag (#13608)
|
8 hónapja |
Xuan-Son Nguyen
|
6aa892ec2a
server : do not return error out of context (with ctx shift disabled) (#13577)
|
8 hónapja |
Olivier Chafik
|
3198405e98
`common`: add partial regex support (#12808)
|
8 hónapja |
Georgi Gerganov
|
053174436f
server : passthrough the /models endpoint during loading (#13535)
|
8 hónapja |
Xuan-Son Nguyen
|
360a9c98e1
server : fix cache_tokens bug with no cache_prompt (#13533)
|
8 hónapja |
Anthony Umfer
|
9a390c4829
tools : fix uninitialized llama_batch in server (#13436)
|
8 hónapja |
Xuan-Son Nguyen
|
33eff40240
server : vision support via libmtmd (#12898)
|
8 hónapja |
Georgi Gerganov
|
6562e5a4d6
context : allow cache-less context for embeddings (#13108)
|
8 hónapja |
oobabooga
|
233461f812
sampling : Integrate Top-nσ into main sampling chain (and add it to the server) (#13264)
|
8 hónapja |
Diego Devesa
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
8 hónapja |