Georgi Gerganov
|
a19b5cef16
llama : fix FA when KV cache is not used (i.e. embeddings) (#12825)
|
преди 9 месеца |
Reza Kakhki
|
9ba399dfa7
server : add support for "encoding_format": "base64" to the */embeddings endpoints (#10967)
|
преди 1 година |
Xuan Son Nguyen
|
57bb2c40cd
server : fix logprobs, make it OAI-compatible (#10783)
|
преди 1 година |
Georgi Gerganov
|
152610eda9
server : output embeddings for all tokens when pooling = none (#10861)
|
преди 1 година |
Xuan Son Nguyen
|
46828872c3
server : (embeddings) using same format for "input" and "content" (#10872)
|
преди 1 година |
krystiancha
|
05c3a444b8
server : fill usage info in embeddings and rerank responses (#10852)
|
преди 1 година |
Xuan Son Nguyen
|
45abe0f74e
server : replace behave with pytest (#10416)
|
преди 1 година |