Xuan Son Nguyen 958367bf53 server : refactor slot input data, move tokenizer to HTTP thread (#10023) 1 год назад
..
steps 958367bf53 server : refactor slot input data, move tokenizer to HTTP thread (#10023) 1 год назад
ctx_shift.feature 1bde94dd02 server : remove self-extend features (#9860) 1 год назад
embeddings.feature f4d2b8846a llama : add reranking support (#9510) 1 год назад
environment.py bd60d82d0c server tests : more pythonic process management; fix bare `except:` (#6146) 1 год назад
infill.feature 958367bf53 server : refactor slot input data, move tokenizer to HTTP thread (#10023) 1 год назад
issues.feature 9731134296 server: tests: passkey challenge / self-extend with context shift demo (#5832) 1 год назад
lora.feature 1e6f6554aa server : add lora hotswap endpoint (WIP) (#8857) 1 год назад
parallel.feature 9b2c24c099 server : simplify state machine for slot (#9283) 1 год назад
passkey.feature 9b2c24c099 server : simplify state machine for slot (#9283) 1 год назад
rerank.feature f4d2b8846a llama : add reranking support (#9510) 1 год назад
results.feature 3bc10cb485 server : fix temperature + disable some tests (#7409) 1 год назад
security.feature 458367a906 server : better security control for public deployments (#9776) 1 год назад
server.feature 78203641fe server : Add option to return token pieces in /tokenize endpoint (#9108) 1 год назад
slotsave.feature d7e852c1bc Tokenizer SPM fixes for phi-3 and llama-spm (bugfix) (#7425) 1 год назад
wrong_usages.feature 6e7d133a5f server : refactor multitask handling (#9274) 1 год назад