cturan/llama.cpp

mirror of https://github.com/cturan/llama.cpp

Author	SHA1 Message	Date
Xuan Son Nguyen	0da5d86026 server : allow using LoRA adapters per-request (#10994)	1 year ago
Georgi Gerganov	1da7b76569 server : fix speculative decoding with context shift (#10641)	1 year ago
Xuan Son Nguyen	b782e5c7d4 server : add more test cases (#10569)	1 year ago