Commit History

Author SHA1 Message Date
  Georgi Gerganov 254098a279 common : refactor common_sampler + grammar logic changes (#17937) 1 month ago
  Georgi Gerganov 3c6391e748 speculative-simple : free batch on exit (#17985) 1 month ago
  Copilot d8914fc47e common : add --override-tensor-draft, --cpu-moe-draft and --n-cpu-moe-draft parameters (#15191) 5 months ago
  g2mt 94933c8c2e server : implement universal assisted decoding (#12635) 5 months ago
  Georgi Gerganov 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 months ago
  Xuan-Son Nguyen 267c1399f1 common : refactor downloading system, handle mmproj with -hf option (#12694) 9 months ago
  Georgi Gerganov e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) 10 months ago
  Georgi Gerganov afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 year ago
  Georgi Gerganov f66f582927 llama : refactor `src/llama.cpp` (#10902) 1 year ago
  Georgi Gerganov ab96610b1e cmake : enable warnings in llama (#10474) 1 year ago
  Georgi Gerganov 811872a59d speculative : simplify the implementation (#10504) 1 year ago
  Diego Devesa 10bce0450f llama : accept a list of devices to use to offload a model (#10497) 1 year ago
  Georgi Gerganov d9d54e498d speculative : refactor and add a simpler example (#10362) 1 year ago