Commit History

Author SHA1 Message Date
  Xuan Son Nguyen 49b0e3cec4 server : fix cleaning up stream task (#11418) 1 year ago
  Diego Devesa 20a758155b docker : fix CPU ARM build (#11403) 1 year ago
  Georgi Gerganov 00c24acb2a ci : fix line breaks on windows builds (#11409) 1 year ago
  jiahao su 466ea66f33 CANN: Add Ascend CANN build ci (#10217) 1 year ago
  uvos 5f0db9522f hip : Add hipGraph and VMM support to ROCM (#11362) 1 year ago
  Johannes Gäßler c5d9effb49 CUDA: fix FP16 cuBLAS GEMM (#11396) 1 year ago
  uvos 9fbadaef4f rocBLAS: Avoid fp32->fp16->fp32 conversion on cdna (#11356) 1 year ago
  Georgi Gerganov 9755129c27 release : pack /lib in the packages (#11392) 1 year ago
  Jafar Uruç a07c2c8a52 docs : Update readme to build targets for local docker build (#11368) 1 year ago
  Johannes Gäßler 8137b4bb2b CPU/CUDA: fix (GQA) mul mat back, add CUDA support (#11380) 1 year ago
  Bernhard M. Wiedemann 1af6945eb0 cmake : avoid -march=native when reproducible build is wanted (#11366) 1 year ago
  Eric Curtin 01f37edf1a Update llama-run README.md (#11386) 1 year ago
  stduhpf c07e87f38b server : (webui) put DeepSeek R1 CoT in a collapsible <details> element (#11364) 1 year ago
  Jeff Bolz 564804b79b tests: fix some mul_mat test gaps (#11375) 1 year ago
  Eric Curtin 05f63cc9ee Update documentation (#11373) 1 year ago
  Eric Curtin f7fb43cd0b Add -ngl (#11372) 1 year ago
  Xuan Son Nguyen 5845661640 server : add more clean up when cancel_tasks is called (#11340) 1 year ago
  Eric Curtin f211d1dc10 Treat hf.co/ prefix the same as hf:// (#11350) 1 year ago
  amd-dwang 955a6c2d91 Vulkan-run-test: fix mmq_wg_denoms (#11343) 1 year ago
  Jeff Bolz 1971adf55e vulkan: sort shaders for more deterministic binary (#11315) 1 year ago
  Jeff Bolz 5245729e33 vulkan: fix diag_mask_inf (#11323) 1 year ago
  Diego Devesa 6152129d05 main : update README documentation for batch size (#11353) 1 year ago
  Georgi Gerganov 16d3df7ab0 readme : add plugin links (#11355) 1 year ago
  Diego Devesa 12c2bdf2de server : fix draft context not being released (#11354) 1 year ago
  Olivier Chafik c64d2becb1 `minja`: sync at https://github.com/google/minja/commit/0f5f7f2b3770eb682fbc11763266d45204173686 (#11352) 1 year ago
  Jiří Podivín 96f4053934 Adding logprobs to /v1/completions (#11344) 1 year ago
  Olivier Chafik a94f3b2727 `common`: utils to split / join / repeat strings (from json converter) (#11342) 1 year ago
  tc-mb 3e3357fd77 llava : support Minicpm-omni (#11289) 1 year ago
  Olivier Chafik 6171c9d258 Add Jinja template support (#11016) 1 year ago
  Xuan Son Nguyen e28245f35f export-lora : fix tok_embd tensor (#11330) 1 year ago