1
0

Коммит түүх

Эзэн SHA1 Мессеж Огноо
  Diego Devesa 20a758155b docker : fix CPU ARM build (#11403) 11 сар өмнө
  Georgi Gerganov 00c24acb2a ci : fix line breaks on windows builds (#11409) 11 сар өмнө
  jiahao su 466ea66f33 CANN: Add Ascend CANN build ci (#10217) 11 сар өмнө
  uvos 5f0db9522f hip : Add hipGraph and VMM support to ROCM (#11362) 11 сар өмнө
  Johannes Gäßler c5d9effb49 CUDA: fix FP16 cuBLAS GEMM (#11396) 11 сар өмнө
  uvos 9fbadaef4f rocBLAS: Avoid fp32->fp16->fp32 conversion on cdna (#11356) 11 сар өмнө
  Georgi Gerganov 9755129c27 release : pack /lib in the packages (#11392) 11 сар өмнө
  Jafar Uruç a07c2c8a52 docs : Update readme to build targets for local docker build (#11368) 11 сар өмнө
  Johannes Gäßler 8137b4bb2b CPU/CUDA: fix (GQA) mul mat back, add CUDA support (#11380) 11 сар өмнө
  Bernhard M. Wiedemann 1af6945eb0 cmake : avoid -march=native when reproducible build is wanted (#11366) 11 сар өмнө
  Eric Curtin 01f37edf1a Update llama-run README.md (#11386) 11 сар өмнө
  stduhpf c07e87f38b server : (webui) put DeepSeek R1 CoT in a collapsible <details> element (#11364) 11 сар өмнө
  Jeff Bolz 564804b79b tests: fix some mul_mat test gaps (#11375) 11 сар өмнө
  Eric Curtin 05f63cc9ee Update documentation (#11373) 11 сар өмнө
  Eric Curtin f7fb43cd0b Add -ngl (#11372) 11 сар өмнө
  Xuan Son Nguyen 5845661640 server : add more clean up when cancel_tasks is called (#11340) 11 сар өмнө
  Eric Curtin f211d1dc10 Treat hf.co/ prefix the same as hf:// (#11350) 11 сар өмнө
  amd-dwang 955a6c2d91 Vulkan-run-test: fix mmq_wg_denoms (#11343) 11 сар өмнө
  Jeff Bolz 1971adf55e vulkan: sort shaders for more deterministic binary (#11315) 11 сар өмнө
  Jeff Bolz 5245729e33 vulkan: fix diag_mask_inf (#11323) 11 сар өмнө
  Diego Devesa 6152129d05 main : update README documentation for batch size (#11353) 1 жил өмнө
  Georgi Gerganov 16d3df7ab0 readme : add plugin links (#11355) 1 жил өмнө
  Diego Devesa 12c2bdf2de server : fix draft context not being released (#11354) 1 жил өмнө
  Olivier Chafik c64d2becb1 `minja`: sync at https://github.com/google/minja/commit/0f5f7f2b3770eb682fbc11763266d45204173686 (#11352) 1 жил өмнө
  Jiří Podivín 96f4053934 Adding logprobs to /v1/completions (#11344) 1 жил өмнө
  Olivier Chafik a94f3b2727 `common`: utils to split / join / repeat strings (from json converter) (#11342) 1 жил өмнө
  tc-mb 3e3357fd77 llava : support Minicpm-omni (#11289) 1 жил өмнө
  Olivier Chafik 6171c9d258 Add Jinja template support (#11016) 1 жил өмнө
  Xuan Son Nguyen e28245f35f export-lora : fix tok_embd tensor (#11330) 1 жил өмнө
  Radoslav Gerganov 6da5bec81c rpc : better caching of the base buffer pointer (#11331) 1 жил өмнө