Commit History

Autor SHA1 Mensaxe Data
  minarchist 5d7002d437 server : add --override-kv parameter (#4710) %!s(int64=2) %!d(string=hai) anos
  Nam D. Tran 26f3071d71 py : re-enable mmap in convert hf (#4732) %!s(int64=2) %!d(string=hai) anos
  Daniel Bevenius 775ac8712a finetune: fix typo in README.md (#4733) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 58ba655af0 metal : enable shader debugging (cmake option) (#4705) %!s(int64=2) %!d(string=hai) anos
  Someone Serge edd1ab7bc3 flake.lock: update %!s(int64=2) %!d(string=hai) anos
  Someone Serge 198ed7ebfc flake.nix: suggest the binary caches %!s(int64=2) %!d(string=hai) anos
  Someone Serge d836174731 workflows: nix-ci: add a qemu job for jetsons %!s(int64=2) %!d(string=hai) anos
  Someone Serge 06f2a5d190 workflows: nix-flakestry: drop tag filters %!s(int64=2) %!d(string=hai) anos
  Someone Serge c5239944ba workflows: weekly `nix flake update` %!s(int64=2) %!d(string=hai) anos
  Someone Serge 1e9ae54cf2 workflows: nix-ci: add a job for eval %!s(int64=2) %!d(string=hai) anos
  Someone Serge 7adedecbe3 workflows: nix-ci: init; build flake outputs %!s(int64=2) %!d(string=hai) anos
  Someone Serge 356ea17e0f flake.nix: expose checks %!s(int64=2) %!d(string=hai) anos
  Someone Serge a5c088d8c6 flake.nix: rocm not yet supported on aarch64, so hide the output %!s(int64=2) %!d(string=hai) anos
  Someone Serge 1e3900ebac flake.nix: expose full scope in legacyPackages %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov e39106c055 ggml : add ggml_vdotq_s32 alias (#4715) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 9fbda719de clip : refactor + bug fixes (#4696) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler 39d8bc71ed CUDA: fixed tensor cores not being used on RDNA3 (#4697) %!s(int64=2) %!d(string=hai) anos
  automaticcat 24a447e20a ggml : add ggml_cpu_has_avx_vnni() (#4589) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler a20f3c7465 CUDA: fix tensor core logic for Pascal and HIP (#4682) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 0235b9b571 clip : use ggml_backend_buffer_is_host (#4205) %!s(int64=2) %!d(string=hai) anos
  Steward Garcia ce18d727a4 clip : enable gpu backend (#4205) %!s(int64=2) %!d(string=hai) anos
  hydai 91bb39cec7 cuda: fix vmm oom issue on NVIDIA AGX Orin (#4687) %!s(int64=2) %!d(string=hai) anos
  crasm 04ac0607e9 python : add check-requirements.sh and GitHub workflow (#4585) %!s(int64=2) %!d(string=hai) anos
  Philip Taron 68eccbdc5b flake.nix : rewrite (#4605) %!s(int64=2) %!d(string=hai) anos
  Cuong Trinh Manh 97bbca6e85 cmake : fix ld warning duplicate libraries libllama.a (#4671) %!s(int64=2) %!d(string=hai) anos
  Justine Tunney 4af4801566 llava-cli : refactor to use sampling library (#4669) %!s(int64=2) %!d(string=hai) anos
  Justine Tunney db49ff8ed7 server : replace sleep with condition variables (#4673) %!s(int64=2) %!d(string=hai) anos
  SakuraUmi 60f55e888c server : fix OpenAI server sampling w.r.t. penalty. (#4675) %!s(int64=2) %!d(string=hai) anos
  Karthik Sethuraman b93edd22f5 server : allow to generate multimodal embeddings (#4681) %!s(int64=2) %!d(string=hai) anos
  andrijdavid 82d6eab224 main-cmake-pkg : fix build issue (#4665) %!s(int64=2) %!d(string=hai) anos