Commit History

Autor SHA1 Mensaxe Data
  Georgi Gerganov d061bf9405 ggml : fix q2_k bpw in comments (ggml/680) %!s(int64=2) %!d(string=hai) anos
  Finn Voorhees 1bf681f90e ggml : add error handling to graph_compute (whisper/1714) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov c1d7cb28d3 ggml : do not sched_yield when calling BLAS (#4761) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 3681f22443 examples : add few-shot translation example (#4783) %!s(int64=2) %!d(string=hai) anos
  Daniel Bevenius b3a7c20b5c finetune : remove unused includes (#4756) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 012cf349ae server : send token probs for "stream == false" (#4714) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler a91928014f Print backend name on test-backend-ops failure (#4751) %!s(int64=2) %!d(string=hai) anos
  singularity 3c0b585561 llama.swiftui : support loading custom model from file picker (#4767) %!s(int64=2) %!d(string=hai) anos
  Michael Coppola e5804313a1 server : fix options in README.md (#4765) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov dc891b7f7a ggml : include stdlib.h before intrin.h (#4736) %!s(int64=2) %!d(string=hai) anos
  singularity 46cea79e1f llama.swiftui : fix build of ggml.metallib (#4754) %!s(int64=2) %!d(string=hai) anos
  Daniel Bevenius cb1e2818e0 train : fix typo in overlapping-samples help msg (#4758) %!s(int64=2) %!d(string=hai) anos
  Ashraful Islam ece9a45e8f swift : update Package.swift to use ggml as dependency (#4691) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 7bed7eba35 cuda : simplify expression %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov d55356d3ba cuda : mark I16 and I32 ops as unsupported %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 75e3fd8581 sync : ggml %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 289313716f metal : add kernel_get_rows_i32 %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov ab62fc3e55 scripts : fix sync order + metal sed %!s(int64=2) %!d(string=hai) anos
  Guillaume Wenzek 5f66ebca9c ggml : extend ggml_get_rows, ggml_repeat, ggml_concat (ggml/639) %!s(int64=2) %!d(string=hai) anos
  Justin Parker f2eb19bd8b server : throw an error when `slot unavailable` (#4741) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov f3f62f0d83 metal : optimize ggml_mul_mat_id (faster Mixtral PP) (#4725) %!s(int64=2) %!d(string=hai) anos
  Phil H 0ef3ca2ac6 server : add token counts to html footer (#4738) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 540938f890 llama : llama_model_desc print number of experts %!s(int64=2) %!d(string=hai) anos
  Marcus Dunn 0040d42eeb llama : replace all API facing `int`'s with `int32_t` (#4577) %!s(int64=2) %!d(string=hai) anos
  postmasters 83e633c27e llama : differentiate the KV dims in the attention (#4657) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 32866c5edd editorconfig : fix whitespace and indentation #4710 %!s(int64=2) %!d(string=hai) anos
  minarchist 5d7002d437 server : add --override-kv parameter (#4710) %!s(int64=2) %!d(string=hai) anos
  Nam D. Tran 26f3071d71 py : re-enable mmap in convert hf (#4732) %!s(int64=2) %!d(string=hai) anos
  Daniel Bevenius 775ac8712a finetune: fix typo in README.md (#4733) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 58ba655af0 metal : enable shader debugging (cmake option) (#4705) %!s(int64=2) %!d(string=hai) anos