Commit History

Author SHA1 Message Date
  slaren a249843d89 common : restore --n-gpu-layers (#9371) 1 year ago
  slaren 19f4a7b296 llama : refactor samplers internal implementation (#9370) 1 year ago
  Neo Zhang Jianyu 2a358fb0c4 [SYCL] add check malloc result on device (#9346) 1 year ago
  slaren eae597182c llama : sanitize tokens in the upper bound (#9359) 1 year ago
  Xuan Son Nguyen 00b02bb249 imatrix : fix arg parser for imatrix (#9366) 1 year ago
  Georgi Gerganov a876861455 metal : update support condition for im2col + fix warning (#0) 1 year ago
  Georgi Gerganov 385decbd63 sync : ggml 1 year ago
  Georgi Gerganov 60a3107ccd scripts : option to increase git patch context 1 year ago
  Salvatore Mesoraca 406c1a32a1 vulkan: add dryrun support to sin and cos ops (ggml/947) 1 year ago
  Salvatore Mesoraca 9cb9260861 vulkan: correctly report support for OP_CONT (ggml/946) 1 year ago
  Johannes Gäßler 202084d31d tests: add gradient tests for all backends (ggml/932) 1 year ago
  Johannes Gäßler dbbebcab33 ggml: fix ggml_graph_cpy undefined behavior (ggml/943) 1 year ago
  Georgi Gerganov ba1cf846ed cann : fix doxy (ggml/0) 1 year ago
  Mengqing Cao d2d3200b38 cann : add Ascend NPU support (whisper/2336) 1 year ago
  Georgi Gerganov 51d964a4ef cuda : mark BF16 CONT as unsupported 1 year ago
  Salvatore Mesoraca efe6a83e30 ggml : fix cont with transposed tensors when one dimension is 1 (ggml/934) 1 year ago
  Kevin Gibbons fbb7fcffbc llama : set attrs of mislabelled EOT/EOM tokens (#9348) 1 year ago
  Georgi Gerganov a5b5d9a101 llama.android : fix build (#9350) 1 year ago
  Georgi Gerganov f12295b8a9 llama : fix empty ring buffer push (#9358) 1 year ago
  Georgi Gerganov faf69d4237 llama : sanitize invalid tokens (#9357) 1 year ago
  Eve e536426ded llamafile : disable sgemm for batch-size 1 (#9330) 1 year ago
  Xuan Son Nguyen 1b9ae5189c common : refactor arg parser (#9308) 1 year ago
  slaren e32d0816ed ggml : always check bounds on get_rows operations (#9354) 1 year ago
  Georgi Gerganov df270ef745 llama : refactor sampling v2 (#9294) 1 year ago
  Xuan Son Nguyen 947538acb8 ggml : fix missing `cpu_set_t` on emscripten (#9336) 1 year ago
  slaren 6c89eb0b47 ci : disable rocm image creation (#9340) 1 year ago
  Xuan Son Nguyen 9b2c24c099 server : simplify state machine for slot (#9283) 1 year ago
  Aarni Koskela 134bc38ecf llama-bench : log benchmark progress (#9287) 1 year ago
  Aarni Koskela 815b1fb20a batched-bench : add `--output-format jsonl` option (#9293) 1 year ago
  Changyeon Kim 409dc4f8bb ggml : fix build break for the vulkan-debug (#9265) 1 year ago