Commit Verlauf

Autor SHA1 Nachricht Datum
  Eric Curtin ef19c71769 run: de-duplicate fmt and format functions and optimize (#11596) vor 10 Monaten
  Dan Johansson 053b3f9aae ggml-cpu : update KleidiAI to v1.5.0 (#12568) vor 10 Monaten
  Akarshan Biswas e2f560175a SYCL: disable Q4_0 reorder optimization (#12560) vor 10 Monaten
  Dan Johansson 36ee06dd2d docs : add build instructions for KleidiAI (#12563) vor 10 Monaten
  R0CKSTAR 3cd3a39532 ci: [MUSA] add CI and update doc (#12562) vor 10 Monaten
  Georgi Gerganov 2d77d88e70 context : fix worst-case reserve outputs (#12545) vor 10 Monaten
  Akarshan Biswas c95fa362b3 ci: [SYCL] ggml-ci Use main GPU and enable sysman (#12547) vor 10 Monaten
  lhez 2b65ae3029 opencl: simplify kernel embedding logic in cmakefile (#12503) vor 10 Monaten
  Akarshan Biswas 48d7021c61 CI: fix SYCL build (#12546) vor 10 Monaten
  Tei Home 3361e2deba docs: update: improve the Fedoa CUDA guide (#12536) vor 10 Monaten
  compilade 00d53800e0 llama-vocab : add SuperBPE pre-tokenizer (#12532) vor 10 Monaten
  R0CKSTAR 7ea75035b6 CUDA: Fix clang warnings (#12540) vor 10 Monaten
  Prajwal B Mehendarkar c54f6b7988 mmap : skip resource limit checks on AIX (#12541) vor 10 Monaten
  Jeff Bolz 9b169a4d4e vulkan: fix mul_mat_vec failure in backend tests (#12529) vor 10 Monaten
  Marius Gerdes 77f9c6bbe5 server : Add verbose output to OAI compatible chat endpoint. (#12246) vor 10 Monaten
  Lars Sonchocky-Helldorf 18b663d8e4 install : add macports (#12518) vor 10 Monaten
  Xuan-Son Nguyen fbdfefe74e llama : gemma3 : use output tensor if it exists in model weight (#12506) vor 10 Monaten
  Georgi Gerganov ba932dfb50 ggml : fix quantized cpy op (#12310) vor 10 Monaten
  R0CKSTAR fac63a3d78 musa: refine compute capability (#12493) vor 10 Monaten
  Jeff Bolz eddfb43850 vulkan: Optimize mul_mat_vec p021 and nc shaders (#12505) vor 10 Monaten
  stduhpf 4375415b4a Vulkan: RTE rounding for cpy to quant (#12480) vor 10 Monaten
  Eve 30c42ef5cb vulkan: workaround for AMD Windows driver 16 bit unpack8 bug (#12472) vor 10 Monaten
  Georgi Gerganov af04481e6b model : do not repack if a GPU device is present (#12498) vor 10 Monaten
  Sigbjørn Skjæret 960e726077 chore : cleanup llama_model_loader::TENSOR_ usage (#12492) vor 10 Monaten
  marcoStocchi ea1518e839 llama-tts : avoid crashes related to bad model file paths (#12482) vor 10 Monaten
  蕭澧邦 1aa87ee53d [SYCL] Fix build on Windows when ccache enabled (#9954) (#9976) vor 10 Monaten
  Svetlozar Georgiev 9ffcc9e374 sycl: cleanup oneDNN related code (#12097) vor 10 Monaten
  Woof Dog e04643063b webui : Prevent rerendering on textarea input (#12299) vor 10 Monaten
  Sigbjørn Skjæret dbb3a4739e llama : make Qwen2MoE QKV bias optional (#12477) vor 10 Monaten
  Srihari-mcw 3d82dbcbce ggml : block interleaving support for Q4_K quantization for x86 AVX2 architecture (#12332) vor 10 Monaten