Commit History

Author SHA1 Message Date
  fanyang 456af35eb7 build : suppress gcc15 compile warnings (#14261) 7 months ago
  Anton Mitkov 600e3e9b50 sycl: Cleanup codepaths in Get Rows in sycl backend (#14215) 7 months ago
  bashayer hijji fffcce535e llama-bench : add --no-warmup flag (#14224) (#14270) 7 months ago
  pqnet 5fc7856815 convert : fix remote option in Windows (#14100) 7 months ago
  Aaron Teo faed5a5f5d llamafile : support s390x SIMD instruction set (#14273) 7 months ago
  0cc4m 10bb545c5b Vulkan: Set device max size for host memory to avoid OOM warning and fallback to CPU buffer (#14249) 7 months ago
  Gabe Goodhart edc4a29eff memory : Hybrid recurrent cache (#13979) 7 months ago
  Georgi Gerganov ed3290ab34 metal : add mean kernel (#14267) 7 months ago
  Aaron Teo 8d94713654 docs: add s390x build documentation (#14264) 7 months ago
  Aaron Teo 50d2227953 ggml-cpu: reduce asm calls for hsum (#14037) 7 months ago
  Aaron Teo 6231c5cd6d ggml-cpu: fix uncaught underscore terminators (#14023) 7 months ago
  Charles Xu ef035803eb ggml: Add Apple support for GGML_CPU_ALL_VARIANTS (#14258) 7 months ago
  Xuan-Son Nguyen 413977de32 mtmd : refactor llava-uhd preprocessing logic (#14247) 7 months ago
  Xuan-Son Nguyen 95402553a5 llama-chat : fix multiple system message for gemma, orion (#14246) 7 months ago
  Sigbjørn Skjæret 3865cff4f5 convert : fix null head_dim AutoConfig regression (#14248) 7 months ago
  Georgi Gerganov d03172cc79 sync : ggml 7 months ago
  Daniel Bevenius dd8e59f443 ggml : disable warnings for tests when using MSVC (ggml/1273) 7 months ago
  Daniel Bevenius bbe98d2784 ggml : remove unused ggml_context_container (ggml/1272) 7 months ago
  Daniel Bevenius c2056ed6d4 examples : include examples in msvc disable warn (ggml/1270) 7 months ago
  bandoti c46503014d cmake: remove shader-gen step-targets from ggml-vulkan (#14226) 7 months ago
  xctan 860a9e4eef ggml-cpu : remove the weak alias trick (#14221) 7 months ago
  R0CKSTAR fe9d60e74a musa: fix build warning (unused variable) (#14231) 7 months ago
  Sigbjørn Skjæret e434e69183 common : suggest --jinja when autodetection fails (#14222) 7 months ago
  Georgi Gerganov 89fea80d29 server : fix incorrect usage of llama_get_embeddings() (#14225) 7 months ago
  Diego Devesa 6adc3c3ebc llama : add thread safety test (#14035) 7 months ago
  bandoti 0dbcabde8c cmake: clean up external project logic for vulkan-shaders-gen (#14179) 7 months ago
  Đinh Trọng Huy ad590be98c model : add NeoBERT (#14164) 7 months ago
  uvos 7d6d91babf HIP: disable rocwmma on gfx12 by default until rocm 7.0 (#14202) 7 months ago
  Georgi Gerganov d3e64b9f49 llama : rework embeddings logic (#14208) 7 months ago
  Charles Xu 3ba0d843c6 ggml: Add Android support for GGML_CPU_ALL_VARIANTS (#14206) 7 months ago