Commit History

Author SHA1 Message Date
  Juk Armstrong daa422881a llama : DeepSeek V2/V3 MLA implementation (#12801) 9 months ago
  Srihari-mcw eccc7a1602 ggml : Add AVX512 implementation of GEMM - Q4_Kx8 (#12829) 9 months ago
  Chenguang Li 0019279bb5 CANN: Opt ROPE optimization (#12865) 9 months ago
  Xinpeng Dou b0c75ac9f9 CANN: Optimize CANN buffer pool memory management (#12875) 9 months ago
  Russyyds d6d2c2ab8c Add performance print for gemma3 in example (#12929) 9 months ago
  Akarshan Biswas 75afa0ae31 SYCL: Fix im2col (#12910) 9 months ago
  Radoslav Gerganov c772d54926 rpc : use ggml_context_ptr (#12938) 9 months ago
  Neo Zhang Jianyu 81c7e64fc2 dsiable curl lib check, this action is missed by commit bd3f59f81289b920bcc597a208c14f55e39ed37e (#12761) (#12937) 9 months ago
  Georgi Gerganov 526739b879 sync : ggml 9 months ago
  cmdr2 a25355e264 cpu: fix cpu backend's supports-op for GET_ROWS_BACK. fixes a fatal when running test-backend-ops with only the CPU backend (ggml/1190) 9 months ago
  SXX e959d32b1c ggml: use _mm[512/256]_dpbusd[_avx]_epi32 to directly accumulate into the result register (#12773) 9 months ago
  Alan Gray 307bfa253d ggml: disable CUDA graphs for unsupported DUP and CONT node types (#12891) 9 months ago
  Ed Addario 71e90e8813 quantize: Handle user-defined quantization levels for additional tensors (#12511) 9 months ago
  Prajwal B Mehendarkar bc091a4dc5 common : Define cache directory on AIX (#12915) 9 months ago
  Jeff Bolz a4837577aa vulkan: use aligned loads for flash attention mask (#12853) 9 months ago
  Matt Clayton e59ea539b8 llava: Fix cpu-only clip image encoding sefault (#12907) 9 months ago
  Georgi Gerganov c94085df28 server : add VSCode's Github Copilot Chat support (#12896) 9 months ago
  yuri@FreeBSD e8a62631b3 rpc : Set cache directory in rpc-server.cpp on FreeBSD (#12903) 9 months ago
  Olivier Chafik b6930ebc42 `tool-call`: fix non-tool-calling grammar crashes w/ Qwen / Hermes 2 templates (#12900) 9 months ago
  yuri@FreeBSD 68b08f36d0 common : Define cache directory on FreeBSD (#12892) 9 months ago
  Ewan Crawford 578754b315 sycl: Support sycl_ext_oneapi_limited_graph (#12873) 9 months ago
  tastelikefeet b2034c2b55 contrib: support modelscope community (#12664) 9 months ago
  Yuxuan Zhang 06bb53ad9b llama-model : add Glm4Model implementation for GLM-4-0414 (#12867) 9 months ago
  Xuan-Son Nguyen 0c50923944 clip : use smart pointer (⚠️ breaking change) (#12869) 9 months ago
  Akarshan Biswas fccf9cae83 SYCL: Add fp16 type support to unary op kernels (#12788) 9 months ago
  Daniel Han ec6c09d0fa convert : Llama4 RoPE fix (#12889) 9 months ago
  R0CKSTAR 8ac9f5d765 ci : Replace freediskspace to free_disk_space in docker.yml (#12861) 9 months ago
  Daniel Bevenius 12e9158f25 xcf : add check for visionos build version (#12854) 9 months ago
  Xuan-Son Nguyen 5b1f13cb64 convert : proper tensor name mapping for llama4 (#12870) 9 months ago
  Xuan-Son Nguyen 8b91d5355a llama : correct rms norm for llama 4 (#12882) 9 months ago