Commit History

Autor SHA1 Mensaxe Data
  musoles 7a689c415e README : added kalavai to infrastructure list (#11216) hai 1 ano
  Jeff Bolz bd38ddea01 vulkan: support copy from f32 to q4_0/q4_1/q5_0/q5_1/q8_0/iq4_nl (#11166) hai 1 ano
  Jeff Bolz 466300fe14 vulkan: optimize coopmat2 q4_k/q5_k dequant functions. (#11206) hai 1 ano
  Jeff Bolz 206bc53422 vulkan: optimize coopmat2 q2_k dequant function (#11130) hai 1 ano
  RunningLeon 4dbc8b9cb7 llama : add internlm3 support (#11233) hai 1 ano
  Johannes Gäßler 9c8dcefe17 CUDA: backwards pass for misc. ops, add tests (#11257) hai 1 ano
  Xuan Son Nguyen 681149ced2 llama : add `llama_model_load_from_splits` (#11255) hai 1 ano
  fj-y-saito c67cc9837d ggml: aarch64: implement SVE kernels for q4_K_q8_K vector dot (#11227) hai 1 ano
  Eve adc5dd92e8 vulkan: scale caching for k quants + misc fixes (#11081) hai 1 ano
  Georgi Gerganov f11cfdfd7f ci : use -no-cnv in gguf-split tests (#11254) hai 1 ano
  Junil Kim 1d8504338e fix: ggml: fix vulkan-shaders-gen build (#10448) hai 1 ano
  Johannes Gäßler 432df2d5f9 RoPE: fix back, CUDA support for back + noncont. (#11240) hai 1 ano
  Daniel Bevenius 0ccd7f3eb2 examples : add embd_to_audio to tts-outetts.py [no ci] (#11235) hai 1 ano
  Akarshan Biswas f446c2cf6a SYCL: Add gated linear attention kernel (#11175) hai 1 ano
  Xuan Son Nguyen b4d92a59a2 ci : add -no-cnv for tests (#11238) hai 1 ano
  Georgi Gerganov bbf3e55e35 vocab : add dummy tokens for "no_vocab" type (#11231) hai 1 ano
  ebraminio c5bf0d1bd7 server : Improve code snippets direction between RTL text (#11221) hai 1 ano
  Olivier Chafik 091592d758 Refactor test-chat-template.cpp (#11224) hai 1 ano
  Georgi Gerganov 44d1e796d0 sync : ggml hai 1 ano
  Georgi Gerganov a4f3f5d8e6 scripts : sync gguf (cont) hai 1 ano
  Georgi Gerganov 48e1ae0e61 scripts : sync gguf hai 1 ano
  Georgi Gerganov d00a80e89d scripts : sync opencl hai 1 ano
  ebraminio 504af20ee4 server : (UI) Improve messages bubble shape in RTL (#11220) hai 1 ano
  Xuan Son Nguyen 84a44815f7 cli : auto activate conversation mode if chat template is available (#11214) hai 1 ano
  Andreas Kieslinger 39509fb082 cuda : CUDA Graph Compute Function Refactor (precursor for performance improvements) (#11042) hai 1 ano
  Georgi Gerganov a29f0870d4 contrib : add naming guidelines (cont) (#11177) hai 1 ano
  ebraminio 437e05f714 server : (UI) Support for RTL text as models input or output (#11208) hai 1 ano
  Georgi Gerganov ca001f6656 contrib : add naming guidelines (cont) (#11177) hai 1 ano
  Xuan Son Nguyen 00b4c3da62 common : support tag-based --hf-repo like on ollama (#11195) hai 1 ano
  Georgi Gerganov 7426a26b24 contrib : add naming guidelines (#11177) hai 1 ano