Commit History

Author SHA1 Message Date
  Radoslav Gerganov af6f91db47 ggml : remove ggml_graph_import and ggml_graph_export declarations (ggml/1247) 7 months ago
  Georgi Gerganov a7b8d35f78 sync : whisper.cpp (ggml/1250) 7 months ago
  Radoslav Gerganov 6eba72b71c ggml : install dynamic backends (ggml/1240) 7 months ago
  Daniel Tang fedf034a98 ggml : Print backtrace on uncaught C++ exceptions (ggml/1232) 7 months ago
  ddh0 8726392d3d readme : update bindings (#13950) 7 months ago
  Georgi Gerganov c04621711a parallel : fix n_junk == 0 (#13952) 7 months ago
  Georgi Gerganov 0fc16b42e8 kv-cache : split implementation in separate sources (#13920) 7 months ago
  Max Krasnyansky 053b1539c0 threading: support for GGML_SCHED_PRIO_LOW, update thread info on Windows to avoid throttling (#12995) 7 months ago
  Jiří Podivín b3a89c3d9e docs : Note about necessity of having libcurl installed for standard build. (#13945) 7 months ago
  Olivier Chafik e15898d1c7 server: allow unclosed thinking tags (#13931) 7 months ago
  Georgi Gerganov 803f8baf4f llama : deprecate explicit kv_self defrag/update calls (#13921) 7 months ago
  Georgi Gerganov 3600cc2886 llama : use n_swa + n_ubatch cells for SWA cache (#13833) 7 months ago
  igardev c7e0a2054b webui : Replace alert and confirm with custom modals. (#13711) 7 months ago
  Georgi Gerganov 3f55f781f1 llama : auto-batch preparation (#13845) 7 months ago
  Xuan-Son Nguyen 51fa76f172 mtmd : drop `_shared` from `libmtmd` name, merge helpers into libmtmd (⚠️ breaking change) (#13917) 7 months ago
  Georgi Gerganov 12d0188c0d kv-cache : refactor + add llama_memory_state_i (#13746) 7 months ago
  Shawn yang eb3949938e CUDA: add a prop in ggml_cuda_device_infor for distinguish iGPU or dGPU in cuda (#13856) (#13895) 7 months ago
  Johannes Gäßler e562eece7c CUDA: fix typo in FlashAttention code (#13926) 7 months ago
  Diego Devesa b47ab7b8e9 sched : avoid changing cur_copy when a graph is already allocated (#13922) 7 months ago
  Georgi Gerganov dd665cc9d4 parallel : increase the variability of the prompt lengths (#13927) 7 months ago
  Diego Devesa df0c0c7d02 cuda : prevent using split buffers with 3d/4d matrices (#13919) 7 months ago
  Akarshan Biswas b49a8ff96b SYCL: Add mrope kernel (#13755) 7 months ago
  Georgi Gerganov 53f925074d sync : vendor (#13901) 7 months ago
  Sigbjørn Skjæret db38704f01 convert : fix rwkv bos/eos token (#13844) 7 months ago
  Xuan-Son Nguyen 07e4351ce6 convert : allow partial update to the chkhsh pre-tokenizer list (#13847) 7 months ago
  Đinh Trọng Huy 291f2b6913 llama : add support for DistilBert (#13907) 7 months ago
  zhangkaihuo 2c90da4c7e llama : use llm_build_granite for minicpm (#13911) 7 months ago
  Christian Kastner ec9e0301fe cmake: Guard GGML_CPU_ALL_VARIANTS by architecture (#13890) 7 months ago
  Sigbjørn Skjæret e83ba3e460 llama : add support for jina-reranker-v2 (#13900) 7 months ago
  Sigbjørn Skjæret 2b131621e6 gguf-py : add support for sub_type (in arrays) in GGUFWriter add_key_value method (#13561) 7 months ago