Commit History

Autor SHA1 Mensaxe Data
  Andrew Aladjev 4a4f7e6550 cli: fixed dead links to tools/main for cli and completion, fixed code owners (#17993) hai 1 mes
  Thomas Jarosch e73d548659 webui: add "delete all conversations" button to import/export tab (#17444) hai 1 mes
  Johannes Gäßler b1f3a6e5db llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (#16653) hai 1 mes
  Neo Zhang Jianyu 4aced7a631 [SYCL] Support gpt-oss by OPs add-id, mul_mat for mxfp4, swiglu_oai (#17826) hai 1 mes
  piDack 745fa0e78b model : add glm-asr support (#17901) hai 1 mes
  Xuan-Son Nguyen 52392291b2 preset: handle negated arg, reverse the meaning if needed (#18041) hai 1 mes
  Sigbjørn Skjæret 5c8a717128 convert : refactor rope scaling handling (#18013) hai 1 mes
  Haowei Wu 37f5a1093b mtmd: enhance image resizing in llava_uhd (#18014) hai 1 mes
  Ruben Ortlam 9e6649ecf2 vulkan: fix mul_mat_vec_iq1_s formatting (#18026) hai 1 mes
  Xuan-Son Nguyen 0759b09c90 graph: add f_attn_temp_offset (#18025) hai 1 mes
  Georgi Gerganov 254098a279 common : refactor common_sampler + grammar logic changes (#17937) hai 1 mes
  Jeff Bolz 3238b1400c vulkan: Fix data race/hang in scalar/cm1 flash attention (#17887) hai 1 mes
  lovedheart 4722671641 vulkan: improve mul_mat_vec_iq1_s speed (#17874) hai 1 mes
  Eve d15d177f43 vulkan: faster q6_k matmul (#17813) hai 1 mes
  Georgi Gerganov 77ad8542bd model-conversion : cast logits to float32 (#18009) hai 1 mes
  Georgi Gerganov 609a2d0268 models : fix YaRN regression + consolidate logic (#18006) hai 1 mes
  Georgi Gerganov a63cbafbbc ggml : arm repack fix build hai 1 mes
  Georgi Gerganov 0e59224990 sync : ggml hai 1 mes
  Georgi Gerganov 71fdcf0616 ggml : arm repack fix build (whisper/0) hai 1 mes
  Congcong Cai 615655aafe cmake : set `CMAKE_RUNTIME_OUTPUT_DIRECTORY` for non standalone build (ggml/1394) hai 1 mes
  Xuan-Son Nguyen c00ff929dc scripts: add script to compare logprobs of llama.cpp against other frameworks (#17947) hai 1 mes
  Sergey Fedorov 4ed2bae50d server-models.cpp: add missing <filesystem> (#18000) hai 1 mes
  Jeff Bolz 5266379bca llama_context: synchronize before reallocating output buffer (#17974) hai 1 mes
  Xuan-Son Nguyen 4d5ae24c0a arg: fix common_params_parse not accepting negated arg (#17991) hai 1 mes
  Gustavo Rocha Dias 66ba51252e cmake: correct scope - link ws2_32 for MinGW/w64devkit builds in cpp-httplib (#17972) hai 1 mes
  Jeff Bolz 36255a2268 vulkan: support get_rows for i32 (#17941) hai 1 mes
  Jeff Bolz 3229a23fa6 vulkan: support GGML_OP_DIAG (#17893) hai 1 mes
  Jeff Bolz 303f8615e9 vulkan: Multi-pass softmax for large number of cols (#17892) hai 1 mes
  Georgi Gerganov 3c6391e748 speculative-simple : free batch on exit (#17985) hai 1 mes
  Sigbjørn Skjæret 8e4d678528 common : skip model validation when --completion-bash is requested (#17975) hai 1 mes