Commit History

Autor SHA1 Mensaxe Data
  Aleksei Nikiforov 64387f6e95 gguf-py: byteswapping improvements (#12851) hai 4 meses
  Joshua Cogliati d35a1e8c41 cli : change log to warning to explain reason for stopping (#15604) hai 4 meses
  Daniel Bevenius 46d9caa27a model-conversion : add mmproj conversion target (#15628) hai 4 meses
  matiaslin 5a0e3ef6f0 cuda: Add cublasLt_static linking when GGML_STATIC is enabled (#15622) hai 4 meses
  Johannes Gäßler fbef0fad7a server: higher timeout for tests (#15621) hai 4 meses
  Georgi Gerganov da54f9f1a2 presets : add qwen3-30B-a3b FIM (#15616) hai 4 meses
  uvos 47373271f9 HIP: Enable support for ggml_backend_cuda_register_host_buffer (#15615) hai 4 meses
  Georgi Gerganov 1bded5a3b3 kv-cache : better estimate of n_kv for multi-sequence batches (#15610) hai 4 meses
  Chenguang Li 1e7489745a CANN: refactor mask handling and improve performance in FA (#15561) hai 4 meses
  xctan 1cf123a343 ggml-cpu : add basic RVV support for vector f32 ops (#15057) hai 4 meses
  Daniel Bevenius fcca2182a1 common : add -m to bash completion for --model [no ci] (#15591) hai 4 meses
  rmatif 86076f92de OpenCL: add fused group_norm/norm, mul, add (#15314) hai 4 meses
  Diego Devesa bcbddcd54f tests : fix test-opt with GGML_BACKEND_DL (#15599) hai 4 meses
  Akarshan Biswas 8b69686136 SYCL: fix rms_norm_mul_add for tensor dim not a multiple of sg_size (#15592) hai 4 meses
  fidoriel 8ce3ff1d91 mtmd : fix mtmd ios build (#15579) hai 4 meses
  Eve 44b1efa41a tests: add performance test for mul mat id (#15543) hai 4 meses
  shalinib-ibm a6a58d6478 llamafile: PowerPC Sgemm Optimization (#15558) hai 4 meses
  Georgi Gerganov 0373486dbc graph : fix assert in memory-less build_attn (#15590) hai 4 meses
  Daniel Bevenius 62cef26ac5 model-conversion : add qat-q4 quantization targets (#15588) hai 4 meses
  Johannes Gäßler 8f5afa94c4 CUDA: return -1 for nonexistent compiled arch (#15587) hai 4 meses
  Georgi Gerganov b3964c1e89 metal : optimize FA vec for large sequences and BS <= 8 (#15566) hai 4 meses
  Xuan-Son Nguyen 79a546220c mtmd : support Kimi VL model (#15458) hai 4 meses
  Georgi Gerganov 85cc1ae998 context : print graph stats for memory-less contexts (#15586) hai 4 meses
  Georgi Gerganov 1d8d83deaa metal : improve `MUL_MAT_ID` (#15541) hai 4 meses
  tc-mb c4e9239064 model : support MiniCPM-V 4.5 (#15575) hai 4 meses
  Sigbjørn Skjæret 39842a7f73 gguf-py : remove erroneous FFN_GATE entry (#15583) hai 4 meses
  Sigbjørn Skjæret 0fd90db585 metal : remove contiguous assertion for src0 in IM2COL (#15577) hai 4 meses
  Yoshi_likes_e4 4c37636b3e Add a warning for special devices (#15563) hai 4 meses
  Jeff Bolz 34bdbbd7c2 vulkan: Remove splitting for mul_mat_id (#15568) hai 4 meses
  Qeeweew 74f52f77f2 CUDA: Accelerate MXFP4 table lookup using `__byte_perm` (#15451) hai 4 meses