1
0

Коммит түүх

Эзэн SHA1 Мессеж Огноо
  Djip007 0bb2919335 llama : change cpu_buft_list order: ACCEL -> GPU host -> CPU extra -> CPU (#12632) 9 сар өмнө
  Jay a69f846351 cmake : fix ccache conflict (#12522) 9 сар өмнө
  hipudding d07a0d7a79 CANN : remove clang-format in ggml-cann (#12607) 9 сар өмнө
  Sigbjørn Skjæret 3714c3ee1a llama : fix incorrect Qwen2Moe ffn_moe_out graph callback (#12631) 9 сар өмнө
  Georgi Gerganov b4ae50810e metal : improve FA + improve MoE (#12612) 9 сар өмнө
  Icenowy Zheng b86f600723 vulkan: fix coopmat shader generation when cross-compiling (#12272) 9 сар өмнө
  Johannes Gäßler dd373dd3bf llama: fix error on bad grammar (#12628) 9 сар өмнө
  Benson Wong 5d01670266 server : include speculative decoding stats when timings_per_token is enabled (#12603) 9 сар өмнө
  Radoslav Gerganov ef03229ff4 rpc : update README for cache usage (#12620) 9 сар өмнө
  amritahs-ibm 13731766db llamafile : ppc64le GEMV forwarding for FP32. (#12594) 9 сар өмнө
  Radoslav Gerganov ab6ab8f809 rpc : send hash when tensor data is above some fixed threshold (#12496) 9 сар өмнө
  Piotr 2099a9d5db server : Support listening on a unix socket (#12613) 9 сар өмнө
  Georgi Gerganov 2969019837 media : add SVG logo [no ci] (#12616) 9 сар өмнө
  lhez 5dec47dcd4 opencl: add multi and vision rope, `gelu_quick` and `im2col` (#12600) 9 сар өмнө
  Si1w f125b8dccf llama : add PLM GGUF Conversion & Inference Support (#12457) 9 сар өмнө
  HighDoping 953c2a62cf model : restore support for T5Encoder (#12590) 9 сар өмнө
  Csaba Kecskemeti d5c6309d91 convert : Support Qwen2_5_VLForConditionalGeneration (#12595) 9 сар өмнө
  Georgi Gerganov 029c693fdc sync : ggml 9 сар өмнө
  Georgi Gerganov 771d84371c scripts : update sync + fix cmake merge 9 сар өмнө
  Georgi Gerganov df0665a483 sync : ggml 9 сар өмнө
  Georgi Gerganov 0306aad1ca cmake : sync/merge PowerPC build commands (#0) 9 сар өмнө
  amritahs-ibm c7b43ab608 llamafile : ppc64le MMA implementation for Q4_0. (#12489) 9 сар өмнө
  xctan 24feaec057 ggml : riscv: add 128-bit RVV support (#12530) 9 сар өмнө
  Georgi Gerganov f28bc4c286 llama : make loras compatible with repacking (#12593) 9 сар өмнө
  Akarshan Biswas f17a3bb4e8 SYCL: implement memset ggml backend buffer interface (#12580) 9 сар өмнө
  Slobodan Josic bd40678df7 HIP: Add support for RDNA4 targets (#12372) 9 сар өмнө
  Georgi Gerganov b3298fa47a metal : refactor mat-vec code (#12569) 10 сар өмнө
  Michał Moskal 2447ad8a98 upgrade to llguidance 0.7.10 (#12576) 10 сар өмнө
  Ivy233 02082f1519 clip: Fix llama-llava-clip-quantize-cli quantization error under CUDA backend (#12566) 10 сар өмнө
  Georgi Gerganov df4d20cd53 convert : fix squeeze for ssm_conv tensors (#12573) 10 сар өмнө