커밋 기록

작성자 SHA1 메시지 날짜
  Sigbjørn Skjæret 144a4ce824 vendor : sync minja (#16500) 2 달 전
  Jeff Bolz f549b0007d vulkan: Call ggml_vk_buffer_write_2d from ggml_vk_buffer_copy (#16793) 2 달 전
  Aman Gupta 9a3ea685b9 CUDA: Fix bug in topk-moe for gpt-oss (#16821) 2 달 전
  YaelLogic 338074c383 sycl: add RMS_NORM_BACK operation support (#16808) 2 달 전
  YaelGitAccount 851553ea6b cuda: add SET operation support (#16804) 2 달 전
  Georgi Gerganov 85a7d8677b memory : remove KV cache size padding (#16812) 2 달 전
  Georgi Gerganov a8ca18b4b8 llama-bench : clarify benchmarked parts of the computation (#16823) 2 달 전
  l3utterfly 8284efc35c initialise buffer.device in ggml_hexagon_session (#16816) 2 달 전
  Sam Malayek 1c1409e131 embedding: add raw option for --embd-output-format (#16541) 2 달 전
  Johannes Gäßler 7a0e900e36 llama: consistent ctx <-> buf order for KV cache (#16746) 2 달 전
  Aldehir Rojas 280d97be96 grammar : support array references in json schema (#16792) 2 달 전
  Chenguang Li 3479efd112 CANN: Improve device ID handling and aclnnArange checks (#16752) 2 달 전
  Aman Gupta 463bbf20bf CUDA: add unused vars to mmvf and mmvq (#16807) 2 달 전
  tamarPal ad8d36beff sycl: add SSM_CONV operation support (#16800) 2 달 전
  Yuri Khrustalev c053e18a66 chat: Add LFM2 tool handling (#16763) 2 달 전
  Xuan-Son Nguyen e1ab084803 mtmd : fix idefics3 preprocessing (#16806) 2 달 전
  Diego Devesa 5a4ff43e7d llama : disable pipeline parallelism if compute buffer allocation fails (#16748) 2 달 전
  Acly 10640e31aa ggml : fix interpolate with align-corners and ne=1 (#16700) 2 달 전
  Johannes Gäßler 80d28f104c HIP: fix AMDGPU_TARGETS, update documentation (#16803) 2 달 전
  Xuan-Son Nguyen c55d53acec model : add LightOnOCR-1B model (#16764) 2 달 전
  Johannes Gäßler 945501f5ea llama: fix leaked buffers for mmap + split files (#16765) 2 달 전
  Aman Gupta 75cbdd3fce test-backend-ops: print failed tests at the end (#16785) 2 달 전
  tamarPal 2b9bd9bf4e sycl: add ROLL operation support (#16665) 2 달 전
  shani-f 59fc1ec8e8 sycl: add REPEAT_BACK operation support (#16734) 2 달 전
  Aman Gupta 75d33b9302 CUDA: support for weight clamp in top-k norm (#16702) 2 달 전
  Acly 3470a5c891 ggml-alloc : make gallocr prefer chunks that allow memory reuse (#16788) 2 달 전
  Sigbjørn Skjæret bd562fe4f7 cuda : use fast copy when src and dst are of different type and contiguous (#16789) 2 달 전
  leejet bbac6a26b2 ggml: fix cuda kernel launch configuration for k_compute_batched_ptrs to support large batch (#16744) 2 달 전
  Sigbjørn Skjæret 73a48c9790 convert : enable expert group selection for all models with it (#16691) 2 달 전
  Sigbjørn Skjæret f696428ce8 graph : add clamping to ffn_moe_weights_sum to avoid div-by-zero (#16655) 2 달 전