cturan/llama.cpp

작성자	SHA1 메시지	날짜
Sigbjørn Skjæret	144a4ce824 vendor : sync minja (#16500)	2 달 전
Jeff Bolz	f549b0007d vulkan: Call ggml_vk_buffer_write_2d from ggml_vk_buffer_copy (#16793)	2 달 전
Aman Gupta	9a3ea685b9 CUDA: Fix bug in topk-moe for gpt-oss (#16821)	2 달 전
YaelLogic	338074c383 sycl: add RMS_NORM_BACK operation support (#16808)	2 달 전
YaelGitAccount	851553ea6b cuda: add SET operation support (#16804)	2 달 전
Georgi Gerganov	85a7d8677b memory : remove KV cache size padding (#16812)	2 달 전
Georgi Gerganov	a8ca18b4b8 llama-bench : clarify benchmarked parts of the computation (#16823)	2 달 전
l3utterfly	8284efc35c initialise buffer.device in ggml_hexagon_session (#16816)	2 달 전
Sam Malayek	1c1409e131 embedding: add raw option for --embd-output-format (#16541)	2 달 전
Johannes Gäßler	7a0e900e36 llama: consistent ctx <-> buf order for KV cache (#16746)	2 달 전
Aldehir Rojas	280d97be96 grammar : support array references in json schema (#16792)	2 달 전
Chenguang Li	3479efd112 CANN: Improve device ID handling and aclnnArange checks (#16752)	2 달 전
Aman Gupta	463bbf20bf CUDA: add unused vars to mmvf and mmvq (#16807)	2 달 전
tamarPal	ad8d36beff sycl: add SSM_CONV operation support (#16800)	2 달 전
Yuri Khrustalev	c053e18a66 chat: Add LFM2 tool handling (#16763)	2 달 전
Xuan-Son Nguyen	e1ab084803 mtmd : fix idefics3 preprocessing (#16806)	2 달 전
Diego Devesa	5a4ff43e7d llama : disable pipeline parallelism if compute buffer allocation fails (#16748)	2 달 전
Acly	10640e31aa ggml : fix interpolate with align-corners and ne=1 (#16700)	2 달 전
Johannes Gäßler	80d28f104c HIP: fix AMDGPU_TARGETS, update documentation (#16803)	2 달 전
Xuan-Son Nguyen	c55d53acec model : add LightOnOCR-1B model (#16764)	2 달 전
Johannes Gäßler	945501f5ea llama: fix leaked buffers for mmap + split files (#16765)	2 달 전
Aman Gupta	75cbdd3fce test-backend-ops: print failed tests at the end (#16785)	2 달 전
tamarPal	2b9bd9bf4e sycl: add ROLL operation support (#16665)	2 달 전
shani-f	59fc1ec8e8 sycl: add REPEAT_BACK operation support (#16734)	2 달 전
Aman Gupta	75d33b9302 CUDA: support for weight clamp in top-k norm (#16702)	2 달 전
Acly	3470a5c891 ggml-alloc : make gallocr prefer chunks that allow memory reuse (#16788)	2 달 전
Sigbjørn Skjæret	bd562fe4f7 cuda : use fast copy when src and dst are of different type and contiguous (#16789)	2 달 전
leejet	bbac6a26b2 ggml: fix cuda kernel launch configuration for k_compute_batched_ptrs to support large batch (#16744)	2 달 전
Sigbjørn Skjæret	73a48c9790 convert : enable expert group selection for all models with it (#16691)	2 달 전
Sigbjørn Skjæret	f696428ce8 graph : add clamping to ffn_moe_weights_sum to avoid div-by-zero (#16655)	2 달 전

최신 이전

커밋 기록 찾기

커밋 기록