cturan/llama.cpp

Autor	SHA1 Mensagem	Data
Jeff Bolz	879d673759 vulkan: Implement top-k (#17418)	há 1 mês atrás
Georgi Gerganov	583cb83416 ggml : add ggml_top_k (#17365)	há 1 mês atrás
Jeff Bolz	d414db02d3 vulkan: Use fewer rows for scalar FA when HS is not a multiple of 16 (#17455)	há 1 mês atrás
Sigbjørn Skjæret	96ac5a2329 cuda : support non-contiguous i32 to i32 copy (#17326)	há 1 mês atrás
Masato Nakasaka	3f3a4fb9c3 Revive MUL_MAT_ID to perf testing (#17397)	há 1 mês atrás
Giuseppe Scrivano	7d77f07325 vulkan: implement ADD1, ARANGE, FILL, SOFTPLUS, STEP, ROUND, CEIL, FLOOR, TRUNC (#17319)	há 2 meses atrás
Jeff Bolz	1fa4551af0 vulkan: support larger argsort (#17313)	há 2 meses atrás
Piotr Wilkin (ilintar)	6fd4f95367 Fix too relaxed check on CUDA "fast copy" (can_be_transposed) condition (#17332)	há 2 meses atrás
Georgi Gerganov	1a139644a8 metal : add cumsum (#17305)	há 2 meses atrás
Jeff Bolz	24dc769f1b vulkan: Fuse mul_mat_id+add_id+mul and mul_mat+add+add. (#17287)	há 2 meses atrás
Georgi Gerganov	45c6ef7307 metal : support argsort for ne00 > 1024 (#17247)	há 2 meses atrás
Piotr Wilkin (ilintar)	389ac78b26 ggml : add ops SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM (#17063)	há 2 meses atrás
Diego Devesa	879dec341a ggml-cpu : use template for argsort (#17222)	há 2 meses atrás
duduta	73460f6278 ggml-cpu: templateify ggml_compute_forward_rope_f32 and _f16 (#16805)	há 2 meses atrás
Acly	1032256ec9 cuda/vulkan : bicubic interpolation (#17022)	há 2 meses atrás
Ruben Ortlam	8a3519b708 vulkan: fix mmq out of bounds reads (#17108)	há 2 meses atrás
Jeff Bolz	80a6cf6347 vulkan: fuse mul_mat_id + mul (#17095)	há 2 meses atrás
Aman Gupta	64fe17fbb8 Revert "CUDA: add expert reduce kernel (#16857)" (#17100)	há 2 meses atrás
Aman Gupta	c1b187688d CUDA: skip fusion for repeating adds in bias (#17080)	há 2 meses atrás
Jeff Bolz	b4e335d8dc vulkan: fuse rms_norm + mul + rope (+ view + set_rows) (#16977)	há 2 meses atrás
bssrdf	299f5d782c CUDA: properly handle nb00=nb02 case for cpy (#17081)	há 2 meses atrás
Johannes Gäßler	aa374175c3 CUDA: fix crash on uneven context without FA (#16988)	há 2 meses atrás
bssrdf	230d1169e5 improve CUDA cpy memory bandwidth when copying transposed tensor (#16841)	há 2 meses atrás
Shagun Bera	a2054e3a8f test-backend-ops : fix segfault in moe-expert-reduce test in support mode and coverage (#16936)	há 2 meses atrás
Georgi Gerganov	2f966b8ed8 clip : use FA (#16837)	há 2 meses atrás
Aman Gupta	4146d6a1a6 CUDA: add expert reduce kernel (#16857)	há 2 meses atrás
Ruben Ortlam	d2a2673dd1 vulkan: fix shmem overrun in mmq id shader (#16873)	há 2 meses atrás
JJJYmmm	d261223d24 model: add support for qwen3vl series (#16780)	há 2 meses atrás
Sigbjørn Skjæret	229bf68628 cuda : fix argsort with 64k+ rows (#16849)	há 2 meses atrás
Jeff Bolz	b9ce940177 vulkan: Fuse rope+set_rows (#16769)	há 2 meses atrás

Recente Antigo

Histórico de Commits Pesquisar

Histórico de Commits