cturan/llama.cpp

نویسنده	SHA1 پیام	تاریخ
Aaron Teo	046d5fd44e llama: use host memory if device reports 0 memory (#18587)	3 هفته پیش
shaofeiqi	568371a726 opencl: add FILL op support (#18682)	3 هفته پیش
lhez	08566977a7 opencl: allow resizing transpose buffers (#18384)	1 ماه پیش
lhez	eb492bf43f opencl: unpack q4_0 for adreno in get_tensor (#18278)	1 ماه پیش
Phylliida Dev	09c7c50e64 ggml : add circular tiling support to pad, for Vulkan, CUDA, and CPU (used for making seamless textures) (#16985)	1 ماه پیش
Tarek Dakhran	2ba719519d model: LFM2-VL fixes (#17577)	2 ماه پیش
lhez	7cba58bbea opencl: add sqr, sqrt, mean and ssm_conv (#17476)	2 ماه پیش
lhez	8e9ddba610 opencl: refine condition for kqv mm (#17392)	2 ماه پیش
lhez	52e5d421f1 opencl: fix rms_norm_mul (#17250)	2 ماه پیش
shaofeiqi	4db5641210 opencl: add kernel to handle mat mul in attention to improve encoding speed (#17181)	2 ماه پیش
lhez	ece0f5c177 opencl: add fastdiv and use it in set_rows, ported from cuda (#17090)	2 ماه پیش
Acly	1032256ec9 cuda/vulkan : bicubic interpolation (#17022)	2 ماه پیش
lhez	c5023daf60 opencl: support imrope (#16914)	2 ماه پیش
Acly	10640e31aa ggml : fix interpolate with align-corners and ne=1 (#16700)	3 ماه پیش
lhez	6ea37f5739 opencl: fix warnings and clean up profiling (#16688)	3 ماه پیش
Shawn Gu	81387858f1 opencl: transposed gemm/gemv moe kernel with mxfp4,f32 (#16602)	3 ماه پیش
lhez	0cb7a0683b opencl: add q8_0 mm support (#16469)	3 ماه پیش
Aman Gupta	120bf7046d CUDA + openCL: fix bug in accessing rms_norm->src while doing fusion (#16577)	3 ماه پیش
lhez	5016b72862 opencl: fix build targeting CL 2 (#16554)	3 ماه پیش
lhez	7c156df414 opencl: support pad_ext (#15888)	4 ماه پیش
lhez	d1c84a662d opencl: support ne3 in get_rows (#15866)	4 ماه پیش
Sigbjørn Skjæret	3ecb2f671a ggml : implement set_rows with i32 index (#16159)	4 ماه پیش
lhez	51f5a45fbe opencl: fix concat crash on win arm64 with Adreno (#15944)	4 ماه پیش
lhez	c4510dc937 opencl: initial `q8_0` mv support (#15732)	4 ماه پیش
Shawn Gu	3edd87cd05 opencl: optimize mxfp4 kernels (#16037)	4 ماه پیش
Jeff Bolz	c0b45097c3 rename optimize_graph to graph_optimize (#16082)	4 ماه پیش
Jeff Bolz	e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850)	4 ماه پیش
leejet	0a1b3982cd ggml: add ops for WAN video model (cuda && cpu) (#15669)	4 ماه پیش
rmatif	820bc98531 opencl: add hs=40 to FA (#15758)	4 ماه پیش
rmatif	97669e4073 opencl: add attn sinks support for FA kernels (#15706)	5 ماه پیش

جدیدتر قدیمی‌تر

تاریخچه Commit ها یافتن

تاریخچه Commit ها