cturan/llama.cpp

نویسنده	SHA1 پیام	تاریخ
Jeff Bolz	3e4bb29666 vulkan: Check maxStorageBufferRange in supports_op (#18709)	2 هفته پیش
Jeff Bolz	8e2da778da vulkan: change memory_logger to be controlled by an env var (#18769)	2 هفته پیش
Jeff Bolz	2bbe4c2cf8 vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) (#18678)	2 هفته پیش
Ruben Ortlam	1051ecd289 vulkan: Disable large coopmat matmul configuration on proprietary AMD driver (#18763)	2 هفته پیش
Ruben Ortlam	0e76501e1d Vulkan: Optimize Matmul parameters for AMD GPUs with Coopmat support (#18749)	2 هفته پیش
Jeff Bolz	2524c26164 vulkan: fix push constant size for quantize_q8_1 (#18687)	3 هفته پیش
Jeff Bolz	cb14b06995 vulkan: optimize ssm_scan (#18630)	3 هفته پیش
Doctor Shotgun	9a5724dee2 ggml: add env var GGML_OP_OFFLOAD_MIN_BATCH (#18535)	3 هفته پیش
Jeff Bolz	ca4a8370bc vulkan: reject ops when a tensor is too large to allocate (#18646)	3 هفته پیش
virajwad	03023296cf vulkan: Warptile tuning for Intel Xe2/Xe3 (#18178)	3 هفته پیش
Jeff Bolz	ea13cba850 vulkan: support buffer_from_host_ptr (#18467)	3 هفته پیش
Jeff Bolz	b37124d2d2 vulkan: handle quantize_q8_1 overflowing the max workgroup count (#18515)	3 هفته پیش
Jeff Bolz	18ddaea2ae vulkan: Optimize GGML_OP_CUMSUM (#18417)	3 هفته پیش
Jeff Bolz	706e3f93a6 vulkan: Implement mmvq for iq1_s/iq1_m (#18450)	3 هفته پیش
Jeff Bolz	be47fb9285 vulkan: extend topk_moe to handle sigmoid w/exp_probs_b for nemotron (#18295)	4 هفته پیش
Jeff Bolz	c9ced4910b vulkan: preprocess mul_mat_id experts and discard workgroups more quickly (#18352)	1 ماه پیش
Jeff Bolz	7ac8902133 vulkan: optimize decodeFuncB in coopmat2 mul_mat_id shader (#18349)	1 ماه پیش
Jeff Bolz	9bf20d8ac3 vulkan: Use BK=32 for coopmat2 mul_mat_id (#18332)	1 ماه پیش
Jeff Bolz	b96b82fc85 vulkan: Support UPSCALE w/antialias (#18327)	1 ماه پیش
Jeff Bolz	10dc500bdb vulkan: handle rope with large number of rows (#18306)	1 ماه پیش
Jeff Bolz	2a9ea2020c vulkan: fix command buffer corruption in ggml_backend_vk_event_wait (#18302)	1 ماه پیش
Ruben Ortlam	7f459c98e7 vulkan: use fewer FA rows for small cache runs (#18280)	1 ماه پیش
Jeff Bolz	e3b35ddf1c vulkan: Extend rope fusions to allow mrope (#18264)	1 ماه پیش
Jeff Bolz	e1f15b454f vulkan: Implement set_tensor_async and the event interfaces (#18047)	1 ماه پیش
Jeff Bolz	fd05c51cec vulkan: fix im2col overflowing maxworkgroupcount (#18180)	1 ماه پیش
Jeff Bolz	b365c3ff01 vulkan/cuda: fix topk_moe with exp_probs_b (#18071)	1 ماه پیش
Jeff Bolz	cb64222b0c vulkan: support GGML_UNARY_OP_XIELU (#18062)	1 ماه پیش
Jeff Bolz	6eb7081860 vulkan: in graph_optimize, try to group ADD operations (#18060)	1 ماه پیش
Jeff Bolz	cdbada8d10 vulkan: Add perf logger mode with concurrency (#17944)	1 ماه پیش
Jeff Bolz	36255a2268 vulkan: support get_rows for i32 (#17941)	1 ماه پیش

جدیدتر قدیمی‌تر

تاریخچه Commit ها یافتن

تاریخچه Commit ها