cturan/llama.cpp

作者	SHA1 メッセージ	日付
rmatif	97669e4073 opencl: add attn sinks support for FA kernels (#15706)	4 ヶ月前
Chenguang Li	2f853687b3 CANN: Support eager execution mode under ACL graph compilation (#15712)	4 ヶ月前
hipudding	ef2af57ddf CANN: Support ext_factor in rope (#15710)	4 ヶ月前
Johannes Gäßler	5d804a4938 ggml-backend: raise GGML_MAX_SPLIT_INPUTS (#15722)	4 ヶ月前
Gilad S.	d4d8dbe383 vulkan: use memory budget extension to read memory usage (#15545)	4 ヶ月前
Jeff Bolz	35a42edac8 vulkan: add missing clamps in new mul_mat_id paths (#15702)	4 ヶ月前
Ruben Ortlam	fec7911f8f vulkan: disable large mmv subgroups on older Nvidia GPUs (#15717)	4 ヶ月前
s-goto-11	078ce23ea7 ggml: SVE support for exponential functions (#15145)	4 ヶ月前
Prashant Vithule	a0c2b207c5 ggml: aarch64: Implement SVE F16 kernels for vector functions (#15115)	4 ヶ月前
Jie Fu (傅杰)	4b20d8b7e3 convert : remove redundant code (#15708)	4 ヶ月前
Ruben Ortlam	02c1813517 Vulkan: Add Integer Dot Product mul_mat_vec shader for legacy quants (#14903)	4 ヶ月前
Daniel Bevenius	77dee9de97 ggml : WebGPU add TRANSPOSE and RESHAPE to supported ops (#15695)	4 ヶ月前
Jie Fu (傅杰)	4795c91c32 docs : add Hunyuan to models section (#15707)	4 ヶ月前
Akarshan Biswas	b66df9d9c9 CUDA: fix build error from ambiguous __half conversions in conv2d (#15690)	4 ヶ月前
hipudding	b9382c3877 CANN: Optimize MUL_MAT_ID (#15658)	4 ヶ月前
hipudding	3dc7397a27 CANN: fix RoPE cache issue on multi-device (#15629)	4 ヶ月前
Georgi Gerganov	e92d53b29e sampling : optimize samplers by reusing bucket sort (#15665)	4 ヶ月前
Georgi Gerganov	0d161f021a server : enable /slots by default and make it secure (#15630)	4 ヶ月前
Georgi Gerganov	4efd5a8316 metal : fix checks for available FA kernels (#15700)	4 ヶ月前
Diego Devesa	274966226f llama : fix fattn reserve call n_seqs parameter (#15699)	4 ヶ月前
Diego Devesa	9777032dcc llama : separate compute buffer reserve from fattn check (#15696)	4 ヶ月前
Sigbjørn Skjæret	7d3c9f2b21 ci : explicitly set fa off or on (#15692)	4 ヶ月前
Jeff Bolz	bbbf5ecccb vulkan: handle large sizes for get_rows (#15686)	4 ヶ月前
Jeff Bolz	c37052ab4d vulkan: mul_mat_id coopmat2 optimizations (#15546)	4 ヶ月前
Daniel Bevenius	5c16b9c87d vulkan : remove unused portability_enumeration_ext variable (#15679)	4 ヶ月前
Jeff Bolz	b97c9edc59 vulkan: Allow fallback to sysmem memory when vidmem is full (#15649)	4 ヶ月前
Jeff Bolz	94e82c7ead vulkan: clamp matmul and FA results to the max finite value (#15652)	4 ヶ月前
Charles Xu	4d74393bcc ggml: update kleidiai to v1.13.0 (#15663)	4 ヶ月前
Diego Devesa	dd892555b0 Update build.md to remove MSVC arm64 notes (#15684)	4 ヶ月前
Johannes Gäßler	e81b8e4b7f llama: use FA + max. GPU layers by default (#15434)	4 ヶ月前

新しい古い

コミット履歴 検索

コミット履歴