cturan/llama.cpp

Author	SHA1 Message	Date
Diego Devesa	c5b0f4b5d9 llama : refactor model loader with backend registry (#10026)	1 year ago
Changyeon Kim	8f275a7c45 ggml: Add POOL2D OP for GPU acceleration to the Vulkan backend in the MobileVLM model. (#9763)	1 year ago
Diego Devesa	f010b77a37 vulkan : add backend registry / device interfaces (#9721)	1 year ago
Diego Devesa	dca1d4b58a ggml : fix BLAS with unsupported types (#9775)	1 year ago
SRHMorris	b0915d5b51 vulkan : retry allocation with fallback flags (whisper/2451)	1 year ago
Diego Devesa	c83ad6d01e ggml-backend : add device and backend reg interfaces (#9707)	1 year ago
Radoslav Gerganov	00b7317e63 vulkan : do not use tensor->extra (#9407)	1 year ago
Salvatore Mesoraca	cb00020504 vulkan : mul_mat: fix UB with small warps (ggml/952)	1 year ago
Jeff Bolz	641002fba8 vulkan : multithread pipeline creation (ggml/963)	1 year ago
Jeff Bolz	0de8b203f1 vulkan : fix build for GGML_VULKAN_RUN_TESTS, add TFLOPS to log (ggml/961)	1 year ago
Markus Tavenrath	89f9944981 Enable use to the rebar feature to upload buffers to the device. (#9251)	1 year ago
Johannes Gäßler	424c5d00a9 ggml/examples: add backend support for numerical optimization (ggml/949)	1 year ago
Georgi Gerganov	d6a04f872d ggml : hide ggml_object, ggml_cgraph, ggml_hash_set (#9408)	1 year ago
Markus Tavenrath	daa9623ab0 Overlap cmdbuffer creation and cmdbuffer execution in Vulkan backend by submitting smaller cmdbuffers early. (#9118)	1 year ago
Salvatore Mesoraca	406c1a32a1 vulkan: add dryrun support to sin and cos ops (ggml/947)	1 year ago
Salvatore Mesoraca	9cb9260861 vulkan: correctly report support for OP_CONT (ggml/946)	1 year ago
Changyeon Kim	409dc4f8bb ggml : fix build break for the vulkan-debug (#9265)	1 year ago
Georgi Gerganov	231cff5f6f sync : ggml	1 year ago
Changyeon Kim	2f3c1466ff llava: Add ACC OP for GPU acceleration to the Vulkan backend in the LLAVA CLIP model. (#8984)	1 year ago
0cc4m	5fd89a70ea Vulkan Optimizations and Fixes (#8959)	1 year ago
Daniel Bevenius	06943a69f6 ggml : move rope type enum to ggml.h (#8949)	1 year ago
Markus Tavenrath	7c5bfd57f8 Optimize Vulkan backend for better CPU performance and less GPU synchronization overhead. (#8943)	1 year ago
Matt Stephenson	70c0ea3560 whisper : use vulkan as gpu backend when available (whisper/2302)	1 year ago
0cc4m	a3738b2fa7 vulkan : implement Stable Diffusion operators (ggml/904)	1 year ago
Tony Wasserka	203b7f1531 vulkan : initialize vk_buffer_struct members to VK_NULL_HANDLE (ggml/893)	1 year ago
slaren	2b1f616b20 ggml : reduce hash table reset cost (#8698)	1 year ago
0cc4m	751fcfc6c3 Vulkan IQ4_NL Support (#8613)	1 year ago
0cc4m	bda62d7999 Vulkan MMQ Fix (#8479)	1 year ago
Georgi Gerganov	f3f65429c4 llama : reorganize source code + improve CMake (#8006)	1 year ago

Commit History Find

Commit History