cturan/llama.cpp

Autor	SHA1 Mensaxe	Data
Daniel Bevenius	37f10f955f make : remove make in favor of CMake (#15449)	hai 5 meses
xctan	f470bc36be ggml-cpu : split arch-specific implementations (#13892)	hai 7 meses
Georgi Gerganov	4773d7a02f examples : remove infill (#13283)	hai 8 meses
Xuan-Son Nguyen	9b61acf060 mtmd : rename llava directory to mtmd (#13311)	hai 8 meses
Diego Devesa	1d36b3670b llama : move end-user examples to tools directory (#13249)	hai 8 meses
David Huang	84778e9770 CUDA/HIP: Share the same unified memory allocation logic. (#12934)	hai 9 meses
R0CKSTAR	251364549f musa: support new arch mp_31 and update doc (#12296)	hai 10 meses
Johannes Gäßler	a28e0d5eb1 CUDA: app option to compile without FlashAttention (#12025)	hai 10 meses
Bodhi	0b3863ff95 MUSA: support ARM64 and enable dp4a .etc (#11843)	hai 11 meses
Olivier Chafik	63e489c025 tool-call: refactor common chat / tool-call api (+ tests / fixes) (#11900)	hai 11 meses
Georgi Gerganov	68ff663a04 repo : update links to new url (#11886)	hai 11 meses
Johannes Gäßler	864a0b67a6 CUDA: use mma PTX instructions for FlashAttention (#11583)	hai 11 meses
Olivier Chafik	8b576b6c55 Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639)	hai 11 meses
Olivier Chafik	6171c9d258 Add Jinja template support (#11016)	hai 1 ano
HimariO	ba1cb19cdd llama : add Qwen2VL support + multimodal RoPE (#10361)	hai 1 ano
Djip007	19d8762ab6 ggml : refactor online repacking (#10446)	hai 1 ano
Xuan Son Nguyen	91c36c269b server : (web ui) Various improvements, now use vite as bundler (#10599)	hai 1 ano
Georgi Gerganov	8648c52101 make : deprecate (#10514)	hai 1 ano
Wang Qin	43957ef203 build: update Makefile comments for C++ version change (#10598)	hai 1 ano
Diego Devesa	7cc2d2c889 ggml : move AMX to the CPU backend (#10570)	hai 1 ano
Tristan Druyen	be0e350c8b Fix HIP flag inconsistency & build docs (#10524)	hai 1 ano
R0CKSTAR	249cd93da3 mtgpu: Add MUSA_DOCKER_ARCH in Dockerfiles && update cmake and make (#10516)	hai 1 ano
Eric Curtin	0cc63754b8 Introduce llama-run (#10291)	hai 1 ano
Diego Devesa	5931c1f233 ggml : add support for dynamic loading of backends (#10469)	hai 1 ano
Georgi Gerganov	d9d54e498d speculative : refactor and add a simpler example (#10362)	hai 1 ano
Anthony Van de Gejuchte	3952a221af Fix missing file renames in Makefile due to changes in commit ae8de6d50a (#10413)	hai 1 ano
Georgi Gerganov	cf32a9b93a metal : refactor kernel args into structs (#10238)	hai 1 ano
Johannes Gäßler	c3ea58aca4 CUDA: remove DMMV, consolidate F16 mult mat vec (#10318)	hai 1 ano
Georgi Gerganov	a4200cafad make : add ggml-opt (#0)	hai 1 ano
Georgi Gerganov	84274a10c3 tests : remove test-grad0	hai 1 ano

Posterior Anterior

Commit History Buscar

Commit History