cturan/llama.cpp

Autor	SHA1 Mensagem	Data
Daniel Bevenius	37f10f955f make : remove make in favor of CMake (#15449)	há 5 meses atrás
xctan	f470bc36be ggml-cpu : split arch-specific implementations (#13892)	há 7 meses atrás
Georgi Gerganov	4773d7a02f examples : remove infill (#13283)	há 8 meses atrás
Xuan-Son Nguyen	9b61acf060 mtmd : rename llava directory to mtmd (#13311)	há 8 meses atrás
Diego Devesa	1d36b3670b llama : move end-user examples to tools directory (#13249)	há 8 meses atrás
David Huang	84778e9770 CUDA/HIP: Share the same unified memory allocation logic. (#12934)	há 9 meses atrás
R0CKSTAR	251364549f musa: support new arch mp_31 and update doc (#12296)	há 10 meses atrás
Johannes Gäßler	a28e0d5eb1 CUDA: app option to compile without FlashAttention (#12025)	há 10 meses atrás
Bodhi	0b3863ff95 MUSA: support ARM64 and enable dp4a .etc (#11843)	há 11 meses atrás
Olivier Chafik	63e489c025 tool-call: refactor common chat / tool-call api (+ tests / fixes) (#11900)	há 11 meses atrás
Georgi Gerganov	68ff663a04 repo : update links to new url (#11886)	há 11 meses atrás
Johannes Gäßler	864a0b67a6 CUDA: use mma PTX instructions for FlashAttention (#11583)	há 11 meses atrás
Olivier Chafik	8b576b6c55 Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639)	há 11 meses atrás
Olivier Chafik	6171c9d258 Add Jinja template support (#11016)	há 1 ano atrás
HimariO	ba1cb19cdd llama : add Qwen2VL support + multimodal RoPE (#10361)	há 1 ano atrás
Djip007	19d8762ab6 ggml : refactor online repacking (#10446)	há 1 ano atrás
Xuan Son Nguyen	91c36c269b server : (web ui) Various improvements, now use vite as bundler (#10599)	há 1 ano atrás
Georgi Gerganov	8648c52101 make : deprecate (#10514)	há 1 ano atrás
Wang Qin	43957ef203 build: update Makefile comments for C++ version change (#10598)	há 1 ano atrás
Diego Devesa	7cc2d2c889 ggml : move AMX to the CPU backend (#10570)	há 1 ano atrás
Tristan Druyen	be0e350c8b Fix HIP flag inconsistency & build docs (#10524)	há 1 ano atrás
R0CKSTAR	249cd93da3 mtgpu: Add MUSA_DOCKER_ARCH in Dockerfiles && update cmake and make (#10516)	há 1 ano atrás
Eric Curtin	0cc63754b8 Introduce llama-run (#10291)	há 1 ano atrás
Diego Devesa	5931c1f233 ggml : add support for dynamic loading of backends (#10469)	há 1 ano atrás
Georgi Gerganov	d9d54e498d speculative : refactor and add a simpler example (#10362)	há 1 ano atrás
Anthony Van de Gejuchte	3952a221af Fix missing file renames in Makefile due to changes in commit ae8de6d50a (#10413)	há 1 ano atrás
Georgi Gerganov	cf32a9b93a metal : refactor kernel args into structs (#10238)	há 1 ano atrás
Johannes Gäßler	c3ea58aca4 CUDA: remove DMMV, consolidate F16 mult mat vec (#10318)	há 1 ano atrás
Georgi Gerganov	a4200cafad make : add ggml-opt (#0)	há 1 ano atrás
Georgi Gerganov	84274a10c3 tests : remove test-grad0	há 1 ano atrás

Recente Antigo

Histórico de Commits Pesquisar

Histórico de Commits