Commit History

Autor SHA1 Mensaxe Data
  Daniel Bevenius 37f10f955f make : remove make in favor of CMake (#15449) hai 5 meses
  xctan f470bc36be ggml-cpu : split arch-specific implementations (#13892) hai 7 meses
  Georgi Gerganov 4773d7a02f examples : remove infill (#13283) hai 8 meses
  Xuan-Son Nguyen 9b61acf060 mtmd : rename llava directory to mtmd (#13311) hai 8 meses
  Diego Devesa 1d36b3670b llama : move end-user examples to tools directory (#13249) hai 8 meses
  David Huang 84778e9770 CUDA/HIP: Share the same unified memory allocation logic. (#12934) hai 9 meses
  R0CKSTAR 251364549f musa: support new arch mp_31 and update doc (#12296) hai 10 meses
  Johannes Gäßler a28e0d5eb1 CUDA: app option to compile without FlashAttention (#12025) hai 10 meses
  Bodhi 0b3863ff95 MUSA: support ARM64 and enable dp4a .etc (#11843) hai 11 meses
  Olivier Chafik 63e489c025 tool-call: refactor common chat / tool-call api (+ tests / fixes) (#11900) hai 11 meses
  Georgi Gerganov 68ff663a04 repo : update links to new url (#11886) hai 11 meses
  Johannes Gäßler 864a0b67a6 CUDA: use mma PTX instructions for FlashAttention (#11583) hai 11 meses
  Olivier Chafik 8b576b6c55 Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639) hai 11 meses
  Olivier Chafik 6171c9d258 Add Jinja template support (#11016) hai 1 ano
  HimariO ba1cb19cdd llama : add Qwen2VL support + multimodal RoPE (#10361) hai 1 ano
  Djip007 19d8762ab6 ggml : refactor online repacking (#10446) hai 1 ano
  Xuan Son Nguyen 91c36c269b server : (web ui) Various improvements, now use vite as bundler (#10599) hai 1 ano
  Georgi Gerganov 8648c52101 make : deprecate (#10514) hai 1 ano
  Wang Qin 43957ef203 build: update Makefile comments for C++ version change (#10598) hai 1 ano
  Diego Devesa 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) hai 1 ano
  Tristan Druyen be0e350c8b Fix HIP flag inconsistency & build docs (#10524) hai 1 ano
  R0CKSTAR 249cd93da3 mtgpu: Add MUSA_DOCKER_ARCH in Dockerfiles && update cmake and make (#10516) hai 1 ano
  Eric Curtin 0cc63754b8 Introduce llama-run (#10291) hai 1 ano
  Diego Devesa 5931c1f233 ggml : add support for dynamic loading of backends (#10469) hai 1 ano
  Georgi Gerganov d9d54e498d speculative : refactor and add a simpler example (#10362) hai 1 ano
  Anthony Van de Gejuchte 3952a221af Fix missing file renames in Makefile due to changes in commit ae8de6d50a (#10413) hai 1 ano
  Georgi Gerganov cf32a9b93a metal : refactor kernel args into structs (#10238) hai 1 ano
  Johannes Gäßler c3ea58aca4 CUDA: remove DMMV, consolidate F16 mult mat vec (#10318) hai 1 ano
  Georgi Gerganov a4200cafad make : add ggml-opt (#0) hai 1 ano
  Georgi Gerganov 84274a10c3 tests : remove test-grad0 hai 1 ano