Historique des commits

Auteur SHA1 Message Date
  Georgi Gerganov 6381d4e110 gguf : new file format with flexible meta data (beta) (#2398) il y a 2 ans
  slaren 097e121e2f llama : add benchmark example (#2626) il y a 2 ans
  drbh 7cf54e1f74 tests : adds simple llama grammar tests (#2618) il y a 2 ans
  Shouzheng Liu bf83bff674 metal : matrix-matrix multiplication kernel (#2615) il y a 2 ans
  drbh ee77efea2a test : add simple grammar parsing tests (#2594) il y a 2 ans
  byte-6174 b19edd54d5 Adding support for llama2.c models (#2559) il y a 2 ans
  Johannes Gäßler 25d43e0eb5 CUDA: tuned mul_mat_q kernels (#2546) il y a 2 ans
  Martin Krasser f5bfea0580 Allow passing grammar to completion endpoint (#2532) il y a 2 ans
  GiviMAD 34a14b28ff [Makefile] Move ARM CFLAGS before compilation (#2536) il y a 2 ans
  DannyDaemonic 3498588e0f Add --simple-io option for subprocesses and break out console.h and cpp (#1558) il y a 2 ans
  Eve 81844fbcfd tests : Fix compilation warnings (Linux/GCC) (#2451) il y a 2 ans
  Johannes Gäßler 49e7cb5bb1 CUDA: fixed LLAMA_FAST compilation option (#2473) il y a 2 ans
  Johannes Gäßler 0728c5a8b9 CUDA: mmq CLI option, fixed mmq build issues (#2453) il y a 2 ans
  slaren a113689571 ggml : add graph tensor allocator (#2411) il y a 2 ans
  Johannes Gäßler 11f3ca06b8 CUDA: Quantized matrix matrix multiplication (#2160) il y a 2 ans
  Cebtenzzre 6df1f5940f make : build with -Wmissing-prototypes (#2394) il y a 2 ans
  Aarni Koskela b3f138d058 Chat UI extras (#2366) il y a 2 ans
  Evan Jones 84e09a7d8b llama : add grammar-based sampling (#1773) il y a 2 ans
  Jose Maldonado 91171b8072 make : fix CLBLAST compile support in FreeBSD (#2331) il y a 2 ans
  Jose Maldonado 73643f5fb1 gitignore : changes for Poetry users + chat examples (#2284) il y a 2 ans
  Georgi Gerganov a814d04f81 make : fix indentation il y a 2 ans
  Sky Yan 42c7c2e2e9 make : support customized LLAMA_CUDA_NVCC and LLAMA_CUDA_CCBIN (#2275) il y a 2 ans
  Jiří Podivín 54e3bc76fe make : add new target for test binaries (#2244) il y a 2 ans
  Przemysław Pawełczyk 9cf022a188 make : fix embdinput library and server examples building on MSYS2 (#2235) il y a 2 ans
  wzy 7dabc66f3c make : use pkg-config for OpenBLAS (#2222) il y a 2 ans
  James Reynolds 229aab351c make : fix combination of LLAMA_METAL and LLAMA_MPI (#2208) il y a 2 ans
  Evan Miller 5656d10599 mpi : add support for distributed inference via MPI (#2099) il y a 2 ans
  dylan 84525e7962 docker : add support for CUDA in docker (#1461) il y a 2 ans
  Johannes Gäßler 924dd22fd3 Quantized dot products for CUDA mul mat vec (#2067) il y a 2 ans
  Henri Vasserman acc111caf9 Allow old Make to build server. (#2098) il y a 2 ans