Historique des commits

Auteur SHA1 Message Date
  cebtenzzre 2046eb4345 make : remove unnecessary dependency on build-info.h (#3842) il y a 2 ans
  Georgi Gerganov d69d777c02 ggml : quantization refactoring (#3833) il y a 2 ans
  Georgi Gerganov 2f9ec7e271 cuda : improve text-generation and batched decoding performance (#3776) il y a 2 ans
  Georgi Gerganov e3932593d4 Revert "make : add optional CUDA_NATIVE_ARCH (#2482)" il y a 2 ans
  Alex 96981f37b1 make : add optional CUDA_NATIVE_ARCH (#2482) il y a 2 ans
  Georgi Gerganov 438c2ca830 server : parallel decoding and multimodal (#3677) il y a 2 ans
  Georgi Gerganov d1031cf49c sampling : refactor init to use llama_sampling_params (#3696) il y a 2 ans
  Georgi Gerganov 0e89203b51 speculative : add tree-based sampling example (#3624) il y a 2 ans
  M. Yusuf Sarıgöz 370359e5ba examples: support LLaVA v1.5 (multimodal model) (#3436) il y a 2 ans
  Kerfuffle 70c29da118 common : fix mirostat state when using multiple sequences (#3543) il y a 2 ans
  Georgi Gerganov 8c70a5ff25 batched : add bench tool (#3545) il y a 2 ans
  Zane Shannon 24ba3d829e examples : add batched.swift + improve CI for swift (#3562) il y a 2 ans
  Georgi Gerganov db3abcc114 sync : ggml (ggml-backend) (#3548) il y a 2 ans
  goerch ff5a3f0c09 Work on the BPE tokenizer (#3252) il y a 2 ans
  vvhg1 c97f01c362 infill : add new example + extend server API (#3296) il y a 2 ans
  Cebtenzzre bc39553c90 build : enable more non-default compiler warnings (#3200) il y a 2 ans
  xaedes 0e76a8992c train : finetune LORA (#2632) il y a 2 ans
  Georgi Gerganov ec893798b7 llama : custom attention mask + parallel decoding + no context swaps (#3228) il y a 2 ans
  Jag Chadha 527e57cfd8 build : add ACCELERATE_NEW_LAPACK to fix warning on macOS Sonoma (#3342) il y a 2 ans
  Cebtenzzre 8781013ef6 make : restore build-info.h dependency for several targets (#3205) il y a 2 ans
  Johannes Gäßler 111163e246 CUDA: enable peer access between devices (#2470) il y a 2 ans
  Vlad 5dbc2b3213 Enable build with CUDA 11.0 (make) (#3132) il y a 2 ans
  Cebtenzzre e6616cf0db examples : add compiler version and target to build info (#2998) il y a 2 ans
  Cebtenzzre 3aefaab9e5 check C++ code with -Wmissing-declarations (#3184) il y a 2 ans
  Cebtenzzre 4b8560e72a make : fix clang++ detection, move some definitions to CPPFLAGS (#3155) il y a 2 ans
  goerch 71ca2fad7d whisper : tokenizer fix + re-enable tokenizer test for LLaMa (#3096) il y a 2 ans
  Johannes Gäßler 0a5eebb45d CUDA: mul_mat_q RDNA2 tunings (#2910) il y a 2 ans
  Przemysław Pawełczyk cb6c44c5e0 build : do not use _GNU_SOURCE gratuitously (#2035) il y a 2 ans
  Cebtenzzre 00d62adb79 fix some warnings from gcc and clang-tidy (#3038) il y a 2 ans
  Cebtenzzre 4fa2cc1750 make : improve test target (#3031) il y a 2 ans