Commit History

Autor SHA1 Mensaxe Data
  Jared Van Bortel 70f806b821 build : detect host compiler and cuda compiler separately (#4414) %!s(int64=2) %!d(string=hai) anos
  slaren 799a1cb13b llama : add Mixtral support (#4406) %!s(int64=2) %!d(string=hai) anos
  Jared Van Bortel 6138963fb2 build : target Windows 8 for standard mingw-w64 (#4405) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov fe680e3d10 sync : ggml (new ops, tests, backend, etc.) (#4359) %!s(int64=2) %!d(string=hai) anos
  Jared Van Bortel 511f52c334 build : enable libstdc++ assertions for debug builds (#4275) %!s(int64=2) %!d(string=hai) anos
  WillCorticesAI d2809a3ba2 make : fix Apple clang determination bug (#4272) %!s(int64=2) %!d(string=hai) anos
  Jared Van Bortel 15f5d96037 build : fix build info generation and cleanup Makefile (#3920) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 922754a8d6 lookahead : add example for lookahead decoding (#4207) %!s(int64=2) %!d(string=hai) anos
  Kerfuffle 28a2e6e7d4 tokenize example: Respect normal add BOS token behavior (#4126) %!s(int64=2) %!d(string=hai) anos
  Roger Meier 8e9361089d build : support ppc64le build for make and CMake (#3963) %!s(int64=2) %!d(string=hai) anos
  Michael Potter 6bb4908a17 Fix MacOS Sonoma model quantization (#4052) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 413503d4b9 make : do not add linker flags when compiling static llava lib (#3977) %!s(int64=2) %!d(string=hai) anos
  Damian Stewart 381efbf480 llava : expose as a shared library for downstream projects (#3613) %!s(int64=2) %!d(string=hai) anos
  cebtenzzre b12fa0d1c1 build : link against build info instead of compiling against it (#3879) %!s(int64=2) %!d(string=hai) anos
  cebtenzzre 2046eb4345 make : remove unnecessary dependency on build-info.h (#3842) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov d69d777c02 ggml : quantization refactoring (#3833) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 2f9ec7e271 cuda : improve text-generation and batched decoding performance (#3776) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov e3932593d4 Revert "make : add optional CUDA_NATIVE_ARCH (#2482)" %!s(int64=2) %!d(string=hai) anos
  Alex 96981f37b1 make : add optional CUDA_NATIVE_ARCH (#2482) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 438c2ca830 server : parallel decoding and multimodal (#3677) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov d1031cf49c sampling : refactor init to use llama_sampling_params (#3696) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 0e89203b51 speculative : add tree-based sampling example (#3624) %!s(int64=2) %!d(string=hai) anos
  M. Yusuf Sarıgöz 370359e5ba examples: support LLaVA v1.5 (multimodal model) (#3436) %!s(int64=2) %!d(string=hai) anos
  Kerfuffle 70c29da118 common : fix mirostat state when using multiple sequences (#3543) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 8c70a5ff25 batched : add bench tool (#3545) %!s(int64=2) %!d(string=hai) anos
  Zane Shannon 24ba3d829e examples : add batched.swift + improve CI for swift (#3562) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov db3abcc114 sync : ggml (ggml-backend) (#3548) %!s(int64=2) %!d(string=hai) anos
  goerch ff5a3f0c09 Work on the BPE tokenizer (#3252) %!s(int64=2) %!d(string=hai) anos
  vvhg1 c97f01c362 infill : add new example + extend server API (#3296) %!s(int64=2) %!d(string=hai) anos
  Cebtenzzre bc39553c90 build : enable more non-default compiler warnings (#3200) %!s(int64=2) %!d(string=hai) anos