Commit History

Autor SHA1 Mensaxe Data
  Cebtenzzre ecf90b1a51 gguf : make token scores and types optional (#3347) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 2619109ad5 ci : disable freeBSD builds due to lack of VMs (#3381) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov ec893798b7 llama : custom attention mask + parallel decoding + no context swaps (#3228) %!s(int64=2) %!d(string=hai) anos
  Kevin Ji 45855b3f1c docs : mark code as Bash (#3375) %!s(int64=2) %!d(string=hai) anos
  Pierre Alexandre SCHEMBRI 4aea3b846e readme : add Mistral AI release 0.1 (#3362) %!s(int64=2) %!d(string=hai) anos
  slaren da0400344b ggml-cuda : perform cublas fp16 matrix multiplication as fp16 (#3370) %!s(int64=2) %!d(string=hai) anos
  Zhang Peiyuan e519621010 convert : remove bug in convert.py permute function (#3364) %!s(int64=2) %!d(string=hai) anos
  Richard Roberson ac43576124 make-ggml.py : compatibility with more models and GGUF (#3290) %!s(int64=2) %!d(string=hai) anos
  Cebtenzzre 20c7e1e804 gguf : fix a few general keys (#3341) %!s(int64=2) %!d(string=hai) anos
  Rickard Hallerbäck dc6897404e metal : reusing llama.cpp logging (#3152) %!s(int64=2) %!d(string=hai) anos
  Jag Chadha 527e57cfd8 build : add ACCELERATE_NEW_LAPACK to fix warning on macOS Sonoma (#3342) %!s(int64=2) %!d(string=hai) anos
  BarfingLemurs ffe88a36a9 readme : add some recent perplexity and bpw measurements to READMES, link for k-quants (#3340) %!s(int64=2) %!d(string=hai) anos
  DAN™ 99115f3fa6 cmake : fix build-info.h on MSVC (#3309) %!s(int64=2) %!d(string=hai) anos
  2f38b454 1726f9626f docs: Fix typo CLBlast_DIR var. (#3330) %!s(int64=2) %!d(string=hai) anos
  Erik Scholz a98b1633d5 nix : add cuda, use a symlinked toolkit for cmake (#3202) %!s(int64=2) %!d(string=hai) anos
  slaren c091cdfb24 llama-bench : add README (#3317) %!s(int64=2) %!d(string=hai) anos
  Cebtenzzre 51a7cf5c6e examples : fix RoPE defaults to match PR #3240 (#3315) %!s(int64=2) %!d(string=hai) anos
  Kevin Ji bedb92b603 scripts : use `/usr/bin/env` in shebang (#3313) %!s(int64=2) %!d(string=hai) anos
  Lee Drake bc9d3e3971 Update README.md (#3289) %!s(int64=2) %!d(string=hai) anos
  shibe2 36b904e200 ggml-opencl.cpp: Make private functions static (#3300) %!s(int64=2) %!d(string=hai) anos
  Edward Taylor 324f3403d5 zig : fix for updated c lib (#3259) %!s(int64=2) %!d(string=hai) anos
  yuiseki f56c418ab0 embedding : update README.md (#3224) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler 8185710a80 CUDA: use only 1 thread if fully offloaded (#2915) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 7eb41179ed readme : update hot topics %!s(int64=2) %!d(string=hai) anos
  Cebtenzzre a5661d7e71 llama : allow gguf RoPE keys to be overridden with defaults (#3240) %!s(int64=2) %!d(string=hai) anos
  Cebtenzzre 65c2c1c5ab benchmark-matmult : do not use integer abs() on a float (#3277) %!s(int64=2) %!d(string=hai) anos
  kang 80834daecf flake : Restore default package's buildInputs (#3262) %!s(int64=2) %!d(string=hai) anos
  Alon a40f2b656f CI: FreeBSD fix (#3258) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov d119c04c15 examples : fix benchmark-matmult (#1554) %!s(int64=2) %!d(string=hai) anos
  Cebtenzzre 8781013ef6 make : restore build-info.h dependency for several targets (#3205) %!s(int64=2) %!d(string=hai) anos