Commit History

Autor SHA1 Mensaxe Data
  Cebtenzzre a5661d7e71 llama : allow gguf RoPE keys to be overridden with defaults (#3240) %!s(int64=2) %!d(string=hai) anos
  goerch b08e75baea Fixing the last deviations from sentencepiece indicated by test-tokenizer-1 (#3170) %!s(int64=2) %!d(string=hai) anos
  Cebtenzzre 3aefaab9e5 check C++ code with -Wmissing-declarations (#3184) %!s(int64=2) %!d(string=hai) anos
  Roland 2d770505a8 llama : remove mtest (#3177) %!s(int64=2) %!d(string=hai) anos
  FK 84e723653c speculative: add --n-gpu-layers-draft option (#3063) %!s(int64=2) %!d(string=hai) anos
  Cebtenzzre 00d62adb79 fix some warnings from gcc and clang-tidy (#3038) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov c4f496648c metal : fix kernel_norm (fixes Falcon on Metal) (#3057) %!s(int64=2) %!d(string=hai) anos
  Cebtenzzre de2fe892af examples : replace fprintf to stdout with printf (#3017) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 921772104b speculative : add grammar support (#2991) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov e36ecdccc8 build : on Mac OS enable Metal by default (#2901) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 47068e5170 speculative : PoC for speeding-up inference via speculative sampling (#2926) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 8f429fa511 perplexity : fix ETA by warming up the model with an empty run %!s(int64=2) %!d(string=hai) anos
  Cebtenzzre ef15649972 build : fix most gcc and clang warnings (#2861) %!s(int64=2) %!d(string=hai) anos
  staviq 8341a25957 main : log file (#2748) %!s(int64=2) %!d(string=hai) anos
  xaedes 44c117f41e train : mem usage and other improvements (#2439) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler 6b73ef1201 YAML result logging + preset script (#2657) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov edd4c14817 llama : more tokenizer fixes (#2810) %!s(int64=2) %!d(string=hai) anos
  Henri Vasserman 6bbc598a63 ROCm Port (#1087) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov cf658adc83 llm : add Falcon support (#2717) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 62959e740e Strided perplexity (#2714) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler c63bb1d16a CUDA: use mul_mat_q kernels by default (#2683) %!s(int64=2) %!d(string=hai) anos
  slaren 1123f7fbdf ggml-cuda : use graph allocator (#2684) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 6381d4e110 gguf : new file format with flexible meta data (beta) (#2398) %!s(int64=2) %!d(string=hai) anos