Commit History

Autor SHA1 Mensaxe Data
  Georgi Gerganov 611363ac79 scripts : add pipefail %!s(int64=2) %!d(string=hai) anos
  Marcus Dunn 95b6e5212f added `struct` to llama_dump_timing_info_yaml's `llama_context` (#2857) %!s(int64=2) %!d(string=hai) anos
  xaedes 44c117f41e train : mem usage and other improvements (#2439) %!s(int64=2) %!d(string=hai) anos
  slaren 43033b7bb4 llama-bench : set locale to utf8 (#2832) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler 6b73ef1201 YAML result logging + preset script (#2657) %!s(int64=2) %!d(string=hai) anos
  alonfaraj 75fafcbccc make : fix tests build (#2855) %!s(int64=2) %!d(string=hai) anos
  grahameth be475f60af llama.cpp : fix wrong vsnprintf call in MS compiler (#2856) %!s(int64=2) %!d(string=hai) anos
  Ronny Brendel 3af6b86301 ggml : tiny ggml_vec_dot_q4_K_q8_K AVX2 improvement (#2819) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 35feac6560 ggml : sync (mem align to header + conv_transpose_2d fixes + ggml_alloc) (#2852) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler 92b1bbd2ec CUDA: fix RoPE asserts, block sizes (#2833) %!s(int64=2) %!d(string=hai) anos
  igarnier dd0dc366da llama.h : add missing struct keyword for C compat in callback type (#2847) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov f55538c3cc metal : fix memory leak (#2762) %!s(int64=2) %!d(string=hai) anos
  Cebtenzzre ebcee207b6 quantize : make output filename optional again (#2823) %!s(int64=2) %!d(string=hai) anos
  JohnnyB 3e8ff47af6 devops : added systemd units and set versioning to use date. (#2835) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 103cfafc77 gguf : fix strings to not be null-terminated (#2839) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov c10704d01e llama : fix MPI threads (close #2827) %!s(int64=2) %!d(string=hai) anos
  Olivier Chafik 230d46c723 examples : update llama2.c converter to read vocab and write models in GGUF format (#2751) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 463173a6c0 llama : speedup tokenization (#2831) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov eaa13a48ff falcon : fix CUDA inference by making K and Q contiguous (#2830) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov da7455d046 readme : fix headings %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 25423e9185 scripts : helper convert script %!s(int64=2) %!d(string=hai) anos
  Kawrakow a6d1189fdd k_quants tuning for Falcon-7b (#2816) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov c48c5bb0b0 readme : update hot topics %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov d0cee0d36d gguf : add 64-bit support (GGUF v2) (#2821) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov edd4c14817 llama : more tokenizer fixes (#2810) %!s(int64=2) %!d(string=hai) anos
  Przemysław Pawełczyk 1591e2e590 ggml : detect SSSE3 (#2825) %!s(int64=2) %!d(string=hai) anos
  slaren 789c8c945a ci : add LoRA test to CI (#2650) %!s(int64=2) %!d(string=hai) anos
  Bruce MacDonald c1ac54b77a server : add `/detokenize` endpoint (#2802) %!s(int64=2) %!d(string=hai) anos
  Kerfuffle 730d9c681e convert.py : advanced option (#2753) %!s(int64=2) %!d(string=hai) anos
  Tim Miller c7d92e6dfe llama : use Unicode Escape Sequence to replace encoded characters (#2814) %!s(int64=2) %!d(string=hai) anos