Commit History

Autor SHA1 Mensaxe Data
  slaren 315a95a4d3 Add LoRA support (#820) %!s(int64=2) %!d(string=hai) anos
  Arik Poznanski efd05648c8 llama : well-defined static initialization of complex objects (#927) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov eb17a026fd quantize-stats : fix bug in --type argument %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 69b740289f ggml : avoid using ggml_fp16_to_fp32() and ggml_fp32_to_fp16() in ggml.c %!s(int64=2) %!d(string=hai) anos
  Ivan Komarov f266259ad9 Speedup the AVX-512 implementation of ggml_vec_dot_q4_0() (#933) %!s(int64=2) %!d(string=hai) anos
  slaren 47f61aaa5f Fix: do not close file on mmap (#1017) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 3173a62eb9 stdout : vertical align outputs for better readibility %!s(int64=2) %!d(string=hai) anos
  Pavol Rusnak 489537e6cf examples: add missing <ctime> include for time() (#1011) %!s(int64=2) %!d(string=hai) anos
  nanahi 2d3481c721 Fix msys2 build error and warnings (#1009) %!s(int64=2) %!d(string=hai) anos
  comex 74f5899df4 convert.py: Fix loading safetensors and ggml format on Windows (#991) %!s(int64=2) %!d(string=hai) anos
  Stephan Walter 2f7c8e014e Fix potential int8 overflow in non-SIMD vec_dot (#986) %!s(int64=2) %!d(string=hai) anos
  Stephan Walter 0ad964631f Refactor ggml.c for future tensor types (#1001) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov e95b6554b4 ggml : add Q8_0 quantization for intermediate results (#951) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov aa485cee33 ggml : use posix_memalign on non-Windows env %!s(int64=2) %!d(string=hai) anos
  Ivan Komarov c12b14b77f benchmark : fix result validation in benchmark-q4_0-matmult (#987) %!s(int64=2) %!d(string=hai) anos
  katsu560 106faaf297 cmake : add finding the OpenBLAS header file (#992) %!s(int64=2) %!d(string=hai) anos
  Pavol Rusnak c85e03d12e Revert "main : alternative instruct mode (Vicuna support, etc.) (#863)" (#982) %!s(int64=2) %!d(string=hai) anos
  Pavol Rusnak 489093548c py : bump sentencepiece to 0.1.98 to support Python 3.11 (#976) %!s(int64=2) %!d(string=hai) anos
  Stephan Walter 93265e988a make : fix dependencies, use auto variables (#983) %!s(int64=2) %!d(string=hai) anos
  Pavol Rusnak c56b715269 Expose type name from ggml (#970) %!s(int64=2) %!d(string=hai) anos
  Tomáš Pazdiora f4d277ae17 main : alternative instruct mode (Vicuna support, etc.) (#863) %!s(int64=2) %!d(string=hai) anos
  Kerfuffle c9a59b70a5 ggml : add unary and binary map operations (#874) %!s(int64=2) %!d(string=hai) anos
  Pavol Rusnak a32f7acc9f py : cleanup dependencies (#962) %!s(int64=2) %!d(string=hai) anos
  Pavol Rusnak 43ffdefb74 py : fix flake8 and isort nitpicks (#960) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 1623a6e9b4 ggml : minor %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov c14e0d2f23 ggml : always allocate buffers with size multiple of GGML_MEM_ALIGN %!s(int64=2) %!d(string=hai) anos
  comex 723dac55fa py : new conversion script (#545) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 0f07cacb05 ggml : fix q4_1 dot product types %!s(int64=2) %!d(string=hai) anos
  Howard Su c5d70f5c9e ggml : optimize rope function to avoid call powf in the tight loop (#807) %!s(int64=2) %!d(string=hai) anos
  Gary Linscott be87b6ed20 perplexity : add support for batch size to `--perplexity` (#407) %!s(int64=2) %!d(string=hai) anos