Historique des commits

Auteur SHA1 Message Date
  Aarni Koskela e4386f417f server : add a subtle loading animation to the edit box (#2466) il y a 2 ans
  Jiahao Li 35195689cd 2x faster (rms) norm cuda kernels (3.7% e2e improvement) (#2985) il y a 2 ans
  slaren cf9b08485c ggml-alloc : use virtual memory for measurement (#2973) il y a 2 ans
  Georgi Gerganov 47068e5170 speculative : PoC for speeding-up inference via speculative sampling (#2926) il y a 2 ans
  Georgi Gerganov 8f429fa511 perplexity : fix ETA by warming up the model with an empty run il y a 2 ans
  Kerfuffle 6519e9c99c gguf(python): Fix special vocab handling when id < 0 (#2984) il y a 2 ans
  Georgi Gerganov b7f2aa9e51 metal : restore 363f0bf and fix reduce in F16_F32 kernels (#2986) il y a 2 ans
  Alon 73a12a6344 cov : disable comment in PRs (#2989) il y a 2 ans
  opparco 3730134776 llama : fix bpe tokenize from byte (#2889) il y a 2 ans
  Georgi Gerganov d9151e6f57 metal : revert 6af0bab until we fix it il y a 2 ans
  Alon afc43d5f82 cov : add Code Coverage and codecov.io integration (#2928) il y a 2 ans
  Wentai Zhang 6460f758db opencl : fix a bug in ggml_cl_pool_malloc() for ggml_cl_mul_mat_f32() (#2955) il y a 2 ans
  Kawrakow ca82cf7bac metal : more optimizations (#2959) il y a 2 ans
  kchro3 6a31a3bd98 swift : add support for k-quants (#2983) il y a 2 ans
  Kerfuffle cff7b0bf07 convert.py : BPE fixes (#2938) il y a 2 ans
  Ido S 340af42f09 docs : add `catai` to `README.md` (#2967) il y a 2 ans
  momonga c42f0ec6b3 examples : fix gpt-neox (#2943) il y a 2 ans
  kchro3 2753415afd swift : add missing c file to Package.swift (#2978) il y a 2 ans
  Cebtenzzre bc054af97a make : support overriding CFLAGS/CXXFLAGS/CPPFLAGS/LDFLAGS (#2886) il y a 2 ans
  Kerfuffle 3358c381f6 logging: Fix creating empty file even when disabled (#2966) il y a 2 ans
  bandoti 52315a4216 readme : update clblast instructions (#2903) il y a 2 ans
  Karsten Weiss 8b56b4f2c3 metal : show all Metal device instances in the system (#2952) il y a 2 ans
  Jhen-Jie Hong 21f3d1be86 k-quants : fix build on armv7 (android only) (#2920) il y a 2 ans
  Jhen-Jie Hong 571083f508 server : avoid aniprompt in probabilities of final response (#2849) il y a 2 ans
  Engininja2 f04d002844 cuda : vsubss4 for older versions of ROCm/clang (#2942) il y a 2 ans
  ZHAOKAI WANG 69fdbb9abc readme : quick start command fix (#2908) il y a 2 ans
  Kerfuffle 5d6f19f16b Allow quantize to only copy tensors, some other improvements (#2931) il y a 2 ans
  Georgi Gerganov 0d58936686 llama2c : rename function il y a 2 ans
  Cebtenzzre 6c9c23429b make : use unaligned vector moves on MinGW (#2945) il y a 2 ans
  m3ndax ee8654bcd0 minor : add const qualifiers (#2853) il y a 2 ans