Commit History

Autor SHA1 Mensaxe Data
  Johannes Gäßler 0bc2cdfc87 Better CUDA synchronization logic (#2057) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler befb3a3562 Test-based VRAM scratch size + context adjustment (#2056) %!s(int64=2) %!d(string=hai) anos
  Daniel Drake b213227067 cmake : don't force -mcpu=native on aarch64 (#2063) %!s(int64=2) %!d(string=hai) anos
  Aaron Miller 2f8cd979ec metal : release buffers when freeing metal context (#2062) %!s(int64=2) %!d(string=hai) anos
  Judd 471aab6e4c convert : add support of baichuan-7b (#2055) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 463f2f4c4f llama : fix return value of llama_load_session_file_internal (#2022) %!s(int64=2) %!d(string=hai) anos
  Rand Xie cb44dbc7de llama : catch llama_load_session_file_internal exceptions (#2022) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 79f634a19d embd-input : fix returning ptr to temporary %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 04606a1599 train : fix compile warning %!s(int64=2) %!d(string=hai) anos
  Qingyou Meng b1ca8f36a9 ggml : disable GGML_TASK_INIT and GGML_TASK_FINALIZE by default (#1995) %!s(int64=2) %!d(string=hai) anos
  Howard Su b8c8dda75f Use unsigned for random seed (#2006) %!s(int64=2) %!d(string=hai) anos
  LostRuins 96a712ca1b Porting the improved K-Quant CUDA kernels to OpenCL (#1966) %!s(int64=2) %!d(string=hai) anos
  m3ndax d3494bb86b llama : replacing auto &kv with const auto &kv (#2041) %!s(int64=2) %!d(string=hai) anos
  Salvador E. Tropea 5b351e94d0 cuda : remove nchannels_x argument from mul_mat_vec_nc_f16_f32 (#2028) %!s(int64=2) %!d(string=hai) anos
  Salvador E. Tropea 6432aabb6d cuda : fix missing const qualifier in casts (#2027) %!s(int64=2) %!d(string=hai) anos
  Howard Su b922bc351b llama : remove shards weight file support (#2000) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler 7f9753fa12 CUDA GPU acceleration for LoRAs + f16 models (#1970) %!s(int64=2) %!d(string=hai) anos
  ningshanwutuobang cfa0750bc9 llama : support input embeddings directly (#1910) %!s(int64=2) %!d(string=hai) anos
  Erik Scholz 9d23589d63 fix pthreads setaffinity usage on android (#2020) %!s(int64=2) %!d(string=hai) anos
  Howard Su 0be54f75a6 baby-llama : fix build after ggml_rope change (#2016) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 181e8d9755 llama : fix rope usage after ChatGLM change %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov d9779021bd ggml : add support for ChatGLM RoPE %!s(int64=2) %!d(string=hai) anos
  Roman Parykin d38e451578 readme : add Scala 3 bindings repo (#2010) %!s(int64=2) %!d(string=hai) anos
  David Yang eaa6ca5a61 ggml : increase max tensor name + clean up compiler warnings in train-text (#1988) %!s(int64=2) %!d(string=hai) anos
  Gustavo Rocha Dias aa777abbb7 readme : LD_LIBRARY_PATH complement for some Android devices when building with CLBlast inside Termux (#2007) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov c824d2e368 ggml : avoid conv 2d kernel round up %!s(int64=2) %!d(string=hai) anos
  zrm b853d45601 ggml : add NUMA support (#1556) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 9225baef71 k-quants : fix indentation %!s(int64=2) %!d(string=hai) anos
  katsu560 a84ab1da8d tests : fix quantize perf (#1990) %!s(int64=2) %!d(string=hai) anos
  katsu560 5743ca8092 k-quants : add AVX support to dot functions (#1916) %!s(int64=2) %!d(string=hai) anos