Commit History

作者 SHA1 備註 提交日期
  Justine Tunney db49ff8ed7 server : replace sleep with condition variables (#4673) 2 年之前
  SakuraUmi 60f55e888c server : fix OpenAI server sampling w.r.t. penalty. (#4675) 2 年之前
  Karthik Sethuraman b93edd22f5 server : allow to generate multimodal embeddings (#4681) 2 年之前
  andrijdavid 82d6eab224 main-cmake-pkg : fix build issue (#4665) 2 年之前
  Peter Sugihara afd997ab60 llama.swiftui : fix infinite loop, ouput timings, buff UI (#4674) 2 年之前
  Georgi Gerganov c8255f8a6b scripts : print list of sync commits 2 年之前
  Tamotsu Takahashi 441f51dca0 ci : build with CLBlast + ggml-opencl use GGML_API (whisper/1576) 2 年之前
  Georgi Gerganov 38b3de4658 sync : ggml 2 年之前
  bssrdf afc8c19291 ggml : fix some mul mat cases + add tests for src1 F16 (ggml/669) 2 年之前
  Georgi Gerganov ca38b8d334 scripts : do not sync commits from this repo 2 年之前
  Justine Tunney 65e5f6dadb Fix OpenAI server sampling w.r.t. temp and seed (#4668) 2 年之前
  manikbhandari ea5497df5d gpt2 : Add gpt2 architecture integration (#4555) 2 年之前
  Nam D. Tran f6793491b5 llama : add AWQ for llama, llama2, mpt, and mistral models (#4593) 2 年之前
  Daniel Bevenius 879b690a9e finetune : fix output formatting in print_params (#4653) 2 年之前
  Georgi Gerganov b47879b0dd scripts : add sync-ggml-am.sh 2 年之前
  Georgi Gerganov 951010fa53 ggml : fix dot product for ARM (#4630) 2 年之前
  wonjun Jang f56d6077d0 Add byte token type when tokenizer.model is not exists (#4641) 2 年之前
  slaren dc68f0054c cuda : fix vmm pool with multi GPU (#4620) 2 年之前
  WillCorticesAI de8e496437 Update comment for AdamW implementation reference. (#4604) 2 年之前
  FantasyGmm 77465dad48 Fix new CUDA10 compilation errors (#4635) 2 年之前
  Paul Tsochantaris a206137f92 Adding Emeltal reference to UI list (#4629) 2 年之前
  slaren b9f47952ff simplify bug issue template (#4623) 2 年之前
  Shintarou Okada 753be377b6 llama : add PLaMo model (#3557) 2 年之前
  slaren 5bf3953d7e cuda : improve cuda pool efficiency using virtual memory (#4606) 2 年之前
  slaren 708e179e85 fallback to CPU buffer if host buffer alloc fails (#4610) 2 年之前
  Samuel Maynard 925e5584a0 ci(docker): fix tags in "Build and push docker image (tagged)" (#4603) 2 年之前
  Alexey Parfenov 6123979952 server : allow to specify custom prompt for penalty calculation (#3727) 2 年之前
  kalomaze b9ec82d262 grammar : check the full vocab only if necessary (opt) (#4306) 2 年之前
  Johannes Gäßler e0a4002273 CUDA: fixed row rounding for 0 tensor splits (#4594) 2 年之前
  LeonEricsson 7082d24cec lookup : add prompt lookup decoding example (#4484) 2 年之前