Commit History

Author SHA1 Message Date
  Johannes Gäßler e141ce624a Fix FlashAttention debug test, FP32 assert (#7684) 1 year ago
  Yazan Agha-Schrader 2e666832e6 server : new UI (#7633) 1 year ago
  HanishKVC 2ac95c9d56 SimpleChat: Simple histogram/repeatMatching driven garbageTrimming, Settings UI, Streaming mode, OpenAi Compat (Model, Authorization Bearer), Save/Restore session, Auto Settings UI (#7548) 1 year ago
  Johannes Gäßler 750f60c03e CUDA: fix Pascal FA, deq. KV to FP16 for batch > 8 (#7681) 1 year ago
  Johannes Gäßler 9b596417af CUDA: quantized KV support for FA vec (#7527) 1 year ago
  Georgi Gerganov a323ec60af server : update js (#7670) 1 year ago
  Galunid 0515ad93f4 convert-hf : Handle NotImplementedError in convert-hf-to-gguf (#7660) 1 year ago
  Johannes Gäßler c8047d538f scripts: update compare_llama_bench.py [no ci] (#7673) 1 year ago
  Daniele 30e238b246 Improve HIP compatibility (#7672) 1 year ago
  Georgi Gerganov 16926dff92 readme : link homebrew discussion 1 year ago
  Georgi Gerganov 0c27e6f62e ggml : fix loongson compile warnings (#7537) 1 year ago
  Galunid 2e32f874e6 Somehow '**' got lost (#7663) 1 year ago
  Galunid 1af511fc22 Add convert.py removal to hot topics (#7662) 1 year ago
  Sertaç Özercan 0541f06296 [no ci] docs: add aikit to readme (#7650) 1 year ago
  JohnnyB 9022c33646 Fixed painfully slow single process builds. (#7326) 1 year ago
  Georgi Gerganov 5921b8f089 llama : cache llama_token_to_piece (#7587) 1 year ago
  Martin Delille 5dcdf94676 Fix conan badge display [no ci] (#7645) 1 year ago
  Manuel 2e2340de17 Add brew installation instruction to README [no ci] (#7616) 1 year ago
  Martin Delille 7846540bd2 readme : add Conan badge (#7638) 1 year ago
  Brian e6157f94c8 github: add contact links to issues and convert question into research [no ci] (#7612) 1 year ago
  Galunid 9c4c9cc83f Move convert.py to examples/convert-legacy-llama.py (#7430) 1 year ago
  Chris Elrod 59b0d07766 faster avx512 exp implementation (#7551) 1 year ago
  junchao-loongson d5c05821f3 ggml : fix loongarch build (O2 issue) (#7636) 1 year ago
  Johannes Gäßler 972b555ab9 README: explain parallel build [no ci] (#7618) 1 year ago
  Meng, Hengyu 3854c9d07f [SYCL] fix intel docker (#7630) 1 year ago
  Galunid eb57fee51f gguf-py : Add tokenizer.ggml.pre to gguf-new-metadata.py (#7627) 1 year ago
  Georgi Gerganov 55d62262a9 metal : remove invalid asserts (#7617) 1 year ago
  Georgi Gerganov 975ec63ff2 metal : add missing asserts (#7617) 1 year ago
  Georgi Gerganov fb76ec31a9 ggml : fix YARN + add tests + add asserts (#7617) 1 year ago
  Georgi Gerganov cce3dcffc5 cuda : non-cont concat support (#7610) 1 year ago