Commit History

Author SHA1 Message Date
  Clint Herron dd07a123b7 Name Migration: Build the deprecation-warning 'main' binary every time (#8404) 1 year ago
  AidanBeltonS f4444d992c [SYCL] Use multi_ptr to clean up deprecated warnings (#8256) 1 year ago
  Georgi Gerganov 6b2a849d1f ggml : move sgemm sources to llamafile subfolder (#8394) 1 year ago
  Dibakar Gope 0f1a39f343 ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (#5780) 1 year ago
  M. Yusuf Sarıgöz 83321c6958 gguf-py rel pipeline (#8410) 1 year ago
  Borislav Stanimirov cc61948b1f llama : C++20 compatibility for u8 strings (#8408) 1 year ago
  Borislav Stanimirov 7a80710d93 msvc : silence codecvt c++17 deprecation warnings (#8395) 1 year ago
  fairydreaming a8be1e6f59 llama : add assert about missing llama_encode() call (#8400) 1 year ago
  RunningLeon e4dd31ff89 py : fix converter for internlm2 (#8321) 1 year ago
  laik 8f0fad42b9 py : fix extra space in convert_hf_to_gguf.py (#8407) 1 year ago
  Clint Herron a59f8fdc85 Server: Enable setting default sampling parameters via command-line (#8402) 1 year ago
  Andy Salerno fd560fe680 Update README.md to fix broken link to docs (#8399) 1 year ago
  Clint Herron e500d6135a Deprecation warning to assist with migration to new binary names (#8283) 1 year ago
  Johannes Gäßler a03e8dd99d make/cmake: LLAMA_NO_CCACHE -> GGML_NO_CCACHE (#8392) 1 year ago
  Alberto Cabrera Pérez 5b0b8d8cfb sycl : Reenabled mmvq path for the SYCL Nvidia Backend (#8372) 1 year ago
  Borislav Stanimirov 9925ca4087 cmake : allow external ggml (#8370) 1 year ago
  daghanerdonmez 9beb2dda03 readme : fix typo [no ci] (#8389) 1 year ago
  compilade 7d0e23d72e gguf-py : do not use internal numpy types (#7472) 1 year ago
  Georgi Gerganov 7fdb6f73e3 flake.lock: Update (#8342) 1 year ago
  Alberto Cabrera Pérez a130eccef4 labeler : updated sycl to match docs and code refactor (#8373) 1 year ago
  b4b4o c4dd11d1d3 readme : fix web link error [no ci] (#8347) 1 year ago
  Alberto Cabrera Pérez 2ec846d558 sycl : fix powf call in device code (#8368) 1 year ago
  Georgi Gerganov 3f2d538b81 scripts : fix sync for sycl 1 year ago
  Georgi Gerganov 2ee44c9a18 sync : ggml 1 year ago
  Georgi Gerganov 6847d54c4f tests : fix whitespace (#0) 1 year ago
  John Balis fde13b3bb9 feat: cuda implementation for `ggml_conv_transpose_1d` (ggml/854) 1 year ago
  Kevin Wang 470939d483 common : preallocate sampling token data vector (#8363) 1 year ago
  Georgi Gerganov 6f0dbf6ab0 infill : assert prefix/suffix tokens + remove old space logic (#8351) 1 year ago
  Kevin Wang ffd00797d8 common : avoid unnecessary logits fetch (#8358) 1 year ago
  toyer 04ce3a8b19 readme : add supported glm models (#8360) 1 year ago