Commit History

Author SHA1 Message Date
  Georgi Gerganov e0085fdf7c Revert "server : change deps.sh xxd files to string literals (#5221)" 2 years ago
  Georgi Gerganov e6f291d158 server : fix context shift (#5195) 2 years ago
  JohnnyB 4003be0e5f server : change deps.sh xxd files to string literals (#5221) 2 years ago
  Kawrakow fea4fd4ba7 ggml : fix IQ3_XXS on Metal (#5219) 2 years ago
  Georgi Gerganov 8f8ddfcfad sync : ggml (#0) 2 years ago
  Georgi Gerganov 6fb50ebbf0 gguf : fix comparison (ggml/715) 2 years ago
  John Balis 625a699b54 `ggml_cuda_cpy` support for 4d tensors and float16->float32 upcasting (ggml/686) 2 years ago
  Georgi Gerganov a4b07c057a gguf : add input validation, prevent integer overflows (ggml/709) 2 years ago
  Georgi Gerganov 549a1e6cd5 ci : fix yolo URLs + fix metal capture (ggml/712) 2 years ago
  Jack Mousseau 5f14ee0b0c metal : add debug capture backend function (ggml/694) 2 years ago
  Kawrakow 8e14e3ddb3 Faster AVX2 dot product for IQ2_XS (#5187) 2 years ago
  Kawrakow f4d7e54974 SOTA 3-bit quants (#5196) 2 years ago
  0cc4m 2256f36b79 Vulkan Windows APU Memory Handling (#5199) 2 years ago
  Vladimir Malyutin 7359016c7c quantize : fix typo (#5211) 2 years ago
  divinity76 813416991a main : allow empty --prompt-cache file (#5176) 2 years ago
  Romain Neutron 5589921ef8 readme : minor (#5204) 2 years ago
  Georgi Gerganov 49f44b5c55 readme : update hot topics 2 years ago
  Wu Jian Ping 6685cc41c2 server : improve README (#5209) 2 years ago
  Paul Tsochantaris ceebbb5b21 ggml alloc: Fix for null dereference on alloc failure (#5200) 2 years ago
  Jared Van Bortel 6daa69ee81 kompute : fix fallback to CPU (#5201) 2 years ago
  Jared Van Bortel fbf1ddec69 Nomic Vulkan backend (#4456) 2 years ago
  divinity76 2aed77eb06 fix typo "RLIMIT_MLOCK" (#5175) 2 years ago
  Wu Jian Ping c82d18e863 server : embeddings compatibility for OpenAI (#5190) 2 years ago
  Georgi Gerganov 14fef85e2d py : fix except (#5194) 2 years ago
  Sang-Kil Park e76627bcce py : improve BPE tokenizer support (#5189) 2 years ago
  slaren fbe7dfa53c ggml : add max buffer sizes to opencl and metal backends (#5181) 2 years ago
  Eve 172ac82629 cmake : fix Vulkan build (#5182) 2 years ago
  Paul Tsochantaris d2f650cb5b metal : free metal objects (#5161) 2 years ago
  Georgi Gerganov 35dec26cc2 sync : ggml 2 years ago
  Georgi Gerganov d460510c72 ggml : minor type fix (int64_t -> size_t) 2 years ago