Commit History

Author SHA1 Message Date
  Daniel Bevenius 6443ddd985 llama : use reserve/emplace_back in sampler_sample (#9534) 1 year ago
  Vinesh Janarthanan 8a308354f6 server : match OAI structured output response (#9527) 1 year ago
  Eric Zhang f799155ab8 server : fix OpenSSL build (remove obsolete `LOG_INFO`) (#9529) 1 year ago
  Neo Zhang Jianyu faf67b3de4 [SYCL]set context default value to avoid memory issue, update guide (#9476) 1 year ago
  Michael Podvitskiy 7be099fa81 llama-bench: correct argument parsing error message (#9524) 1 year ago
  Bert Wagner 8b836ae731 arg : add env variable for parallel (#9513) 1 year ago
  Michael Podvitskiy 8344ef58f8 llama : fix n_vocab init for 'no_vocab' case (#9511) 1 year ago
  Max Krasnyansky 0226613853 threadpool : skip polling for unused threads (#9461) 1 year ago
  Yuri Khrustalev 503147a9f9 unicode : add <algorithm> (#9508) 1 year ago
  Gabe Goodhart 0d2ec43833 llama : support IBM Granite architecture (#9412) 1 year ago
  Michael Podvitskiy 37f3a3810e llama : add llama_n_head() (#9512) 1 year ago
  slaren 23e0d70bac ggml : move common CPU backend impl to new header (#9509) 1 year ago
  Daniel Bevenius acb2c32c33 llama : rename n_embed to n_embd in rwkv6_time_mix (#9504) 1 year ago
  Michael Podvitskiy a6a3a5c531 ggml : link MATH_LIBRARY not by its full path (#9339) 1 year ago
  compilade d54c21df7e convert : identify missing model files (#9397) 1 year ago
  Georgi Gerganov 19514d632e cmake : do not hide GGML options + rename option (#9465) 1 year ago
  Eve 5c3d0f1824 ggml : IQ4_NL sgemm + Q4_0 AVX optimization (#9422) 1 year ago
  Shane A 0aadac10c7 llama : support OLMoE (#9462) 1 year ago
  CarryFun 95ca85168b llama : support MiniCPM3 (#9322) 1 year ago
  Vinesh Janarthanan 441b72b91f main : option to disable context shift (#9484) 1 year ago
  Georgi Gerganov c4965a64f7 metal : handle zero-sized allocs (#9466) 1 year ago
  Georgi Gerganov 90a2fff0e7 flake.lock: Update (#9488) 1 year ago
  Georgi Gerganov 6262d13e0b common : reimplement logging (#9418) 1 year ago
  slaren e6deac31f7 gguf-split : add basic checks (#9499) 1 year ago
  Michael Podvitskiy 6988da94a2 cmake : correct order of sycl flags (#9497) 1 year ago
  Csaba Kecskemeti 3c7989fd29 py : add "LLaMAForCausalLM" conversion support (#9485) 1 year ago
  OSecret d6b37c881f readme : update tools list (#9475) 1 year ago
  Michael Podvitskiy 7596487beb cmake : try to fix sycl+intel build (#9487) 1 year ago
  Yuri Khrustalev 822b6322de ggml : ggml_type_name return "NONE" for invalid values (#9458) 1 year ago
  VoidIsVoid dcdcee3a74 server: add data: [DONE] to /chat/completions stream response (#9459) 1 year ago