Commit History

Author SHA1 Message Date
  Georgi Gerganov ee1a0ec9cb llama : add option for greedy sampling with probs (#3813) 2 years ago
  Henk Poley 177461104b common : print that one line of the syntax help *also* to standard output (#3823) 2 years ago
  Marcus Dunn 5be6c803fa llama : remove token functions with `context` args in favor of `model` (#3720) 2 years ago
  vvhg1 d3956aea53 main : escape prompt for cfg_negative_prompt and consecutive inputs in main with interactive (#3623) 2 years ago
  Georgi Gerganov d1031cf49c sampling : refactor init to use llama_sampling_params (#3696) 2 years ago
  Georgi Gerganov 0e89203b51 speculative : add tree-based sampling example (#3624) 2 years ago
  staviq 1a159553f9 tokenizer : special token handling (#3538) 2 years ago
  M. Yusuf Sarıgöz 370359e5ba examples: support LLaVA v1.5 (multimodal model) (#3436) 2 years ago
  Kerfuffle 70c29da118 common : fix mirostat state when using multiple sequences (#3543) 2 years ago
  Kerfuffle a16e89cec8 Fix trying to strip newline from empty prompt and cfg prompt file content (#3534) 2 years ago
  pudepiedj a8777ad84e parallel : add option to load external prompt file (#3416) 2 years ago
  Jhen-Jie Hong 97af49fa39 server : reuse llama_sample_token common util (#3494) 2 years ago
  Kenvix ⭐ 45eba9369f build : use std::make_tuple() for compatibility with older GCC versions (#3488) 2 years ago
  staviq acec9eaaa9 common : process escape sequences in reverse prompts (#3461) 2 years ago
  goerch ff5a3f0c09 Work on the BPE tokenizer (#3252) 2 years ago
  vvhg1 c97f01c362 infill : add new example + extend server API (#3296) 2 years ago
  Cebtenzzre bc39553c90 build : enable more non-default compiler warnings (#3200) 2 years ago
  slaren 16bc66d947 llama.cpp : split llama_context_params into model and context params (#3301) 2 years ago
  xaedes 0e76a8992c train : finetune LORA (#2632) 2 years ago
  Georgi Gerganov ec893798b7 llama : custom attention mask + parallel decoding + no context swaps (#3228) 2 years ago
  Cebtenzzre a5661d7e71 llama : allow gguf RoPE keys to be overridden with defaults (#3240) 2 years ago
  goerch b08e75baea Fixing the last deviations from sentencepiece indicated by test-tokenizer-1 (#3170) 2 years ago
  Cebtenzzre 3aefaab9e5 check C++ code with -Wmissing-declarations (#3184) 2 years ago
  Roland 2d770505a8 llama : remove mtest (#3177) 2 years ago
  FK 84e723653c speculative: add --n-gpu-layers-draft option (#3063) 2 years ago
  Cebtenzzre 00d62adb79 fix some warnings from gcc and clang-tidy (#3038) 2 years ago
  Georgi Gerganov c4f496648c metal : fix kernel_norm (fixes Falcon on Metal) (#3057) 2 years ago
  Cebtenzzre de2fe892af examples : replace fprintf to stdout with printf (#3017) 2 years ago
  Georgi Gerganov 921772104b speculative : add grammar support (#2991) 2 years ago
  Georgi Gerganov e36ecdccc8 build : on Mac OS enable Metal by default (#2901) 2 years ago