Commit History

Author SHA1 Message Date
  Georgi Gerganov e0dbec0bc6 llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181) 10 months ago
  JC bfd11a2344 Fix: Compile failure due to Microsoft STL breaking change (#11836) 11 months ago
  Georgi Gerganov afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 year ago
  Georgi Gerganov f66f582927 llama : refactor `src/llama.cpp` (#10902) 1 year ago
  Johannes Gäßler 4e54be0ec6 llama/ex: remove --logdir argument (#10339) 1 year ago
  Xuan Son Nguyen cda0e4b648 llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745) 1 year ago
  Diego Devesa 7eee341bee common : use common_ prefix for common library functions (#9805) 1 year ago
  Georgi Gerganov e7022064ab perplexity : fix integer overflow (#9783) 1 year ago
  Georgi Gerganov 37f8c7b4c9 perplexity : remove extra new lines after chunks (#9596) 1 year ago
  Sigbjørn Skjæret 722ec1eb51 perplexity : do not escape input data by default (#9548) 1 year ago
  Georgi Gerganov 6262d13e0b common : reimplement logging (#9418) 1 year ago
  Georgi Gerganov 0abc6a2c25 llama : llama_perf + option to disable timings during decode (#9355) 1 year ago
  slaren 49006c67b4 llama : move random seed generation to the samplers (#9398) 1 year ago
  Xuan Son Nguyen bfe76d4a17 common : move arg parser code to `arg.cpp` (#9388) 1 year ago
  Xuan Son Nguyen 1b9ae5189c common : refactor arg parser (#9308) 1 year ago
  Georgi Gerganov df270ef745 llama : refactor sampling v2 (#9294) 1 year ago
  Zhenwei Jin 4af8420afb common : remove duplicate function llama_should_add_bos_token (#8778) 1 year ago
  Liu Jia 0a4ce78681 common : Changed tuple to struct (TODO fix) (#8823) 1 year ago
  slaren 5f2d4e60e2 ppl : fix n_seq_max for perplexity (#8277) 1 year ago
  Olivier Chafik 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 year ago
  Georgi Gerganov 1442677f92 common : refactor cli arg parsing (#7675) 1 year ago
  Georgi Gerganov 6ff13987ad common : normalize naming style (#7462) 1 year ago
  strawberrymelonpanda ca57e0f35e perplexity : ndot progress and show stats with < 100 tasks (#7348) 1 year ago
  Johannes Gäßler a8f9b07631 perplexity: more statistics, added documentation (#6936) 1 year ago
  Georgi Gerganov 58227ffdeb perplexity : require positive --ctx-size arg (#6695) 1 year ago
  Jared Van Bortel 1b67731e18 BERT tokenizer fixes (#6498) 1 year ago
  compilade 557410b8f0 llama : greatly reduce output buffer memory usage (#6122) 1 year ago
  slaren f30ea47a87 llama : add pipeline parallelism support (#6017) 1 year ago
  Georgi Gerganov 05b06210c9 llama : more consistent names of count variables (#5994) 1 year ago
  slaren d894f352bf perplexity : support using multiple sequences to allow larger batch sizes (#5946) 1 year ago