Commit History

Autor SHA1 Mensaxe Data
  Georgi Gerganov afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) hai 1 ano
  Georgi Gerganov f66f582927 llama : refactor `src/llama.cpp` (#10902) hai 1 ano
  Johannes Gäßler 4e54be0ec6 llama/ex: remove --logdir argument (#10339) hai 1 ano
  Xuan Son Nguyen cda0e4b648 llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745) hai 1 ano
  Diego Devesa 7eee341bee common : use common_ prefix for common library functions (#9805) hai 1 ano
  Georgi Gerganov e7022064ab perplexity : fix integer overflow (#9783) hai 1 ano
  Georgi Gerganov 37f8c7b4c9 perplexity : remove extra new lines after chunks (#9596) hai 1 ano
  Sigbjørn Skjæret 722ec1eb51 perplexity : do not escape input data by default (#9548) hai 1 ano
  Georgi Gerganov 6262d13e0b common : reimplement logging (#9418) hai 1 ano
  Georgi Gerganov 0abc6a2c25 llama : llama_perf + option to disable timings during decode (#9355) hai 1 ano
  slaren 49006c67b4 llama : move random seed generation to the samplers (#9398) hai 1 ano
  Xuan Son Nguyen bfe76d4a17 common : move arg parser code to `arg.cpp` (#9388) hai 1 ano
  Xuan Son Nguyen 1b9ae5189c common : refactor arg parser (#9308) hai 1 ano
  Georgi Gerganov df270ef745 llama : refactor sampling v2 (#9294) hai 1 ano
  Zhenwei Jin 4af8420afb common : remove duplicate function llama_should_add_bos_token (#8778) hai 1 ano
  Liu Jia 0a4ce78681 common : Changed tuple to struct (TODO fix) (#8823) hai 1 ano
  slaren 5f2d4e60e2 ppl : fix n_seq_max for perplexity (#8277) hai 1 ano
  Olivier Chafik 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) hai 1 ano
  Georgi Gerganov 1442677f92 common : refactor cli arg parsing (#7675) hai 1 ano
  Georgi Gerganov 6ff13987ad common : normalize naming style (#7462) hai 1 ano
  strawberrymelonpanda ca57e0f35e perplexity : ndot progress and show stats with < 100 tasks (#7348) hai 1 ano
  Johannes Gäßler a8f9b07631 perplexity: more statistics, added documentation (#6936) hai 1 ano
  Georgi Gerganov 58227ffdeb perplexity : require positive --ctx-size arg (#6695) hai 1 ano
  Jared Van Bortel 1b67731e18 BERT tokenizer fixes (#6498) hai 1 ano
  compilade 557410b8f0 llama : greatly reduce output buffer memory usage (#6122) hai 1 ano
  slaren f30ea47a87 llama : add pipeline parallelism support (#6017) hai 1 ano
  Georgi Gerganov 05b06210c9 llama : more consistent names of count variables (#5994) hai 1 ano
  slaren d894f352bf perplexity : support using multiple sequences to allow larger batch sizes (#5946) hai 1 ano
  compilade c2101a2e90 llama : support Mamba Selective State Space Models (#5328) hai 1 ano
  Georgi Gerganov b1de96824b ci : fix wikitext url + compile warnings (#5569) hai 1 ano