Commit History

Autor SHA1 Mensaxe Data
  compilade 557410b8f0 llama : greatly reduce output buffer memory usage (#6122) hai 1 ano
  slaren f30ea47a87 llama : add pipeline parallelism support (#6017) hai 1 ano
  Georgi Gerganov 05b06210c9 llama : more consistent names of count variables (#5994) hai 1 ano
  slaren d894f352bf perplexity : support using multiple sequences to allow larger batch sizes (#5946) hai 1 ano
  compilade c2101a2e90 llama : support Mamba Selective State Space Models (#5328) hai 1 ano
  Georgi Gerganov b1de96824b ci : fix wikitext url + compile warnings (#5569) hai 1 ano
  Herman Semenov 5d3de51f97 ggml, common, examples, tests : fixed type arguments in printf (#5528) hai 1 ano
  bmwl f486f6e1e5 ggml : add numa options (#5377) hai 1 ano
  Michael Klimenko 52bb63c708 refactor : switch to emplace_back to avoid extra object (#5291) hai 1 ano
  kalomaze 191221178f perplexity : fix KL divergence calculations on Windows (#5273) hai 1 ano
  Kawrakow 44879ee885 Additional KL-divergence statistics (#5081) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 89758723c7 minor : clean-up some warnings and style (#5094) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 6f9939d119 KL-divergence (#5076) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 7dcbe39d36 Add ability to evauate multiple choice tasks (#5047) %!s(int64=2) %!d(string=hai) anos
  Jared Van Bortel 97c1549808 perplexity : fix MSVC build after #5020 (#5043) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 7051aacfac winogrande: evaluate log-probs in parallel (#5036) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 993fba8180 perplexity: avoid unnecessary alloocations and logit copies (#5035) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 8b20858e5e perplexity : faster Winogrande via batching (#5024) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov d391ae9b49 perplexity : fix winogrande N tasks option %!s(int64=2) %!d(string=hai) anos
  Kawrakow 3e945cc1e9 HellaSwag: speed up by parallelizing log-prob evaluation (#5020) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov ad19812cda perplexity : faster HellaSwag via batching (#5017) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 682986a08e Add Winogrande evaluation (#5015) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 959ef0c0df perplexity : fix kv cache handling for hellaswag (#4981) %!s(int64=2) %!d(string=hai) anos
  Kerfuffle 91f6499393 Respect tokenizer.ggml.add_bos_token value when tokenizing (#4040) %!s(int64=2) %!d(string=hai) anos
  cebtenzzre b12fa0d1c1 build : link against build info instead of compiling against it (#3879) %!s(int64=2) %!d(string=hai) anos
  Kerfuffle 6e08281e58 Extend llama_kv_cache_seq_rm to allow matching any sequence (#3843) %!s(int64=2) %!d(string=hai) anos
  Marcus Dunn 5be6c803fa llama : remove token functions with `context` args in favor of `model` (#3720) %!s(int64=2) %!d(string=hai) anos
  slaren 16bc66d947 llama.cpp : split llama_context_params into model and context params (#3301) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov ec893798b7 llama : custom attention mask + parallel decoding + no context swaps (#3228) %!s(int64=2) %!d(string=hai) anos
  Cebtenzzre 8781013ef6 make : restore build-info.h dependency for several targets (#3205) %!s(int64=2) %!d(string=hai) anos