Commit History

Autor SHA1 Mensaxe Data
  slaren 96d7f56d29 llama : fix mlock with no-mmap with Metal (#5025) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 2d5419d08a imatrix : fix assert for src0 non-cont check %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov d391ae9b49 perplexity : fix winogrande N tasks option %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov e9240cdfa0 scripts : add get-winogrande.sh %!s(int64=2) %!d(string=hai) anos
  David Sommers b46757735d convert.py : fix llama/llama2 conversion due to vocab_size=-1 (#5019) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 3e945cc1e9 HellaSwag: speed up by parallelizing log-prob evaluation (#5020) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov ad19812cda perplexity : faster HellaSwag via batching (#5017) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 682986a08e Add Winogrande evaluation (#5015) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov dcad445d0c scritps : add helper script to get hellaswag data in txt format %!s(int64=2) %!d(string=hai) anos
  Paul Tsochantaris 1e605f4102 metal : fix memory leak, dangling pointer and unused autorel (#5007) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 6b6916b215 sync : ggml %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 38566680cd ggml : add IQ2 to test-backend-ops + refactoring (#4990) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov ba69bbc84c imatrix : offload to GPU support (#4957) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 44a1a4a41a backend : add eval callback (#4935) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov c918fe8dca metal : create autorelease pool during library build (#4970) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 0f83e727af py : fix whitespace %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 4f4bf35f46 py : fix missing added_tokens_dict for SPM and BPE vocabs (#4971) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 2b3a665d39 llama : use Q4_K for attn_v for Q2_K_S when n_gqa >= 4 (#4996) %!s(int64=2) %!d(string=hai) anos
  Paul Tsochantaris 7563293665 metal : remove unnecessary nil check (#4986) %!s(int64=2) %!d(string=hai) anos
  David Renshaw f46c0c1b0e llama : fix copy/paste error in llama_sampling_params comment (#4994) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 5c99960901 py : remove unnecessary hasattr (#4903) %!s(int64=2) %!d(string=hai) anos
  Philip Taron bee938da74 nix: remove nixConfig from flake.nix (#4984) %!s(int64=2) %!d(string=hai) anos
  Daniel Bevenius cec8a48470 finetune : add training data file to log message (#4979) %!s(int64=2) %!d(string=hai) anos
  Kawrakow 334a835a1c ggml : importance matrix support for legacy quants (#4969) %!s(int64=2) %!d(string=hai) anos
  Maximilian Winter 4feb4b33ee examples : add complete parallel function calling example (#4974) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov 959ef0c0df perplexity : fix kv cache handling for hellaswag (#4981) %!s(int64=2) %!d(string=hai) anos
  Georgi Gerganov c37b3474e6 flake.lock: update flake-parts, flake-parts/nixpkgs-lib, and nixpkgs (#4920) %!s(int64=2) %!d(string=hai) anos
  Paul Tsochantaris 158f8c9e21 metal : localized logic in `ggml_metal_graph_compute` (#4924) %!s(int64=2) %!d(string=hai) anos
  Neuman Vong 862f5e41ab android : introduce starter project example (#4926) %!s(int64=2) %!d(string=hai) anos
  Alex Azarov 3a48d558a6 metal : replace loop of dispatch_async with dispatch_apply (#4934) %!s(int64=2) %!d(string=hai) anos