تاریخچه Commit ها

نویسنده SHA1 پیام تاریخ
  Georgi Gerganov f914544b16 batched-bench : add "separate text gen" mode (#17103) 2 ماه پیش
  Georgi Gerganov 7fd205a8e8 scripts : add script to bench models (#16894) 2 ماه پیش
  Georgi Gerganov a885dcff11 batched-bench : fix llama_synchronize usage during prompt processing (#15835) 4 ماه پیش
  Johannes Gäßler e81b8e4b7f llama: use FA + max. GPU layers by default (#15434) 4 ماه پیش
  Georgi Gerganov b3964c1e89 metal : optimize FA vec for large sequences and BS <= 8 (#15566) 4 ماه پیش
  Georgi Gerganov 6b64f74b55 batched-bench : fix unified KV cache handling + pp timing (#15562) 4 ماه پیش
  Georgi Gerganov f0d3c7405c batched-bench : use rand tokens (#15398) 5 ماه پیش
  Georgi Gerganov 225e7a1438 llama : add high-throughput mode (#14363) 6 ماه پیش
  Georgi Gerganov 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 ماه پیش
  Georgi Gerganov b89d605a91 batched-bench : fix pp batch contents (#13492) 8 ماه پیش
  Diego Devesa 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 ماه پیش