Commit History

Author SHA1 Message Date
  Georgi Gerganov dd665cc9d4 parallel : increase the variability of the prompt lengths (#13927) 7 months ago
  Georgi Gerganov 518329b2d4 parallel : add option for non-shared and larger prompts (#13598) 8 months ago
  Richard Kiss 532dd74e38 Fix some documentation typos/grammar mistakes (#4032) 2 years ago
  Georgi Gerganov ec893798b7 llama : custom attention mask + parallel decoding + no context swaps (#3228) 2 years ago