Georgi Gerganov
|
dd665cc9d4
parallel : increase the variability of the prompt lengths (#13927)
|
há 7 meses atrás |
Georgi Gerganov
|
518329b2d4
parallel : add option for non-shared and larger prompts (#13598)
|
há 8 meses atrás |
Richard Kiss
|
532dd74e38
Fix some documentation typos/grammar mistakes (#4032)
|
há 2 anos atrás |
Georgi Gerganov
|
ec893798b7
llama : custom attention mask + parallel decoding + no context swaps (#3228)
|
há 2 anos atrás |