Georgi Gerganov
|
dd665cc9d4
parallel : increase the variability of the prompt lengths (#13927)
|
7 hónapja |
Georgi Gerganov
|
518329b2d4
parallel : add option for non-shared and larger prompts (#13598)
|
8 hónapja |
Richard Kiss
|
532dd74e38
Fix some documentation typos/grammar mistakes (#4032)
|
2 éve |
Georgi Gerganov
|
ec893798b7
llama : custom attention mask + parallel decoding + no context swaps (#3228)
|
2 éve |