cturan/llama.cpp

kopia lustrzana https://github.com/cturan/llama.cpp

Autor	SHA1 Wiadomość	Data
Georgi Gerganov	dd665cc9d4 parallel : increase the variability of the prompt lengths (#13927)	7 miesięcy temu
Georgi Gerganov	518329b2d4 parallel : add option for non-shared and larger prompts (#13598)	8 miesięcy temu
Richard Kiss	532dd74e38 Fix some documentation typos/grammar mistakes (#4032)	2 lat temu
Georgi Gerganov	ec893798b7 llama : custom attention mask + parallel decoding + no context swaps (#3228)	2 lat temu