Georgi Gerganov
|
d1031cf49c
sampling : refactor init to use llama_sampling_params (#3696)
|
2 yıl önce |
Georgi Gerganov
|
0e89203b51
speculative : add tree-based sampling example (#3624)
|
2 yıl önce |
Kerfuffle
|
70c29da118
common : fix mirostat state when using multiple sequences (#3543)
|
2 yıl önce |
Georgi Gerganov
|
fcca0a7004
refact : fix convert script + zero out KV cache to avoid nans (#3523)
|
2 yıl önce |
pudepiedj
|
a8777ad84e
parallel : add option to load external prompt file (#3416)
|
2 yıl önce |
Georgi Gerganov
|
ac2219fef3
llama : fix session saving/loading (#3400)
|
2 yıl önce |
slaren
|
16bc66d947
llama.cpp : split llama_context_params into model and context params (#3301)
|
2 yıl önce |
Georgi Gerganov
|
ec893798b7
llama : custom attention mask + parallel decoding + no context swaps (#3228)
|
2 yıl önce |