Diego Devesa
|
7cc2d2c889
ggml : move AMX to the CPU backend (#10570)
|
vor 1 Jahr |
Olivier Chafik
|
1c641e6aac
`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)
|
vor 1 Jahr |
cebtenzzre
|
b12fa0d1c1
build : link against build info instead of compiling against it (#3879)
|
vor 2 Jahren |
Georgi Gerganov
|
ec893798b7
llama : custom attention mask + parallel decoding + no context swaps (#3228)
|
vor 2 Jahren |