Georgi Gerganov
|
745aa5319b
llama : deprecate llama_kv_self_ API (#14030)
|
vor 7 Monaten |
Xuan-Son Nguyen
|
267c1399f1
common : refactor downloading system, handle mmproj with -hf option (#12694)
|
vor 9 Monaten |
Georgi Gerganov
|
e0dbec0bc6
llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181)
|
vor 10 Monaten |
Georgi Gerganov
|
afa8a9ec9b
llama : add `llama_vocab`, functions -> methods, naming (#11110)
|
vor 1 Jahr |
Georgi Gerganov
|
f66f582927
llama : refactor `src/llama.cpp` (#10902)
|
vor 1 Jahr |
Georgi Gerganov
|
ab96610b1e
cmake : enable warnings in llama (#10474)
|
vor 1 Jahr |
Georgi Gerganov
|
811872a59d
speculative : simplify the implementation (#10504)
|
vor 1 Jahr |
Diego Devesa
|
10bce0450f
llama : accept a list of devices to use to offload a model (#10497)
|
vor 1 Jahr |
Georgi Gerganov
|
d9d54e498d
speculative : refactor and add a simpler example (#10362)
|
vor 1 Jahr |