Piotr Wilkin (ilintar)
|
34fcc5a4ac
model : Apertus model implementation (#15852)
|
3 달 전 |
Gabe Goodhart
|
e8d99dd0b6
nvidia nemotron nano v2 (nemotronh) (#15507)
|
4 달 전 |
Georgi Gerganov
|
fd1234cb46
llama : add gpt-oss (#15091)
|
5 달 전 |
Sigbjørn Skjæret
|
d17a809ef0
llama : support multiple classifier outputs and labels (#13940)
|
7 달 전 |
Diego Devesa
|
c6a2c9e741
gguf : use ggml log system (#13571)
|
8 달 전 |
Diego Devesa
|
b7d2672082
llama : fix quantize with dl backends (#13539)
|
8 달 전 |
Johannes Gäßler
|
10d2af0eaa
llama/ggml: add LLM training support (#10544)
|
8 달 전 |
Diego Devesa
|
27ebfcacba
llama : do not crash if there is no CPU backend (#13395)
|
8 달 전 |
Georgi Gerganov
|
833e2b7409
model : print tensor size during load (#12711)
|
9 달 전 |
Diego Devesa
|
e0e912f49b
llama : add option to override model tensor buffers (#11397)
|
9 달 전 |
jklincn
|
e39e727e9a
llama : use LLM_KV_GENERAL_FILE_TYPE instead of gguf_find_key (#12672)
|
9 달 전 |
lexasub
|
a5203b4465
llama : minor fixes for up llama load model speed (#11448)
|
11 달 전 |
Xuan Son Nguyen
|
681149ced2
llama : add `llama_model_load_from_splits` (#11255)
|
1 년 전 |
Georgi Gerganov
|
afa8a9ec9b
llama : add `llama_vocab`, functions -> methods, naming (#11110)
|
1 년 전 |
Johannes Gäßler
|
53ff6b9b9f
GGUF: C++ refactor, backend support, misc fixes (#11030)
|
1 년 전 |
Georgi Gerganov
|
f66f582927
llama : refactor `src/llama.cpp` (#10902)
|
1 년 전 |