Eric Curtin
|
0cc63754b8
Introduce llama-run (#10291)
|
1 سال پیش |
Diego Devesa
|
5931c1f233
ggml : add support for dynamic loading of backends (#10469)
|
1 سال پیش |
Georgi Gerganov
|
d9d54e498d
speculative : refactor and add a simpler example (#10362)
|
1 سال پیش |
Diego Devesa
|
9f40989351
ggml : move CPU backend to a separate file (#10144)
|
1 سال پیش |
Diego Devesa
|
a6744e43e8
llama : add simple-chat example (#10124)
|
1 سال پیش |
Georgi Gerganov
|
148844fe97
examples : remove benchmark (#9704)
|
1 سال پیش |
Xuan Son Nguyen
|
be6d7c0791
examples : remove `finetune` and `train-text-from-scratch` (#8669)
|
1 سال پیش |
Brian
|
f7cab35ef9
gguf-hash: model wide and per tensor hashing using xxhash and sha1 (#8048)
|
1 سال پیش |
Georgi Gerganov
|
f3f65429c4
llama : reorganize source code + improve CMake (#8006)
|
1 سال پیش |
Xuan Son Nguyen
|
0c7b3595b9
Add `cvector-generator` example (#7514)
|
1 سال پیش |
Olivier Chafik
|
1c641e6aac
`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)
|
1 سال پیش |
Georgi Gerganov
|
0cd6bd3483
llama : remove beam search (#7736)
|
1 سال پیش |
Radoslav Gerganov
|
5e31828d3e
ggml : add RPC backend (#6829)
|
1 سال پیش |
Pierrick Hymbert
|
b804b1ef77
eval-callback: Example how to use eval callback for debugging (#6576)
|
1 سال پیش |
Minsoo Cheong
|
64e7b47c69
examples : add "retrieval" (#6193)
|
1 سال پیش |
Pierrick Hymbert
|
d0d5de42e5
gguf-split: split and merge gguf per batch of tensors (#6135)
|
1 سال پیش |
DAN™
|
bcebd7dbf6
llama : add support for GritLM (#5959)
|
1 سال پیش |
John
|
6c00a06692
gguf : add python reader example (#5216)
|
1 سال پیش |
Abhilash Majumder
|
0f648573dd
ggml : add unified SYCL backend for Intel GPUs (#2690)
|
1 سال پیش |
Georgi Gerganov
|
4be5ef556d
metal : remove old API (#4919)
|
2 سال پیش |
Kawrakow
|
326b418b59
Importance Matrix calculation (#4861)
|
2 سال پیش |
Georgi Gerganov
|
b0034d93ce
examples : add passkey test (#3856)
|
2 سال پیش |
LeonEricsson
|
7082d24cec
lookup : add prompt lookup decoding example (#4484)
|
2 سال پیش |
Georgi Gerganov
|
922754a8d6
lookahead : add example for lookahead decoding (#4207)
|
2 سال پیش |
zakkor
|
2fa02b4b3d
examples : add tokenize (#4039)
|
2 سال پیش |
Georgi Gerganov
|
d1031cf49c
sampling : refactor init to use llama_sampling_params (#3696)
|
2 سال پیش |
M. Yusuf Sarıgöz
|
370359e5ba
examples: support LLaVA v1.5 (multimodal model) (#3436)
|
2 سال پیش |
Georgi Gerganov
|
8c70a5ff25
batched : add bench tool (#3545)
|
2 سال پیش |
xaedes
|
0e76a8992c
train : finetune LORA (#2632)
|
2 سال پیش |
Georgi Gerganov
|
ec893798b7
llama : custom attention mask + parallel decoding + no context swaps (#3228)
|
2 سال پیش |