Georgi Gerganov
|
d1031cf49c
sampling : refactor init to use llama_sampling_params (#3696)
|
2 лет назад |
M. Yusuf Sarıgöz
|
370359e5ba
examples: support LLaVA v1.5 (multimodal model) (#3436)
|
2 лет назад |
Georgi Gerganov
|
8c70a5ff25
batched : add bench tool (#3545)
|
2 лет назад |
xaedes
|
0e76a8992c
train : finetune LORA (#2632)
|
2 лет назад |
Georgi Gerganov
|
ec893798b7
llama : custom attention mask + parallel decoding + no context swaps (#3228)
|
2 лет назад |
Georgi Gerganov
|
47068e5170
speculative : PoC for speeding-up inference via speculative sampling (#2926)
|
2 лет назад |
Georgi Gerganov
|
c90d135eb4
examples : fix underscore in beam-search + .gitignore (close #2900)
|
2 лет назад |
Matt Pulver
|
c82742ac9c
llama : add llama_beam_search() (#2267)
|
2 лет назад |
Georgi Gerganov
|
6381d4e110
gguf : new file format with flexible meta data (beta) (#2398)
|
2 лет назад |
slaren
|
097e121e2f
llama : add benchmark example (#2626)
|
2 лет назад |
byte-6174
|
b19edd54d5
Adding support for llama2.c models (#2559)
|
2 лет назад |
DannyDaemonic
|
3498588e0f
Add --simple-io option for subprocesses and break out console.h and cpp (#1558)
|
2 лет назад |
Evan Jones
|
84e09a7d8b
llama : add grammar-based sampling (#1773)
|
2 лет назад |
ningshanwutuobang
|
cfa0750bc9
llama : support input embeddings directly (#1910)
|
2 лет назад |
Georgi Gerganov
|
051e1b0e6a
llama : fix kv_cache `n` init (close #1903)
|
2 лет назад |
xaedes
|
e32089b2c2
train : improved training-from-scratch example (#1652)
|
2 лет назад |
Georgi Gerganov
|
ecb217db4f
llama : Metal inference (#1642)
|
2 лет назад |
Steward Garcia
|
7e4ea5beff
examples : add server example with REST API (#1443)
|
2 лет назад |
xaedes
|
f954edda93
ggml : implement backward pass for llama + small training-llama-from-scratch example (#1360)
|
2 лет назад |
Stephan Walter
|
f0d70f147d
Various fixes to mat_mul benchmark (#1253)
|
2 лет назад |
xaedes
|
0c5692345d
examples : add save_load_state example (#1150)
|
2 лет назад |
unbounded
|
62cfc54f77
Add quantize-stats command for testing quantization (#728)
|
2 лет назад |
Georgi Gerganov
|
a316a425d0
Overhaul the examples structure
|
2 лет назад |