Georgi Gerganov
|
a3a2a0eda8
ggml : add GGML_DEFAULT_N_THREADS
|
пре 2 година |
Stephan Walter
|
3e6e70d8e8
Add enum llama_ftype, sync ggml_type to model files (#709)
|
пре 2 година |
Georgi Gerganov
|
c3ac702e5e
ggml : add ggml_cont() + optimize ggml_cpy() for contiguous dst
|
пре 2 година |
comex
|
f963b63afa
Rewrite loading code to try to satisfy everyone:
|
пре 2 година |
unbounded
|
62cfc54f77
Add quantize-stats command for testing quantization (#728)
|
пре 2 година |
Georgi Gerganov
|
986b6ce9f9
ggml, llama : avoid heavy V transpose + improvements (#775)
|
пре 2 година |
Marian Cepok
|
c0bb1d3ce2
ggml : change ne to int64_t (#626)
|
пре 2 година |
Justine Tunney
|
6f23ba5ee2
Ensure --mlock works properly with mmap() support
|
пре 2 година |
Slaren
|
c03ae8dca1
Add mmap support for model files
|
пре 2 година |
Stephan Walter
|
c1f885067c
ggml : introduce structs for the q4 data blocks (#356)
|
пре 2 година |
comex
|
563cdc391d
Support calling mlock() on loaded model data on Linux and macOS (#453)
|
пре 2 година |
Stephan Walter
|
69c92298a9
Deduplicate q4 quantization functions (#383)
|
пре 2 година |
Georgi Gerganov
|
f5a77a629b
Introduce C-style API (#370)
|
пре 2 година |
hoangmit
|
6eac39ba95
Add RMS norm and use it (#187)
|
пре 2 година |
Georgi Gerganov
|
26c0846629
Initial release
|
пре 2 година |