Georgi Gerganov
|
611363ac79
scripts : add pipefail
|
%!s(int64=2) %!d(string=hai) anos |
Marcus Dunn
|
95b6e5212f
added `struct` to llama_dump_timing_info_yaml's `llama_context` (#2857)
|
%!s(int64=2) %!d(string=hai) anos |
xaedes
|
44c117f41e
train : mem usage and other improvements (#2439)
|
%!s(int64=2) %!d(string=hai) anos |
slaren
|
43033b7bb4
llama-bench : set locale to utf8 (#2832)
|
%!s(int64=2) %!d(string=hai) anos |
Johannes Gäßler
|
6b73ef1201
YAML result logging + preset script (#2657)
|
%!s(int64=2) %!d(string=hai) anos |
alonfaraj
|
75fafcbccc
make : fix tests build (#2855)
|
%!s(int64=2) %!d(string=hai) anos |
grahameth
|
be475f60af
llama.cpp : fix wrong vsnprintf call in MS compiler (#2856)
|
%!s(int64=2) %!d(string=hai) anos |
Ronny Brendel
|
3af6b86301
ggml : tiny ggml_vec_dot_q4_K_q8_K AVX2 improvement (#2819)
|
%!s(int64=2) %!d(string=hai) anos |
Georgi Gerganov
|
35feac6560
ggml : sync (mem align to header + conv_transpose_2d fixes + ggml_alloc) (#2852)
|
%!s(int64=2) %!d(string=hai) anos |
Johannes Gäßler
|
92b1bbd2ec
CUDA: fix RoPE asserts, block sizes (#2833)
|
%!s(int64=2) %!d(string=hai) anos |
igarnier
|
dd0dc366da
llama.h : add missing struct keyword for C compat in callback type (#2847)
|
%!s(int64=2) %!d(string=hai) anos |
Georgi Gerganov
|
f55538c3cc
metal : fix memory leak (#2762)
|
%!s(int64=2) %!d(string=hai) anos |
Cebtenzzre
|
ebcee207b6
quantize : make output filename optional again (#2823)
|
%!s(int64=2) %!d(string=hai) anos |
JohnnyB
|
3e8ff47af6
devops : added systemd units and set versioning to use date. (#2835)
|
%!s(int64=2) %!d(string=hai) anos |
Georgi Gerganov
|
103cfafc77
gguf : fix strings to not be null-terminated (#2839)
|
%!s(int64=2) %!d(string=hai) anos |
Georgi Gerganov
|
c10704d01e
llama : fix MPI threads (close #2827)
|
%!s(int64=2) %!d(string=hai) anos |
Olivier Chafik
|
230d46c723
examples : update llama2.c converter to read vocab and write models in GGUF format (#2751)
|
%!s(int64=2) %!d(string=hai) anos |
Kawrakow
|
463173a6c0
llama : speedup tokenization (#2831)
|
%!s(int64=2) %!d(string=hai) anos |
Georgi Gerganov
|
eaa13a48ff
falcon : fix CUDA inference by making K and Q contiguous (#2830)
|
%!s(int64=2) %!d(string=hai) anos |
Georgi Gerganov
|
da7455d046
readme : fix headings
|
%!s(int64=2) %!d(string=hai) anos |
Georgi Gerganov
|
25423e9185
scripts : helper convert script
|
%!s(int64=2) %!d(string=hai) anos |
Kawrakow
|
a6d1189fdd
k_quants tuning for Falcon-7b (#2816)
|
%!s(int64=2) %!d(string=hai) anos |
Georgi Gerganov
|
c48c5bb0b0
readme : update hot topics
|
%!s(int64=2) %!d(string=hai) anos |
Georgi Gerganov
|
d0cee0d36d
gguf : add 64-bit support (GGUF v2) (#2821)
|
%!s(int64=2) %!d(string=hai) anos |
Georgi Gerganov
|
edd4c14817
llama : more tokenizer fixes (#2810)
|
%!s(int64=2) %!d(string=hai) anos |
Przemysław Pawełczyk
|
1591e2e590
ggml : detect SSSE3 (#2825)
|
%!s(int64=2) %!d(string=hai) anos |
slaren
|
789c8c945a
ci : add LoRA test to CI (#2650)
|
%!s(int64=2) %!d(string=hai) anos |
Bruce MacDonald
|
c1ac54b77a
server : add `/detokenize` endpoint (#2802)
|
%!s(int64=2) %!d(string=hai) anos |
Kerfuffle
|
730d9c681e
convert.py : advanced option (#2753)
|
%!s(int64=2) %!d(string=hai) anos |
Tim Miller
|
c7d92e6dfe
llama : use Unicode Escape Sequence to replace encoded characters (#2814)
|
%!s(int64=2) %!d(string=hai) anos |