cturan/llama.cpp

Autor	SHA1 Mensaxe	Data
Georgi Gerganov	611363ac79 scripts : add pipefail	%!s(int64=2) %!d(string=hai) anos
Marcus Dunn	95b6e5212f added `struct` to llama_dump_timing_info_yaml's `llama_context` (#2857)	%!s(int64=2) %!d(string=hai) anos
xaedes	44c117f41e train : mem usage and other improvements (#2439)	%!s(int64=2) %!d(string=hai) anos
slaren	43033b7bb4 llama-bench : set locale to utf8 (#2832)	%!s(int64=2) %!d(string=hai) anos
Johannes Gäßler	6b73ef1201 YAML result logging + preset script (#2657)	%!s(int64=2) %!d(string=hai) anos
alonfaraj	75fafcbccc make : fix tests build (#2855)	%!s(int64=2) %!d(string=hai) anos
grahameth	be475f60af llama.cpp : fix wrong vsnprintf call in MS compiler (#2856)	%!s(int64=2) %!d(string=hai) anos
Ronny Brendel	3af6b86301 ggml : tiny ggml_vec_dot_q4_K_q8_K AVX2 improvement (#2819)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	35feac6560 ggml : sync (mem align to header + conv_transpose_2d fixes + ggml_alloc) (#2852)	%!s(int64=2) %!d(string=hai) anos
Johannes Gäßler	92b1bbd2ec CUDA: fix RoPE asserts, block sizes (#2833)	%!s(int64=2) %!d(string=hai) anos
igarnier	dd0dc366da llama.h : add missing struct keyword for C compat in callback type (#2847)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	f55538c3cc metal : fix memory leak (#2762)	%!s(int64=2) %!d(string=hai) anos
Cebtenzzre	ebcee207b6 quantize : make output filename optional again (#2823)	%!s(int64=2) %!d(string=hai) anos
JohnnyB	3e8ff47af6 devops : added systemd units and set versioning to use date. (#2835)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	103cfafc77 gguf : fix strings to not be null-terminated (#2839)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	c10704d01e llama : fix MPI threads (close #2827)	%!s(int64=2) %!d(string=hai) anos
Olivier Chafik	230d46c723 examples : update llama2.c converter to read vocab and write models in GGUF format (#2751)	%!s(int64=2) %!d(string=hai) anos
Kawrakow	463173a6c0 llama : speedup tokenization (#2831)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	eaa13a48ff falcon : fix CUDA inference by making K and Q contiguous (#2830)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	da7455d046 readme : fix headings	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	25423e9185 scripts : helper convert script	%!s(int64=2) %!d(string=hai) anos
Kawrakow	a6d1189fdd k_quants tuning for Falcon-7b (#2816)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	c48c5bb0b0 readme : update hot topics	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	d0cee0d36d gguf : add 64-bit support (GGUF v2) (#2821)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	edd4c14817 llama : more tokenizer fixes (#2810)	%!s(int64=2) %!d(string=hai) anos
Przemysław Pawełczyk	1591e2e590 ggml : detect SSSE3 (#2825)	%!s(int64=2) %!d(string=hai) anos
slaren	789c8c945a ci : add LoRA test to CI (#2650)	%!s(int64=2) %!d(string=hai) anos
Bruce MacDonald	c1ac54b77a server : add `/detokenize` endpoint (#2802)	%!s(int64=2) %!d(string=hai) anos
Kerfuffle	730d9c681e convert.py : advanced option (#2753)	%!s(int64=2) %!d(string=hai) anos
Tim Miller	c7d92e6dfe llama : use Unicode Escape Sequence to replace encoded characters (#2814)	%!s(int64=2) %!d(string=hai) anos

Posterior Anterior

Commit History Buscar

Commit History