cturan/llama.cpp

Autor	SHA1 Nachricht	Datum
Stephan Walter	436e561931 all : be more strict about converting float to double (#458)	vor 2 Jahren
Stephan Walter	c1f885067c ggml : introduce structs for the q4 data blocks (#356)	vor 2 Jahren
slaren	a6bdc47cba Fix usage of F16C intrinsics in AVX code (#563)	vor 2 Jahren
Stephan Walter	939ad2d3a5 Fix undefined variables in debug build, remove unused variables (#531)	vor 2 Jahren
slaren	459e93cce0 Add AVX2 implementation of dequantize_row_q4_1 (#505)	vor 2 Jahren
Georgi Gerganov	a316a425d0 Overhaul the examples structure	vor 2 Jahren
Georgi Gerganov	ecbe466a36 Retire the ggml_mul_mat() branch for transposed src0 (#500)	vor 2 Jahren
slaren	09aecbf628 Add AVX2 implementation of dequantize_row_q4_0 (#467)	vor 2 Jahren
Georgi Gerganov	6b6dbc8910 Remove obsolete assert and fix compiler warning	vor 2 Jahren
Georgi Gerganov	2a2e63ce05 Fix nasty bug in ggml_compute_forward_mul_mat_f32() and reenable BLAS	vor 2 Jahren
Georgi Gerganov	8520fc310e Disable BLAS altogether - the bug is not just for qunatized mat mul	vor 2 Jahren
Georgi Gerganov	b3f460e941 Disable BLAS branch in mul_mat - seems there is a bug	vor 2 Jahren
Georgi Gerganov	7a9b6c3a8b Reduce memory usage and allocate enough memory for largest context (#473)	vor 2 Jahren
Cameron Kaiser	481044d50c additional optimizations for POWER9 (#454)	vor 2 Jahren
comex	563cdc391d Support calling mlock() on loaded model data on Linux and macOS (#453)	vor 2 Jahren
Stephan Walter	69c92298a9 Deduplicate q4 quantization functions (#383)	vor 2 Jahren
Valentyn Bezshapkin	97940520e8 fix: add POSIX functionality for Linux compilation (#51)	vor 2 Jahren
Georgi Gerganov	f5a77a629b Introduce C-style API (#370)	vor 2 Jahren
Kevin Lo	715d292ee0 Add OpenBSD support (#314)	vor 2 Jahren
Casey Primozic	2e664f1ff4 Add initial AVX512 support for dot product on Linux (#320)	vor 2 Jahren
Georgi Gerganov	22213a17b5 Change RMSNorm eps to 1e-6 (#173)	vor 2 Jahren
Stephan Walter	367946c668 Don't tell users to use a bad number of threads (#243)	vor 2 Jahren
Matvey Soloviev	904d2a8d6a Q4_1 quantization (#193)	vor 2 Jahren
Nebula	9b4a15b17d Fix RMS norm in GGML (#191)	vor 2 Jahren
hoangmit	6eac39ba95 Add RMS norm and use it (#187)	vor 2 Jahren
hoangmit	113e685d18 inline -> static inline for "bytesFromNibbles" (#161)	vor 2 Jahren
Ronsor	47857e564c Don't use vdotq_s32 if it's not available (#139)	vor 2 Jahren
Thomas Klausner	41be0a3b3d Add NetBSD support. (#90)	vor 2 Jahren
Georgi Gerganov	84d9015c4a Use vdotq_s32 to improve performance (#67)	vor 2 Jahren
Georgi Gerganov	c80e2a8f2a Revert "10% performance boost on ARM"	vor 2 Jahren

Neuer Älter

Commit Verlauf Finden

Commit Verlauf