cturan/llama.cpp

Autor	SHA1 Nachricht	Datum
Stephan Walter	3525899277 Enable -std= for cmake builds, fix warnings (#598)	vor 2 Jahren
slaren	1d08882afa Optimize AVX2 ggml_vec_dot_q4_0 (#642)	vor 2 Jahren
perserk	02c5b27e91 Add AVX acceleration (#617)	vor 2 Jahren
Justine Tunney	6f23ba5ee2 Ensure --mlock works properly with mmap() support	vor 2 Jahren
Slaren	c03ae8dca1 Add mmap support for model files	vor 2 Jahren
Casey Primozic	a4755cf288 Remove unused variable (#607)	vor 2 Jahren
Georgi Gerganov	77efdf5a50 ggml : fix NEON signs (close #620, #622)	vor 2 Jahren
slaren	ed3c680bcd Fix GGML_F32Cx8_STORE in AVX without F16C path (#619)	vor 2 Jahren
Georgi Gerganov	b51c717d5c ggml : init time on first ggml_init() call	vor 2 Jahren
Georgi Gerganov	cea1c85948 ggml : add ARM_NEON dequantize_row_q4_1()	vor 2 Jahren
Georgi Gerganov	f202ada131 ggml : add ARM_NEON quantize_row_q4_1()	vor 2 Jahren
Georgi Gerganov	3b44d30d9b ggml : add ARM_NEON ggml_vec_dot_q4_1()	vor 2 Jahren
anzz1	83df5639eb Fix GCC warning about binary literal (#595)	vor 2 Jahren
anzz1	5a5f8b1501 Enable Fused-Multiply-Add (FMA) and F16C/CVT16 vector extensions on MSVC (#375)	vor 2 Jahren
slaren	2a98bc18ea ggml : add AVX2 implementation of quantize_row_q4_1 (#515)	vor 2 Jahren
Stephan Walter	99c5b27654 ggml : refactor quantized processing functions (#509)	vor 2 Jahren
Stephan Walter	436e561931 all : be more strict about converting float to double (#458)	vor 2 Jahren
Stephan Walter	c1f885067c ggml : introduce structs for the q4 data blocks (#356)	vor 2 Jahren
slaren	a6bdc47cba Fix usage of F16C intrinsics in AVX code (#563)	vor 2 Jahren
Stephan Walter	939ad2d3a5 Fix undefined variables in debug build, remove unused variables (#531)	vor 2 Jahren
slaren	459e93cce0 Add AVX2 implementation of dequantize_row_q4_1 (#505)	vor 2 Jahren
Georgi Gerganov	a316a425d0 Overhaul the examples structure	vor 2 Jahren
Georgi Gerganov	ecbe466a36 Retire the ggml_mul_mat() branch for transposed src0 (#500)	vor 2 Jahren
slaren	09aecbf628 Add AVX2 implementation of dequantize_row_q4_0 (#467)	vor 2 Jahren
Georgi Gerganov	6b6dbc8910 Remove obsolete assert and fix compiler warning	vor 2 Jahren
Georgi Gerganov	2a2e63ce05 Fix nasty bug in ggml_compute_forward_mul_mat_f32() and reenable BLAS	vor 2 Jahren
Georgi Gerganov	8520fc310e Disable BLAS altogether - the bug is not just for qunatized mat mul	vor 2 Jahren
Georgi Gerganov	b3f460e941 Disable BLAS branch in mul_mat - seems there is a bug	vor 2 Jahren
Georgi Gerganov	7a9b6c3a8b Reduce memory usage and allocate enough memory for largest context (#473)	vor 2 Jahren
Cameron Kaiser	481044d50c additional optimizations for POWER9 (#454)	vor 2 Jahren

Neuer Älter

Commit Verlauf Finden

Commit Verlauf