cturan/llama.cpp

Author	SHA1 Message	Date
Kawrakow	99009e72f8 ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684)	2 years ago
Georgi Gerganov	ecb217db4f llama : Metal inference (#1642)	2 years ago
Henri Vasserman	0ecb1bbbeb [CI] Fix openblas (#1613)	2 years ago
Johannes Gäßler	1fcdcc28b1 cuda : performance optimizations (#1530)	2 years ago
0cc4m	2e6cd4b025 OpenCL Token Generation Acceleration (#1459)	2 years ago
Steward Garcia	7e4ea5beff examples : add server example with REST API (#1443)	2 years ago
Zenix	b8ee340abe feature : support blis and other blas implementation (#1536)	2 years ago
Georgi Gerganov	ea600071cb Revert "feature : add blis and other BLAS implementation support (#1502)"	2 years ago
Zenix	07e9ace0f9 feature : add blis and other BLAS implementation support (#1502)	2 years ago
kuvaus	9daff419f6 fix build-info.h for git submodules (#1289)	2 years ago
Marvin Gießing	cc0bb7235c ggml : fix ppc64le build error and make cmake detect Power processors (#1284)	2 years ago
DannyDaemonic	f4cef87edf Add git-based build information for better issue tracking (#1232)	2 years ago
Pavol Rusnak	6f79699286 build: add armv{6,7,8} support to cmake (#1251)	2 years ago
Georgi Gerganov	305eb5afd5 build : fix reference to old llama_util.h	2 years ago
0cc4m	7296c961d9 ggml : add CLBlast support (#1164)	2 years ago
Georgi Gerganov	0e018fe008 ggml : fix Q4_3 cuBLAS	2 years ago
Howard Su	7e312f165c cmake : fix build under Windows when enable BUILD_SHARED_LIBS (#1100)	2 years ago
源文雨	018f2279f5 cmake : link threads publicly to ggml (#1042)	2 years ago
slaren	02d6988121 Improve cuBLAS performance by dequantizing on the GPU (#1065)	2 years ago
Stephan Walter	f3d4edf504 ggml : Q4 cleanup - remove 4-bit dot product code (#1061)	2 years ago
slaren	8944a13296 Add NVIDIA cuBLAS support (#1044)	2 years ago
Kawrakow	5ecff35151 Adding a simple program to measure speed of dot products (#1041)	2 years ago
Ivan Komarov	f266259ad9 Speedup the AVX-512 implementation of ggml_vec_dot_q4_0() (#933)	2 years ago
katsu560	106faaf297 cmake : add finding the OpenBLAS header file (#992)	2 years ago
Georgi Gerganov	9190e8eac8 llama : merge llama_internal.h into llama.h	2 years ago
anzz1	585d91a156 cmake : add explicit F16C option (x86) (#576)	2 years ago
comex	f963b63afa Rewrite loading code to try to satisfy everyone:	2 years ago
eiery	f2d1c47294 cmake should link openblas properly with -lopenblas like how it's done in the makefile (#839)	2 years ago
Stephan Walter	3525899277 Enable -std= for cmake builds, fix warnings (#598)	2 years ago
Stephan Walter	3bcc129ba8 cmake : properly invoke CTest (#629)	2 years ago

Newer Older

Commit History Find

Commit History