cturan/llama.cpp

Autor	SHA1 Mensaxe	Data
sandyiscool	2a5ee023ad Add alternate include path for openblas (#1476)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	bda4d7c215 make : fix PERF build with cuBLAS	%!s(int64=2) %!d(string=hai) anos
DaniAndTheWeb	173d0e6419 makefile: automatic Arch Linux detection (#1332)	%!s(int64=2) %!d(string=hai) anos
Ionoclast Laboratories	2d13786e91 Fix for OpenCL / clbast builds on macOS. (#1329)	%!s(int64=2) %!d(string=hai) anos
DannyDaemonic	55bc5f0900 Call sh on build-info.sh (#1294)	%!s(int64=2) %!d(string=hai) anos
DannyDaemonic	f4cef87edf Add git-based build information for better issue tracking (#1232)	%!s(int64=2) %!d(string=hai) anos
Pavol Rusnak	6f79699286 build: add armv{6,7,8} support to cmake (#1251)	%!s(int64=2) %!d(string=hai) anos
Stephan Walter	f0d70f147d Various fixes to mat_mul benchmark (#1253)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	214b6a3570 ggml : adjust mul_mat_f16 work memory (#1226)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	305eb5afd5 build : fix reference to old llama_util.h	%!s(int64=2) %!d(string=hai) anos
slaren	7fc50c051a cuBLAS: use host pinned memory and dequantize while copying (#1207)	%!s(int64=2) %!d(string=hai) anos
0cc4m	7296c961d9 ggml : add CLBlast support (#1164)	%!s(int64=2) %!d(string=hai) anos
Johannes Gäßler	92a6e13a31 Add Manjaro CUDA include and lib dirs to Makefile (#1212)	%!s(int64=2) %!d(string=hai) anos
slaren	e4cf982e0d Fix cuda compilation (#1128)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	e4422e299c ggml : better PERF prints + support "LLAMA_PERF=1 make"	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	872c365a91 ggml : fix AVX build + update to new Q8_0 format	%!s(int64=2) %!d(string=hai) anos
slaren	50cb666b8a Improve cuBLAS performance by using a memory pool (#1094)	%!s(int64=2) %!d(string=hai) anos
slaren	2005469ea1 Add Q4_3 support to cuBLAS (#1086)	%!s(int64=2) %!d(string=hai) anos
源文雨	5addcb120c fix: LLAMA_CUBLAS=1 undefined reference 'shm_open' (#1080)	%!s(int64=2) %!d(string=hai) anos
slaren	02d6988121 Improve cuBLAS performance by dequantizing on the GPU (#1065)	%!s(int64=2) %!d(string=hai) anos
Stephan Walter	f3d4edf504 ggml : Q4 cleanup - remove 4-bit dot product code (#1061)	%!s(int64=2) %!d(string=hai) anos
slaren	8944a13296 Add NVIDIA cuBLAS support (#1044)	%!s(int64=2) %!d(string=hai) anos
Kawrakow	5ecff35151 Adding a simple program to measure speed of dot products (#1041)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	e95b6554b4 ggml : add Q8_0 quantization for intermediate results (#951)	%!s(int64=2) %!d(string=hai) anos
Stephan Walter	93265e988a make : fix dependencies, use auto variables (#983)	%!s(int64=2) %!d(string=hai) anos
Georgi Gerganov	9190e8eac8 llama : merge llama_internal.h into llama.h	%!s(int64=2) %!d(string=hai) anos
CRD716	8cda5c981d fix whitespace (#944)	%!s(int64=2) %!d(string=hai) anos
SebastianApel	95ea26f6e9 benchmark : add tool for timing q4_0 matrix multiplication (#653)	%!s(int64=2) %!d(string=hai) anos
comex	f963b63afa Rewrite loading code to try to satisfy everyone:	%!s(int64=2) %!d(string=hai) anos
unbounded	62cfc54f77 Add quantize-stats command for testing quantization (#728)	%!s(int64=2) %!d(string=hai) anos

Posterior Anterior

Commit History Buscar

Commit History