Kerfuffle
|
74d4cfa343
Allow "quantizing" to f16 and f32 (#1787)
|
2 лет назад |
rankaiyx
|
555275a693
make : add SSSE3 compilation use case (#1659)
|
2 лет назад |
Georgi Gerganov
|
5c64a0952e
k-quants : allow to optionally disable at compile time (#1734)
|
2 лет назад |
Georgi Gerganov
|
2d43387daf
ggml : fix builds, add ggml-quants-k.o (close #1712, close #1710)
|
2 лет назад |
Kawrakow
|
99009e72f8
ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684)
|
2 лет назад |
Georgi Gerganov
|
ecb217db4f
llama : Metal inference (#1642)
|
2 лет назад |
Johannes Gäßler
|
3b126f654f
LLAMA_DEBUG adds debug symbols (#1617)
|
2 лет назад |
Kerfuffle
|
0df7d63e5b
Include server in releases + other build system cleanups (#1610)
|
2 лет назад |
Johannes Gäßler
|
1fcdcc28b1
cuda : performance optimizations (#1530)
|
2 лет назад |
0cc4m
|
2e6cd4b025
OpenCL Token Generation Acceleration (#1459)
|
2 лет назад |
Stefan Sydow
|
7780e4f479
make : .PHONY clean (#1553)
|
2 лет назад |
Zenix
|
b8ee340abe
feature : support blis and other blas implementation (#1536)
|
2 лет назад |
Georgi Gerganov
|
ea600071cb
Revert "feature : add blis and other BLAS implementation support (#1502)"
|
2 лет назад |
Zenix
|
07e9ace0f9
feature : add blis and other BLAS implementation support (#1502)
|
2 лет назад |
sandyiscool
|
2a5ee023ad
Add alternate include path for openblas (#1476)
|
2 лет назад |
Georgi Gerganov
|
bda4d7c215
make : fix PERF build with cuBLAS
|
2 лет назад |
DaniAndTheWeb
|
173d0e6419
makefile: automatic Arch Linux detection (#1332)
|
2 лет назад |
Ionoclast Laboratories
|
2d13786e91
Fix for OpenCL / clbast builds on macOS. (#1329)
|
2 лет назад |
DannyDaemonic
|
55bc5f0900
Call sh on build-info.sh (#1294)
|
2 лет назад |
DannyDaemonic
|
f4cef87edf
Add git-based build information for better issue tracking (#1232)
|
2 лет назад |
Pavol Rusnak
|
6f79699286
build: add armv{6,7,8} support to cmake (#1251)
|
2 лет назад |
Stephan Walter
|
f0d70f147d
Various fixes to mat_mul benchmark (#1253)
|
2 лет назад |
Georgi Gerganov
|
214b6a3570
ggml : adjust mul_mat_f16 work memory (#1226)
|
2 лет назад |
Georgi Gerganov
|
305eb5afd5
build : fix reference to old llama_util.h
|
2 лет назад |
slaren
|
7fc50c051a
cuBLAS: use host pinned memory and dequantize while copying (#1207)
|
2 лет назад |
0cc4m
|
7296c961d9
ggml : add CLBlast support (#1164)
|
2 лет назад |
Johannes Gäßler
|
92a6e13a31
Add Manjaro CUDA include and lib dirs to Makefile (#1212)
|
2 лет назад |
slaren
|
e4cf982e0d
Fix cuda compilation (#1128)
|
2 лет назад |
Georgi Gerganov
|
e4422e299c
ggml : better PERF prints + support "LLAMA_PERF=1 make"
|
2 лет назад |
Georgi Gerganov
|
872c365a91
ggml : fix AVX build + update to new Q8_0 format
|
2 лет назад |