wzy
|
78a3d13424
flake : remove intel mkl from flake.nix due to missing files (#2277)
|
пре 2 година |
wzy
|
45a1b07e9b
flake : update flake.nix (#2270)
|
пре 2 година |
wzy
|
b1f4290953
cmake : install targets (#2256)
|
пре 2 година |
Howard Su
|
4e7464ef88
FP16 is supported in CM=6.0 (#2177)
|
пре 2 година |
Evan Miller
|
5656d10599
mpi : add support for distributed inference via MPI (#2099)
|
пре 2 година |
clyang
|
3bbc1a11f0
ggml : fix buidling with Intel MKL but ask for "cblas.h" issue (#2104) (#2115)
|
пре 2 година |
Johannes Gäßler
|
924dd22fd3
Quantized dot products for CUDA mul mat vec (#2067)
|
пре 2 година |
Tobias Lütke
|
7ee76e45af
Simple webchat for server (#1998)
|
пре 2 година |
Daniel Drake
|
b213227067
cmake : don't force -mcpu=native on aarch64 (#2063)
|
пре 2 година |
Kawrakow
|
6769e944c7
k-quants : support for super-block size of 64 (#2001)
|
пре 2 година |
Johannes Gäßler
|
bbca06e269
cmake: revert CUDA arch default to 52, 61 if f16 (#1959)
|
пре 2 година |
Georgi Gerganov
|
23fc5c219a
cmake : fix trailing whitespaces
|
пре 2 година |
Howard Su
|
1e3abfcef0
cmake : fix build shared ggml when CUDA is enabled (#1929)
|
пре 2 година |
Johannes Gäßler
|
16b9cd1939
Convert vector to f16 for dequantize mul mat vec (#1913)
|
пре 2 година |
Howard Su
|
57cd69460f
cmake : add CUDA_ARCHITECTURES to new target ggml_static (#1917)
|
пре 2 година |
Kerfuffle
|
b4c6f46f17
Allow cmake to build ggml as a library (#1896)
|
пре 2 година |
Zenix
|
13fe9d2d84
cmake : add auto detection of BLAS_INCLUDE_DIRS (#1886)
|
пре 2 година |
Kawrakow
|
3d01122610
CUDA : faster k-quant dot kernels (#1862)
|
пре 2 година |
Georgi Gerganov
|
bed9275617
cmake : remove whitespaces
|
пре 2 година |
Igor Okulist
|
3559433fec
cmake : set include path for OpenBlas (#1830)
|
пре 2 година |
Georgi Gerganov
|
4de0334f5c
cmake : fix Metal build (close #1791)
|
пре 2 година |
Andrei
|
303f5809f1
metal : fix issue with ggml-metal.metal path. Closes #1769 (#1782)
|
пре 2 година |
johnson442
|
0035858273
k-quants : add missing compile definition to CMakeLists (#1748)
|
пре 2 година |
Georgi Gerganov
|
5c64a0952e
k-quants : allow to optionally disable at compile time (#1734)
|
пре 2 година |
Kawrakow
|
99009e72f8
ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684)
|
пре 2 година |
Georgi Gerganov
|
ecb217db4f
llama : Metal inference (#1642)
|
пре 2 година |
Henri Vasserman
|
0ecb1bbbeb
[CI] Fix openblas (#1613)
|
пре 2 година |
Johannes Gäßler
|
1fcdcc28b1
cuda : performance optimizations (#1530)
|
пре 2 година |
0cc4m
|
2e6cd4b025
OpenCL Token Generation Acceleration (#1459)
|
пре 2 година |
Steward Garcia
|
7e4ea5beff
examples : add server example with REST API (#1443)
|
пре 2 година |