Cebtenzzre
|
ef15649972
build : fix most gcc and clang warnings (#2861)
|
2 лет назад |
Cebtenzzre
|
849408957c
tests : add a C compliance test (#2848)
|
2 лет назад |
Georgi Gerganov
|
3a007648f2
metal : add option to disable debug logs (close #2764)
|
2 лет назад |
Henri Vasserman
|
6bbc598a63
ROCm Port (#1087)
|
2 лет назад |
Georgi Gerganov
|
6381d4e110
gguf : new file format with flexible meta data (beta) (#2398)
|
2 лет назад |
Kolen Cheung
|
0919a0f73d
cmake : install ggml-meta.metal if LLAMA_METAL (#2449)
|
2 лет назад |
Shouzheng Liu
|
bf83bff674
metal : matrix-matrix multiplication kernel (#2615)
|
2 лет назад |
Johannes Gäßler
|
f64d44a9b9
CUDA: Fixed OpenLLaMA 3b mmq, reduced compile time (#2590)
|
2 лет назад |
Johannes Gäßler
|
4f6b60c776
CUDA: Fix models with output size != 32000 (#2480)
|
2 лет назад |
Johannes Gäßler
|
b772bba42e
CUDA: fixed cmake F16 option (#2471)
|
2 лет назад |
Johannes Gäßler
|
0728c5a8b9
CUDA: mmq CLI option, fixed mmq build issues (#2453)
|
2 лет назад |
slaren
|
a113689571
ggml : add graph tensor allocator (#2411)
|
2 лет назад |
Johannes Gäßler
|
11f3ca06b8
CUDA: Quantized matrix matrix multiplication (#2160)
|
2 лет назад |
Cebtenzzre
|
6df1f5940f
make : build with -Wmissing-prototypes (#2394)
|
2 лет назад |
wzy
|
78a3d13424
flake : remove intel mkl from flake.nix due to missing files (#2277)
|
2 лет назад |
wzy
|
45a1b07e9b
flake : update flake.nix (#2270)
|
2 лет назад |
wzy
|
b1f4290953
cmake : install targets (#2256)
|
2 лет назад |
Howard Su
|
4e7464ef88
FP16 is supported in CM=6.0 (#2177)
|
2 лет назад |
Evan Miller
|
5656d10599
mpi : add support for distributed inference via MPI (#2099)
|
2 лет назад |
clyang
|
3bbc1a11f0
ggml : fix buidling with Intel MKL but ask for "cblas.h" issue (#2104) (#2115)
|
2 лет назад |
Johannes Gäßler
|
924dd22fd3
Quantized dot products for CUDA mul mat vec (#2067)
|
2 лет назад |
Tobias Lütke
|
7ee76e45af
Simple webchat for server (#1998)
|
2 лет назад |
Daniel Drake
|
b213227067
cmake : don't force -mcpu=native on aarch64 (#2063)
|
2 лет назад |
Kawrakow
|
6769e944c7
k-quants : support for super-block size of 64 (#2001)
|
2 лет назад |
Johannes Gäßler
|
bbca06e269
cmake: revert CUDA arch default to 52, 61 if f16 (#1959)
|
2 лет назад |
Georgi Gerganov
|
23fc5c219a
cmake : fix trailing whitespaces
|
2 лет назад |
Howard Su
|
1e3abfcef0
cmake : fix build shared ggml when CUDA is enabled (#1929)
|
2 лет назад |
Johannes Gäßler
|
16b9cd1939
Convert vector to f16 for dequantize mul mat vec (#1913)
|
2 лет назад |
Howard Su
|
57cd69460f
cmake : add CUDA_ARCHITECTURES to new target ggml_static (#1917)
|
2 лет назад |
Kerfuffle
|
b4c6f46f17
Allow cmake to build ggml as a library (#1896)
|
2 лет назад |