cturan/llama.cpp

Auteur	SHA1 Message	Date
Georgi Gerganov	6381d4e110 gguf : new file format with flexible meta data (beta) (#2398)	il y a 2 ans
slaren	097e121e2f llama : add benchmark example (#2626)	il y a 2 ans
drbh	7cf54e1f74 tests : adds simple llama grammar tests (#2618)	il y a 2 ans
Shouzheng Liu	bf83bff674 metal : matrix-matrix multiplication kernel (#2615)	il y a 2 ans
drbh	ee77efea2a test : add simple grammar parsing tests (#2594)	il y a 2 ans
byte-6174	b19edd54d5 Adding support for llama2.c models (#2559)	il y a 2 ans
Johannes Gäßler	25d43e0eb5 CUDA: tuned mul_mat_q kernels (#2546)	il y a 2 ans
Martin Krasser	f5bfea0580 Allow passing grammar to completion endpoint (#2532)	il y a 2 ans
GiviMAD	34a14b28ff [Makefile] Move ARM CFLAGS before compilation (#2536)	il y a 2 ans
DannyDaemonic	3498588e0f Add --simple-io option for subprocesses and break out console.h and cpp (#1558)	il y a 2 ans
Eve	81844fbcfd tests : Fix compilation warnings (Linux/GCC) (#2451)	il y a 2 ans
Johannes Gäßler	49e7cb5bb1 CUDA: fixed LLAMA_FAST compilation option (#2473)	il y a 2 ans
Johannes Gäßler	0728c5a8b9 CUDA: mmq CLI option, fixed mmq build issues (#2453)	il y a 2 ans
slaren	a113689571 ggml : add graph tensor allocator (#2411)	il y a 2 ans
Johannes Gäßler	11f3ca06b8 CUDA: Quantized matrix matrix multiplication (#2160)	il y a 2 ans
Cebtenzzre	6df1f5940f make : build with -Wmissing-prototypes (#2394)	il y a 2 ans
Aarni Koskela	b3f138d058 Chat UI extras (#2366)	il y a 2 ans
Evan Jones	84e09a7d8b llama : add grammar-based sampling (#1773)	il y a 2 ans
Jose Maldonado	91171b8072 make : fix CLBLAST compile support in FreeBSD (#2331)	il y a 2 ans
Jose Maldonado	73643f5fb1 gitignore : changes for Poetry users + chat examples (#2284)	il y a 2 ans
Georgi Gerganov	a814d04f81 make : fix indentation	il y a 2 ans
Sky Yan	42c7c2e2e9 make : support customized LLAMA_CUDA_NVCC and LLAMA_CUDA_CCBIN (#2275)	il y a 2 ans
Jiří Podivín	54e3bc76fe make : add new target for test binaries (#2244)	il y a 2 ans
Przemysław Pawełczyk	9cf022a188 make : fix embdinput library and server examples building on MSYS2 (#2235)	il y a 2 ans
wzy	7dabc66f3c make : use pkg-config for OpenBLAS (#2222)	il y a 2 ans
James Reynolds	229aab351c make : fix combination of LLAMA_METAL and LLAMA_MPI (#2208)	il y a 2 ans
Evan Miller	5656d10599 mpi : add support for distributed inference via MPI (#2099)	il y a 2 ans
dylan	84525e7962 docker : add support for CUDA in docker (#1461)	il y a 2 ans
Johannes Gäßler	924dd22fd3 Quantized dot products for CUDA mul mat vec (#2067)	il y a 2 ans
Henri Vasserman	acc111caf9 Allow old Make to build server. (#2098)	il y a 2 ans

Récemment Précédemment

Historique des commits Trouver

Historique des commits