cturan/llama.cpp

Autor	SHA1 Mensaje	Fecha
Georgi Gerganov	7eb41179ed readme : update hot topics	hace 2 años
Cebtenzzre	a5661d7e71 llama : allow gguf RoPE keys to be overridden with defaults (#3240)	hace 2 años
Cebtenzzre	65c2c1c5ab benchmark-matmult : do not use integer abs() on a float (#3277)	hace 2 años
kang	80834daecf flake : Restore default package's buildInputs (#3262)	hace 2 años
Alon	a40f2b656f CI: FreeBSD fix (#3258)	hace 2 años
Georgi Gerganov	d119c04c15 examples : fix benchmark-matmult (#1554)	hace 2 años
Cebtenzzre	8781013ef6 make : restore build-info.h dependency for several targets (#3205)	hace 2 años
Erik Scholz	7ddf185537 ci : switch cudatoolkit install on windows to networked (#3236)	hace 2 años
Johannes Gäßler	ee66942d7e CUDA: fix peer access logic (#3231)	hace 2 años
Johannes Gäßler	111163e246 CUDA: enable peer access between devices (#2470)	hace 2 años
slaren	8b428c9bc8 llama.cpp : show model size and BPW on load (#3223)	hace 2 años
Johannes Gäßler	578d8c8f5c CUDA: fix scratch malloced on non-main device (#3220)	hace 2 años
IsaacDynamo	b541b4f0b1 Enable BUILD_SHARED_LIBS=ON on all Windows builds (#3215)	hace 2 años
Vlad	5dbc2b3213 Enable build with CUDA 11.0 (make) (#3132)	hace 2 años
goerch	b08e75baea Fixing the last deviations from sentencepiece indicated by test-tokenizer-1 (#3170)	hace 2 años
Cebtenzzre	e6616cf0db examples : add compiler version and target to build info (#2998)	hace 2 años
Cebtenzzre	3aefaab9e5 check C++ code with -Wmissing-declarations (#3184)	hace 2 años
Cebtenzzre	69eb67e282 fix build numbers by setting fetch-depth=0 (#3197)	hace 2 años
Meng Zhang	4fe09dfe66 llama : add support for StarCoder model architectures (#3187)	hace 2 años
Cebtenzzre	80291a1d02 common : do not use GNU zero-length __VA_ARGS__ extension (#3195)	hace 2 años
Georgi Gerganov	c6f1491da0 metal : fix bug in soft_max kernels (out-of-bounds access) (#3194)	hace 2 años
Cebtenzzre	e3d87a6c36 convert : make ftype optional in simple scripts (#3185)	hace 2 años
Georgi Gerganov	8c00b7a6ff sync : ggml (Metal F32 support + reduce ggml-alloc size) (#3192)	hace 2 años
Engininja2	7e50d34be6 cmake : fix building shared libs for clang (rocm) on windows (#3176)	hace 2 años
Evgeny Kurnevsky	235f7c193b flake : use pkg-config instead of pkgconfig (#3188)	hace 2 años
Georgi Gerganov	a51b687657 metal : relax conditions on fast matrix multiplication kernel (#3168)	hace 2 años
Andrei	76164fe2e6 cmake : fix llama.h location when built outside of root directory (#3179)	hace 2 años
Ali Tariq	c2ab6fe661 ci : Cloud-V for RISC-V builds (#3160)	hace 2 años
Roland	2d770505a8 llama : remove mtest (#3177)	hace 2 años
Cebtenzzre	98311c4277 llama : make quantize example up to 2.7x faster (#3115)	hace 2 años

Posterior Anterior

Historial de Commits Buscar

Historial de Commits