cturan/llama.cpp

Аутор	SHA1 Порука	Датум
Meng Zhang	4fe09dfe66 llama : add support for StarCoder model architectures (#3187)	пре 2 година
Cebtenzzre	80291a1d02 common : do not use GNU zero-length __VA_ARGS__ extension (#3195)	пре 2 година
Georgi Gerganov	c6f1491da0 metal : fix bug in soft_max kernels (out-of-bounds access) (#3194)	пре 2 година
Cebtenzzre	e3d87a6c36 convert : make ftype optional in simple scripts (#3185)	пре 2 година
Georgi Gerganov	8c00b7a6ff sync : ggml (Metal F32 support + reduce ggml-alloc size) (#3192)	пре 2 година
Engininja2	7e50d34be6 cmake : fix building shared libs for clang (rocm) on windows (#3176)	пре 2 година
Evgeny Kurnevsky	235f7c193b flake : use pkg-config instead of pkgconfig (#3188)	пре 2 година
Georgi Gerganov	a51b687657 metal : relax conditions on fast matrix multiplication kernel (#3168)	пре 2 година
Andrei	76164fe2e6 cmake : fix llama.h location when built outside of root directory (#3179)	пре 2 година
Ali Tariq	c2ab6fe661 ci : Cloud-V for RISC-V builds (#3160)	пре 2 година
Roland	2d770505a8 llama : remove mtest (#3177)	пре 2 година
Cebtenzzre	98311c4277 llama : make quantize example up to 2.7x faster (#3115)	пре 2 година
jneem	feea179e9f flake : allow $out/include to already exist (#3175)	пре 2 година
Andrei	769266a543 cmake : compile ggml-rocm with -fpic when building shared library (#3158)	пре 2 година
Asbjørn Olling	cf8238e7f4 flake : include llama.h in nix output (#3159)	пре 2 година
Cebtenzzre	4b8560e72a make : fix clang++ detection, move some definitions to CPPFLAGS (#3155)	пре 2 година
Alon	83a53b753a CI: add FreeBSD & simplify CUDA windows (#3053)	пре 2 година
akawrykow	5c872dbca2 falcon : use stated vocab size (#2914)	пре 2 година
bandoti	990a5e226a cmake : add relocatable Llama package (#2960)	пре 2 година
dylan	980ab41afb docker : add gpu image CI builds (#3103)	пре 2 година
Kerfuffle	e394084166 gguf-py : support identity operation in TensorNameMap (#3095)	пре 2 година
jameswu2014	4c8643dd6e feature : support Baichuan serial models (#3009)	пре 2 година
Leng Yue	35f73049af speculative : add heuristic algorithm (#3006)	пре 2 година
goerch	71ca2fad7d whisper : tokenizer fix + re-enable tokenizer test for LLaMa (#3096)	пре 2 година
Tristan Ross	1b6c650d16 cmake : add a compiler flag check for FP16 format (#3086)	пре 2 година
Johannes Gäßler	0a5eebb45d CUDA: mul_mat_q RDNA2 tunings (#2910)	пре 2 година
FK	84e723653c speculative: add --n-gpu-layers-draft option (#3063)	пре 2 година
Eric Sommerlade	b52b29ab9d arm64 support for windows (#3007)	пре 2 година
Johannes Gäßler	4f7cd6ba9c CUDA: fix LoRAs (#3130)	пре 2 година
Johannes Gäßler	89e89599fd CUDA: fix mul_mat_q not used for output tensor (#3127)	пре 2 година

Новије Старије

Историја ревизија Пронађи

Историја ревизија