cturan/llama.cpp

Autor	SHA1 Mensagem	Data
Johannes Gäßler	d54a4027a6 CUDA: lower GPU latency + fix Windows performance (#3110)	há 2 anos atrás
Jhen-Jie Hong	1b0d09259e cmake : support build for iOS/tvOS (#3116)	há 2 anos atrás
Johannes Gäßler	8a4ca9af56 CUDA: add device number to error messages (#3112)	há 2 anos atrás
Kawrakow	f31b6f4e2d metal : PP speedup (#3084)	há 2 anos atrás
Erik Scholz	6eeb4d9083 convert: remove most of the n_mult usage in convert.py (#3098)	há 2 anos atrás
kchro3	21ac3a1503 metal : support for Swift (#3078)	há 2 anos atrás
Jhen-Jie Hong	4fd5477955 metal : support build for iOS/tvOS (#3089)	há 2 anos atrás
takov751	ec2a24fedf flake : add train-text-from-scratch to flake.nix (#3042)	há 2 anos atrás
Ikko Eltociear Ashimine	7d99aca759 readme : fix typo (#3043)	há 2 anos atrás
Kawrakow	ba7ffbb251 metal : Q3_K speedup (#2995)	há 2 anos atrás
Cebtenzzre	e64f5b5578 examples : make n_ctx warning work again (#3066)	há 2 anos atrás
Georgi Gerganov	94f10b91ed readme : update hot tpoics	há 2 anos atrás
Georgi Gerganov	b3e9852e47 sync : ggml (CUDA GLM RoPE + POSIX) (#3082)	há 2 anos atrás
Przemysław Pawełczyk	cb6c44c5e0 build : do not use _GNU_SOURCE gratuitously (#2035)	há 2 anos atrás
hongbo.mo	a21baeb122 docker : add git to full-cuda.Dockerfile main-cuda.Dockerfile (#3044)	há 2 anos atrás
Yui	6ff712a6d1 Update deprecated GGML TheBloke links to GGUF (#3079)	há 2 anos atrás
slaren	ebc96086af ggml-alloc : correctly check mmap return value for errors (#3075)	há 2 anos atrás
Kunshang Ji	7f412dab9c enable CPU HBM (#2603)	há 2 anos atrás
Cebtenzzre	6336d834ec convert : fix F32 ftype not being saved (#3048)	há 2 anos atrás
Cebtenzzre	00d62adb79 fix some warnings from gcc and clang-tidy (#3038)	há 2 anos atrás
Cebtenzzre	4fa2cc1750 make : improve test target (#3031)	há 2 anos atrás
Cebtenzzre	5ffab089a5 make : fix CPPFLAGS (#3035)	há 2 anos atrás
slaren	15b67a66c2 llama-bench : use two tokens in the warmup run for prompt evals (#3059)	há 2 anos atrás
Kawrakow	be8c9c245b metal : parallel RoPE on Metal (#3024)	há 2 anos atrás
Kawrakow	be6beeb8d7 metal : correct fix of kernel_norm (#3060)	há 2 anos atrás
Georgi Gerganov	c4f496648c metal : fix kernel_norm (fixes Falcon on Metal) (#3057)	há 2 anos atrás
Przemysław Pawełczyk	fec2fb19e4 ggml : posixify madvise and pagesize (#3037)	há 2 anos atrás
Georgi Gerganov	178b1850eb k-quants : fix zero-weight guard in Q6_K (ref #3040)	há 2 anos atrás
Kerfuffle	ea2c85d5d2 convert-llama-ggml-to-gguf: Try to handle files older than GGJTv3 (#3023)	há 2 anos atrás
Cebtenzzre	9912b9efc8 build : add LLAMA_METAL_NDEBUG flag (#3033)	há 2 anos atrás

Recente Antigo

Histórico de Commits Pesquisar

Histórico de Commits