cturan/llama.cpp

Author	SHA1 Message	Date
a-n-n-a-l-e-e	eec22a1c63 cmake : check for openblas64 (#4134)	2 years ago
Georgi Gerganov	91d38876df metal : switch back to default.metallib (ggml/681)	2 years ago
Georgi Gerganov	58ba655af0 metal : enable shader debugging (cmake option) (#4705)	2 years ago
slaren	5bf3953d7e cuda : improve cuda pool efficiency using virtual memory (#4606)	2 years ago
Erik Garrison	0f630fbc92 cuda : ROCm AMD Unified Memory Architecture (UMA) handling (#4449)	2 years ago
Bach Le	5daa5f54fd Link to cublas dynamically on Windows even with LLAMA_STATIC (#4506)	2 years ago
Jared Van Bortel	70f806b821 build : detect host compiler and cuda compiler separately (#4414)	2 years ago
Jared Van Bortel	6138963fb2 build : target Windows 8 for standard mingw-w64 (#4405)	2 years ago
Georgi Gerganov	fe680e3d10 sync : ggml (new ops, tests, backend, etc.) (#4359)	2 years ago
Jared Van Bortel	511f52c334 build : enable libstdc++ assertions for debug builds (#4275)	2 years ago
Li Tan	f7f9e06212 cmake : fix the metal file foder path (#4217)	2 years ago
bandoti	b38a16dfcf cmake : fix issue with version info not getting baked into LlamaConfig.cmake (#3970)	2 years ago
Roger Meier	8e9361089d build : support ppc64le build for make and CMake (#3963)	2 years ago
Michael Potter	6bb4908a17 Fix MacOS Sonoma model quantization (#4052)	2 years ago
Eve	c41ea36eaa cmake : MSVC instruction detection (fixed up #809) (#3923)	2 years ago
slaren	21958bb393 cmake : disable LLAMA_NATIVE by default (#3906)	2 years ago
cebtenzzre	b12fa0d1c1 build : link against build info instead of compiling against it (#3879)	2 years ago
Georgi Gerganov	d69d777c02 ggml : quantization refactoring (#3833)	2 years ago
Georgi Gerganov	2f9ec7e271 cuda : improve text-generation and batched decoding performance (#3776)	2 years ago
Georgi Gerganov	2b4ea35e56 cuda : add batched cuBLAS GEMM for faster attention (#3749)	2 years ago
Georgi Gerganov	d28e572c02 cmake : fix add_compile_options on macOS	2 years ago
Georgi Gerganov	db3abcc114 sync : ggml (ggml-backend) (#3548)	2 years ago
Eve	017efe899d cmake : make LLAMA_NATIVE flag actually use the instructions supported by the processor (#3273)	2 years ago
cebtenzzre	e78f0b0d05 cmake : increase minimum version for add_link_options (#3444)	2 years ago
cebtenzzre	9476b01226 cmake : make CUDA flags more similar to the Makefile (#3420)	2 years ago
bandoti	095231dfd3 cmake : fix transient definitions in find pkg (#3411)	2 years ago
Cebtenzzre	bc39553c90 build : enable more non-default compiler warnings (#3200)	2 years ago
Jag Chadha	527e57cfd8 build : add ACCELERATE_NEW_LAPACK to fix warning on macOS Sonoma (#3342)	2 years ago
DAN™	99115f3fa6 cmake : fix build-info.h on MSVC (#3309)	2 years ago
Johannes Gäßler	111163e246 CUDA: enable peer access between devices (#2470)	2 years ago

Newer Older

Commit History Find

Commit History