Commit History

Author SHA1 Message Date
  a-n-n-a-l-e-e eec22a1c63 cmake : check for openblas64 (#4134) 2 years ago
  Georgi Gerganov 91d38876df metal : switch back to default.metallib (ggml/681) 2 years ago
  Georgi Gerganov 58ba655af0 metal : enable shader debugging (cmake option) (#4705) 2 years ago
  slaren 5bf3953d7e cuda : improve cuda pool efficiency using virtual memory (#4606) 2 years ago
  Erik Garrison 0f630fbc92 cuda : ROCm AMD Unified Memory Architecture (UMA) handling (#4449) 2 years ago
  Bach Le 5daa5f54fd Link to cublas dynamically on Windows even with LLAMA_STATIC (#4506) 2 years ago
  Jared Van Bortel 70f806b821 build : detect host compiler and cuda compiler separately (#4414) 2 years ago
  Jared Van Bortel 6138963fb2 build : target Windows 8 for standard mingw-w64 (#4405) 2 years ago
  Georgi Gerganov fe680e3d10 sync : ggml (new ops, tests, backend, etc.) (#4359) 2 years ago
  Jared Van Bortel 511f52c334 build : enable libstdc++ assertions for debug builds (#4275) 2 years ago
  Li Tan f7f9e06212 cmake : fix the metal file foder path (#4217) 2 years ago
  bandoti b38a16dfcf cmake : fix issue with version info not getting baked into LlamaConfig.cmake (#3970) 2 years ago
  Roger Meier 8e9361089d build : support ppc64le build for make and CMake (#3963) 2 years ago
  Michael Potter 6bb4908a17 Fix MacOS Sonoma model quantization (#4052) 2 years ago
  Eve c41ea36eaa cmake : MSVC instruction detection (fixed up #809) (#3923) 2 years ago
  slaren 21958bb393 cmake : disable LLAMA_NATIVE by default (#3906) 2 years ago
  cebtenzzre b12fa0d1c1 build : link against build info instead of compiling against it (#3879) 2 years ago
  Georgi Gerganov d69d777c02 ggml : quantization refactoring (#3833) 2 years ago
  Georgi Gerganov 2f9ec7e271 cuda : improve text-generation and batched decoding performance (#3776) 2 years ago
  Georgi Gerganov 2b4ea35e56 cuda : add batched cuBLAS GEMM for faster attention (#3749) 2 years ago
  Georgi Gerganov d28e572c02 cmake : fix add_compile_options on macOS 2 years ago
  Georgi Gerganov db3abcc114 sync : ggml (ggml-backend) (#3548) 2 years ago
  Eve 017efe899d cmake : make LLAMA_NATIVE flag actually use the instructions supported by the processor (#3273) 2 years ago
  cebtenzzre e78f0b0d05 cmake : increase minimum version for add_link_options (#3444) 2 years ago
  cebtenzzre 9476b01226 cmake : make CUDA flags more similar to the Makefile (#3420) 2 years ago
  bandoti 095231dfd3 cmake : fix transient definitions in find pkg (#3411) 2 years ago
  Cebtenzzre bc39553c90 build : enable more non-default compiler warnings (#3200) 2 years ago
  Jag Chadha 527e57cfd8 build : add ACCELERATE_NEW_LAPACK to fix warning on macOS Sonoma (#3342) 2 years ago
  DAN™ 99115f3fa6 cmake : fix build-info.h on MSVC (#3309) 2 years ago
  Johannes Gäßler 111163e246 CUDA: enable peer access between devices (#2470) 2 years ago