Andreas Kieslinger 39509fb082 cuda : CUDA Graph Compute Function Refactor (precursor for performance improvements) (#11042) há 1 ano atrás
..
include ee7136c6d1 llama: add support for QRWKV6 model architecture (#11001) há 1 ano atrás
src 39509fb082 cuda : CUDA Graph Compute Function Refactor (precursor for performance improvements) (#11042) há 1 ano atrás
.gitignore 17eb6aa8a9 vulkan : cmake integration (#8119) há 1 ano atrás
CMakeLists.txt 53ff6b9b9f GGUF: C++ refactor, backend support, misc fixes (#11030) há 1 ano atrás