cturan/llama.cpp

同期ミラー https://github.com/cturan/llama.cpp

作者	SHA1 メッセージ	日付
Henri Vasserman	6bbc598a63 ROCm Port (#1087)	2 年前
Georgi Gerganov	cf658adc83 llm : add Falcon support (#2717)	2 年前
Kawrakow	62959e740e Strided perplexity (#2714)	2 年前
Johannes Gäßler	c63bb1d16a CUDA: use mul_mat_q kernels by default (#2683)	2 年前
slaren	1123f7fbdf ggml-cuda : use graph allocator (#2684)	2 年前
Georgi Gerganov	6381d4e110 gguf : new file format with flexible meta data (beta) (#2398)	2 年前