Commit History

Автор SHA1 Съобщение Дата
  Georgi Gerganov fd1234cb46 llama : add gpt-oss (#15091) преди 6 месеца
  xctan f470bc36be ggml-cpu : split arch-specific implementations (#13892) преди 8 месеца
  R0CKSTAR 492d7f1ff7 musa: fix all warnings, re-enable `-DLLAMA_FATAL_WARNINGS=ON` in ci and update doc (#12611) преди 10 месеца
  Johannes Gäßler b9ab0a4d0b CUDA: use arch list for compatibility check (#11775) преди 11 месеца
  Andreas Kieslinger 750cb3e246 CUDA: rename macros to avoid conflicts with WinAPI (#10736) преди 1 година
  Djip007 19d8762ab6 ggml : refactor online repacking (#10446) преди 1 година
  Shupei Fan c202cef168 ggml-cpu: support IQ4_NL_4_4 by runtime repack (#10541) преди 1 година
  compilade 9bc6db28d0 ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151) преди 1 година
  R0CKSTAR e54c35e4fb feat: Support Moore Threads GPU (#8383) преди 1 година
  Dibakar Gope 0f1a39f343 ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (#5780) преди 1 година
  Johannes Gäßler cb5fad4c6c CUDA: refactor and optimize IQ MMVQ (#8215) преди 1 година
  Georgi Gerganov f3f65429c4 llama : reorganize source code + improve CMake (#8006) преди 1 година