Commit History

Author SHA1 Message Date
  0cc4m 5fd89a70ea Vulkan Optimizations and Fixes (#8959) 1 year ago
  Georgi Gerganov 272e3bd95e make : fix llava obj file race (#8946) 1 year ago
  tc-mb 3071c0a5f2 llava : support MiniCPM-V-2.5 (#7599) 1 year ago
  Pablo Duboue ebd541a570 make : clean llamafile objects (#8923) 1 year ago
  slaren 15fa07a5c5 make : use C compiler to build metal embed object (#8899) 1 year ago
  Clint Herron ed9d2854c9 Build: Fix potential race condition (#8781) 1 year ago
  R0CKSTAR e54c35e4fb feat: Support Moore Threads GPU (#8383) 1 year ago
  slaren 2b1f616b20 ggml : reduce hash table reset cost (#8698) 1 year ago
  Xuan Son Nguyen be6d7c0791 examples : remove `finetune` and `train-text-from-scratch` (#8669) 1 year ago
  Xuan Son Nguyen de280085e7 examples : Fix `llama-export-lora` example (#8607) 1 year ago
  Georgi Gerganov 938943cdbf llama : move vocab, grammar and sampling into separate files (#8508) 1 year ago
  Johannes Gäßler 5e116e8dd5 make/cmake: add missing force MMQ/cuBLAS for HIP (#8515) 1 year ago
  bandoti 17eb6aa8a9 vulkan : cmake integration (#8119) 1 year ago
  Nicholai Tukanov 368645698a ggml : add NVPL BLAS support (#8329) (#8425) 1 year ago
  Clint Herron dd07a123b7 Name Migration: Build the deprecation-warning 'main' binary every time (#8404) 1 year ago
  Georgi Gerganov 6b2a849d1f ggml : move sgemm sources to llamafile subfolder (#8394) 1 year ago
  Dibakar Gope 0f1a39f343 ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (#5780) 1 year ago
  Clint Herron e500d6135a Deprecation warning to assist with migration to new binary names (#8283) 1 year ago
  Johannes Gäßler a03e8dd99d make/cmake: LLAMA_NO_CCACHE -> GGML_NO_CCACHE (#8392) 1 year ago
  Brian f7cab35ef9 gguf-hash: model wide and per tensor hashing using xxhash and sha1 (#8048) 1 year ago
  Clint Herron 3e2618bc7b Adding step to `clean` target to remove legacy binary names to reduce upgrade / migration confusion arising from #7809. (#8257) 1 year ago
  Xuan Son Nguyen a27aa50ab7 Add missing items in makefile (#8177) 1 year ago
  slaren c7ab7b612c make : fix missing -O3 (#8143) 1 year ago
  Georgi Gerganov f3f65429c4 llama : reorganize source code + improve CMake (#8006) 1 year ago
  Johannes Gäßler a818f3028d CUDA: use MMQ instead of cuBLAS by default (#8075) 1 year ago
  slaren 95f57bb5d5 ggml : remove ggml_task_type and GGML_PERF (#8017) 1 year ago
  Clint Herron c5a8d4b749 JSON Schema to GBNF integration tests (#7790) 1 year ago
  Ulrich Drepper 61665277af Allow compiling with CUDA without CUDA runtime installed (#7989) 1 year ago
  0cc4m 7c7836d9d4 Vulkan Shader Refactor, Memory Debugging Option (#7947) 1 year ago
  Xuan Son Nguyen 0c7b3595b9 Add `cvector-generator` example (#7514) 1 year ago