Commit History

Autor SHA1 Mensaxe Data
  slaren 883d206fbd ggml : fix some build issues hai 1 ano
  Charles Xu 1607a5e5b0 backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (#9921) hai 1 ano
  Diego Devesa ae8de6d50a ggml : build backends as libraries (#10256) hai 1 ano
  Georgi Gerganov ec450d3bbf metal : opt-in compile flag for BF16 (#10218) hai 1 ano
  Xuan Son Nguyen a71d81cf8c server : revamp chat UI with vuejs and daisyui (#10175) hai 1 ano
  Diego Devesa 9f40989351 ggml : move CPU backend to a separate file (#10144) hai 1 ano
  Diego Devesa a6744e43e8 llama : add simple-chat example (#10124) hai 1 ano
  Ma Mingfei 60ce97c9d8 add amx kernel for gemm (#8998) hai 1 ano
  Diego Devesa c83ad6d01e ggml-backend : add device and backend reg interfaces (#9707) hai 1 ano
  Georgi Gerganov 148844fe97 examples : remove benchmark (#9704) hai 1 ano
  R0CKSTAR c35e586ea5 musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (#9526) hai 1 ano
  Georgi Gerganov 19514d632e cmake : do not hide GGML options + rename option (#9465) hai 1 ano
  Georgi Gerganov 6262d13e0b common : reimplement logging (#9418) hai 1 ano
  Xuan Son Nguyen feff4aa846 server : add loading html page while model is loading (#9468) hai 1 ano
  Ahmad Tameem 2b00fa7997 riscv : modify Makefile and add a RISCV_VECT to print log info (#9442) hai 1 ano
  slaren fb3f249815 make : do not run llama-gen-docs when building (#9399) hai 1 ano
  Xuan Son Nguyen bfe76d4a17 common : move arg parser code to `arg.cpp` (#9388) hai 1 ano
  Xuan Son Nguyen 1b9ae5189c common : refactor arg parser (#9308) hai 1 ano
  Georgi Gerganov df270ef745 llama : refactor sampling v2 (#9294) hai 1 ano
  0cc4m 5fd89a70ea Vulkan Optimizations and Fixes (#8959) hai 1 ano
  Georgi Gerganov 272e3bd95e make : fix llava obj file race (#8946) hai 1 ano
  tc-mb 3071c0a5f2 llava : support MiniCPM-V-2.5 (#7599) hai 1 ano
  Pablo Duboue ebd541a570 make : clean llamafile objects (#8923) hai 1 ano
  slaren 15fa07a5c5 make : use C compiler to build metal embed object (#8899) hai 1 ano
  Clint Herron ed9d2854c9 Build: Fix potential race condition (#8781) hai 1 ano
  R0CKSTAR e54c35e4fb feat: Support Moore Threads GPU (#8383) hai 1 ano
  slaren 2b1f616b20 ggml : reduce hash table reset cost (#8698) hai 1 ano
  Xuan Son Nguyen be6d7c0791 examples : remove `finetune` and `train-text-from-scratch` (#8669) hai 1 ano
  Xuan Son Nguyen de280085e7 examples : Fix `llama-export-lora` example (#8607) hai 1 ano
  Georgi Gerganov 938943cdbf llama : move vocab, grammar and sampling into separate files (#8508) hai 1 ano