cturan/llama.cpp

Autor	SHA1 Mensaxe	Data
slaren	883d206fbd ggml : fix some build issues	hai 1 ano
Charles Xu	1607a5e5b0 backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (#9921)	hai 1 ano
Diego Devesa	ae8de6d50a ggml : build backends as libraries (#10256)	hai 1 ano
Georgi Gerganov	ec450d3bbf metal : opt-in compile flag for BF16 (#10218)	hai 1 ano
Xuan Son Nguyen	a71d81cf8c server : revamp chat UI with vuejs and daisyui (#10175)	hai 1 ano
Diego Devesa	9f40989351 ggml : move CPU backend to a separate file (#10144)	hai 1 ano
Diego Devesa	a6744e43e8 llama : add simple-chat example (#10124)	hai 1 ano
Ma Mingfei	60ce97c9d8 add amx kernel for gemm (#8998)	hai 1 ano
Diego Devesa	c83ad6d01e ggml-backend : add device and backend reg interfaces (#9707)	hai 1 ano
Georgi Gerganov	148844fe97 examples : remove benchmark (#9704)	hai 1 ano
R0CKSTAR	c35e586ea5 musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (#9526)	hai 1 ano
Georgi Gerganov	19514d632e cmake : do not hide GGML options + rename option (#9465)	hai 1 ano
Georgi Gerganov	6262d13e0b common : reimplement logging (#9418)	hai 1 ano
Xuan Son Nguyen	feff4aa846 server : add loading html page while model is loading (#9468)	hai 1 ano
Ahmad Tameem	2b00fa7997 riscv : modify Makefile and add a RISCV_VECT to print log info (#9442)	hai 1 ano
slaren	fb3f249815 make : do not run llama-gen-docs when building (#9399)	hai 1 ano
Xuan Son Nguyen	bfe76d4a17 common : move arg parser code to `arg.cpp` (#9388)	hai 1 ano
Xuan Son Nguyen	1b9ae5189c common : refactor arg parser (#9308)	hai 1 ano
Georgi Gerganov	df270ef745 llama : refactor sampling v2 (#9294)	hai 1 ano
0cc4m	5fd89a70ea Vulkan Optimizations and Fixes (#8959)	hai 1 ano
Georgi Gerganov	272e3bd95e make : fix llava obj file race (#8946)	hai 1 ano
tc-mb	3071c0a5f2 llava : support MiniCPM-V-2.5 (#7599)	hai 1 ano
Pablo Duboue	ebd541a570 make : clean llamafile objects (#8923)	hai 1 ano
slaren	15fa07a5c5 make : use C compiler to build metal embed object (#8899)	hai 1 ano
Clint Herron	ed9d2854c9 Build: Fix potential race condition (#8781)	hai 1 ano
R0CKSTAR	e54c35e4fb feat: Support Moore Threads GPU (#8383)	hai 1 ano
slaren	2b1f616b20 ggml : reduce hash table reset cost (#8698)	hai 1 ano
Xuan Son Nguyen	be6d7c0791 examples : remove `finetune` and `train-text-from-scratch` (#8669)	hai 1 ano
Xuan Son Nguyen	de280085e7 examples : Fix `llama-export-lora` example (#8607)	hai 1 ano
Georgi Gerganov	938943cdbf llama : move vocab, grammar and sampling into separate files (#8508)	hai 1 ano

Posterior Anterior

Commit History Buscar

Commit History