cturan/llama.cpp

Autor	SHA1 Mensagem	Data
Diego Devesa	ae8de6d50a ggml : build backends as libraries (#10256)	há 1 ano atrás
Georgi Gerganov	841f27abdb metal : optimize FA kernels (#10171)	há 1 ano atrás
Diego Devesa	c5b0f4b5d9 llama : refactor model loader with backend registry (#10026)	há 1 ano atrás
Xuan Son Nguyen	cda0e4b648 llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745)	há 1 ano atrás
Ouadie EL FAROUKI	87421a23e8 [SYCL] Add SYCL Backend registry, device and Event Interfaces (#9705)	há 1 ano atrás
Diego Devesa	0e9f760eb1 rpc : add backend registry / device interfaces (#9812)	há 1 ano atrás
Michael Podvitskiy	7be099fa81 llama-bench: correct argument parsing error message (#9524)	há 1 ano atrás
Georgi Gerganov	0abc6a2c25 llama : llama_perf + option to disable timings during decode (#9355)	há 1 ano atrás
Georgi Gerganov	df270ef745 llama : refactor sampling v2 (#9294)	há 1 ano atrás
Aarni Koskela	134bc38ecf llama-bench : log benchmark progress (#9287)	há 1 ano atrás
slaren	bdf314f38a llama-bench : fix NUL terminators in CPU name (#9313)	há 1 ano atrás
Radoslav Gerganov	82e3b03c11 rpc : make RPC servers come first in the device list (#9296)	há 1 ano atrás
Aarni Koskela	8962422b1c llama-bench : add JSONL (NDJSON) output mode (#9288)	há 1 ano atrás
Faisal Zaghloul	42c76d1358 Threadpool: take 2 (#8672)	há 1 ano atrás
Zhenwei Jin	506122d854 llama-bench : add support for getting cpu info on Windows (#8824)	há 1 ano atrás
slaren	2b1f616b20 ggml : reduce hash table reset cost (#8698)	há 1 ano atrás
hipudding	1bdd8ae19f [CANN] Add Ascend NPU backend (#6035)	há 1 ano atrás
Radoslav Gerganov	e65bbf606c llama-bench : fix RPC indication (#7936)	há 1 ano atrás
slaren	f578b86b21 move BLAS to a separate backend (#6210)	há 1 ano atrás
Johannes Gäßler	148995e5e5 llama-bench: more compact markdown tables (#7879)	há 1 ano atrás
Georgi Gerganov	1442677f92 common : refactor cli arg parsing (#7675)	há 1 ano atrás
Georgi Gerganov	554c247caf ggml : remove OpenCL (#7735)	há 1 ano atrás
slaren	adc9ff3841 llama-bench : allow using a different printer for stderr with -oe (#7722)	há 1 ano atrás
Radoslav Gerganov	210d99173d llama-bench : add support for the RPC backend (#7435)	há 1 ano atrás
Georgi Gerganov	6ff13987ad common : normalize naming style (#7462)	há 1 ano atrás
slaren	b18532a4ef phi3 : duplicate rope factors in each layer (#7447)	há 1 ano atrás
slaren	e849648888 llama-bench : add pp+tg test type (#7199)	há 1 ano atrás
kunnis	628b299106 Adding support for the --numa argument for llama-bench. (#7080)	há 1 ano atrás
Georgi Gerganov	9c67c2773d ggml : add Flash Attention (#5021)	há 1 ano atrás
Justine Tunney	8cc91dc63c ggml : add llamafile sgemm (#6414)	há 1 ano atrás

Recente Antigo

Histórico de Commits Pesquisar

Histórico de Commits