cturan/llama.cpp

Author	SHA1 Message	Date
Xuan Son Nguyen	c421ac072d lora : warn user if new token is added in the adapter (#9948)	1 year ago
Molly Sophia	4ff7fe1fb3 llama : add chat template for RWKV-World + fix EOT (#9968)	1 year ago
leo-pony	6b8447352d [CANN] Adapt to dynamically loadable backends mechanism (#9970)	1 year ago
Daniel Bevenius	674804a996 arg : fix typo in embeddings argument help [no ci] (#9994)	1 year ago
Georgi Gerganov	e94a138d64 llama.vim : fix info text display [no ci] (#9787)	1 year ago
Georgi Gerganov	e01c67affe llama.vim : move info to the right of screen [no ci] (#9787)	1 year ago
Asghar Ghorbani	994cfb1acb readme : update UI list (#9972)	1 year ago
Daniel Bevenius	94008cc760 arg : fix attention non-causal arg value hint (#9985)	1 year ago
Georgi Gerganov	dbd5f2f573 llama.vim : plugin for Neovim (#9787)	1 year ago
Georgi Gerganov	f594bc80ba ggml : add asserts for type conversion in fattn kernels (#9971)	1 year ago
Radoslav Gerganov	d5ebd79c76 rpc : pack only RPC structs (#9959)	1 year ago
Georgi Gerganov	55e47786e3 llama : default sampling changes + greedy update (#9897)	1 year ago
Georgi Gerganov	bc21975084 speculative : fix handling of some input params (#9963)	1 year ago
Neo Zhang Jianyu	1db8c84fc6 fix mul_mat_vec_q and *_vec_q error (#9939)	1 year ago
Loïc Carrère	45f097645e readme : update bindings list (#9951)	1 year ago
icppWorld	7cab2083c7 readme : update infra list (#9942)	1 year ago
Xuan Son Nguyen	cda0e4b648 llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745)	1 year ago
Radoslav Gerganov	afd9909a64 rpc : backend refactoring (#9912)	1 year ago
Ouadie EL FAROUKI	87421a23e8 [SYCL] Add SYCL Backend registry, device and Event Interfaces (#9705)	1 year ago
Ma Mingfei	60ce97c9d8 add amx kernel for gemm (#8998)	1 year ago
Georgi Gerganov	8901755ba3 server : add n_indent parameter for line indentation requirement (#9929)	1 year ago
Daniel Bevenius	6f55bccbb8 llama : rename batch_all to batch (#8881)	1 year ago
Georgi Gerganov	17bb928080 readme : remove --memory-f32 references (#9925)	1 year ago
Georgi Gerganov	9f45fc1e99 llama : change warning to debug log	1 year ago
Georgi Gerganov	99bd4ac28c llama : infill sampling handle very long tokens (#9924)	1 year ago
Tim Wang	3752217ed5 readme : update bindings list (#9918)	1 year ago
Diego Devesa	f010b77a37 vulkan : add backend registry / device interfaces (#9721)	1 year ago
Gilad S.	2194200278 fix: allocating CPU buffer with size `0` (#9917)	1 year ago
Gilad S.	73afe681aa fix: use `vm_allocate` to allocate CPU backend buffer on macOS (#9875)	1 year ago
Daniel Bevenius	9e04102448 llama : suppress conversion from 'size_t' to 'int' (#9046)	1 year ago

Newer Older

Commit History Find

Commit History