cturan/llama.cpp

Author	SHA1 Message	Date
Brian	f7cab35ef9 gguf-hash: model wide and per tensor hashing using xxhash and sha1 (#8048)	1 year ago
Georgi Gerganov	f3f65429c4 llama : reorganize source code + improve CMake (#8006)	1 year ago
Xuan Son Nguyen	0c7b3595b9 Add `cvector-generator` example (#7514)	1 year ago
Olivier Chafik	1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809)	1 year ago
Georgi Gerganov	0cd6bd3483 llama : remove beam search (#7736)	1 year ago
Radoslav Gerganov	5e31828d3e ggml : add RPC backend (#6829)	1 year ago
Pierrick Hymbert	b804b1ef77 eval-callback: Example how to use eval callback for debugging (#6576)	1 year ago
Minsoo Cheong	64e7b47c69 examples : add "retrieval" (#6193)	1 year ago
Pierrick Hymbert	d0d5de42e5 gguf-split: split and merge gguf per batch of tensors (#6135)	1 year ago
DAN™	bcebd7dbf6 llama : add support for GritLM (#5959)	1 year ago
John	6c00a06692 gguf : add python reader example (#5216)	1 year ago
Abhilash Majumder	0f648573dd ggml : add unified SYCL backend for Intel GPUs (#2690)	2 years ago
Georgi Gerganov	4be5ef556d metal : remove old API (#4919)	2 years ago
Kawrakow	326b418b59 Importance Matrix calculation (#4861)	2 years ago
Georgi Gerganov	b0034d93ce examples : add passkey test (#3856)	2 years ago
LeonEricsson	7082d24cec lookup : add prompt lookup decoding example (#4484)	2 years ago
Georgi Gerganov	922754a8d6 lookahead : add example for lookahead decoding (#4207)	2 years ago
zakkor	2fa02b4b3d examples : add tokenize (#4039)	2 years ago
Georgi Gerganov	d1031cf49c sampling : refactor init to use llama_sampling_params (#3696)	2 years ago
M. Yusuf Sarıgöz	370359e5ba examples: support LLaVA v1.5 (multimodal model) (#3436)	2 years ago
Georgi Gerganov	8c70a5ff25 batched : add bench tool (#3545)	2 years ago
xaedes	0e76a8992c train : finetune LORA (#2632)	2 years ago
Georgi Gerganov	ec893798b7 llama : custom attention mask + parallel decoding + no context swaps (#3228)	2 years ago
Georgi Gerganov	47068e5170 speculative : PoC for speeding-up inference via speculative sampling (#2926)	2 years ago
Georgi Gerganov	c90d135eb4 examples : fix underscore in beam-search + .gitignore (close #2900)	2 years ago
Matt Pulver	c82742ac9c llama : add llama_beam_search() (#2267)	2 years ago
Georgi Gerganov	6381d4e110 gguf : new file format with flexible meta data (beta) (#2398)	2 years ago
slaren	097e121e2f llama : add benchmark example (#2626)	2 years ago
byte-6174	b19edd54d5 Adding support for llama2.c models (#2559)	2 years ago
DannyDaemonic	3498588e0f Add --simple-io option for subprocesses and break out console.h and cpp (#1558)	2 years ago

Newer Older

Commit History Find

Commit History