cturan/llama.cpp

Автор	SHA1 Сообщение	Дата
WillCorticesAI	d2809a3ba2 make : fix Apple clang determination bug (#4272)	2 лет назад
Jared Van Bortel	15f5d96037 build : fix build info generation and cleanup Makefile (#3920)	2 лет назад
Georgi Gerganov	922754a8d6 lookahead : add example for lookahead decoding (#4207)	2 лет назад
Kerfuffle	28a2e6e7d4 tokenize example: Respect normal add BOS token behavior (#4126)	2 лет назад
Roger Meier	8e9361089d build : support ppc64le build for make and CMake (#3963)	2 лет назад
Michael Potter	6bb4908a17 Fix MacOS Sonoma model quantization (#4052)	2 лет назад
Georgi Gerganov	413503d4b9 make : do not add linker flags when compiling static llava lib (#3977)	2 лет назад
Damian Stewart	381efbf480 llava : expose as a shared library for downstream projects (#3613)	2 лет назад
cebtenzzre	b12fa0d1c1 build : link against build info instead of compiling against it (#3879)	2 лет назад
cebtenzzre	2046eb4345 make : remove unnecessary dependency on build-info.h (#3842)	2 лет назад
Georgi Gerganov	d69d777c02 ggml : quantization refactoring (#3833)	2 лет назад
Georgi Gerganov	2f9ec7e271 cuda : improve text-generation and batched decoding performance (#3776)	2 лет назад
Georgi Gerganov	e3932593d4 Revert "make : add optional CUDA_NATIVE_ARCH (#2482)"	2 лет назад
Alex	96981f37b1 make : add optional CUDA_NATIVE_ARCH (#2482)	2 лет назад
Georgi Gerganov	438c2ca830 server : parallel decoding and multimodal (#3677)	2 лет назад
Georgi Gerganov	d1031cf49c sampling : refactor init to use llama_sampling_params (#3696)	2 лет назад
Georgi Gerganov	0e89203b51 speculative : add tree-based sampling example (#3624)	2 лет назад
M. Yusuf Sarıgöz	370359e5ba examples: support LLaVA v1.5 (multimodal model) (#3436)	2 лет назад
Kerfuffle	70c29da118 common : fix mirostat state when using multiple sequences (#3543)	2 лет назад
Georgi Gerganov	8c70a5ff25 batched : add bench tool (#3545)	2 лет назад
Zane Shannon	24ba3d829e examples : add batched.swift + improve CI for swift (#3562)	2 лет назад
Georgi Gerganov	db3abcc114 sync : ggml (ggml-backend) (#3548)	2 лет назад
goerch	ff5a3f0c09 Work on the BPE tokenizer (#3252)	2 лет назад
vvhg1	c97f01c362 infill : add new example + extend server API (#3296)	2 лет назад
Cebtenzzre	bc39553c90 build : enable more non-default compiler warnings (#3200)	2 лет назад
xaedes	0e76a8992c train : finetune LORA (#2632)	2 лет назад
Georgi Gerganov	ec893798b7 llama : custom attention mask + parallel decoding + no context swaps (#3228)	2 лет назад
Jag Chadha	527e57cfd8 build : add ACCELERATE_NEW_LAPACK to fix warning on macOS Sonoma (#3342)	2 лет назад
Cebtenzzre	8781013ef6 make : restore build-info.h dependency for several targets (#3205)	2 лет назад
Johannes Gäßler	111163e246 CUDA: enable peer access between devices (#2470)	2 лет назад

Новее Раньше

История коммитов Найти

История коммитов