cturan/llama.cpp

작성자	SHA1 메시지	날짜
Jeff Bolz	9810cb8247 ops.md: update vulkan support (#17661)	1 개월 전
Xuan-Son Nguyen	ecf74a8417 mtmd: add mtmd_context_params::warmup option (#17652)	1 개월 전
Gilad S.	00c361fe53 fix: llama arch implementation (#17665)	1 개월 전
Xuan-Son Nguyen	ec18edfcba server: introduce API for serving / loading / unloading multiple models (#17470)	1 개월 전
Xuan-Son Nguyen	7733409734 common: improve verbosity level definitions (#17630)	1 개월 전
Xuan-Son Nguyen	cd3c118908 model: support Ministral3 (#17644)	1 개월 전
Georgi Gerganov	649495c9d9 metal : add FA head size 48 (#17619)	1 개월 전
Georgi Gerganov	90c72a614a ggml : extend the GGML_SCHED_NO_REALLOC debug logic of the scheduler (#17617)	1 개월 전
Aman Gupta	6eea666912 llama-graph: avoid expand_forward for fusion (#17633)	1 개월 전
Xuan-Son Nguyen	ff90508d68 contributing: update guidelines for AI-generated code (#17625)	1 개월 전
Adrien Gallouët	0a4aeb927d cmake : add option to build and link LibreSSL (#17552)	1 개월 전
Tarek Dakhran	2ba719519d model: LFM2-VL fixes (#17577)	1 개월 전
Xuan-Son Nguyen	7f8ef50cce clip: fix nb calculation for qwen3-vl (#17594)	1 개월 전
Xuan-Son Nguyen	3c136b21a3 cli: add migration warning (#17620)	1 개월 전
Adrien Gallouët	beb1f0c503 common : throttle download progress output to reduce IO flush (#17427)	1 개월 전
Aaron Teo	def5404f26 common: add LLAMA_LOG_FILE env var (#17609)	1 개월 전
Gilad S.	fa0465954f ggml: fix: macOS build with `-DGGML_BACKEND_DL=ON` (#17581)	1 개월 전
ddh0	5a6241feb0 common: update env var name (#17588)	1 개월 전
Aman Gupta	c7af376c29 CUDA: add stream-based concurrency (#16991)	1 개월 전
Mahekk Shaikh	00425e2ed1 cuda : add error checking for cudaMemcpyAsync in argsort (#17599)	1 개월 전
Acly	385c3da5e6 vulkan : fix FA mask load with bounds check (coopmat2) (#17606)	1 개월 전
Xuan-Son Nguyen	ab49f094d2 server: move server-context to its own cpp\|h (#17595)	1 개월 전
Haiyue Wang	8c32d9d96d server: explicitly set the function name in lambda (#17538)	1 개월 전
Igor Smirnov	0874693b44 common : fix json schema with '\' in literals (#17307)	1 개월 전
Neo Zhang	7d2add51d8 sycl : support to malloc memory on device more than 4GB, update the doc and script (#17566)	1 개월 전
ixgbe	f698a79c63 ggml: replace hwcap with riscv_hwprobe for RVV detection (#17567)	1 개월 전
Ruben Ortlam	47a268ea50 Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support (#16900)	1 개월 전
Jeff Bolz	59d8d4e963 vulkan: improve topk perf for large k, fix overflow in unit tests (#17582)	1 개월 전
Aleksei Nikiforov	d82b7a7c1d gguf-py : fix passing non-native endian tensors (editor-gui and new-metadata) (#17553)	1 개월 전
DAN™	03914c7ef8 common : move all common_chat_parse_* to chat-parser.cpp. (#17481)	1 개월 전

최신 이전

커밋 기록 찾기

커밋 기록