cturan/llama.cpp

Tác giả	SHA1 Thông báo	Ngày
Gabe Goodhart	5fac79cbc7 Thinking model disabled assistant prefill (#15404)	4 tháng trước cách đây
Eric Curtin	408ff524b4 Implement --log-colors with always/never/auto (#15792)	4 tháng trước cách đây
Johannes Gäßler	5143fa895e CUDA: fastdiv, launch bounds for mmvq + q8_1 quant (#15802)	4 tháng trước cách đây
Daniel Bevenius	3a550b5ca4 tests : add --list-ops and --show-coverage options (#15745)	4 tháng trước cách đây
Erik Scholz	a81283820a gguf: gguf_writer refactor (#15691)	4 tháng trước cách đây
Georgi Gerganov	c610b6c11b kv-cache : fix SWA checks + disable cacheless iSWA (#15811)	4 tháng trước cách đây
Daniel Bevenius	5d6688de08 model-conversion : add --embeddings flag to modelcard.template [no ci] (#15801)	4 tháng trước cách đây
ExtReMLapin	4fd1242bef chat : fixed crash when Hermes 2 <tool_call> had a newline before it (#15639)	4 tháng trước cách đây
Piotr Wilkin (ilintar)	b2426e469e chat : nemotron thinking & toolcalling support (#15676)	4 tháng trước cách đây
Piotr Wilkin (ilintar)	9e2b1e83c6 scripts : add Jinja tester PySide6 simple app (#15756)	4 tháng trước cách đây
Daniel Bevenius	fb15d649ed llama : add support for EmbeddingGemma 300m (#15798)	4 tháng trước cách đây
Gabe Goodhart	856ed0947f metal : Add template specialization for mul_mm_id w/ ne20 == 10 (#15799)	4 tháng trước cách đây
Daniel Bevenius	d1e2adba65 llama : set n_outputs to 1 to avoid 0 outputs mean-pooling (#15791)	4 tháng trước cách đây
Chenguang Li	c1c354e44c CANN: Refactor ND to NZ workspace to be per-device (#15763)	4 tháng trước cách đây
Xuan-Son Nguyen	a68d914426 server: add exceed_context_size_error type (#15780)	4 tháng trước cách đây
Eric Curtin	badb80cadb Document the new max GPU layers default in help (#15771)	4 tháng trước cách đây
leejet	0a1b3982cd ggml: add ops for WAN video model (cuda && cpu) (#15669)	4 tháng trước cách đây
hipudding	5421f63ab0 CANN: Fix precision issue on 310I DUO multi-devices (#15784)	4 tháng trước cách đây
rmatif	820bc98531 opencl: add hs=40 to FA (#15758)	4 tháng trước cách đây
Chenguang Li	239b60e898 CANN: fix acl_rstd allocation size in ggml_cann_rms_norm (#15760)	4 tháng trước cách đây
Ruben Ortlam	dff7551bfd vulkan: fix mmv subgroup16 selection (#15775)	4 tháng trước cách đây
Jeff Bolz	0fce7a1248 vulkan: don't use std::string in load_shaders, to improve compile time (#15724)	4 tháng trước cách đây
Daniel Bevenius	8227695d7a vulkan : update ggml_vk_instance_validation_ext_available (#15666)	4 tháng trước cách đây
Shin-myoung-serp	0014fb4add ggml vulkan: add hardsigmoid and hardswish operations (#15762)	4 tháng trước cách đây
Oliver Simons	661ae31c9c CUDA: Optimize `rms_norm_f32` kernel and its fused variants, giving 1-6% perf E2E (#15715)	4 tháng trước cách đây
Daniel Bevenius	407c23786d model-conversion : fix pyright errors (#15770)	4 tháng trước cách đây
Georgi Gerganov	cdedb70a99 sampling : optimize dist sampler (#15704)	4 tháng trước cách đây
Daniel Bevenius	2c8dac72eb llama : fix incorrect model type for Gemma 270M (#15764)	4 tháng trước cách đây
Daniel Bevenius	40a751ea9a model-conversion : remove hardcoded /bin/bash shebangs [no ci] (#15765)	4 tháng trước cách đây
hipudding	5eae934883 CANN: Add RoPE contiguous check for 310I DUP device (#15735)	4 tháng trước cách đây

Mới hơn Cũ hơn

Lịch sử commit Tìm kiếm

Lịch sử commit