cturan/llama.cpp

Autor	SHA1 Mensagem	Data
Aldehir Rojas	7057faf64b json : support `enum` values within `allOf` (#15830)	há 4 meses atrás
j-k	fe1c92cd7b media : add llama1 icon (#15878)	há 4 meses atrás
Jeff Bolz	e68aa10d8f vulkan: sort graph to allow more parallel execution (#15850)	há 4 meses atrás
Aman Gupta	0a16bf52e6 CUDA: generate_cu_files.py - add missing mxfp4 (#15880)	há 4 meses atrás
Jesse	88021565f0 chat : Deepseek V3.1 reasoning and tool calling support (OpenAI Style) (#15533)	há 4 meses atrás
Xuan-Son Nguyen	56920f5665 server : bring back timings_per_token (#15879)	há 4 meses atrás
Georgi Gerganov	b0d52998b9 cuda : fix supports_op condition for get_rows when number of blocks is too large (#15868)	há 4 meses atrás
Georgi Gerganov	f28d4f4ac9 metal : refactor + optimize (#15857)	há 4 meses atrás
Xuan-Son Nguyen	9fcb29f22f ggml: allow casting between f32 and i32 (#15783)	há 4 meses atrás
Sigbjørn Skjæret	5ef22d281d CUDA: non-contiguous src0 not supported for PAD (#15869)	há 4 meses atrás
Daniel Bevenius	233d773d02 convert : force setting sliding_window from original config (#15867)	há 4 meses atrás
Georgi Gerganov	a885dcff11 batched-bench : fix llama_synchronize usage during prompt processing (#15835)	há 4 meses atrás
Georgi Gerganov	663027fd54 context : fix n_outputs during reserve (#15858)	há 4 meses atrás
Georgi Gerganov	cf0e3ba150 model : avoid ggml_cont_3d for fused QKV weights (#15662)	há 4 meses atrás
Jeff Bolz	d413dca003 tests: large sizes for get_rows (#15687)	há 4 meses atrás
Chenguang Li	85ca66a746 CANN: Stream sync between devices for acl_graph (#15809)	há 4 meses atrás
Jeff Bolz	3976dfbe00 vulkan: support im2col_3d (#15795)	há 4 meses atrás
Aaron Teo	d36e61c580 ggml-cpu: clean up s390x SIMD (#15855)	há 4 meses atrás
Jeff Bolz	c97b5e5854 vulkan: Support pad_ext (#15794)	há 4 meses atrás
Jeff Bolz	267e99867f vulkan: Use larger loads in scalar/coopmat1 matmul (#15729)	há 4 meses atrás
Daniel Bevenius	3b15924d71 ggml WebGPU: remove userdata from request adapter callback (#15527)	há 4 meses atrás
Johannes Gäßler	79bc429262 CUDA: faster tile FA (Pascal/AMD), headsize 256 (#15769)	há 4 meses atrás
Charles Xu	c4df49a42d kleidiai: generalize compute_forward_kv_cache to compute_forward_fp16 (#15817)	há 4 meses atrás
Xuan-Son Nguyen	3c3635d2f2 server : speed up tests (#15836)	há 4 meses atrás
Xuan-Son Nguyen	61bdfd5298 server : implement prompt processing progress report in stream mode (#15827)	há 4 meses atrás
Johannes Gäßler	01806e7771 ggml-cpu: document use of "free" memory [no ci] (#15834)	há 4 meses atrás
Aaron Teo	186415d595 ggml-cpu: drop support for nnpa intrinsics (#15821)	há 4 meses atrás
Gabe Goodhart	fd621880f3 aLoRA Support (#15327)	há 4 meses atrás
Sigbjørn Skjæret	4281c7b315 ci : exempt correct research label (#15825)	há 4 meses atrás
Gabe Goodhart	5fac79cbc7 Thinking model disabled assistant prefill (#15404)	há 4 meses atrás

Recente Antigo

Histórico de Commits Pesquisar

Histórico de Commits