cturan/llama.cpp

Autor	SHA1 Mensagem	Data
Georgi Gerganov	66d8eccd42 server : do context shift only while generating (#17000)	há 2 meses atrás
Georgi Gerganov	afd353246d readme : update hot topics (#17002)	há 2 meses atrás
Acly	cc98f8d349 ggml-cpu : bicubic interpolation (#16891)	há 2 meses atrás
Sigbjørn Skjæret	d945834366 ci : apply model label to models (#16994)	há 2 meses atrás
Sigbjørn Skjæret	b164259bba chore : fix models indent after refactor (#16992)	há 2 meses atrás
Noah	1f5accb8d0 Fix garbled output with REPACK at high thread counts (#16956)	há 2 meses atrás
Aman Gupta	2759ccdb4a CUDA: avoid mul + bias fusion when doing fusion (#16935)	há 2 meses atrás
lhez	c5023daf60 opencl: support imrope (#16914)	há 2 meses atrás
Aleksander Grygier	e7da30b584 fix: Viewing multiple PDF attachments (#16974)	há 2 meses atrás
Daniel Bevenius	ed8aa63320 model-conversion : pass config to from_pretrained (#16963)	há 2 meses atrás
Georgi Gerganov	48bd26501b server : add props.model_alias (#16943)	há 2 meses atrás
theo77186	622cd010ff ggml: CUDA: add head size 72 for flash-attn (#16962)	há 2 meses atrás
Xuan-Son Nguyen	070ff4d535 mtmd: add --image-min/max-tokens (#16921)	há 2 meses atrás
Xuan-Son Nguyen	bf7b0c9725 mtmd: pad mask for qwen2.5vl (#16954)	há 2 meses atrás
Jinyang He	fcfce040e8 ggml : LoongArch fixes (#16958)	há 2 meses atrás
Olivier Chafik	ee3a5a10ad sync: minja (glm 4.6 & minmax m2 templates) (#16949)	há 2 meses atrás
shani-f	7e994168b1 SYCL: optimized repeat_back kernel (3× fewer asm instructions, 2× faster)Feature/sycl repeat back opt (#16869)	há 2 meses atrás
Sascha Rogmann	bcfa87622a feat(webui): improve LaTeX rendering with currency detection (#16508)	há 2 meses atrás
Shagun Bera	a2054e3a8f test-backend-ops : fix segfault in moe-expert-reduce test in support mode and coverage (#16936)	há 2 meses atrás
Sigbjørn Skjæret	dd52868050 ci : disable failing riscv cross build (#16952)	há 2 meses atrás
Zhiyong Wang	6b9a52422b model: add Janus Pro for image understanding (#16906)	há 2 meses atrás
Georgi Gerganov	2f966b8ed8 clip : use FA (#16837)	há 2 meses atrás
Georgi Gerganov	cd5e3b5754 server : support unified cache across slots (#16736)	há 2 meses atrás
Aldehir Rojas	87c9efc3b2 common : move gpt-oss reasoning processing to init params (#16937)	há 2 meses atrás
Adrian Lundberg	76af40aaaa docs: remove llama_sampler_accept reference in sampling sample usage (#16920)	há 2 meses atrás
mnehete32	7db35a7958 CUDA: add FLOOR, CEIL, ROUND, TRUNC unary ops (#16917)	há 2 meses atrás
Aaron Teo	a864132ba5 devops: fix failing s390x docker build (#16918)	há 2 meses atrás
Aaron Teo	d38d9f0877 ggml: add s390x cpu-feats (#16774)	há 2 meses atrás
Georgi Gerganov	7fd205a8e8 scripts : add script to bench models (#16894)	há 2 meses atrás
Pascal	2f68ce7cfd webui: auto-refresh /props on inference start to resync model metadata (#16784)	há 2 meses atrás

Recente Antigo

Histórico de Commits Pesquisar

Histórico de Commits