cturan/llama.cpp

Autore	SHA1 Messaggio	Data
Piotr Wilkin	890fa2c1e3 WE HAVE OUTPUT!	4 mesi fa
Piotr Wilkin	e590a75905 Cleanup complete, now for the recurrent memory management...	4 mesi fa
Piotr Wilkin	2b0673c315 Cleanup ggml_delta_net	4 mesi fa
Piotr Wilkin (ilintar)	72c98b0c7d Merge pull request #1 from ggml-org/xsn/qwen3next_experiment	4 mesi fa
Xuan Son Nguyen	e83ef74733 one less magic number	4 mesi fa
Xuan Son Nguyen	f643b957f4 refactor softplus fn	4 mesi fa
Xuan Son Nguyen	46110e0630 split q_proj/gate	4 mesi fa
Piotr Wilkin	9832f2934a Remove comments as half of them are wrong anyways	4 mesi fa
Piotr Wilkin	8152df60f3 Getting closer (graph builds for bs=1 but tensor shaping is still wrong for bigger sizes)	4 mesi fa
Piotr Wilkin	e0c5dff2a7 Rewrite to tensor ops	4 mesi fa
Piotr Wilkin	178230ee21 Getting to decode stage...	4 mesi fa
Piotr Wilkin (ilintar)	c78f9fce68 Merge branch 'ggml-org:master' into qwen3_next	4 mesi fa
Radoslav Gerganov	2b6b55a59f server : include usage statistics only when user request them (#16052)	4 mesi fa
Georgi Gerganov	e58174cecb llama : bump max seq limit from 64 to 256 (#15916)	4 mesi fa
Georgi Gerganov	b213fce89b metal : improve F32, F16 and BF16 mat-vec multiplication (#16057)	4 mesi fa
Jhen-Jie Hong	e00f3fd8ff metal : avoid call free for non-owned buffer (#16067)	4 mesi fa
Georgi Gerganov	f2f28380ea metal : handle nil cv during pipeline creation (#16065)	4 mesi fa
Chenguang Li	62c3b645c5 CANN: Remove print (#16044)	4 mesi fa
Piotr Wilkin	344331c2b6 First draft	4 mesi fa
Reese Levine	d304f459d8 GGML WebGPU: Support for ADD, MUL, RMS_NORM, GET_ROWS operators (#16018)	4 mesi fa
Georgi Gerganov	0320ac5264 metal : refactor + optimize v2 (#15995)	4 mesi fa
Aleksander Grygier	a7a98e0fff SvelteKit-based WebUI (#14839)	4 mesi fa
Xuan-Son Nguyen	8f8f2274ee convert : add Llama4ForCausalLM (#16042)	4 mesi fa
Johannes Gäßler	c959b676be CUDA: fix FA occupancy, optimize tile kernel (#15982)	4 mesi fa
David Ribeiro Alves	cd08fc3ecc common : Fix corrupted memory error on json grammar initialization (#16038)	4 mesi fa
Eve	cb5bb6cc05 vulkan: automatically remove unsupported devices (#15976)	4 mesi fa
Daniel Bevenius	a91d035b90 ci : revert back to macos-13 for macOS-latest-cmake-x64 (#16040)	4 mesi fa
Jie Fu (傅杰)	745cbcf2fe llama-quant : fix the verification of attention layers for encoder-decoder models (#16023)	4 mesi fa
Jie Fu (傅杰)	1cbd80f8cf examples : support encoder-decoder models in the simple example (#16002)	4 mesi fa
Shane A	85286f3548 model : add OLMo3 support (#16015)	4 mesi fa

Più recente Più vecchio

Cronologia Commit Cerca

Cronologia Commit