cturan/llama.cpp

Autor	SHA1 Mensagem	Data
Georgi Gerganov	b1dd4d08e8 sync : ggml	há 8 meses atrás
Daniel Bevenius	99881f77d8 whisper : add check that target name exists (whisper/3103)	há 8 meses atrás
Daniel Bevenius	b5769d92b4 ggml : suppress Windows compiler warnings (whisper/3075)	há 9 meses atrás
Xuan-Son Nguyen	8936784f7a mtmd : add vision support for Mistral Small 3.1 (#13231)	há 8 meses atrás
Xuan-Son Nguyen	13c9a3319b arg : remove CURLINFO_EFFECTIVE_METHOD (#13228)	há 8 meses atrás
Jared Van Bortel	a70183eb00 llama-model : fix the reported size class for nomic-embed-text-v2-moe (#13223)	há 8 meses atrás
Georgi Gerganov	8d33d740c3 sync : ggml	há 8 meses atrás
Diego Devesa	4254bb4951 ggml : fix ggml_gallocr_ptr type (ggml/1205)	há 8 meses atrás
Georgi Gerganov	9998540149 cuda : fix unused variable compile warning (whisper/0)	há 9 meses atrás
Johannes Gäßler	e1e8e0991f CUDA: batched+noncont MMQ, refactor bs>1 MoE code (#13199)	há 8 meses atrás
Xuan-Son Nguyen	6f67cf1f48 arg : -hf do not fail if url mismatch (#13219)	há 8 meses atrás
ddh0	16a457facd fix typo: `n_ctx_pre_seq` -> `n_ctx_per_seq` (#13221)	há 8 meses atrás
Xuan-Son Nguyen	3e168bede4 convert : improve model arch handling (#13122)	há 8 meses atrás
Tatsuya Tanaka	ceda28ef8e llava : remove duplicate include (#13207)	há 8 meses atrás
Olivier Chafik	3b127c7385 common : add -jf / --json-schema-file flag (#12011)	há 8 meses atrás
Jeff Bolz	e5007a5edf vulkan: use uint array index to avoid glslang bug (#13193)	há 8 meses atrás
shalinib-ibm	416313773b ggml : fix ppc64le build (#13176)	há 8 meses atrás
Xuan-Son Nguyen	07c2e2f76c convert : correct typo image_mean --> image_std (#13208)	há 8 meses atrás
Aaron Teo	44cd8d91ff feat(ggml-cpu): enable z17 compile (#13182)	há 8 meses atrás
Xuan-Son Nguyen	5933e6fdc9 arg : allow using -hf offline (#13202)	há 8 meses atrás
Xuan-Son Nguyen	da84c04d8f docker : do not build tests (#13204)	há 8 meses atrás
xiaofei	a0f7016d17 rpc : fix cache directory initialization (#13188)	há 8 meses atrás
Johannes Gäßler	19e899ce21 scripts: n_depth for compare-llama-bench [no ci] (#13201)	há 9 meses atrás
matteo	e2e1ddb93a server : Prefilling assistant message in openai compatible API (#13174)	há 9 meses atrás
Georgi Gerganov	d9d398f84f sampling : when top-k <= 0 -> noop (#13173)	há 9 meses atrás
Alberto Cabrera Pérez	5a63980117 llama-bench: fixed size of fields to correctly map to values (#13183)	há 9 meses atrás
Johannes Gäßler	cdf76586b2 CUDA: fix non-cont. inputs for batched mat mul (#13155)	há 9 meses atrás
Sigbjørn Skjæret	7d3af70b08 llama : llm_type order by size (#13177)	há 9 meses atrás
Xuan-Son Nguyen	00e3e5a194 mtmd : add qwen2vl and qwen2.5vl (#13141)	há 9 meses atrás
Sigbjørn Skjæret	e98b3692be llama : set qwen3 model type sizes (#13175)	há 9 meses atrás

Recente Antigo

Histórico de Commits Pesquisar

Histórico de Commits