cturan/llama.cpp

Auteur	SHA1 Message	Date
Douglas Hanley	b5bd037832 llama : add support for qwen3 reranker (#15824)	il y a 3 mois
Sigbjørn Skjæret	b8e09f08b9 model : add grok-2 support (#15539)	il y a 4 mois
Sigbjørn Skjæret	6ab397e12b graph : support non-contiguous Q in build_attn_mha (#15908)	il y a 4 mois
Georgi Gerganov	663027fd54 context : fix n_outputs during reserve (#15858)	il y a 4 mois
Georgi Gerganov	c610b6c11b kv-cache : fix SWA checks + disable cacheless iSWA (#15811)	il y a 4 mois
Daniel Bevenius	fb15d649ed llama : add support for EmbeddingGemma 300m (#15798)	il y a 4 mois
Johannes Gäßler	e81b8e4b7f llama: use FA + max. GPU layers by default (#15434)	il y a 4 mois
Georgi Gerganov	8a4280ce43 kv-cache : remove LLAMA_SET_ROWS checks (#15505)	il y a 4 mois
Georgi Gerganov	0373486dbc graph : fix assert in memory-less build_attn (#15590)	il y a 4 mois
Georgi Gerganov	3f196be84b graph : remove build_attn_with_sinks overload (#15469)	il y a 4 mois
Georgi Gerganov	715a6db02c kv-cache : drop the "unified" prefix (#15467)	il y a 5 mois
Georgi Gerganov	fd1234cb46 llama : add gpt-oss (#15091)	il y a 5 mois
Sam	ef0144c087 model: support GLM 4.5 family of models (#14939)	il y a 5 mois
Dongliang Wei	c1dacaa99b llama : merge build_moe_ffn_from_probs function into build_moe_ffn (#14968)	il y a 5 mois
compilade	66625a59a5 graph : reduce splits for recurrent and hybrid models (#14825)	il y a 5 mois
Douglas Hanley	a118d80233 embeddings: fix extraction of CLS pooling results (#14927)	il y a 5 mois
Dongliang Wei	6c6e397aff model : add support for SmallThinker series (#14898)	il y a 5 mois
Georgi Gerganov	bf9087f59a metal : fuse add, mul + add tests (#14596)	il y a 6 mois
Georgi Gerganov	9fb1042ce6 graph : fix graph reuse reset of params (#14760)	il y a 6 mois
Georgi Gerganov	d498af3d5a graph : avoid huge warm-up graphs for MoE models (#14753)	il y a 6 mois
Georgi Gerganov	8f974bc1e9 graph : refactor context to not pass gf explicitly (#14629)	il y a 6 mois
Nexes the Elder	09651d09ff graph : Pass the graph placeholder message in debug mode (#14748)	il y a 6 mois
Georgi Gerganov	01612b7409 llama : reuse compute graphs (#14482)	il y a 6 mois
Georgi Gerganov	225e7a1438 llama : add high-throughput mode (#14363)	il y a 6 mois
Xuan-Son Nguyen	cb9178f885 llama : remove llm_graph_input_one (#14603)	il y a 6 mois
compilade	4a5686da22 llama : support Jamba hybrid Transformer-Mamba models (#7531)	il y a 6 mois
Georgi Gerganov	7b50f7c025 graph : prepare for 4D mask (#14515)	il y a 6 mois
Georgi Gerganov	a70c8a0c4b kv-cache : use ggml_set_rows (#14285)	il y a 6 mois
compilade	5d46babdc2 llama : initial Mamba-2 support (#9126)	il y a 6 mois
Sigbjørn Skjæret	a0535ffa0d ggml : implement REGLU/GEGLU/SWIGLU ops (#14158)	il y a 6 mois

Récemment Précédemment

Historique des commits Trouver

Historique des commits