cturan/llama.cpp

Auteur	SHA1 Message	Date
Georgi Gerganov	ad57d3edd2 batch : fix uninitialized has_cpl flag (#14733)	il y a 6 mois
Sigbjørn Skjæret	1ba45d4982 ci : disable failing vulkan crossbuilds (#14723)	il y a 6 mois
Sigbjørn Skjæret	19e5943d9e convert : make hf token optional (#14717)	il y a 6 mois
Diner Burger	496957e1cb llama : fix parameter order for hybrid memory initialization (#14725)	il y a 6 mois
Reese Levine	21c021745d ggml: Add initial WebGPU backend (#14521)	il y a 6 mois
tempstudio	b0f0ecc3dc model : support output bias for qwen2 (#14711)	il y a 6 mois
Georgi Gerganov	225e7a1438 llama : add high-throughput mode (#14363)	il y a 6 mois
Aman Gupta	ab14019821 Support diffusion models: Add Dream 7B (#14644)	il y a 6 mois
Georgi Gerganov	64978340b0 ggml : add asserts (#14720)	il y a 6 mois
Georgi Gerganov	6ffd4e9c44 server : pre-calculate EOG logit biases (#14721)	il y a 6 mois
Shunta Saito	e4841d24d3 llama : fix parallel processing for plamo2 (#14716)	il y a 6 mois
Georgi Gerganov	538cc77f7f server : fix handling of the ignore_eos flag (#14710)	il y a 6 mois
Johannes Gäßler	5cae766541 scripts: synthetic prompt mode for server-bench.py (#14695)	il y a 6 mois
Sigbjørn Skjæret	4b91d6f71f convert : only check for tokenizer folder if we need it (#14704)	il y a 6 mois
Sigbjørn Skjæret	cf91f217f1 convert : add pre-computed hashes first to prevent order mishaps (#14701)	il y a 6 mois
Min-Hua	79e0b68c17 llama: add LLAMA_API to deprecated llama_kv_self_seq_div (#14708)	il y a 6 mois
Ed Addario	c81f4192f9 gguf-py : dump bpw per layer and model in markdown mode (#14703)	il y a 6 mois
Gabriel Larson	4a4f426944 model : add Kimi-K2 support (#14654)	il y a 6 mois
Jeff Bolz	ba1ceb3456 vulkan: fix noncontig check for mat_mul_id splitting (#14683)	il y a 6 mois
Jeff Bolz	10a0351a97 vulkan: add RTE variants for glu/add/sub/mul/div (#14653)	il y a 6 mois
Shunta Saito	68e37a61a7 model : add PLaMo-2 support (#14560)	il y a 6 mois
R0CKSTAR	cbc68be51d cuda: fix build warnings in set-rows.cu (unused variable) (#14687)	il y a 6 mois
Anton Mitkov	bdca38376f sycl: Hotfix for non dnnl codepath (#14677)	il y a 6 mois
shalinib-ibm	55c509daf5 ggml : refactor llamafile_sgemm PPC code (#14673)	il y a 6 mois
Aman Gupta	9c9e4fc635 llama-context: add ability to get logits (#14672)	il y a 6 mois
Johannes Gäßler	494c5899cb scripts: benchmark for HTTP server throughput (#14668)	il y a 6 mois
Akarshan Biswas	0f4c6ec0f1 SYCL: use 1D kernel for set_rows (#14618)	il y a 6 mois
Anton Mitkov	65a3ebb0aa sycl: Batched mulmat rework for oneDNN dispatch (#14617)	il y a 6 mois
Molly Sophia	0d9226763c llama : add jinja template for rwkv-world (#14665)	il y a 6 mois
Ed Addario	982e347255 quantize : fix minor logic flaw in --tensor-type (#14572)	il y a 6 mois

Récemment Précédemment

Historique des commits Trouver

Historique des commits