cturan/llama.cpp

Author	SHA1 Message	Date
Anton Mitkov	bdca38376f sycl: Hotfix for non dnnl codepath (#14677)	6 months ago
shalinib-ibm	55c509daf5 ggml : refactor llamafile_sgemm PPC code (#14673)	6 months ago
Aman Gupta	9c9e4fc635 llama-context: add ability to get logits (#14672)	6 months ago
Johannes Gäßler	494c5899cb scripts: benchmark for HTTP server throughput (#14668)	6 months ago
Akarshan Biswas	0f4c6ec0f1 SYCL: use 1D kernel for set_rows (#14618)	6 months ago
Anton Mitkov	65a3ebb0aa sycl: Batched mulmat rework for oneDNN dispatch (#14617)	6 months ago
Molly Sophia	0d9226763c llama : add jinja template for rwkv-world (#14665)	6 months ago
Ed Addario	982e347255 quantize : fix minor logic flaw in --tensor-type (#14572)	6 months ago
Sigbjørn Skjæret	923e3ea2e3 cuda : add set rows for bf16 (#14664)	6 months ago
Yavor Ivanov	e743cddb60 cuda : add ELU support (#14657)	6 months ago
Georgi Gerganov	05fec5bd29 ggml : add build-time message to remind about ggml_set_rows (#14661)	6 months ago
Yavor Ivanov	dcf7f2ea3c metal : Add missing unary ops Metal support (#14660)	6 months ago
Yavor Ivanov	84b396e051 cmake : Add CMake presets for Linux and GCC (#14656)	6 months ago
Tarek Dakhran	c31e60647d tests : cover lfm2 cases in test_ssm_conv (#14651)	6 months ago
Tarek Dakhran	67eade1bf9 docs : add LFM2 to models section (#14650)	6 months ago
Aman Gupta	7de5c7cab6 CUDA: add set rows for f32 and f16 (#14551)	6 months ago
Georgi Gerganov	8eff95544e sync : ggml	6 months ago
Georgi Gerganov	3120413ccd vulkan : remove unused vars (#0)	6 months ago
Georgi Gerganov	215535701d sync : ggml	6 months ago
Acly	74bb294591 vulkan : implement bilinear interpolation (ggml/1291)	6 months ago
Acly	3e303b1107 vulkan : implement ggml_roll (ggml/1290)	6 months ago
Douglas Hanley	0c1df14b5f server : fix pooled embedding output (#14645)	6 months ago
Jeff Bolz	b3ad3a0191 vulkan: support SET_ROWS (#14587)	6 months ago
Jeff Bolz	98197e5c98 vulkan: optimizations for deepseek prompt processing (#14555)	6 months ago
Tarek Dakhran	f5e96b368f model : support LiquidAI LFM2 hybrid family (#14620)	6 months ago
Slobodan Josic	756aa1020a HIP : Add HIP 7.0+ compatibility for hipBLAS compute types (#14634)	6 months ago
Georgi Gerganov	aaa088d87f readme : add hot PRs (#14636)	6 months ago
Georgi Gerganov	0d5375d54b llama : move enum llama_vocab_pre_type to implementation (#14631)	6 months ago
Dowon	576c82eda2 vocab : add midm-2.0 model pre-tokenizer (#14626)	6 months ago
Gabe Goodhart	0aedae00e6 model : Granite Four (#13550)	6 months ago

Newer Older

Commit History Find

Commit History