cturan/llama.cpp

Author	SHA1 Message	Date
Piotr Wilkin (ilintar)	389ac78b26 ggml : add ops SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM (#17063)	2 months ago
Jeff Bolz	10fcc41290 vulkan: Update topk_moe fusion to handle gpt's late softmax (#16656)	3 months ago
Aman Gupta	4926419c4d ggml: add ggml_can_fuse_subgraph (#16662)	3 months ago
Jeff Bolz	e56abd2098 vulkan: Implement topk_moe fused shader, ported from CUDA (#16641)	3 months ago
Piotr Wilkin (ilintar)	34fcc5a4ac model : Apertus model implementation (#15852)	4 months ago
Acly	f2a789e334 ggml : split graph allocations according to backend max buffer size (#15815)	4 months ago
Georgi Gerganov	4f324a556c ggml : extend ggml_can_fuse to work with non-sequential nodes (#16123)	4 months ago
Georgi Gerganov	a71ae3ba7a ggml : add ggml_op_is_empty (#16122)	4 months ago
Georgi Gerganov	fd1234cb46 llama : add gpt-oss (#15091)	6 months ago
Georgi Gerganov	bf9087f59a metal : fuse add, mul + add tests (#14596)	6 months ago
Jeff Bolz	bd9c981d72 vulkan: Add fusion support for RMS_NORM+MUL (#14366)	7 months ago
Aaron Teo	60ef23d6c1 ggml-cpu: enable IBM NNPA Vector Intrinsics (#14317)	7 months ago
Daniel Tang	fedf034a98 ggml : Print backtrace on uncaught C++ exceptions (ggml/1232)	8 months ago
xctan	05f6ac6283 ggml : riscv: add xtheadvector support (#13720)	8 months ago
cmdr2	cb79c2e7fa ggml: don't include arm_neon.h when using CUDA 12 with ARM Neon (ggml/1187)	10 months ago
Diego Devesa	459895c326 ggml : add more generic custom op, remove deprecated custom ops (ggml/1183)	10 months ago
Piotr Kubaj	2391506ace ggml-impl.h: fix build on POWER9 (#12855)	10 months ago
Georgi Gerganov	ff067dbcb9 ggml : simplify Arm fp16 CPU logic (ggml/1177)	10 months ago
xctan	24feaec057 ggml : riscv: add 128-bit RVV support (#12530)	10 months ago
Bodhi	0b3863ff95 MUSA: support ARM64 and enable dp4a .etc (#11843)	11 months ago
Johannes Gäßler	53ff6b9b9f GGUF: C++ refactor, backend support, misc fixes (#11030)	1 year ago
Johannes Gäßler	081b29bd2a tests: add tests for GGUF (#10830)	1 year ago
Diego Devesa	cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797)	1 year ago
Frankie Robertson	cd2f37b304 Avoid using __fp16 on ARM with old nvcc (#10616)	1 year ago
Diego Devesa	7cc2d2c889 ggml : move AMX to the CPU backend (#10570)	1 year ago
Frankie Robertson	9150f8fef9 Do not include arm_neon.h when compiling CUDA code (ggml/1028)	1 year ago
Johannes Gäßler	02e4eaf22f ggml-opt: fix data corruption (ggml/1022)	1 year ago
Johannes Gäßler	8a43e940ab ggml: new optimization interface (ggml/988)	1 year ago
Diego Devesa	ae8de6d50a ggml : build backends as libraries (#10256)	1 year ago
Diego Devesa	9f40989351 ggml : move CPU backend to a separate file (#10144)	1 year ago

Newer Older

Commit History Find

Commit History