Piotr Wilkin (ilintar)
|
389ac78b26
ggml : add ops SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM (#17063)
|
2 months ago |
Jeff Bolz
|
10fcc41290
vulkan: Update topk_moe fusion to handle gpt's late softmax (#16656)
|
3 months ago |
Aman Gupta
|
4926419c4d
ggml: add ggml_can_fuse_subgraph (#16662)
|
3 months ago |
Jeff Bolz
|
e56abd2098
vulkan: Implement topk_moe fused shader, ported from CUDA (#16641)
|
3 months ago |
Piotr Wilkin (ilintar)
|
34fcc5a4ac
model : Apertus model implementation (#15852)
|
4 months ago |
Acly
|
f2a789e334
ggml : split graph allocations according to backend max buffer size (#15815)
|
4 months ago |
Georgi Gerganov
|
4f324a556c
ggml : extend ggml_can_fuse to work with non-sequential nodes (#16123)
|
4 months ago |
Georgi Gerganov
|
a71ae3ba7a
ggml : add ggml_op_is_empty (#16122)
|
4 months ago |
Georgi Gerganov
|
fd1234cb46
llama : add gpt-oss (#15091)
|
6 months ago |
Georgi Gerganov
|
bf9087f59a
metal : fuse add, mul + add tests (#14596)
|
6 months ago |
Jeff Bolz
|
bd9c981d72
vulkan: Add fusion support for RMS_NORM+MUL (#14366)
|
7 months ago |
Aaron Teo
|
60ef23d6c1
ggml-cpu: enable IBM NNPA Vector Intrinsics (#14317)
|
7 months ago |
Daniel Tang
|
fedf034a98
ggml : Print backtrace on uncaught C++ exceptions (ggml/1232)
|
8 months ago |
xctan
|
05f6ac6283
ggml : riscv: add xtheadvector support (#13720)
|
8 months ago |
cmdr2
|
cb79c2e7fa
ggml: don't include arm_neon.h when using CUDA 12 with ARM Neon (ggml/1187)
|
10 months ago |
Diego Devesa
|
459895c326
ggml : add more generic custom op, remove deprecated custom ops (ggml/1183)
|
10 months ago |
Piotr Kubaj
|
2391506ace
ggml-impl.h: fix build on POWER9 (#12855)
|
10 months ago |
Georgi Gerganov
|
ff067dbcb9
ggml : simplify Arm fp16 CPU logic (ggml/1177)
|
10 months ago |
xctan
|
24feaec057
ggml : riscv: add 128-bit RVV support (#12530)
|
10 months ago |
Bodhi
|
0b3863ff95
MUSA: support ARM64 and enable dp4a .etc (#11843)
|
11 months ago |
Johannes Gäßler
|
53ff6b9b9f
GGUF: C++ refactor, backend support, misc fixes (#11030)
|
1 year ago |
Johannes Gäßler
|
081b29bd2a
tests: add tests for GGUF (#10830)
|
1 year ago |
Diego Devesa
|
cb13ef85a4
remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797)
|
1 year ago |
Frankie Robertson
|
cd2f37b304
Avoid using __fp16 on ARM with old nvcc (#10616)
|
1 year ago |
Diego Devesa
|
7cc2d2c889
ggml : move AMX to the CPU backend (#10570)
|
1 year ago |
Frankie Robertson
|
9150f8fef9
Do not include arm_neon.h when compiling CUDA code (ggml/1028)
|
1 year ago |
Johannes Gäßler
|
02e4eaf22f
ggml-opt: fix data corruption (ggml/1022)
|
1 year ago |
Johannes Gäßler
|
8a43e940ab
ggml: new optimization interface (ggml/988)
|
1 year ago |
Diego Devesa
|
ae8de6d50a
ggml : build backends as libraries (#10256)
|
1 year ago |
Diego Devesa
|
9f40989351
ggml : move CPU backend to a separate file (#10144)
|
1 year ago |