瀏覽代碼

ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels (#9217)

* ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels

* added fallback mechanism when the offline re-quantized model is not
optimized for the underlying target.

* fix for build errors

* remove prints from the low-level code

* Rebase to the latest upstream
Charles Xu 1 年之前
父節點
當前提交
1e43630218
共有 1 個文件被更改,包括 648 次插入670 次删除
  1. 648 670
      ggml/src/ggml-aarch64.c

File diff suppressed because it is too large
+ 648 - 670
ggml/src/ggml-aarch64.c


Some files were not shown because too many files changed in this diff