Commit History

Autor SHA1 Mensaxe Data
  slaren 9c77ec1d74 ggml : synchronize threads using barriers (#7993) hai 1 ano
  slaren f578b86b21 move BLAS to a separate backend (#6210) hai 1 ano
  Georgi Gerganov a9cae48003 tests : add non-cont unary tests (#7857) hai 1 ano
  Georgi Gerganov bfaa676b08 ggml : improve ggml_is_contiguous logic (#7856) hai 1 ano
  Georgi Gerganov 2b3389677a ggml : refactor rope norm/neox (#7634) hai 1 ano
  Georgi Gerganov 554c247caf ggml : remove OpenCL (#7735) hai 1 ano
  Georgi Gerganov 6d1616944d ggml : prevent builds with -ffinite-math-only (#7726) hai 1 ano
  Masaya, Kato a5735e4426 ggml : use OpenMP as a thread pool (#7606) hai 1 ano
  Georgi Gerganov 0c27e6f62e ggml : fix loongson compile warnings (#7537) hai 1 ano
  Chris Elrod 59b0d07766 faster avx512 exp implementation (#7551) hai 1 ano
  junchao-loongson d5c05821f3 ggml : fix loongarch build (O2 issue) (#7636) hai 1 ano
  Georgi Gerganov fb76ec31a9 ggml : fix YARN + add tests + add asserts (#7617) hai 1 ano
  Radoslav Gerganov 210d99173d llama-bench : add support for the RPC backend (#7435) hai 1 ano
  slaren 87bdf2a199 ggml : use atomic_flag for critical section (#7598) hai 1 ano
  Georgi Gerganov 72de268bec ggml : restore ggml_rope_xpos_inplace (ggml/0) hai 1 ano
  zhouwg 504f0c340f ggml : fix typo in ggml.c (#7603) hai 1 ano
  Georgi Gerganov 0548a4187f ggml : generalize GGML_OP_CONCAT (#7563) hai 1 ano
  Masaya, Kato faa0e6979a ggml: aarch64: SVE kernels for q8_0_q8_0, q4_0_q8_0 vector dot (#7433) hai 1 ano
  Georgi Gerganov d48c88cbd5 ggml : remove ggml_flash_attn and ggml_flash_ff (#7463) hai 1 ano
  Georgi Gerganov e84b71c2c6 ggml : drop support for QK_K=64 (#7473) hai 1 ano
  Georgi Gerganov 3e5faa8503 cuda : fix rope + add tests (#7452) hai 1 ano
  liuwei-git 201cc11afa llama : add phi3 128K model support (#7225) hai 1 ano
  junchao-loongson 65c58207ec ggml : add loongarch lsx and lasx support (#6454) hai 1 ano
  Srihari-mcw 33c8d50acc Add provisions for windows support for BF16 code including CMake provision for enabling AVX512_BF16 (#7258) hai 1 ano
  Johannes Gäßler 5ca49cbecd ggml: implement quantized KV cache for FA (#7372) hai 1 ano
  Georgi Gerganov 511182eabb android : use "ci-android" branch for CI (#7341) hai 1 ano
  Justine Tunney 934266c0e0 ggml : rewrite silu and softmax for cpu (#7154) hai 1 ano
  kunnis e1b40ac3b9 ggml : use dynamic thread scheduling for matrix multiplication (#6915) hai 1 ano
  slaren 344f9126cc ggml : tag ggml_tensor::backend as deprecated (#7290) hai 1 ano
  John Balis 48aa8fd1f2 ggml : add `ggml_upscale_ext` (ggml/814) hai 1 ano