Georgi Gerganov
|
d5ed2b929d
metal : remove abort (skip) (ggml/0)
|
1 anno fa |
Georgi Gerganov
|
1bb8a64ebf
sync : ggml
|
1 anno fa |
Johannes Gäßler
|
fabdc3bda3
ggml/ex: calculate accuracy in graph, adapt MNIST (ggml/980)
|
1 anno fa |
Johannes Gäßler
|
eee39bdc96
ggml: refactor cross entropy loss CPU impl. (ggml/976)
|
1 anno fa |
Jack Mousseau
|
5d5ab1e5cc
metal : fix compute pass descriptor autorelease crash (#9718)
|
1 anno fa |
Diego Devesa
|
a7ad553513
ggml-backend : add device description to CPU backend (#9720)
|
1 anno fa |
bandoti
|
d6fe7abf04
ggml: unify backend logging mechanism (#9709)
|
1 anno fa |
compilade
|
e3c355ba65
convert : handle tokenizer merges format from transformers 4.45 (#9696)
|
1 anno fa |
Radoslav Gerganov
|
841713e1e4
rpc : enable vulkan (#9714)
|
1 anno fa |
Ouadie EL FAROUKI
|
5639971466
Fixed dequant precision issues in Q4_1 and Q5_1 (#9711)
|
1 anno fa |
Diego Devesa
|
c83ad6d01e
ggml-backend : add device and backend reg interfaces (#9707)
|
1 anno fa |
Xuan Son Nguyen
|
a39ab216aa
llama : reduce compile time and binary size (#9712)
|
1 anno fa |
Alberto Cabrera Pérez
|
f536f4c439
[SYCL] Initial cmake support of SYCL for AMD GPUs (#9658)
|
1 anno fa |
Radoslav Gerganov
|
00b7317e63
vulkan : do not use tensor->extra (#9407)
|
1 anno fa |
Zhenwei Jin
|
76b37d1541
gguf-split : improve --split and --merge logic (#9619)
|
1 anno fa |
Georgi Gerganov
|
148844fe97
examples : remove benchmark (#9704)
|
1 anno fa |
Paweł Wodnicki
|
3f1ae2e32c
Update README.md (#9591)
|
1 anno fa |
Georgi Gerganov
|
f1b8c42711
sync : ggml
|
1 anno fa |
Johannes Gäßler
|
e98c1c188e
test: fix OPT_STEP_ADAMW for test-backend-ops (ggml/974)
|
1 anno fa |
Salvatore Mesoraca
|
cb00020504
vulkan : mul_mat: fix UB with small warps (ggml/952)
|
1 anno fa |
Borislav Stanimirov
|
6c5322481a
ggml : fix ggml_cast (ggml/973)
|
1 anno fa |
Johannes Gäßler
|
7254cdf7e8
ggml: fix gradient allocation logic (ggml/966)
|
1 anno fa |
Georgi Gerganov
|
cad341d889
metal : reduce command encoding overhead (#9698)
|
1 anno fa |
Georgi Gerganov
|
a90484c6d9
llama : print correct model type for Llama 3.2 1B and 3B
|
1 anno fa |
compilade
|
1927378bcc
convert : refactor rope_freqs generation (#9396)
|
1 anno fa |
serhii-nakon
|
6f1d9d71f4
Fix Docker ROCM builds, use AMDGPU_TARGETS instead of GPU_TARGETS (#9641)
|
1 anno fa |
compilade
|
511636df0c
ci : reduce severity of unused Pyright ignore comments (#9697)
|
1 anno fa |
vb
|
08a43d05b6
py : update transfomers version (#9694)
|
1 anno fa |
Georgi Gerganov
|
ace4f4be37
flake.lock: Update (#9680)
|
1 anno fa |
Ruchira Hasaranga
|
8277a817f1
console : utf-8 fix for windows stdin (#9690)
|
1 anno fa |