Paweł Wodnicki
|
3f1ae2e32c
Update README.md (#9591)
|
1 year ago |
Georgi Gerganov
|
f1b8c42711
sync : ggml
|
1 year ago |
Johannes Gäßler
|
e98c1c188e
test: fix OPT_STEP_ADAMW for test-backend-ops (ggml/974)
|
1 year ago |
Salvatore Mesoraca
|
cb00020504
vulkan : mul_mat: fix UB with small warps (ggml/952)
|
1 year ago |
Borislav Stanimirov
|
6c5322481a
ggml : fix ggml_cast (ggml/973)
|
1 year ago |
Johannes Gäßler
|
7254cdf7e8
ggml: fix gradient allocation logic (ggml/966)
|
1 year ago |
Georgi Gerganov
|
cad341d889
metal : reduce command encoding overhead (#9698)
|
1 year ago |
Georgi Gerganov
|
a90484c6d9
llama : print correct model type for Llama 3.2 1B and 3B
|
1 year ago |
compilade
|
1927378bcc
convert : refactor rope_freqs generation (#9396)
|
1 year ago |
serhii-nakon
|
6f1d9d71f4
Fix Docker ROCM builds, use AMDGPU_TARGETS instead of GPU_TARGETS (#9641)
|
1 year ago |
compilade
|
511636df0c
ci : reduce severity of unused Pyright ignore comments (#9697)
|
1 year ago |
vb
|
08a43d05b6
py : update transfomers version (#9694)
|
1 year ago |
Georgi Gerganov
|
ace4f4be37
flake.lock: Update (#9680)
|
1 year ago |
Ruchira Hasaranga
|
8277a817f1
console : utf-8 fix for windows stdin (#9690)
|
1 year ago |
Georgi Gerganov
|
c919d5db39
ggml : define missing HWCAP flags (#9684)
|
1 year ago |
Georgi Gerganov
|
d0b1d663e4
sync : ggml
|
1 year ago |
Johannes Gäßler
|
aaa4099925
CUDA: remove bad assert (ggml/972)
|
1 year ago |
Jeff Bolz
|
641002fba8
vulkan : multithread pipeline creation (ggml/963)
|
1 year ago |
Jeff Bolz
|
0de8b203f1
vulkan : fix build for GGML_VULKAN_RUN_TESTS, add TFLOPS to log (ggml/961)
|
1 year ago |
Salvatore Mesoraca
|
544f409b4b
vulkan : argsort barriers must be under uniform control flow (ggml/951)
|
1 year ago |
Georgi Gerganov
|
6084bfb261
ggml : fix GGML_MAX_N_THREADS + improve formatting (ggml/969)
|
1 year ago |
matiaslin
|
faac0bae26
common : ensure llama_batch size does not exceed max size (#9668)
|
1 year ago |
nopperl
|
f99d3f8367
py : add model class for Chameleon conversion (#9683)
|
1 year ago |
Georgi Gerganov
|
589b48d41e
contrib : add Resources section (#9675)
|
1 year ago |
Georgi Gerganov
|
f4d2b8846a
llama : add reranking support (#9510)
|
1 year ago |
slaren
|
1b2f992cd2
test-backend-ops : use flops for some performance tests (#9657)
|
1 year ago |
Georgi Gerganov
|
739842703e
llama : add comment about thread-safety [no ci] (#9449)
|
1 year ago |
Zhenwei Jin
|
6102037bbb
vocab : refactor tokenizer to reduce init overhead (#9449)
|
1 year ago |
nopperl
|
9a913110cf
llama : add support for Chameleon (#8543)
|
1 year ago |
Aarni Koskela
|
43bcdd9703
readme : add tool (#9655)
|
1 year ago |