Someone Serge
|
a5c088d8c6
flake.nix: rocm not yet supported on aarch64, so hide the output
|
2 éve |
Someone Serge
|
1e3900ebac
flake.nix: expose full scope in legacyPackages
|
2 éve |
Georgi Gerganov
|
e39106c055
ggml : add ggml_vdotq_s32 alias (#4715)
|
2 éve |
Georgi Gerganov
|
9fbda719de
clip : refactor + bug fixes (#4696)
|
2 éve |
Johannes Gäßler
|
39d8bc71ed
CUDA: fixed tensor cores not being used on RDNA3 (#4697)
|
2 éve |
automaticcat
|
24a447e20a
ggml : add ggml_cpu_has_avx_vnni() (#4589)
|
2 éve |
Johannes Gäßler
|
a20f3c7465
CUDA: fix tensor core logic for Pascal and HIP (#4682)
|
2 éve |
Georgi Gerganov
|
0235b9b571
clip : use ggml_backend_buffer_is_host (#4205)
|
2 éve |
Steward Garcia
|
ce18d727a4
clip : enable gpu backend (#4205)
|
2 éve |
hydai
|
91bb39cec7
cuda: fix vmm oom issue on NVIDIA AGX Orin (#4687)
|
2 éve |
crasm
|
04ac0607e9
python : add check-requirements.sh and GitHub workflow (#4585)
|
2 éve |
Philip Taron
|
68eccbdc5b
flake.nix : rewrite (#4605)
|
2 éve |
Cuong Trinh Manh
|
97bbca6e85
cmake : fix ld warning duplicate libraries libllama.a (#4671)
|
2 éve |
Justine Tunney
|
4af4801566
llava-cli : refactor to use sampling library (#4669)
|
2 éve |
Justine Tunney
|
db49ff8ed7
server : replace sleep with condition variables (#4673)
|
2 éve |
SakuraUmi
|
60f55e888c
server : fix OpenAI server sampling w.r.t. penalty. (#4675)
|
2 éve |
Karthik Sethuraman
|
b93edd22f5
server : allow to generate multimodal embeddings (#4681)
|
2 éve |
andrijdavid
|
82d6eab224
main-cmake-pkg : fix build issue (#4665)
|
2 éve |
Peter Sugihara
|
afd997ab60
llama.swiftui : fix infinite loop, ouput timings, buff UI (#4674)
|
2 éve |
Georgi Gerganov
|
c8255f8a6b
scripts : print list of sync commits
|
2 éve |
Tamotsu Takahashi
|
441f51dca0
ci : build with CLBlast + ggml-opencl use GGML_API (whisper/1576)
|
2 éve |
Georgi Gerganov
|
38b3de4658
sync : ggml
|
2 éve |
bssrdf
|
afc8c19291
ggml : fix some mul mat cases + add tests for src1 F16 (ggml/669)
|
2 éve |
Georgi Gerganov
|
ca38b8d334
scripts : do not sync commits from this repo
|
2 éve |
Justine Tunney
|
65e5f6dadb
Fix OpenAI server sampling w.r.t. temp and seed (#4668)
|
2 éve |
manikbhandari
|
ea5497df5d
gpt2 : Add gpt2 architecture integration (#4555)
|
2 éve |
Nam D. Tran
|
f6793491b5
llama : add AWQ for llama, llama2, mpt, and mistral models (#4593)
|
2 éve |
Daniel Bevenius
|
879b690a9e
finetune : fix output formatting in print_params (#4653)
|
2 éve |
Georgi Gerganov
|
b47879b0dd
scripts : add sync-ggml-am.sh
|
2 éve |
Georgi Gerganov
|
951010fa53
ggml : fix dot product for ARM (#4630)
|
2 éve |