Daniel Bevenius
|
8f7080bf48
readme : remove stray double quote (#7310)
|
1 年之前 |
kunnis
|
e1b40ac3b9
ggml : use dynamic thread scheduling for matrix multiplication (#6915)
|
1 年之前 |
agray3
|
dc020985b8
Avoid unnecessarily disabling CUDA graphs (#7302)
|
1 年之前 |
slaren
|
344f9126cc
ggml : tag ggml_tensor::backend as deprecated (#7290)
|
1 年之前 |
AidanBeltonS
|
9a17ab914b
Add missing " (#7303)
|
1 年之前 |
dm4
|
ea3b0590ee
embedding : free the batch after execution (#7297)
|
1 年之前 |
Georgi Gerganov
|
29499bb593
sync : ggml
|
1 年之前 |
John Balis
|
48aa8fd1f2
ggml : add `ggml_upscale_ext` (ggml/814)
|
1 年之前 |
Johannes Gäßler
|
583fd6b000
server bench: fix bench not waiting for model load (#7284)
|
1 年之前 |
Georgi Gerganov
|
9f773486ab
script : sync ggml-rpc
|
1 年之前 |
Georgi Gerganov
|
e8a7fd4fb0
metal : support FA without mask + add asserts (#7278)
|
1 年之前 |
Georgi Gerganov
|
a5e3fde857
sync : ggml
|
1 年之前 |
Georgi Gerganov
|
f308ea7059
metal : tune soft_max number of threads (whisper/0)
|
1 年之前 |
Georgi Gerganov
|
c3c88f296a
ggml : try fix ppc64 (whisper/0)
|
1 年之前 |
Przemysław Pawełczyk
|
182adefcf3
ggml : expose SSE3 and SSSE3 for MSVC when AVX is available (whisper/2128)
|
1 年之前 |
Hong Bo PENG
|
0d26d8ccd8
ggml : optimize for ppc64le using VSX intrinsics (ggml/784)
|
1 年之前 |
Steve Grubb
|
4f0263633b
server: free sampling contexts on exit (#7264)
|
1 年之前 |
Brian
|
1265c670fd
Revert "move ndk code to a new library (#6951)" (#7282)
|
1 年之前 |
Radoslav Gerganov
|
5e31828d3e
ggml : add RPC backend (#6829)
|
1 年之前 |
slaren
|
541600201e
llama : disable pipeline parallelism with nkvo (#7265)
|
1 年之前 |
Elton Kola
|
efc8f767c8
move ndk code to a new library (#6951)
|
1 年之前 |
Haggai Nuchi
|
e0f556186b
Add left recursion check: quit early instead of going into an infinite loop (#7083)
|
1 年之前 |
Ryuei
|
27f65d6267
docs: Fix typo and update description for --embeddings flag (#7026)
|
1 年之前 |
compilade
|
ee52225067
convert-hf : support direct Q8_0 conversion (#7234)
|
1 年之前 |
Georgi Gerganov
|
614d3b914e
llama : less KV padding when FA is off (#7257)
|
1 年之前 |
k.h.lai
|
30e70334f7
llava-cli: fix base64 prompt (#7248)
|
1 年之前 |
Johannes Gäßler
|
1c570d8bee
perplexity: add BF16 vs. FP16 results (#7150)
|
1 年之前 |
Neo Zhang
|
948f4ec7c5
[SYCL] rm wait() (#7233)
|
1 年之前 |
Joan Fontanals
|
9aa672490c
llama : rename jina tokenizers to v2 (#7249)
|
1 年之前 |
Brian
|
b1f8af1886
convert.py: Outfile default name change and additional metadata support (#4858)
|
1 年之前 |