Georgi Gerganov
|
d061bf9405
ggml : fix q2_k bpw in comments (ggml/680)
|
пре 2 година |
Finn Voorhees
|
1bf681f90e
ggml : add error handling to graph_compute (whisper/1714)
|
пре 2 година |
Georgi Gerganov
|
c1d7cb28d3
ggml : do not sched_yield when calling BLAS (#4761)
|
пре 2 година |
Georgi Gerganov
|
3681f22443
examples : add few-shot translation example (#4783)
|
пре 2 година |
Daniel Bevenius
|
b3a7c20b5c
finetune : remove unused includes (#4756)
|
пре 2 година |
Georgi Gerganov
|
012cf349ae
server : send token probs for "stream == false" (#4714)
|
пре 2 година |
Johannes Gäßler
|
a91928014f
Print backend name on test-backend-ops failure (#4751)
|
пре 2 година |
singularity
|
3c0b585561
llama.swiftui : support loading custom model from file picker (#4767)
|
пре 2 година |
Michael Coppola
|
e5804313a1
server : fix options in README.md (#4765)
|
пре 2 година |
Georgi Gerganov
|
dc891b7f7a
ggml : include stdlib.h before intrin.h (#4736)
|
пре 2 година |
singularity
|
46cea79e1f
llama.swiftui : fix build of ggml.metallib (#4754)
|
пре 2 година |
Daniel Bevenius
|
cb1e2818e0
train : fix typo in overlapping-samples help msg (#4758)
|
пре 2 година |
Ashraful Islam
|
ece9a45e8f
swift : update Package.swift to use ggml as dependency (#4691)
|
пре 2 година |
Georgi Gerganov
|
7bed7eba35
cuda : simplify expression
|
пре 2 година |
Georgi Gerganov
|
d55356d3ba
cuda : mark I16 and I32 ops as unsupported
|
пре 2 година |
Georgi Gerganov
|
75e3fd8581
sync : ggml
|
пре 2 година |
Georgi Gerganov
|
289313716f
metal : add kernel_get_rows_i32
|
пре 2 година |
Georgi Gerganov
|
ab62fc3e55
scripts : fix sync order + metal sed
|
пре 2 година |
Guillaume Wenzek
|
5f66ebca9c
ggml : extend ggml_get_rows, ggml_repeat, ggml_concat (ggml/639)
|
пре 2 година |
Justin Parker
|
f2eb19bd8b
server : throw an error when `slot unavailable` (#4741)
|
пре 2 година |
Georgi Gerganov
|
f3f62f0d83
metal : optimize ggml_mul_mat_id (faster Mixtral PP) (#4725)
|
пре 2 година |
Phil H
|
0ef3ca2ac6
server : add token counts to html footer (#4738)
|
пре 2 година |
Georgi Gerganov
|
540938f890
llama : llama_model_desc print number of experts
|
пре 2 година |
Marcus Dunn
|
0040d42eeb
llama : replace all API facing `int`'s with `int32_t` (#4577)
|
пре 2 година |
postmasters
|
83e633c27e
llama : differentiate the KV dims in the attention (#4657)
|
пре 2 година |
Georgi Gerganov
|
32866c5edd
editorconfig : fix whitespace and indentation #4710
|
пре 2 година |
minarchist
|
5d7002d437
server : add --override-kv parameter (#4710)
|
пре 2 година |
Nam D. Tran
|
26f3071d71
py : re-enable mmap in convert hf (#4732)
|
пре 2 година |
Daniel Bevenius
|
775ac8712a
finetune: fix typo in README.md (#4733)
|
пре 2 година |
Georgi Gerganov
|
58ba655af0
metal : enable shader debugging (cmake option) (#4705)
|
пре 2 година |