Justine Tunney
|
db49ff8ed7
server : replace sleep with condition variables (#4673)
|
2 年之前 |
SakuraUmi
|
60f55e888c
server : fix OpenAI server sampling w.r.t. penalty. (#4675)
|
2 年之前 |
Karthik Sethuraman
|
b93edd22f5
server : allow to generate multimodal embeddings (#4681)
|
2 年之前 |
andrijdavid
|
82d6eab224
main-cmake-pkg : fix build issue (#4665)
|
2 年之前 |
Peter Sugihara
|
afd997ab60
llama.swiftui : fix infinite loop, ouput timings, buff UI (#4674)
|
2 年之前 |
Georgi Gerganov
|
c8255f8a6b
scripts : print list of sync commits
|
2 年之前 |
Tamotsu Takahashi
|
441f51dca0
ci : build with CLBlast + ggml-opencl use GGML_API (whisper/1576)
|
2 年之前 |
Georgi Gerganov
|
38b3de4658
sync : ggml
|
2 年之前 |
bssrdf
|
afc8c19291
ggml : fix some mul mat cases + add tests for src1 F16 (ggml/669)
|
2 年之前 |
Georgi Gerganov
|
ca38b8d334
scripts : do not sync commits from this repo
|
2 年之前 |
Justine Tunney
|
65e5f6dadb
Fix OpenAI server sampling w.r.t. temp and seed (#4668)
|
2 年之前 |
manikbhandari
|
ea5497df5d
gpt2 : Add gpt2 architecture integration (#4555)
|
2 年之前 |
Nam D. Tran
|
f6793491b5
llama : add AWQ for llama, llama2, mpt, and mistral models (#4593)
|
2 年之前 |
Daniel Bevenius
|
879b690a9e
finetune : fix output formatting in print_params (#4653)
|
2 年之前 |
Georgi Gerganov
|
b47879b0dd
scripts : add sync-ggml-am.sh
|
2 年之前 |
Georgi Gerganov
|
951010fa53
ggml : fix dot product for ARM (#4630)
|
2 年之前 |
wonjun Jang
|
f56d6077d0
Add byte token type when tokenizer.model is not exists (#4641)
|
2 年之前 |
slaren
|
dc68f0054c
cuda : fix vmm pool with multi GPU (#4620)
|
2 年之前 |
WillCorticesAI
|
de8e496437
Update comment for AdamW implementation reference. (#4604)
|
2 年之前 |
FantasyGmm
|
77465dad48
Fix new CUDA10 compilation errors (#4635)
|
2 年之前 |
Paul Tsochantaris
|
a206137f92
Adding Emeltal reference to UI list (#4629)
|
2 年之前 |
slaren
|
b9f47952ff
simplify bug issue template (#4623)
|
2 年之前 |
Shintarou Okada
|
753be377b6
llama : add PLaMo model (#3557)
|
2 年之前 |
slaren
|
5bf3953d7e
cuda : improve cuda pool efficiency using virtual memory (#4606)
|
2 年之前 |
slaren
|
708e179e85
fallback to CPU buffer if host buffer alloc fails (#4610)
|
2 年之前 |
Samuel Maynard
|
925e5584a0
ci(docker): fix tags in "Build and push docker image (tagged)" (#4603)
|
2 年之前 |
Alexey Parfenov
|
6123979952
server : allow to specify custom prompt for penalty calculation (#3727)
|
2 年之前 |
kalomaze
|
b9ec82d262
grammar : check the full vocab only if necessary (opt) (#4306)
|
2 年之前 |
Johannes Gäßler
|
e0a4002273
CUDA: fixed row rounding for 0 tensor splits (#4594)
|
2 年之前 |
LeonEricsson
|
7082d24cec
lookup : add prompt lookup decoding example (#4484)
|
2 年之前 |