cturan/llama.cpp

Auteur	SHA1 Message	Date
Georgi Gerganov	c8255f8a6b scripts : print list of sync commits	il y a 2 ans
Tamotsu Takahashi	441f51dca0 ci : build with CLBlast + ggml-opencl use GGML_API (whisper/1576)	il y a 2 ans
Georgi Gerganov	38b3de4658 sync : ggml	il y a 2 ans
bssrdf	afc8c19291 ggml : fix some mul mat cases + add tests for src1 F16 (ggml/669)	il y a 2 ans
Georgi Gerganov	ca38b8d334 scripts : do not sync commits from this repo	il y a 2 ans
Justine Tunney	65e5f6dadb Fix OpenAI server sampling w.r.t. temp and seed (#4668)	il y a 2 ans
manikbhandari	ea5497df5d gpt2 : Add gpt2 architecture integration (#4555)	il y a 2 ans
Nam D. Tran	f6793491b5 llama : add AWQ for llama, llama2, mpt, and mistral models (#4593)	il y a 2 ans
Daniel Bevenius	879b690a9e finetune : fix output formatting in print_params (#4653)	il y a 2 ans
Georgi Gerganov	b47879b0dd scripts : add sync-ggml-am.sh	il y a 2 ans
Georgi Gerganov	951010fa53 ggml : fix dot product for ARM (#4630)	il y a 2 ans
wonjun Jang	f56d6077d0 Add byte token type when tokenizer.model is not exists (#4641)	il y a 2 ans
slaren	dc68f0054c cuda : fix vmm pool with multi GPU (#4620)	il y a 2 ans
WillCorticesAI	de8e496437 Update comment for AdamW implementation reference. (#4604)	il y a 2 ans
FantasyGmm	77465dad48 Fix new CUDA10 compilation errors (#4635)	il y a 2 ans
Paul Tsochantaris	a206137f92 Adding Emeltal reference to UI list (#4629)	il y a 2 ans
slaren	b9f47952ff simplify bug issue template (#4623)	il y a 2 ans
Shintarou Okada	753be377b6 llama : add PLaMo model (#3557)	il y a 2 ans
slaren	5bf3953d7e cuda : improve cuda pool efficiency using virtual memory (#4606)	il y a 2 ans
slaren	708e179e85 fallback to CPU buffer if host buffer alloc fails (#4610)	il y a 2 ans
Samuel Maynard	925e5584a0 ci(docker): fix tags in "Build and push docker image (tagged)" (#4603)	il y a 2 ans
Alexey Parfenov	6123979952 server : allow to specify custom prompt for penalty calculation (#3727)	il y a 2 ans
kalomaze	b9ec82d262 grammar : check the full vocab only if necessary (opt) (#4306)	il y a 2 ans
Johannes Gäßler	e0a4002273 CUDA: fixed row rounding for 0 tensor splits (#4594)	il y a 2 ans
LeonEricsson	7082d24cec lookup : add prompt lookup decoding example (#4484)	il y a 2 ans
Georgi Gerganov	ba66175132 sync : ggml (fix im2col) (#4591)	il y a 2 ans
FantasyGmm	a55876955b cuda : fix jetson compile error (#4560)	il y a 2 ans
Henrik Forstén	6724ef1657 Fix CudaMemcpy direction (#4599)	il y a 2 ans
slaren	48b7ff193e llama : fix platforms without mmap (#4578)	il y a 2 ans
Herman Semenov	48b24b170e ggml : add comment about backward GGML_OP_DIAG_MASK_INF (#4203)	il y a 2 ans

Récemment Précédemment

Historique des commits Trouver

Historique des commits