cturan/llama.cpp

Autor	SHA1 Mensaje	Fecha
Romain D	3a6efdd03c convert : use f32 outtype for bf16 tensors (#6106)	hace 1 año
Pierrick Hymbert	d01b3c4c32 common: llama_load_model_from_url using --model-url (#6098)	hace 1 año
Georgi Gerganov	cd776c37c9 ci : close all stale issues at once (#6115)	hace 1 año
GainLee	dc0f612548 ggml:fix finding transfer queue family index error (#6094)	hace 1 año
AmirAli Mirian	c47cf414ef ggml : add AVX512F SIMD (#6088)	hace 1 año
Daniel Bevenius	b5f4ae09c3 gritlm : add initial README.md (#6086)	hace 1 año
Xuan Son Nguyen	dfbfdd60f9 readme : add wllama as a wasm binding (#6100)	hace 1 año
DAN™	15961ec04d common : refactor nested if causing error C1061 on MSVC (#6101)	hace 1 año
Pierrick Hymbert	a56d09a440 ci : close inactive issue with workflow (#6053)	hace 1 año
slaren	d84c48505f llama : fix Baichuan2 13B (#6092)	hace 1 año
Theia Vogel	877b4d0c62 llama : add support for control vectors (#5970)	hace 1 año
Andrew Canis	12247f4c69 llama : add Command-R support (#6033)	hace 1 año
Ting Lou	4e9a7f7f7f llava : change API to pure C style for Rust FFI bindgen (#6079)	hace 1 año
slaren	3020327f6c cuda : disable unused cudaLaunchHostFunc code (#6078)	hace 1 año
Neo Zhang Jianyu	46acb36767 fix set main gpu error (#6073)	hace 1 año
Georgi Gerganov	131b058409 make : ggml-metal.o depends on ggml.h	hace 1 año
AidanBeltonS	753e36f650 [SYCL] Fix non-intel device selection (#6042)	hace 1 año
Ondřej Čertík	7ce2c77f88 gguf : add support for I64 and F64 arrays (#6062)	hace 1 año
Xuan Son Nguyen	aab606a11f llama : add Orion chat template (#6066)	hace 1 año
slaren	b0bc9f4a9d llama-bench : use random tokens to improve accuracy with mixtral (#6069)	hace 1 año
Georgi Gerganov	4755afd1cb llama : fix integer overflow during quantization (#6063)	hace 1 año
Steve Grubb	6e0438da3c gguf : fix resource leaks (#6061)	hace 1 año
Ondřej Čertík	727107707a gguf-py : bump version to 0.8.0 (#6060)	hace 1 año
Michael Podvitskiy	69ff61397d llama : support models without vocabulary (#5798)	hace 1 año
Georgi Gerganov	044ec4b2a5 embedding : add EOS token if not present (#899)	hace 1 año
Georgi Gerganov	77178eedc8 gguf-py : fix dtype check (#6045)	hace 1 año
Jian Liao	15a333260a readme : improve readme for Llava-1.6 example (#6044)	hace 1 año
Pierrick Hymbert	43241adf22 server: disable debug release type sanitizer, simplify trigger (#6047)	hace 1 año
Georgi Gerganov	a44bc969e4 llama : fix typo	hace 1 año
Michael Podvitskiy	2c4fb69246 llama : optimize defrag moves + fix fragmentation calculation (#6037)	hace 1 año

Posterior Anterior

Historial de Commits Buscar

Historial de Commits