cturan/llama.cpp

Autor	SHA1 Mensagem	Data
Yuri Khrustalev	284e5b0275 cmake : make it possible linking ggml as external lib (ggml/1003)	há 1 ano atrás
Plamen Minev	e2292aaa17 metal : fix minor string leaks (ggml/1004)	há 1 ano atrás
Diego Devesa	9f40989351 ggml : move CPU backend to a separate file (#10144)	há 1 ano atrás
Georgi Gerganov	08828a6d7d metal : minor fixup in FA kernel (#10143)	há 1 ano atrás
Georgi Gerganov	1839f69130 flake.lock: Update (#10146)	há 1 ano atrás
Christian Köhnenkamp	9830b6923b Add apple arm to presets (#10134)	há 1 ano atrás
sasha0552	42cadc74bd server : fix slot selection by lru (#10126)	há 1 ano atrás
Georgi Gerganov	45950415ed server : fix endpoint checks (#10135)	há 1 ano atrás
Georgi Gerganov	1926d6e39d llama : adjust default context size + print warnings (#10136)	há 1 ano atrás
Diego Devesa	b634f8a26f simple-chat : only add bos on first prompt (#10129)	há 1 ano atrás
Xuan Son Nguyen	7554aa4655 convert-lora : make `--base` optional (#10110)	há 1 ano atrás
Diego Devesa	a6744e43e8 llama : add simple-chat example (#10124)	há 1 ano atrás
Diego Devesa	e991e3127f llama : use smart pointers for ggml resources (#10117)	há 1 ano atrás
Shupei Fan	418f5eef26 vulkan : improve ggml_vk_create_buffer error handling (#9898)	há 1 ano atrás
Georgi Gerganov	ba6f62eb79 readme : update hot topics	há 1 ano atrás
sasha0552	d865d1478c server : fix smart selection of available slot (#10120)	há 1 ano atrás
Georgi Gerganov	1804adb0cf ggml : remove ggml_scratch (#10121)	há 1 ano atrás
Georgi Gerganov	815fe72adc sync : ggml	há 1 ano atrás
Georgi Gerganov	f221d56220 ggml : alloc ggml_contexts on the heap (whisper/2525)	há 1 ano atrás
Zhenwei Jin	e597e50794 build: fix build error in Windows env with OneAPI setup (#10107)	há 1 ano atrás
Diego Devesa	85679d37f3 llama : improve output buffer type selection (#10098)	há 1 ano atrás
Diego Devesa	1e9f94994e quantize : fix --keep-split (#10114)	há 1 ano atrás
Diego Devesa	c02e5ab2a6 llama : fix buffer checks for mamba and rwk (#10111)	há 1 ano atrás
Zhenwei Jin	ab3d71f97f loader: refactor tensor weights storage (#9935)	há 1 ano atrás
Kevin Gibbons	0a683e8088 server : include scheme when printing URL (#10106)	há 1 ano atrás
Diego Devesa	dea5e86051 ggml : check tensor name lengths in gguf files (#10100)	há 1 ano atrás
Sergio López	1329c0a75e kompute: add mul_mat_q4_k shader (#10097)	há 1 ano atrás
Sergio López	61408e7fad kompute: add backend registry / device interfaces (#10045)	há 1 ano atrás
Diego Devesa	b9e02e8184 ggml : fix memory leaks when loading invalid gguf files (#10094)	há 1 ano atrás
Rich Dougherty	6763f713bb readme : more lora detail in main example readme (#10064)	há 1 ano atrás

Recente Antigo

Histórico de Commits Pesquisar

Histórico de Commits