cturan/llama.cpp

Author	SHA1 Message	Date
Paweł Wodnicki	3f1ae2e32c Update README.md (#9591)	1 year ago
Georgi Gerganov	f1b8c42711 sync : ggml	1 year ago
Johannes Gäßler	e98c1c188e test: fix OPT_STEP_ADAMW for test-backend-ops (ggml/974)	1 year ago
Salvatore Mesoraca	cb00020504 vulkan : mul_mat: fix UB with small warps (ggml/952)	1 year ago
Borislav Stanimirov	6c5322481a ggml : fix ggml_cast (ggml/973)	1 year ago
Johannes Gäßler	7254cdf7e8 ggml: fix gradient allocation logic (ggml/966)	1 year ago
Georgi Gerganov	cad341d889 metal : reduce command encoding overhead (#9698)	1 year ago
Georgi Gerganov	a90484c6d9 llama : print correct model type for Llama 3.2 1B and 3B	1 year ago
compilade	1927378bcc convert : refactor rope_freqs generation (#9396)	1 year ago
serhii-nakon	6f1d9d71f4 Fix Docker ROCM builds, use AMDGPU_TARGETS instead of GPU_TARGETS (#9641)	1 year ago
compilade	511636df0c ci : reduce severity of unused Pyright ignore comments (#9697)	1 year ago
vb	08a43d05b6 py : update transfomers version (#9694)	1 year ago
Georgi Gerganov	ace4f4be37 flake.lock: Update (#9680)	1 year ago
Ruchira Hasaranga	8277a817f1 console : utf-8 fix for windows stdin (#9690)	1 year ago
Georgi Gerganov	c919d5db39 ggml : define missing HWCAP flags (#9684)	1 year ago
Georgi Gerganov	d0b1d663e4 sync : ggml	1 year ago
Johannes Gäßler	aaa4099925 CUDA: remove bad assert (ggml/972)	1 year ago
Jeff Bolz	641002fba8 vulkan : multithread pipeline creation (ggml/963)	1 year ago
Jeff Bolz	0de8b203f1 vulkan : fix build for GGML_VULKAN_RUN_TESTS, add TFLOPS to log (ggml/961)	1 year ago
Salvatore Mesoraca	544f409b4b vulkan : argsort barriers must be under uniform control flow (ggml/951)	1 year ago
Georgi Gerganov	6084bfb261 ggml : fix GGML_MAX_N_THREADS + improve formatting (ggml/969)	1 year ago
matiaslin	faac0bae26 common : ensure llama_batch size does not exceed max size (#9668)	1 year ago
nopperl	f99d3f8367 py : add model class for Chameleon conversion (#9683)	1 year ago
Georgi Gerganov	589b48d41e contrib : add Resources section (#9675)	1 year ago
Georgi Gerganov	f4d2b8846a llama : add reranking support (#9510)	1 year ago
slaren	1b2f992cd2 test-backend-ops : use flops for some performance tests (#9657)	1 year ago
Georgi Gerganov	739842703e llama : add comment about thread-safety [no ci] (#9449)	1 year ago
Zhenwei Jin	6102037bbb vocab : refactor tokenizer to reduce init overhead (#9449)	1 year ago
nopperl	9a913110cf llama : add support for Chameleon (#8543)	1 year ago
Aarni Koskela	43bcdd9703 readme : add tool (#9655)	1 year ago

Newer Older

Commit History Find

Commit History