cturan/llama.cpp

Autor	SHA1 Mensaxe	Data
Kawrakow	5ed26e1fc9 Adding some imatrix tools (#5302)	hai 1 ano
Welby Seely	277fad30c6 cmake : use set() for LLAMA_WIN_VER (#5298)	hai 1 ano
Johannes Gäßler	3c0d25c475 make: add nvcc info print (#5310)	hai 1 ano
Johannes Gäßler	3cc5ed353c make: fix nvcc optimization flags for host code (#5309)	hai 1 ano
Martin Schwaighofer	60ecf099ed add Vulkan support to Nix flake	%!s(int64=2) %!d(string=hai) anos
0cc4m	e920ed393d Vulkan Intel Fixes, Optimizations and Debugging Flags (#5301)	hai 1 ano
Michael Klimenko	52bb63c708 refactor : switch to emplace_back to avoid extra object (#5291)	hai 1 ano
Jared Van Bortel	1ec3332ade YaRN : store rope scaling type as int32_t in memory (#5285)	hai 1 ano
BADR	6a66c5071a readme : add tenere in the ui tools list (#5284)	hai 1 ano
AidanBeltonS	a305dba8ff Fix im2col with 32fp (#5286)	hai 1 ano
kalomaze	191221178f perplexity : fix KL divergence calculations on Windows (#5273)	hai 1 ano
Georgi Gerganov	e437b37fd0 scripts : parse wtype in server-llm.sh (#5167)	hai 1 ano
Mirror Azure	2d40085c26 py : add check for '.attn.masked_bias' layers to GPT2model (#5281)	hai 1 ano
AidanBeltonS	b05102fe8c Tidy ggml-sycl (#5261)	hai 1 ano
Xuan Son Nguyen	6b91b1e0a9 docker : add build for SYCL, Vulkan + update readme (#5228)	hai 1 ano
Meng, Hengyu	e805f0fa99 [SYCL] get MAX_MEM_ALLOC from device property (#5270)	hai 1 ano
Neo Zhang Jianyu	af3ba5d946 [SYCL] update guide of SYCL backend (#5254)	hai 1 ano
Ian Bull	e1e721094d llama : fix memory leak in llama_batch_free (#5252)	hai 1 ano
Neo Zhang Jianyu	128dcbd3c9 add --no-mmap in llama-bench (#5257)	hai 1 ano
0cc4m	4d0924a890 Vulkan Phi Fix for AMD Proprietary Drivers (#5260)	hai 1 ano
slaren	8ca511cade cuda : fix LLAMA_CUDA_F16 (#5262)	hai 1 ano
Ali Nehzat	d71ac90985 make : generate .a library for static linking (#5205)	hai 1 ano
Guoteng	ce32060198 llama : support InternLM2 (#5184)	hai 1 ano
Eve	1cfb5372cf Fix broken Vulkan Cmake (properly) (#5230)	hai 1 ano
Georgi Gerganov	d3bac7d584 llama : reorder build_orion() at correct place (#5118)	hai 1 ano
Georgi Gerganov	5cb04dbc16 llama : remove LLAMA_MAX_DEVICES and LLAMA_SUPPORTS_GPU_OFFLOAD (#5240)	hai 1 ano
Georgi Gerganov	efb7bdbbd0 metal : add im2col F32 dst support (#5132)	hai 1 ano
JidongZhang-THU	15606309a0 llava : add MobileVLM support (#5132)	hai 1 ano
Neo Zhang Jianyu	b2b9f025e7 format license text, restore apache license by legal suggestion (#5233)	hai 1 ano
slaren	dabcc5b471 ggml : limit n_threads to the max n_tasks (#5238)	hai 1 ano

Posterior Anterior

Commit History Buscar

Commit History