Kawrakow
|
5ed26e1fc9
Adding some imatrix tools (#5302)
|
hai 1 ano |
Welby Seely
|
277fad30c6
cmake : use set() for LLAMA_WIN_VER (#5298)
|
hai 1 ano |
Johannes Gäßler
|
3c0d25c475
make: add nvcc info print (#5310)
|
hai 1 ano |
Johannes Gäßler
|
3cc5ed353c
make: fix nvcc optimization flags for host code (#5309)
|
hai 1 ano |
Martin Schwaighofer
|
60ecf099ed
add Vulkan support to Nix flake
|
%!s(int64=2) %!d(string=hai) anos |
0cc4m
|
e920ed393d
Vulkan Intel Fixes, Optimizations and Debugging Flags (#5301)
|
hai 1 ano |
Michael Klimenko
|
52bb63c708
refactor : switch to emplace_back to avoid extra object (#5291)
|
hai 1 ano |
Jared Van Bortel
|
1ec3332ade
YaRN : store rope scaling type as int32_t in memory (#5285)
|
hai 1 ano |
BADR
|
6a66c5071a
readme : add tenere in the ui tools list (#5284)
|
hai 1 ano |
AidanBeltonS
|
a305dba8ff
Fix im2col with 32fp (#5286)
|
hai 1 ano |
kalomaze
|
191221178f
perplexity : fix KL divergence calculations on Windows (#5273)
|
hai 1 ano |
Georgi Gerganov
|
e437b37fd0
scripts : parse wtype in server-llm.sh (#5167)
|
hai 1 ano |
Mirror Azure
|
2d40085c26
py : add check for '.attn.masked_bias' layers to GPT2model (#5281)
|
hai 1 ano |
AidanBeltonS
|
b05102fe8c
Tidy ggml-sycl (#5261)
|
hai 1 ano |
Xuan Son Nguyen
|
6b91b1e0a9
docker : add build for SYCL, Vulkan + update readme (#5228)
|
hai 1 ano |
Meng, Hengyu
|
e805f0fa99
[SYCL] get MAX_MEM_ALLOC from device property (#5270)
|
hai 1 ano |
Neo Zhang Jianyu
|
af3ba5d946
[SYCL] update guide of SYCL backend (#5254)
|
hai 1 ano |
Ian Bull
|
e1e721094d
llama : fix memory leak in llama_batch_free (#5252)
|
hai 1 ano |
Neo Zhang Jianyu
|
128dcbd3c9
add --no-mmap in llama-bench (#5257)
|
hai 1 ano |
0cc4m
|
4d0924a890
Vulkan Phi Fix for AMD Proprietary Drivers (#5260)
|
hai 1 ano |
slaren
|
8ca511cade
cuda : fix LLAMA_CUDA_F16 (#5262)
|
hai 1 ano |
Ali Nehzat
|
d71ac90985
make : generate .a library for static linking (#5205)
|
hai 1 ano |
Guoteng
|
ce32060198
llama : support InternLM2 (#5184)
|
hai 1 ano |
Eve
|
1cfb5372cf
Fix broken Vulkan Cmake (properly) (#5230)
|
hai 1 ano |
Georgi Gerganov
|
d3bac7d584
llama : reorder build_orion() at correct place (#5118)
|
hai 1 ano |
Georgi Gerganov
|
5cb04dbc16
llama : remove LLAMA_MAX_DEVICES and LLAMA_SUPPORTS_GPU_OFFLOAD (#5240)
|
hai 1 ano |
Georgi Gerganov
|
efb7bdbbd0
metal : add im2col F32 dst support (#5132)
|
hai 1 ano |
JidongZhang-THU
|
15606309a0
llava : add MobileVLM support (#5132)
|
hai 1 ano |
Neo Zhang Jianyu
|
b2b9f025e7
format license text, restore apache license by legal suggestion (#5233)
|
hai 1 ano |
slaren
|
dabcc5b471
ggml : limit n_threads to the max n_tasks (#5238)
|
hai 1 ano |