Ananta Bastola
|
6e4e973b26
ci : add an option to fail on compile warning (#3952)
|
1 anno fa |
Johannes Gäßler
|
ad014bba97
make: add error message for bad CUDA version (#5444)
|
1 anno fa |
Johannes Gäßler
|
098f6d737b
make: Use ccache for faster compilation (#5318)
|
1 anno fa |
Johannes Gäßler
|
3c0d25c475
make: add nvcc info print (#5310)
|
1 anno fa |
Johannes Gäßler
|
3cc5ed353c
make: fix nvcc optimization flags for host code (#5309)
|
1 anno fa |
0cc4m
|
e920ed393d
Vulkan Intel Fixes, Optimizations and Debugging Flags (#5301)
|
1 anno fa |
Ali Nehzat
|
d71ac90985
make : generate .a library for static linking (#5205)
|
1 anno fa |
0cc4m
|
2307523d32
ggml : add Vulkan backend (#2059)
|
2 anni fa |
Xuan Son Nguyen
|
48c857aa10
server : refactored the task processing logic (#5065)
|
2 anni fa |
crasm
|
413e7b0559
ci : add model tests + script wrapper (#4586)
|
2 anni fa |
Georgi Gerganov
|
c918fe8dca
metal : create autorelease pool during library build (#4970)
|
2 anni fa |
Georgi Gerganov
|
4be5ef556d
metal : remove old API (#4919)
|
2 anni fa |
Kawrakow
|
326b418b59
Importance Matrix calculation (#4861)
|
2 anni fa |
Georgi Gerganov
|
b0034d93ce
examples : add passkey test (#3856)
|
2 anni fa |
slaren
|
5bf3953d7e
cuda : improve cuda pool efficiency using virtual memory (#4606)
|
2 anni fa |
LeonEricsson
|
7082d24cec
lookup : add prompt lookup decoding example (#4484)
|
2 anni fa |
FantasyGmm
|
a55876955b
cuda : fix jetson compile error (#4560)
|
2 anni fa |
Michael Kesper
|
28cb35a0ec
make : add LLAMA_HIP_UMA option (#4587)
|
2 anni fa |
Georgi Gerganov
|
32259b2dad
gguf : simplify example dependencies
|
2 anni fa |
slaren
|
d232aca5a7
llama : initial ggml-backend integration (#4520)
|
2 anni fa |
Matheus Gabriel Alves Silva
|
919c40660f
build : Check the ROCm installation location (#4485)
|
2 anni fa |
Jared Van Bortel
|
70f806b821
build : detect host compiler and cuda compiler separately (#4414)
|
2 anni fa |
slaren
|
799a1cb13b
llama : add Mixtral support (#4406)
|
2 anni fa |
Jared Van Bortel
|
6138963fb2
build : target Windows 8 for standard mingw-w64 (#4405)
|
2 anni fa |
Georgi Gerganov
|
fe680e3d10
sync : ggml (new ops, tests, backend, etc.) (#4359)
|
2 anni fa |
Jared Van Bortel
|
511f52c334
build : enable libstdc++ assertions for debug builds (#4275)
|
2 anni fa |
WillCorticesAI
|
d2809a3ba2
make : fix Apple clang determination bug (#4272)
|
2 anni fa |
Jared Van Bortel
|
15f5d96037
build : fix build info generation and cleanup Makefile (#3920)
|
2 anni fa |
Georgi Gerganov
|
922754a8d6
lookahead : add example for lookahead decoding (#4207)
|
2 anni fa |
Kerfuffle
|
28a2e6e7d4
tokenize example: Respect normal add BOS token behavior (#4126)
|
2 anni fa |