Cebtenzzre
|
ecf90b1a51
gguf : make token scores and types optional (#3347)
|
%!s(int64=2) %!d(string=hai) anos |
Georgi Gerganov
|
2619109ad5
ci : disable freeBSD builds due to lack of VMs (#3381)
|
%!s(int64=2) %!d(string=hai) anos |
Georgi Gerganov
|
ec893798b7
llama : custom attention mask + parallel decoding + no context swaps (#3228)
|
%!s(int64=2) %!d(string=hai) anos |
Kevin Ji
|
45855b3f1c
docs : mark code as Bash (#3375)
|
%!s(int64=2) %!d(string=hai) anos |
Pierre Alexandre SCHEMBRI
|
4aea3b846e
readme : add Mistral AI release 0.1 (#3362)
|
%!s(int64=2) %!d(string=hai) anos |
slaren
|
da0400344b
ggml-cuda : perform cublas fp16 matrix multiplication as fp16 (#3370)
|
%!s(int64=2) %!d(string=hai) anos |
Zhang Peiyuan
|
e519621010
convert : remove bug in convert.py permute function (#3364)
|
%!s(int64=2) %!d(string=hai) anos |
Richard Roberson
|
ac43576124
make-ggml.py : compatibility with more models and GGUF (#3290)
|
%!s(int64=2) %!d(string=hai) anos |
Cebtenzzre
|
20c7e1e804
gguf : fix a few general keys (#3341)
|
%!s(int64=2) %!d(string=hai) anos |
Rickard Hallerbäck
|
dc6897404e
metal : reusing llama.cpp logging (#3152)
|
%!s(int64=2) %!d(string=hai) anos |
Jag Chadha
|
527e57cfd8
build : add ACCELERATE_NEW_LAPACK to fix warning on macOS Sonoma (#3342)
|
%!s(int64=2) %!d(string=hai) anos |
BarfingLemurs
|
ffe88a36a9
readme : add some recent perplexity and bpw measurements to READMES, link for k-quants (#3340)
|
%!s(int64=2) %!d(string=hai) anos |
DAN™
|
99115f3fa6
cmake : fix build-info.h on MSVC (#3309)
|
%!s(int64=2) %!d(string=hai) anos |
2f38b454
|
1726f9626f
docs: Fix typo CLBlast_DIR var. (#3330)
|
%!s(int64=2) %!d(string=hai) anos |
Erik Scholz
|
a98b1633d5
nix : add cuda, use a symlinked toolkit for cmake (#3202)
|
%!s(int64=2) %!d(string=hai) anos |
slaren
|
c091cdfb24
llama-bench : add README (#3317)
|
%!s(int64=2) %!d(string=hai) anos |
Cebtenzzre
|
51a7cf5c6e
examples : fix RoPE defaults to match PR #3240 (#3315)
|
%!s(int64=2) %!d(string=hai) anos |
Kevin Ji
|
bedb92b603
scripts : use `/usr/bin/env` in shebang (#3313)
|
%!s(int64=2) %!d(string=hai) anos |
Lee Drake
|
bc9d3e3971
Update README.md (#3289)
|
%!s(int64=2) %!d(string=hai) anos |
shibe2
|
36b904e200
ggml-opencl.cpp: Make private functions static (#3300)
|
%!s(int64=2) %!d(string=hai) anos |
Edward Taylor
|
324f3403d5
zig : fix for updated c lib (#3259)
|
%!s(int64=2) %!d(string=hai) anos |
yuiseki
|
f56c418ab0
embedding : update README.md (#3224)
|
%!s(int64=2) %!d(string=hai) anos |
Johannes Gäßler
|
8185710a80
CUDA: use only 1 thread if fully offloaded (#2915)
|
%!s(int64=2) %!d(string=hai) anos |
Georgi Gerganov
|
7eb41179ed
readme : update hot topics
|
%!s(int64=2) %!d(string=hai) anos |
Cebtenzzre
|
a5661d7e71
llama : allow gguf RoPE keys to be overridden with defaults (#3240)
|
%!s(int64=2) %!d(string=hai) anos |
Cebtenzzre
|
65c2c1c5ab
benchmark-matmult : do not use integer abs() on a float (#3277)
|
%!s(int64=2) %!d(string=hai) anos |
kang
|
80834daecf
flake : Restore default package's buildInputs (#3262)
|
%!s(int64=2) %!d(string=hai) anos |
Alon
|
a40f2b656f
CI: FreeBSD fix (#3258)
|
%!s(int64=2) %!d(string=hai) anos |
Georgi Gerganov
|
d119c04c15
examples : fix benchmark-matmult (#1554)
|
%!s(int64=2) %!d(string=hai) anos |
Cebtenzzre
|
8781013ef6
make : restore build-info.h dependency for several targets (#3205)
|
%!s(int64=2) %!d(string=hai) anos |