l3utterfly
|
5eaf9964fc
llama : dynamic temperature sampling (#4972)
|
2 سال پیش |
Jared Van Bortel
|
d292f4f204
examples : make pydantic scripts pass mypy and support py3.8 (#5099)
|
2 سال پیش |
Valentin Konovalov
|
256d1bb0dd
android : use release cmake build type by default (#5123)
|
2 سال پیش |
Kawrakow
|
faa3526a1e
Fix Q3_K_XS for MoE models (#5113)
|
2 سال پیش |
Georgi Gerganov
|
ddc5a5033f
metal : show compile log messages
|
2 سال پیش |
Engininja2
|
cd4fddb29f
cuda : fix 2-bit quants on amd hip (#5105)
|
2 سال پیش |
Michael Hueschen
|
c9b316c78f
nix-shell: use addToSearchPath
|
2 سال پیش |
Michael Hueschen
|
bf63d695b8
nix: add cc to devShell LD_LIBRARY_PATH
|
2 سال پیش |
slaren
|
1387ea2117
llama : pre-allocate input tensors in a separate buffer (#5100)
|
2 سال پیش |
Georgi Gerganov
|
26d607608d
metal : disable support for MUL_MAT F32 x F16
|
2 سال پیش |
Kawrakow
|
44879ee885
Additional KL-divergence statistics (#5081)
|
2 سال پیش |
Johannes Gäßler
|
9ecdd12e95
CUDA: more info when no device code (#5088)
|
2 سال پیش |
Georgi Gerganov
|
89758723c7
minor : clean-up some warnings and style (#5094)
|
2 سال پیش |
Xuan Son Nguyen
|
2bed4aa3f3
devops : add intel oneapi dockerfile (#5068)
|
2 سال پیش |
Michael Coppola
|
125d03a503
llama.vim : added api key support (#5090)
|
2 سال پیش |
slaren
|
011e8ec577
llama : fix not enough space in buffer with Qwen (#5086)
|
2 سال پیش |
Kawrakow
|
6f9939d119
KL-divergence (#5076)
|
2 سال پیش |
Reinforce-II
|
780e24a22e
ggml : parallelize FP32 conversion when using BLAS (#5045)
|
2 سال پیش |
XiaotaoChen
|
3ce7e8f8e7
llava : MobileVLM support (#4954)
|
2 سال پیش |
Someone Serge
|
b2d80e105a
flake.nix: add a comment about flakes vs nix
|
2 سال پیش |
Someone Serge
|
28603cd283
nix: add a comment on the many nixpkgs-with-cuda instances
|
2 سال پیش |
Someone Serge
|
5e97ec91ae
nix: add a comment about makeScope
|
2 سال پیش |
Someone Serge
|
7251870780
nix: refactor the cleanSource rules
|
2 سال پیش |
Someone Serge
|
fe8b3c0d4b
workflows: nix-ci: drop the redundant "paths" filter
|
2 سال پیش |
Someone Serge
|
f4dd059259
workflows: nix-build-aarch64: rate limit
|
2 سال پیش |
Someone Serge
|
f7276f7500
workflows: nix-ci: rebuild on flake.lock updates
|
2 سال پیش |
Kawrakow
|
15bceec2d7
imatrix : keep intermediate imatrix results (#5077)
|
2 سال پیش |
compilade
|
d6bd4d46dd
llama : support StableLM 2 1.6B (#5052)
|
2 سال پیش |
Daniel Bevenius
|
152d9d05e0
finetune : print sample-start/include-sample-start (#5072)
|
2 سال پیش |
Kawrakow
|
66d575c45c
llama : add Q3_K_XS (#5060)
|
2 سال پیش |