Eric Zhang
|
6902cb7f2e
server : stop gracefully on SIGTERM (#6348)
|
1 yıl önce |
hutli
|
d2d8f38996
nix: removed unnessesary indentation
|
1 yıl önce |
hutli
|
d39b308eaf
nix: moved blas availability check to package inputs so it is still overridable
|
1 yıl önce |
hutli
|
c873976649
using blas.meta.available to check host platform
|
1 yıl önce |
hutli
|
dbb03e2b9c
only using explicit blas if hostPlatform is allowed
|
1 yıl önce |
Someone Serge
|
e9f17dc3bf
nix: .#windows: proper cross-compilation set-up
|
1 yıl önce |
Someone Serge
|
22a462cc1f
nix: package: don't introduce the dependency on python
|
1 yıl önce |
hutli
|
f6a0f5c642
nix: .#widnows: init
|
1 yıl önce |
Ziang Wu
|
d0e2f6416b
doc: fix typo in MobileVLM-README.md (#6181)
|
1 yıl önce |
Neo Zhang Jianyu
|
25f4a613c4
[SYCL] fix set main gpu crash (#6339)
|
1 yıl önce |
Pierrick Hymbert
|
a016026a3a
server: continuous performance monitoring and PR comment (#6283)
|
1 yıl önce |
Someone Serge
|
53c7ec53d5
nix: ci: dont test cuda and rocm (for now)
|
1 yıl önce |
slaren
|
e5b89a441a
ggml : fix bounds checking of zero size views (#6347)
|
1 yıl önce |
Georgi Gerganov
|
3a0345970e
make : whitespace
|
1 yıl önce |
howlger
|
1e13987fba
embedding : show full embedding for single prompt (#6342)
|
1 yıl önce |
AidanBeltonS
|
e82f9e2b83
[SYCL] Fix batched impl for NVidia GPU (#6164)
|
1 yıl önce |
Kawrakow
|
cbc8343619
Make IQ1_M work for QK_K = 64 (#6327)
|
1 yıl önce |
Sigbjørn Skjæret
|
e562b9714b
common : change --no-penalize-nl to --penalize-nl (#6334)
|
1 yıl önce |
Georgi Gerganov
|
2ab4f00d25
llama2c : open file as binary (#6332)
|
1 yıl önce |
Mateusz Charytoniuk
|
1740d6dd4e
readme : add php api bindings (#6326)
|
1 yıl önce |
Eric Zhang
|
0642b22cd1
server: public: use relative routes for static files (#6325)
|
1 yıl önce |
Neo Zhang Jianyu
|
a4f569e8a3
[SYCL] fix no file in win rel (#6314)
|
1 yıl önce |
Jared Van Bortel
|
32c8486e1f
wpm : portable unicode tolower (#6305)
|
1 yıl önce |
compilade
|
557410b8f0
llama : greatly reduce output buffer memory usage (#6122)
|
1 yıl önce |
Kawrakow
|
55c1b2a3bb
IQ1_M: 1.75 bpw quantization (#6302)
|
1 yıl önce |
Pedro Cuenca
|
e097633f63
convert-hf : fix exception in sentencepiece with added tokens (#6320)
|
1 yıl önce |
Kawrakow
|
d25b1c31b0
quantize : be able to override metadata by key (#6321)
|
1 yıl önce |
Minsoo Cheong
|
deb7240100
embedding : adjust `n_ubatch` value (#6296)
|
1 yıl önce |
Jan Boon
|
3d032ece8e
server : add `n_discard` parameter (#6300)
|
1 yıl önce |
Joseph Stahl
|
e190f1fca6
nix: make `xcrun` visible in Nix sandbox for precompiling Metal shaders (#6118)
|
1 yıl önce |