Ed Lepedus
|
0a1d889e27
server: add cURL support to server Dockerfiles (#6474)
|
1 year ago |
Minsoo Cheong
|
7dda1b727e
ci: exempt master branch workflows from getting cancelled (#6486)
|
1 year ago |
Ewout ter Hoeven
|
c666ba26c3
build CI: Name artifacts (#6482)
|
1 year ago |
Shakhar Dasgupta
|
2e66913e5f
server: allow penalizing repetition of newlines on server webpage (#6431)
|
1 year ago |
Pierrick Hymbert
|
8120efee1d
ci: bench fix concurrency for workflow trigger dispatch with sha1 (#6478)
|
1 year ago |
limitedAtonement
|
a74401f0e5
Correct README link (#6458)
|
1 year ago |
Pierrick Hymbert
|
7a2c92637a
ci: bench: add more ftype, fix triggers and bot comment (#6466)
|
1 year ago |
Daniel Bevenius
|
4bcd6b959c
common: remove duplicate check for curl (#6471)
|
1 year ago |
Clint Herron
|
9b84ae1806
examples : add GBNF validator program (#5948)
|
1 year ago |
Georgi Gerganov
|
4399f13fb9
server : remove obsolete --memory-f32 option
|
1 year ago |
Xiao-Yong Jin
|
1a43c7254e
server : add option to disable KV offload (#6468)
|
1 year ago |
Clint Herron
|
72d73af651
convert : fix for lint error complaining of bare except (#6470)
|
1 year ago |
Fattire
|
5fb1574c81
A few small fixes to server's README docs (#6428)
|
1 year ago |
JH23X
|
60cdf40cc3
server : handle exception on wrong type in request (#6452)
|
1 year ago |
bryanSwk
|
bb43cf7e9d
llama : add SEA-LION support (#6448)
|
1 year ago |
Ewout ter Hoeven
|
9f62c0173d
ci : update checkout, setup-python and upload-artifact to latest (#6456)
|
1 year ago |
Ed Lepedus
|
5d4f12e462
server: add cURL support to `server.Dockerfile` (#6461)
|
1 year ago |
Francisco Melo
|
154d4ee39c
readme : add feature-rich rust bindings (#6465)
|
1 year ago |
Joyce
|
e69945d953
security : create policy (#6354)
|
1 year ago |
Abhishek Gopinath K
|
db214fa578
Missing tokenizer.model error during gguf conversion (#6443)
|
1 year ago |
kaizau
|
1ff4d9f3d6
Add OpenChat, Alpaca, Vicuna chat templates (#6397)
|
1 year ago |
Georgi Gerganov
|
076b08649e
readme : update hot topics
|
1 year ago |
slaren
|
08a0c02060
ggml : mul_mat_id use the same tensor for all the experts (#6387)
|
1 year ago |
Meng, Hengyu
|
52604860f9
[SYCL] Disable iqx on windows as WA (#6435)
|
1 year ago |
Georgi Gerganov
|
f87f7b8986
flake.lock: Update (#6402)
|
1 year ago |
Johannes Gäßler
|
33a5244806
compare-llama-bench.py: fix long hexsha args (#6424)
|
1 year ago |
Pierrick Hymbert
|
226e819371
ci: server: verify deps are coherent with the commit (#6409)
|
1 year ago |
Georgi Gerganov
|
c50a82ce0f
readme : update hot topics
|
1 year ago |
Pierrick Hymbert
|
37e7854c10
ci: bench: fix Resource not accessible by integration on PR event (#6393)
|
1 year ago |
Mohammadreza Hendiani
|
c342d070c6
Fedora build update (#6388)
|
1 year ago |