Eve
|
ed0bf32290
readme : modernize (#5379)
|
1 gadu atpakaļ |
Ben Williams
|
9a697d842b
readme : update ui list (#5354)
|
1 gadu atpakaļ |
runfuture
|
316c7faf77
llama : add MiniCPM support (#5346)
|
1 gadu atpakaļ |
Justin Parker
|
f3e2b4fa3f
server : update `/props` with "total_slots" value (#5373)
|
1 gadu atpakaļ |
Sang-Kil Park
|
f68664ac24
convert : fix TypeError on GPT-2 vocab.json (#5288)
|
1 gadu atpakaļ |
Alexey Parfenov
|
213d1439fa
server : remove model.json endpoint (#5371)
|
1 gadu atpakaļ |
Johannes Gäßler
|
17c97fb062
CUDA: mul_mat_vec_q max. batch size 8 -> 4 (#5370)
|
1 gadu atpakaļ |
Kawrakow
|
b08f22c882
Update README.md (#5366)
|
1 gadu atpakaļ |
Kawrakow
|
f57fadc009
Slight quantization improvement for Q4_K and Q5_K (#5361)
|
1 gadu atpakaļ |
BarfingLemurs
|
2e9c0bd6b3
readme : add phi, orion 14b, internlm2, and yi-VL to readme (#5362)
|
1 gadu atpakaļ |
Johannes Gäßler
|
2c516611f1
CUDA: mul_mat_vec_q for batch sizes > 1 (#5351)
|
1 gadu atpakaļ |
Justin Parker
|
8a79c591de
server : include total "num_slots" in props endpoint (#5349)
|
1 gadu atpakaļ |
Michael Coppola
|
31e7903221
server : add `dynatemp_range` and `dynatemp_exponent` (#5352)
|
1 gadu atpakaļ |
Niall Coates
|
4ffc7a17d4
server : various fixes for the prompt field in /completion (#5300)
|
1 gadu atpakaļ |
Georgi Gerganov
|
906cff55c2
py : handle byte tokens in `get_token_type` (#5341)
|
1 gadu atpakaļ |
Johannes Gäßler
|
098f6d737b
make: Use ccache for faster compilation (#5318)
|
1 gadu atpakaļ |
Johannes Gäßler
|
78b00dda6c
README: updated introduction (#5343)
|
1 gadu atpakaļ |
Kawrakow
|
c6b395535a
ggml : make use of ggml-quants.h possible in C++ code (#5338)
|
1 gadu atpakaļ |
Dr. Tom Murphy VII Ph.D
|
abb61944a5
ggml : avoid duplicating function calls using MIN/MAX macros (#5325)
|
1 gadu atpakaļ |
Kawrakow
|
89503dcb5f
iq3_xxs: quards for the no-imatrix situation (#5334)
|
1 gadu atpakaļ |
Guoteng
|
7e1ae372f3
py : fix internlm2-hf convert to gguf (#5305)
|
1 gadu atpakaļ |
Kawrakow
|
6fdfa2ecc6
iq2_xxs: tune quantization (#5320)
|
1 gadu atpakaļ |
Alexey Parfenov
|
a2d60c9158
server : allow to get default generation settings for completion (#5307)
|
1 gadu atpakaļ |
l3utterfly
|
e6f8177532
common : add dynamic temperature parameters to main example cli (#5295)
|
1 gadu atpakaļ |
Georgi Gerganov
|
30679d438d
scripts : fix typos, cleanup (#5303)
|
1 gadu atpakaļ |
Нияз Гарифзянов
|
4be04c8965
scripts : add non-interactive server-llm.sh (#5303)
|
1 gadu atpakaļ |
chiranko
|
5d55b0cd82
readme : add CodeShell models to the supported models list (#5330)
|
1 gadu atpakaļ |
AidanBeltonS
|
4833ac209d
[SYCL] Fix cpy with dims of 3 (#5289)
|
1 gadu atpakaļ |
github-actions[bot]
|
9392ebd49e
flake.lock: Update
|
1 gadu atpakaļ |
Kawrakow
|
5ed26e1fc9
Adding some imatrix tools (#5302)
|
1 gadu atpakaļ |