Johannes Gäßler
|
af0a5b6163
server: fix incorrectly reported token probabilities (#7125)
|
пре 1 година |
David Renshaw
|
3f167476b1
sampling : use std::random_device{}() for default random seed (#6962)
|
пре 1 година |
Johannes Gäßler
|
28103f4832
Server: fix seed for multiple slots (#6835)
|
пре 1 година |
Minsoo Cheong
|
586e7bc561
sampling : deduplicated code for probability distribution access (#6240)
|
пре 1 година |
Clint Herron
|
463628372d
grammar : handle missing "root" node (#6004)
|
пре 1 година |
Minsoo Cheong
|
6d341ab6c5
speculative : implement stochastic speculative sampling (#5625)
|
пре 1 година |
Pierrick Hymbert
|
e3965cf35a
server: tests - slow inference causes timeout on the CI (#5715)
|
пре 1 година |
Robey Holderith
|
5ee99c32f5
common, server : surface min_keep as its own parameter (#5567)
|
пре 1 година |
Georgi Gerganov
|
689a091bbe
sampling : do not set min_keep to n_probs (#5564)
|
пре 1 година |
Alexey Parfenov
|
6dcc02d244
server : add "samplers" param to control the samplers order (#5494)
|
пре 1 година |
Alexey Parfenov
|
a803333a4e
common : use enums for sampler types (#5418)
|
пре 1 година |
Georgi Gerganov
|
139b62a839
common : fix compile warning
|
пре 1 година |
Johannes Gäßler
|
26d4efd11e
sampling: fix top_k <= 0 (#5388)
|
пре 1 година |
Michael Klimenko
|
35a2ee9143
Remove unused data and add fixes (#5154)
|
пре 2 година |
l3utterfly
|
5eaf9964fc
llama : dynamic temperature sampling (#4972)
|
пре 2 година |
David Friehs
|
4483396751
llama : apply classifier-free guidance to logits directly (#4951)
|
пре 2 година |
Alexey Parfenov
|
6123979952
server : allow to specify custom prompt for penalty calculation (#3727)
|
пре 2 година |
kalomaze
|
b9ec82d262
grammar : check the full vocab only if necessary (opt) (#4306)
|
пре 2 година |
Georgi Gerganov
|
caa9249217
common : fix compile warning
|
пре 2 година |
MaggotHATE
|
52c8bc3cf3
sampling : custom samplers order (#4285)
|
пре 2 година |
l3utterfly
|
e75dfdd31b
sampling : null grammar field after reset (#3885)
|
пре 2 година |
kalomaze
|
238657db23
samplers : Min-P sampler implementation [alternative to Top P/Top K] (#3841)
|
пре 2 година |
Georgi Gerganov
|
ee1a0ec9cb
llama : add option for greedy sampling with probs (#3813)
|
пре 2 година |
Marcus Dunn
|
5be6c803fa
llama : remove token functions with `context` args in favor of `model` (#3720)
|
пре 2 година |
Georgi Gerganov
|
d1031cf49c
sampling : refactor init to use llama_sampling_params (#3696)
|
пре 2 година |
Georgi Gerganov
|
0e89203b51
speculative : add tree-based sampling example (#3624)
|
пре 2 година |
Kerfuffle
|
70c29da118
common : fix mirostat state when using multiple sequences (#3543)
|
пре 2 година |