Olivier Chafik
|
c64d2becb1
`minja`: sync at https://github.com/google/minja/commit/0f5f7f2b3770eb682fbc11763266d45204173686 (#11352)
|
1 year ago |
Jiří Podivín
|
96f4053934
Adding logprobs to /v1/completions (#11344)
|
1 year ago |
Olivier Chafik
|
a94f3b2727
`common`: utils to split / join / repeat strings (from json converter) (#11342)
|
1 year ago |
tc-mb
|
3e3357fd77
llava : support Minicpm-omni (#11289)
|
1 year ago |
Olivier Chafik
|
6171c9d258
Add Jinja template support (#11016)
|
1 year ago |
Xuan Son Nguyen
|
e28245f35f
export-lora : fix tok_embd tensor (#11330)
|
1 year ago |
Radoslav Gerganov
|
6da5bec81c
rpc : better caching of the base buffer pointer (#11331)
|
1 year ago |
Eric Curtin
|
2e2f8f093c
linenoise.cpp refactoring (#11301)
|
1 year ago |
Georgi Gerganov
|
2139667ec4
metal : fix out-of-bounds write (#11314)
|
1 year ago |
Georgi Gerganov
|
80d0d6b4b7
common : add -hfd option for the draft model (#11318)
|
1 year ago |
Jeff Bolz
|
aea8ddd516
vulkan: fix coopmat2 validation failures (#11284)
|
1 year ago |
Georgi Gerganov
|
9f7add1cde
examples : fix add_special conditions (#11311)
|
1 year ago |
Christopher Nielsen
|
90d987b105
mmap: add include for cerrno (#11296)
|
1 year ago |
Michael Podvitskiy
|
a4251edd6f
cmake: fix shell command quoting in build-info script (#11309)
|
1 year ago |
Xuan Son Nguyen
|
ec7f3ac9ab
llama : add support for Deepseek-R1-Qwen distill model (#11310)
|
1 year ago |
Georgi Gerganov
|
ef6dada60c
cont : fix whitespaces (#11305)
|
1 year ago |
Kyle Bruene
|
ae3c1db2f9
llama : re-add LLM_ARCH_PHIMOE (#11305)
|
1 year ago |
Georgi Gerganov
|
92bc493917
tests : increase timeout when sanitizers are enabled (#11300)
|
1 year ago |
Georgi Gerganov
|
b9daaffe02
simple-chat : fix BOS being added to each message (#11278)
|
1 year ago |
Nicolò Scipione
|
99487b57d4
SYCL: Introducing memory host pool (#11251)
|
1 year ago |
Eric Curtin
|
a1649cc13f
Adding linenoise.cpp to llama-run (#11252)
|
1 year ago |
Georgi Gerganov
|
4dd34ff831
cmake : add sanitizer flags for llama.cpp (#11279)
|
1 year ago |
Xuan Son Nguyen
|
f30f099228
server : implement cancellable request (#11285)
|
1 year ago |
Georgi Gerganov
|
f26c874179
scripts : restore hf.sh (#11288)
|
1 year ago |
LostRuins Concedo
|
6390a998bf
tts : add guide tokens support (#11186)
|
1 year ago |
Jeff Bolz
|
44e18ef939
vulkan: fix coopmat2 flash attention for non-contiguous inputs (#11281)
|
1 year ago |
codezjx
|
3edfa7d375
llama.android: add field formatChat to control whether to parse special tokens when send message (#11270)
|
1 year ago |
Radoslav Gerganov
|
667d72846c
rpc : early register backend devices (#11262)
|
1 year ago |
Georgi Gerganov
|
a133566d34
vocab : fix double-eos check (#11273)
|
1 year ago |
David Renshaw
|
960ec65273
llama : fix deprecation message: vocabable -> vocab (#11269)
|
1 year ago |