| .. |
|
CMakeLists.txt
|
f32ca51bfe
server: add presets (config) when using multiple models (#17859)
|
1 месяц назад |
|
arg.cpp
|
2bc94e7928
add llama-completion to completion-bash executables (#17976)
|
1 месяц назад |
|
arg.h
|
380b4c984e
common: support negated args (#17919)
|
1 месяц назад |
|
base64.hpp
|
381efbf480
llava : expose as a shared library for downstream projects (#3613)
|
2 лет назад |
|
build-info.cpp.in
|
cc8d081879
cmake: Add ability to pass in LLAMA_BUILD_NUMBER/COMMIT (#14167)
|
7 месяцев назад |
|
chat-parser-xml-toolcall.cpp
|
636fc17a37
Fix Kimi-K2 tool-call parsing issues (#17376)
|
1 месяц назад |
|
chat-parser-xml-toolcall.h
|
636fc17a37
Fix Kimi-K2 tool-call parsing issues (#17376)
|
1 месяц назад |
|
chat-parser.cpp
|
636fc17a37
Fix Kimi-K2 tool-call parsing issues (#17376)
|
1 месяц назад |
|
chat-parser.h
|
1920345c3b
common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932)
|
2 месяцев назад |
|
chat-peg-parser.cpp
|
0a8026e768
common : introduce composable PEG parser combinators for chat parsing (#17136)
|
1 месяц назад |
|
chat-peg-parser.h
|
0a8026e768
common : introduce composable PEG parser combinators for chat parsing (#17136)
|
1 месяц назад |
|
chat.cpp
|
2fbe3b7bb7
common : add parser for ministral/mistral large 3/devstral 2 (#17713)
|
1 месяц назад |
|
chat.h
|
190c4838bd
chat : reserve memory in compute_diffs and improve naming (#17729)
|
1 месяц назад |
|
common.cpp
|
22577583a3
common : change --color to accept on/off/auto, default to auto (#17827)
|
1 месяц назад |
|
common.h
|
34a6d86982
cli: enable jinja by default (#17911)
|
1 месяц назад |
|
console.cpp
|
6c2131773c
cli: new CLI experience (#17824)
|
1 месяц назад |
|
console.h
|
6c2131773c
cli: new CLI experience (#17824)
|
1 месяц назад |
|
download.cpp
|
b8ee22cfde
common : add minimalist multi-thread progress bar (#17602)
|
1 месяц назад |
|
download.h
|
ec18edfcba
server: introduce API for serving / loading / unloading multiple models (#17470)
|
1 месяц назад |
|
http.h
|
4201deae9c
common: introduce http.h for httplib-based client (#16373)
|
3 месяцев назад |
|
json-partial.cpp
|
1920345c3b
common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932)
|
2 месяцев назад |
|
json-partial.h
|
53f925074d
sync : vendor (#13901)
|
7 месяцев назад |
|
json-schema-to-grammar.cpp
|
c4357dcc35
Server: Change Invalid Schema from Server Error (500) to User Error (400) (#17572)
|
1 месяц назад |
|
json-schema-to-grammar.h
|
1920345c3b
common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932)
|
2 месяцев назад |
|
llguidance.cpp
|
43dfd741a5
llguidance : set tokenizer slices to default (#13424)
|
8 месяцев назад |
|
log.cpp
|
6c2131773c
cli: new CLI experience (#17824)
|
1 месяц назад |
|
log.h
|
6c2131773c
cli: new CLI experience (#17824)
|
1 месяц назад |
|
ngram-cache.cpp
|
5bbe6a9fe9
ggml : portability fixes for VS 2017 (#12150)
|
10 месяцев назад |
|
ngram-cache.h
|
727368c60f
llama : use LLAMA_TOKEN_NULL (#11062)
|
1 год назад |
|
peg-parser.cpp
|
0a8026e768
common : introduce composable PEG parser combinators for chat parsing (#17136)
|
1 месяц назад |
|
peg-parser.h
|
0a8026e768
common : introduce composable PEG parser combinators for chat parsing (#17136)
|
1 месяц назад |
|
preset.cpp
|
380b4c984e
common: support negated args (#17919)
|
1 месяц назад |
|
preset.h
|
f32ca51bfe
server: add presets (config) when using multiple models (#17859)
|
1 месяц назад |
|
regex-partial.cpp
|
3198405e98
`common`: add partial regex support (#12808)
|
8 месяцев назад |
|
regex-partial.h
|
3198405e98
`common`: add partial regex support (#12808)
|
8 месяцев назад |
|
sampling.cpp
|
196f5083ef
common : more accurate sampling timing (#17382)
|
1 месяц назад |
|
sampling.h
|
e92d53b29e
sampling : optimize samplers by reusing bucket sort (#15665)
|
4 месяцев назад |
|
speculative.cpp
|
e92d53b29e
sampling : optimize samplers by reusing bucket sort (#15665)
|
4 месяцев назад |
|
speculative.h
|
94933c8c2e
server : implement universal assisted decoding (#12635)
|
5 месяцев назад |
|
unicode.cpp
|
0a8026e768
common : introduce composable PEG parser combinators for chat parsing (#17136)
|
1 месяц назад |
|
unicode.h
|
0a8026e768
common : introduce composable PEG parser combinators for chat parsing (#17136)
|
1 месяц назад |