Xuan-Son Nguyen 9e39a1e6a9 server: support load model on startup, support preset-only options (#18206) il y a 4 semaines
..
CMakeLists.txt f32ca51bfe server: add presets (config) when using multiple models (#17859) il y a 1 mois
arg.cpp 9e39a1e6a9 server: support load model on startup, support preset-only options (#18206) il y a 4 semaines
arg.h 9e39a1e6a9 server: support load model on startup, support preset-only options (#18206) il y a 4 semaines
base64.hpp 381efbf480 llava : expose as a shared library for downstream projects (#3613) il y a 2 ans
build-info.cpp.in cc8d081879 cmake: Add ability to pass in LLAMA_BUILD_NUMBER/COMMIT (#14167) il y a 7 mois
chat-parser-xml-toolcall.cpp 636fc17a37 Fix Kimi-K2 tool-call parsing issues (#17376) il y a 1 mois
chat-parser-xml-toolcall.h 636fc17a37 Fix Kimi-K2 tool-call parsing issues (#17376) il y a 1 mois
chat-parser.cpp 636fc17a37 Fix Kimi-K2 tool-call parsing issues (#17376) il y a 1 mois
chat-parser.h 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) il y a 1 mois
chat-peg-parser.cpp c05aa69f32 common : add nemotron 3 parsing (#18077) il y a 1 mois
chat-peg-parser.h 0a8026e768 common : introduce composable PEG parser combinators for chat parsing (#17136) il y a 1 mois
chat.cpp c05aa69f32 common : add nemotron 3 parsing (#18077) il y a 1 mois
chat.h 190c4838bd chat : reserve memory in compute_diffs and improve naming (#17729) il y a 1 mois
common.cpp a2c199e479 common: clarify instructions for bug reports (#18134) il y a 1 mois
common.h 6ce3d85796 server: (webui) add --webui-config (#18028) il y a 1 mois
console.cpp 6c2131773c cli: new CLI experience (#17824) il y a 1 mois
console.h 6c2131773c cli: new CLI experience (#17824) il y a 1 mois
download.cpp b8ee22cfde common : add minimalist multi-thread progress bar (#17602) il y a 1 mois
download.h ec18edfcba server: introduce API for serving / loading / unloading multiple models (#17470) il y a 1 mois
http.h 4201deae9c common: introduce http.h for httplib-based client (#16373) il y a 3 mois
json-partial.cpp 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) il y a 1 mois
json-partial.h 53f925074d sync : vendor (#13901) il y a 7 mois
json-schema-to-grammar.cpp c05aa69f32 common : add nemotron 3 parsing (#18077) il y a 1 mois
json-schema-to-grammar.h c05aa69f32 common : add nemotron 3 parsing (#18077) il y a 1 mois
llguidance.cpp 43dfd741a5 llguidance : set tokenizer slices to default (#13424) il y a 8 mois
log.cpp 6c2131773c cli: new CLI experience (#17824) il y a 1 mois
log.h 6c2131773c cli: new CLI experience (#17824) il y a 1 mois
ngram-cache.cpp 5bbe6a9fe9 ggml : portability fixes for VS 2017 (#12150) il y a 10 mois
ngram-cache.h 727368c60f llama : use LLAMA_TOKEN_NULL (#11062) il y a 1 an
peg-parser.cpp c05aa69f32 common : add nemotron 3 parsing (#18077) il y a 1 mois
peg-parser.h 0a8026e768 common : introduce composable PEG parser combinators for chat parsing (#17136) il y a 1 mois
preset.cpp 9e39a1e6a9 server: support load model on startup, support preset-only options (#18206) il y a 4 semaines
preset.h 98c1c7a7bf presets: refactor, allow cascade presets from different sources, add global section (#18169) il y a 4 semaines
regex-partial.cpp 3198405e98 `common`: add partial regex support (#12808) il y a 8 mois
regex-partial.h 3198405e98 `common`: add partial regex support (#12808) il y a 8 mois
sampling.cpp 4301e27319 common : restore grammar-based rejection sampling (#18137) il y a 1 mois
sampling.h 4301e27319 common : restore grammar-based rejection sampling (#18137) il y a 1 mois
speculative.cpp 4301e27319 common : restore grammar-based rejection sampling (#18137) il y a 1 mois
speculative.h 94933c8c2e server : implement universal assisted decoding (#12635) il y a 5 mois
unicode.cpp 0a8026e768 common : introduce composable PEG parser combinators for chat parsing (#17136) il y a 1 mois
unicode.h 0a8026e768 common : introduce composable PEG parser combinators for chat parsing (#17136) il y a 1 mois