matteo caf5681fcb server : support jinja extra template kwargs (Qwen3 enable_thinking feature), from command line and from client (#13196) 6 месяцев назад
..
CMakeLists.txt 09cf2c7c65 cmake : Improve build-info.cpp generation (#14156) 7 месяцев назад
arg.cpp caf5681fcb server : support jinja extra template kwargs (Qwen3 enable_thinking feature), from command line and from client (#13196) 6 месяцев назад
arg.h 2d451c8059 common : add common_remote_get_content (#13123) 8 месяцев назад
base64.hpp 381efbf480 llava : expose as a shared library for downstream projects (#3613) 2 лет назад
build-info.cpp.in cc8d081879 cmake: Add ability to pass in LLAMA_BUILD_NUMBER/COMMIT (#14167) 7 месяцев назад
chat-parser.cpp 3cb203c89f llama-chat : Do not throw when tool parsing fails (#14012) 7 месяцев назад
chat-parser.h 3cb203c89f llama-chat : Do not throw when tool parsing fails (#14012) 7 месяцев назад
chat.cpp caf5681fcb server : support jinja extra template kwargs (Qwen3 enable_thinking feature), from command line and from client (#13196) 6 месяцев назад
chat.h caf5681fcb server : support jinja extra template kwargs (Qwen3 enable_thinking feature), from command line and from client (#13196) 6 месяцев назад
common.cpp dd6e6d0b6a vocab : prevent tokenizer overflow (#14301) 7 месяцев назад
common.h caf5681fcb server : support jinja extra template kwargs (Qwen3 enable_thinking feature), from command line and from client (#13196) 6 месяцев назад
console.cpp 8277a817f1 console : utf-8 fix for windows stdin (#9690) 1 год назад
console.h 6381d4e110 gguf : new file format with flexible meta data (beta) (#2398) 2 лет назад
json-partial.cpp 53f925074d sync : vendor (#13901) 7 месяцев назад
json-partial.h 53f925074d sync : vendor (#13901) 7 месяцев назад
json-schema-to-grammar.cpp 40bfa04c95 common : use std::string_view now that we target c++17 (#14319) 6 месяцев назад
json-schema-to-grammar.h 53f925074d sync : vendor (#13901) 7 месяцев назад
llguidance.cpp 43dfd741a5 llguidance : set tokenizer slices to default (#13424) 8 месяцев назад
log.cpp bfd11a2344 Fix: Compile failure due to Microsoft STL breaking change (#11836) 11 месяцев назад
log.h fef0cbeadf cleanup: fix compile warnings associated with gnu_printf (#11811) 11 месяцев назад
ngram-cache.cpp 5bbe6a9fe9 ggml : portability fixes for VS 2017 (#12150) 10 месяцев назад
ngram-cache.h 727368c60f llama : use LLAMA_TOKEN_NULL (#11062) 1 год назад
regex-partial.cpp 3198405e98 `common`: add partial regex support (#12808) 8 месяцев назад
regex-partial.h 3198405e98 `common`: add partial regex support (#12808) 8 месяцев назад
sampling.cpp f5cd27b71d `server`: streaming of tool calls and thoughts when `--jinja` is on (#12379) 7 месяцев назад
sampling.h ff227703d6 sampling : support for llguidance grammars (#10224) 11 месяцев назад
speculative.cpp 745aa5319b llama : deprecate llama_kv_self_ API (#14030) 7 месяцев назад
speculative.h abd4d0bc4f speculative : update default params (#11954) 11 месяцев назад