Çetin b1f48d449e Add inline tool injection system (Sauron Protocol) with 85+ tools há 1 mês atrás
..
CMakeLists.txt 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) há 1 mês atrás
arg.cpp f914544b16 batched-bench : add "separate text gen" mode (#17103) há 2 meses atrás
arg.h 5c9a18e674 common: move download functions to download.(cpp|h) (#17059) há 2 meses atrás
base64.hpp 381efbf480 llava : expose as a shared library for downstream projects (#3613) há 2 anos atrás
build-info.cpp.in cc8d081879 cmake: Add ability to pass in LLAMA_BUILD_NUMBER/COMMIT (#14167) há 7 meses atrás
chat-parser-xml-toolcall.cpp 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) há 1 mês atrás
chat-parser-xml-toolcall.h 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) há 1 mês atrás
chat-parser.cpp 2c301e91ab common : handle unicode during partial json parsing (#16526) há 3 meses atrás
chat-parser.h 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) há 1 mês atrás
chat.cpp 10e9780154 chat: fix int overflow, prevent size calculation in float/double (#17357) há 1 mês atrás
chat.h 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) há 1 mês atrás
common.cpp 196f5083ef common : more accurate sampling timing (#17382) há 1 mês atrás
common.h 196f5083ef common : more accurate sampling timing (#17382) há 1 mês atrás
console.cpp 8277a817f1 console : utf-8 fix for windows stdin (#9690) há 1 ano atrás
console.h 6381d4e110 gguf : new file format with flexible meta data (beta) (#2398) há 2 anos atrás
download.cpp 78010a0d52 cmake : move OpenSSL linking to vendor/cpp-httplib (#17177) há 2 meses atrás
download.h aa3b7a90b4 arg: add --cache-list argument to list cached models (#17073) há 2 meses atrás
http.h 4201deae9c common: introduce http.h for httplib-based client (#16373) há 3 meses atrás
inline-tools.h b1f48d449e Add inline tool injection system (Sauron Protocol) with 85+ tools há 1 mês atrás
json-partial.cpp 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) há 1 mês atrás
json-partial.h 53f925074d sync : vendor (#13901) há 7 meses atrás
json-schema-to-grammar.cpp 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) há 1 mês atrás
json-schema-to-grammar.h 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) há 1 mês atrás
llguidance.cpp 43dfd741a5 llguidance : set tokenizer slices to default (#13424) há 8 meses atrás
log.cpp 9b17d74ab7 mtmd: add mtmd_log_set (#17268) há 2 meses atrás
log.h 9b17d74ab7 mtmd: add mtmd_log_set (#17268) há 2 meses atrás
ngram-cache.cpp 5bbe6a9fe9 ggml : portability fixes for VS 2017 (#12150) há 10 meses atrás
ngram-cache.h 727368c60f llama : use LLAMA_TOKEN_NULL (#11062) há 1 ano atrás
regex-partial.cpp 3198405e98 `common`: add partial regex support (#12808) há 8 meses atrás
regex-partial.h 3198405e98 `common`: add partial regex support (#12808) há 8 meses atrás
sampling.cpp 196f5083ef common : more accurate sampling timing (#17382) há 1 mês atrás
sampling.h e92d53b29e sampling : optimize samplers by reusing bucket sort (#15665) há 4 meses atrás
speculative.cpp e92d53b29e sampling : optimize samplers by reusing bucket sort (#15665) há 4 meses atrás
speculative.h 94933c8c2e server : implement universal assisted decoding (#12635) há 5 meses atrás