Reese Levine 7ca5991d2b ggml webgpu: add support for emscripten builds (#17184) hace 1 mes
..
CMakeLists.txt 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) hace 1 mes
arg.cpp 7ca5991d2b ggml webgpu: add support for emscripten builds (#17184) hace 1 mes
arg.h 5c9a18e674 common: move download functions to download.(cpp|h) (#17059) hace 2 meses
base64.hpp 381efbf480 llava : expose as a shared library for downstream projects (#3613) hace 2 años
build-info.cpp.in cc8d081879 cmake: Add ability to pass in LLAMA_BUILD_NUMBER/COMMIT (#14167) hace 7 meses
chat-parser-xml-toolcall.cpp 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) hace 1 mes
chat-parser-xml-toolcall.h 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) hace 1 mes
chat-parser.cpp 03914c7ef8 common : move all common_chat_parse_* to chat-parser.cpp. (#17481) hace 1 mes
chat-parser.h 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) hace 1 mes
chat.cpp c4357dcc35 Server: Change Invalid Schema from Server Error (500) to User Error (400) (#17572) hace 1 mes
chat.h 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) hace 1 mes
common.cpp 7ca5991d2b ggml webgpu: add support for emscripten builds (#17184) hace 1 mes
common.h 13628d8bdb server: add --media-path for local media files (#17697) hace 1 mes
console.cpp 8277a817f1 console : utf-8 fix for windows stdin (#9690) hace 1 año
console.h 6381d4e110 gguf : new file format with flexible meta data (beta) (#2398) hace 2 años
download.cpp 7ca5991d2b ggml webgpu: add support for emscripten builds (#17184) hace 1 mes
download.h ec18edfcba server: introduce API for serving / loading / unloading multiple models (#17470) hace 1 mes
http.h 4201deae9c common: introduce http.h for httplib-based client (#16373) hace 3 meses
json-partial.cpp 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) hace 1 mes
json-partial.h 53f925074d sync : vendor (#13901) hace 7 meses
json-schema-to-grammar.cpp c4357dcc35 Server: Change Invalid Schema from Server Error (500) to User Error (400) (#17572) hace 1 mes
json-schema-to-grammar.h 1920345c3b common : Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS + Kimi-K2 + Qwen3-Coder + Apriel-1.5 + Xiaomi-MiMo) (#16932) hace 1 mes
llguidance.cpp 43dfd741a5 llguidance : set tokenizer slices to default (#13424) hace 8 meses
log.cpp 7733409734 common: improve verbosity level definitions (#17630) hace 1 mes
log.h 7733409734 common: improve verbosity level definitions (#17630) hace 1 mes
ngram-cache.cpp 5bbe6a9fe9 ggml : portability fixes for VS 2017 (#12150) hace 10 meses
ngram-cache.h 727368c60f llama : use LLAMA_TOKEN_NULL (#11062) hace 1 año
regex-partial.cpp 3198405e98 `common`: add partial regex support (#12808) hace 8 meses
regex-partial.h 3198405e98 `common`: add partial regex support (#12808) hace 8 meses
sampling.cpp 196f5083ef common : more accurate sampling timing (#17382) hace 1 mes
sampling.h e92d53b29e sampling : optimize samplers by reusing bucket sort (#15665) hace 4 meses
speculative.cpp e92d53b29e sampling : optimize samplers by reusing bucket sort (#15665) hace 4 meses
speculative.h 94933c8c2e server : implement universal assisted decoding (#12635) hace 5 meses