Georgi Gerganov f914544b16 batched-bench : add "separate text gen" mode (#17103) 2 months ago
..
CMakeLists.txt 5c9a18e674 common: move download functions to download.(cpp|h) (#17059) 2 months ago
arg.cpp f914544b16 batched-bench : add "separate text gen" mode (#17103) 2 months ago
arg.h 5c9a18e674 common: move download functions to download.(cpp|h) (#17059) 2 months ago
base64.hpp 381efbf480 llava : expose as a shared library for downstream projects (#3613) 2 years ago
build-info.cpp.in cc8d081879 cmake: Add ability to pass in LLAMA_BUILD_NUMBER/COMMIT (#14167) 7 months ago
chat-parser.cpp 2c301e91ab common : handle unicode during partial json parsing (#16526) 3 months ago
chat-parser.h 34fcc5a4ac model : Apertus model implementation (#15852) 3 months ago
chat.cpp 87c9efc3b2 common : move gpt-oss reasoning processing to init params (#16937) 2 months ago
chat.h c053e18a66 chat: Add LFM2 tool handling (#16763) 2 months ago
common.cpp aa3b7a90b4 arg: add --cache-list argument to list cached models (#17073) 2 months ago
common.h f914544b16 batched-bench : add "separate text gen" mode (#17103) 2 months ago
console.cpp 8277a817f1 console : utf-8 fix for windows stdin (#9690) 1 year ago
console.h 6381d4e110 gguf : new file format with flexible meta data (beta) (#2398) 2 years ago
download.cpp aa3b7a90b4 arg: add --cache-list argument to list cached models (#17073) 2 months ago
download.h aa3b7a90b4 arg: add --cache-list argument to list cached models (#17073) 2 months ago
http.h 4201deae9c common: introduce http.h for httplib-based client (#16373) 3 months ago
json-partial.cpp 2c301e91ab common : handle unicode during partial json parsing (#16526) 3 months ago
json-partial.h 53f925074d sync : vendor (#13901) 7 months ago
json-schema-to-grammar.cpp 280d97be96 grammar : support array references in json schema (#16792) 2 months ago
json-schema-to-grammar.h 53f925074d sync : vendor (#13901) 7 months ago
llguidance.cpp 43dfd741a5 llguidance : set tokenizer slices to default (#13424) 8 months ago
log.cpp 408ff524b4 Implement --log-colors with always/never/auto (#15792) 4 months ago
log.h 408ff524b4 Implement --log-colors with always/never/auto (#15792) 4 months ago
ngram-cache.cpp 5bbe6a9fe9 ggml : portability fixes for VS 2017 (#12150) 10 months ago
ngram-cache.h 727368c60f llama : use LLAMA_TOKEN_NULL (#11062) 1 year ago
regex-partial.cpp 3198405e98 `common`: add partial regex support (#12808) 8 months ago
regex-partial.h 3198405e98 `common`: add partial regex support (#12808) 8 months ago
sampling.cpp e789095502 llama: print memory breakdown on exit (#15860) 3 months ago
sampling.h e92d53b29e sampling : optimize samplers by reusing bucket sort (#15665) 4 months ago
speculative.cpp e92d53b29e sampling : optimize samplers by reusing bucket sort (#15665) 4 months ago
speculative.h 94933c8c2e server : implement universal assisted decoding (#12635) 5 months ago