Olivier Chafik 79967ec596 grammar : use int64_t to avoid int overflows in int schema to grammar conversion logic (#16626) il y a 3 mois
..
CMakeLists.txt 4201deae9c common: introduce http.h for httplib-based client (#16373) il y a 3 mois
arg.cpp 6f5d924637 common : Update the docs on -t --threads (#16236) il y a 3 mois
arg.h 364a7a6d4a common : remove common_has_curl() (#16351) il y a 3 mois
base64.hpp 381efbf480 llava : expose as a shared library for downstream projects (#3613) il y a 2 ans
build-info.cpp.in cc8d081879 cmake: Add ability to pass in LLAMA_BUILD_NUMBER/COMMIT (#14167) il y a 7 mois
chat-parser.cpp 2c301e91ab common : handle unicode during partial json parsing (#16526) il y a 3 mois
chat-parser.h 34fcc5a4ac model : Apertus model implementation (#15852) il y a 3 mois
chat.cpp 12bbc3fa50 refactor: centralize CoT parsing in backend for streaming mode (#16394) il y a 3 mois
chat.h d00cbea63c server : host-memory prompt caching (#16391) il y a 3 mois
common.cpp 3df2244df4 llama : add --no-host to disable host buffers (#16310) il y a 3 mois
common.h 4b2dae383d common : update presets (#16504) il y a 3 mois
console.cpp 8277a817f1 console : utf-8 fix for windows stdin (#9690) il y a 1 an
console.h 6381d4e110 gguf : new file format with flexible meta data (beta) (#2398) il y a 2 ans
http.h 4201deae9c common: introduce http.h for httplib-based client (#16373) il y a 3 mois
json-partial.cpp 2c301e91ab common : handle unicode during partial json parsing (#16526) il y a 3 mois
json-partial.h 53f925074d sync : vendor (#13901) il y a 7 mois
json-schema-to-grammar.cpp 79967ec596 grammar : use int64_t to avoid int overflows in int schema to grammar conversion logic (#16626) il y a 3 mois
json-schema-to-grammar.h 53f925074d sync : vendor (#13901) il y a 7 mois
llguidance.cpp 43dfd741a5 llguidance : set tokenizer slices to default (#13424) il y a 8 mois
log.cpp 408ff524b4 Implement --log-colors with always/never/auto (#15792) il y a 4 mois
log.h 408ff524b4 Implement --log-colors with always/never/auto (#15792) il y a 4 mois
ngram-cache.cpp 5bbe6a9fe9 ggml : portability fixes for VS 2017 (#12150) il y a 10 mois
ngram-cache.h 727368c60f llama : use LLAMA_TOKEN_NULL (#11062) il y a 1 an
regex-partial.cpp 3198405e98 `common`: add partial regex support (#12808) il y a 8 mois
regex-partial.h 3198405e98 `common`: add partial regex support (#12808) il y a 8 mois
sampling.cpp e789095502 llama: print memory breakdown on exit (#15860) il y a 3 mois
sampling.h e92d53b29e sampling : optimize samplers by reusing bucket sort (#15665) il y a 4 mois
speculative.cpp e92d53b29e sampling : optimize samplers by reusing bucket sort (#15665) il y a 4 mois
speculative.h 94933c8c2e server : implement universal assisted decoding (#12635) il y a 5 mois