Diego Devesa dca1d4b58a ggml : fix BLAS with unsupported types (#9775) 1 year ago
..
CMakeLists.txt 938943cdbf llama : move vocab, grammar and sampling into separate files (#8508) 1 year ago
llama-grammar.cpp df270ef745 llama : refactor sampling v2 (#9294) 1 year ago
llama-grammar.h df270ef745 llama : refactor sampling v2 (#9294) 1 year ago
llama-impl.h cea1486ecf log : add CONT level for continuing previous log entry (#9610) 1 year ago
llama-sampling.cpp b0f27361f3 sampling : avoid expensive softmax during greedy sampling (#9605) 1 year ago
llama-sampling.h 19f4a7b296 llama : refactor samplers internal implementation (#9370) 1 year ago
llama-vocab.cpp f4d2b8846a llama : add reranking support (#9510) 1 year ago
llama-vocab.h 8c475b97b8 rerank : use [SEP] token instead of [BOS] (#9737) 1 year ago
llama.cpp dca1d4b58a ggml : fix BLAS with unsupported types (#9775) 1 year ago
unicode-data.cpp 458367a906 server : better security control for public deployments (#9776) 1 year ago
unicode-data.h a39ab216aa llama : reduce compile time and binary size (#9712) 1 year ago
unicode.cpp a39ab216aa llama : reduce compile time and binary size (#9712) 1 year ago
unicode.h 938943cdbf llama : move vocab, grammar and sampling into separate files (#8508) 1 year ago