cturan/llama.cpp @ f536f4c4391bec74c432a924625c04e8c484d3ee

Georgi Gerganov cad341d889 metal : reduce command encoding overhead (#9698)		před 1 rokem
..
CMakeLists.txt	938943cdbf llama : move vocab, grammar and sampling into separate files (#8508)	před 1 rokem
llama-grammar.cpp	df270ef745 llama : refactor sampling v2 (#9294)	před 1 rokem
llama-grammar.h	df270ef745 llama : refactor sampling v2 (#9294)	před 1 rokem
llama-impl.h	cea1486ecf log : add CONT level for continuing previous log entry (#9610)	před 1 rokem
llama-sampling.cpp	b0f27361f3 sampling : avoid expensive softmax during greedy sampling (#9605)	před 1 rokem
llama-sampling.h	19f4a7b296 llama : refactor samplers internal implementation (#9370)	před 1 rokem
llama-vocab.cpp	f4d2b8846a llama : add reranking support (#9510)	před 1 rokem
llama-vocab.h	6102037bbb vocab : refactor tokenizer to reduce init overhead (#9449)	před 1 rokem
llama.cpp	cad341d889 metal : reduce command encoding overhead (#9698)	před 1 rokem
unicode-data.cpp	07a3fc0608 Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (#8258)	před 1 rokem
unicode-data.h	f3f65429c4 llama : reorganize source code + improve CMake (#8006)	před 1 rokem
unicode.cpp	503147a9f9 unicode : add <algorithm> (#9508)	před 1 rokem
unicode.h	938943cdbf llama : move vocab, grammar and sampling into separate files (#8508)	před 1 rokem