cturan/llama.cpp

Author	SHA1 Message	Date
Maël Kerbiriou	41318d708e llama : use the same threshold for OpenBLAS and ggml thread limiting (#577)	2 years ago
Tobias Lütke	a6956b25a1 add example of re-act pattern (#583)	2 years ago
anzz1	83df5639eb Fix GCC warning about binary literal (#595)	2 years ago
anzz1	a5c42c4b13 Fix typo in llama.h (#593)	2 years ago
anzz1	5a5f8b1501 Enable Fused-Multiply-Add (FMA) and F16C/CVT16 vector extensions on MSVC (#375)	2 years ago
anzz1	f1217055ea CI: fix subdirectory path globbing (#546)	2 years ago
anzz1	7f4c5c6651 llama : fix linkage with mingw (#551)	2 years ago
slaren	2a98bc18ea ggml : add AVX2 implementation of quantize_row_q4_1 (#515)	2 years ago
thement	d0aaff571c py : add temporary script to convert old ggml files to newer version (#539)	2 years ago
Tai Duc Nguyen	d0330fd783 py : add capabiliy to convert from ggml back to torch or hf format for further consumption/training/finetuning (#403)	2 years ago
Stephan Walter	99c5b27654 ggml : refactor quantized processing functions (#509)	2 years ago
DooWoong Lee (David)	692ce3164e py : removed unused `model` variable and verified that the code functions correctly with `vocab_only` setting. Also confirmed that the code works as expected after running with reduced memory usage due to deletion of no-longer-needed variable. (#547)	2 years ago
Georgi Gerganov	96f9c0506f ci : make ctest verbose, hopefully we see what is wrong with the sanitizer	2 years ago
Georgi Gerganov	d502bc7c9d tests : free llama context at the end of the test	2 years ago
Stephan Walter	436e561931 all : be more strict about converting float to double (#458)	2 years ago
Jed Fox	20e1e84884 deploy : add a Package.swift for SwiftPM support (#393)	2 years ago
Stephan Walter	c1f885067c ggml : introduce structs for the q4 data blocks (#356)	2 years ago
Georgi Gerganov	e0670260fb gitignore : add "embedding"	2 years ago
dotpy314	28ba975aea Check the existence of f16_model_path_base in quantize.py (#574)	2 years ago
slaren	a6bdc47cba Fix usage of F16C intrinsics in AVX code (#563)	2 years ago
anzz1	7b8dbcb78b main.cpp fixes, refactoring (#571)	2 years ago
RJ Adriaansen	4b8efff0e3 Add embedding example to Makefile (#540)	2 years ago
Marco Matthies	7e5395575a Fix missing ggml link in cmake for examples/* on w64-mingw32 (#542)	2 years ago
Erik Scholz	34c1072e49 ci: add debug build to sanitizer build matrix (#527)	2 years ago
Stephan Walter	939ad2d3a5 Fix undefined variables in debug build, remove unused variables (#531)	2 years ago
Juan Calderon-Perez	8c2ec5e21d Add support for linux/arm64 platform during Docker Builds (#514)	2 years ago
Stephan Walter	b391579db9 Update README and comments for standalone perplexity tool (#525)	2 years ago
anzz1	7a87d31f4f [main] fix infinite generation (-n == -1) (#523)	2 years ago
Georgi Gerganov	348d6926ee Add logo to README.md	2 years ago
Harald Fernengel	33e35b8fe8 Exit from interactive mode if input stream is bad (#491)	2 years ago

Newer Older

Commit History Find

Commit History