Commit History

Author SHA1 Message Date
  Maël Kerbiriou 41318d708e llama : use the same threshold for OpenBLAS and ggml thread limiting (#577) 2 years ago
  Tobias Lütke a6956b25a1 add example of re-act pattern (#583) 2 years ago
  anzz1 83df5639eb Fix GCC warning about binary literal (#595) 2 years ago
  anzz1 a5c42c4b13 Fix typo in llama.h (#593) 2 years ago
  anzz1 5a5f8b1501 Enable Fused-Multiply-Add (FMA) and F16C/CVT16 vector extensions on MSVC (#375) 2 years ago
  anzz1 f1217055ea CI: fix subdirectory path globbing (#546) 2 years ago
  anzz1 7f4c5c6651 llama : fix linkage with mingw (#551) 2 years ago
  slaren 2a98bc18ea ggml : add AVX2 implementation of quantize_row_q4_1 (#515) 2 years ago
  thement d0aaff571c py : add temporary script to convert old ggml files to newer version (#539) 2 years ago
  Tai Duc Nguyen d0330fd783 py : add capabiliy to convert from ggml back to torch or hf format for further consumption/training/finetuning (#403) 2 years ago
  Stephan Walter 99c5b27654 ggml : refactor quantized processing functions (#509) 2 years ago
  DooWoong Lee (David) 692ce3164e py : removed unused `model` variable and verified that the code functions correctly with `vocab_only` setting. Also confirmed that the code works as expected after running with reduced memory usage due to deletion of no-longer-needed variable. (#547) 2 years ago
  Georgi Gerganov 96f9c0506f ci : make ctest verbose, hopefully we see what is wrong with the sanitizer 2 years ago
  Georgi Gerganov d502bc7c9d tests : free llama context at the end of the test 2 years ago
  Stephan Walter 436e561931 all : be more strict about converting float to double (#458) 2 years ago
  Jed Fox 20e1e84884 deploy : add a Package.swift for SwiftPM support (#393) 2 years ago
  Stephan Walter c1f885067c ggml : introduce structs for the q4 data blocks (#356) 2 years ago
  Georgi Gerganov e0670260fb gitignore : add "embedding" 2 years ago
  dotpy314 28ba975aea Check the existence of f16_model_path_base in quantize.py (#574) 2 years ago
  slaren a6bdc47cba Fix usage of F16C intrinsics in AVX code (#563) 2 years ago
  anzz1 7b8dbcb78b main.cpp fixes, refactoring (#571) 2 years ago
  RJ Adriaansen 4b8efff0e3 Add embedding example to Makefile (#540) 2 years ago
  Marco Matthies 7e5395575a Fix missing ggml link in cmake for examples/* on w64-mingw32 (#542) 2 years ago
  Erik Scholz 34c1072e49 ci: add debug build to sanitizer build matrix (#527) 2 years ago
  Stephan Walter 939ad2d3a5 Fix undefined variables in debug build, remove unused variables (#531) 2 years ago
  Juan Calderon-Perez 8c2ec5e21d Add support for linux/arm64 platform during Docker Builds (#514) 2 years ago
  Stephan Walter b391579db9 Update README and comments for standalone perplexity tool (#525) 2 years ago
  anzz1 7a87d31f4f [main] fix infinite generation (-n == -1) (#523) 2 years ago
  Georgi Gerganov 348d6926ee Add logo to README.md 2 years ago
  Harald Fernengel 33e35b8fe8 Exit from interactive mode if input stream is bad (#491) 2 years ago