Commit History

Author SHA1 Message Date
  slaren 2a98bc18ea ggml : add AVX2 implementation of quantize_row_q4_1 (#515) 2 years ago
  thement d0aaff571c py : add temporary script to convert old ggml files to newer version (#539) 2 years ago
  Tai Duc Nguyen d0330fd783 py : add capabiliy to convert from ggml back to torch or hf format for further consumption/training/finetuning (#403) 2 years ago
  Stephan Walter 99c5b27654 ggml : refactor quantized processing functions (#509) 2 years ago
  DooWoong Lee (David) 692ce3164e py : removed unused `model` variable and verified that the code functions correctly with `vocab_only` setting. Also confirmed that the code works as expected after running with reduced memory usage due to deletion of no-longer-needed variable. (#547) 2 years ago
  Georgi Gerganov 96f9c0506f ci : make ctest verbose, hopefully we see what is wrong with the sanitizer 2 years ago
  Georgi Gerganov d502bc7c9d tests : free llama context at the end of the test 2 years ago
  Stephan Walter 436e561931 all : be more strict about converting float to double (#458) 2 years ago
  Jed Fox 20e1e84884 deploy : add a Package.swift for SwiftPM support (#393) 2 years ago
  Stephan Walter c1f885067c ggml : introduce structs for the q4 data blocks (#356) 2 years ago
  Georgi Gerganov e0670260fb gitignore : add "embedding" 2 years ago
  dotpy314 28ba975aea Check the existence of f16_model_path_base in quantize.py (#574) 2 years ago
  slaren a6bdc47cba Fix usage of F16C intrinsics in AVX code (#563) 2 years ago
  anzz1 7b8dbcb78b main.cpp fixes, refactoring (#571) 2 years ago
  RJ Adriaansen 4b8efff0e3 Add embedding example to Makefile (#540) 2 years ago
  Marco Matthies 7e5395575a Fix missing ggml link in cmake for examples/* on w64-mingw32 (#542) 2 years ago
  Erik Scholz 34c1072e49 ci: add debug build to sanitizer build matrix (#527) 2 years ago
  Stephan Walter 939ad2d3a5 Fix undefined variables in debug build, remove unused variables (#531) 2 years ago
  Juan Calderon-Perez 8c2ec5e21d Add support for linux/arm64 platform during Docker Builds (#514) 2 years ago
  Stephan Walter b391579db9 Update README and comments for standalone perplexity tool (#525) 2 years ago
  anzz1 7a87d31f4f [main] fix infinite generation (-n == -1) (#523) 2 years ago
  Georgi Gerganov 348d6926ee Add logo to README.md 2 years ago
  Harald Fernengel 33e35b8fe8 Exit from interactive mode if input stream is bad (#491) 2 years ago
  anzz1 19726169b3 CI: Run other sanitizer builds even if one fails (#511) 2 years ago
  jp-x-g f732695cd5 Clarify console output in convert-pth-to-ggml.py (#512) 2 years ago
  anzz1 2f7bf7dd7c CMake / CI additions (#497) 2 years ago
  anzz1 34ab526843 (Windows) Set console to UTF-8 on init (#420) 2 years ago
  Georgi Gerganov c2b25b6912 Fix colors enabling on WIN32 2 years ago
  Georgi Gerganov 79b2b266db If n_predict == -1, generate forever 2 years ago
  Georgi Gerganov e2d490dafd Inifinite generation via context swapping (#71) 2 years ago