Commit History

Author SHA1 Message Date
  Andrew Duffy 58c438cf7d Add Accelerate/BLAS when using Swift (#765) 2 years ago
  mgroeber9110 53dbba7695 Windows: reactive sigint handler after each Ctrl-C (#736) 2 years ago
  SebastianApel 437e77855a 10+% performance improvement of ggml_vec_dot_q4_0 on AVX2 (#654) 2 years ago
  Ivan Stepanov cd7fa95690 Define non-positive temperature behavior (#720) 2 years ago
  bsilvereagle a0c0516416 Remove torch GPU dependencies from the Docker.full image (#665) 2 years ago
  Thatcher Chamberlin d8d4e865cd Add a missing step to the gpt4all instructions (#690) 2 years ago
  Christian Falch e986f94829 Added api for getting/setting the kv_cache (#685) 2 years ago
  Marian Cepok c0bb1d3ce2 ggml : change ne to int64_t (#626) 2 years ago
  Leonardo Neumann 6e7801d08d examples : add gpt4all script (#658) 2 years ago
  Stephan Walter 81040f10aa llama : do not allocate KV cache for "vocab_only == true" (#682) 2 years ago
  Fabian c4f89d8d73 make : use -march=native -mtune=native on x86 (#609) 2 years ago
  Murilo Santana 5b70e7de4c fix default params for examples/main (#697) 2 years ago
  Ikko Eltociear Ashimine a717cba844 py: huggingface -> Hugging Face (#686) 2 years ago
  rimoliga d0a7f742e7 readme: replace termux links with homepage, play store is deprecated (#680) 2 years ago
  Slaren 0d054e292e Show error message when -f fails 2 years ago
  Stephan Walter 3525899277 Enable -std= for cmake builds, fix warnings (#598) 2 years ago
  slaren 1d08882afa Optimize AVX2 ggml_vec_dot_q4_0 (#642) 2 years ago
  perserk 02c5b27e91 Add AVX acceleration (#617) 2 years ago
  Pavol Rusnak cbef542879 py : cleanup the code 2 years ago
  Pavol Rusnak 9733104be5 drop quantize.py (now that models are using a single file) 2 years ago
  Georgi Gerganov 3df890aef4 readme : update supported models 2 years ago
  Justine Tunney ee0c40dd6d Introduce GGML migration tool for new file format 2 years ago
  Justine Tunney 6f23ba5ee2 Ensure --mlock works properly with mmap() support 2 years ago
  Justine Tunney 78ca9838ee Make loading weights 10-100x faster 2 years ago
  Slaren a017390358 Initial windows support (untested) 2 years ago
  Slaren ac184d5147 Always initialize mm_addr and mm_length in llama_model 2 years ago
  Slaren 276e5b7811 Unmap the file in llama_free 2 years ago
  Slaren d68c5dc435 Make mmap_file static 2 years ago
  Slaren 64bde3ffd4 Fix ggml_init_params in quantize 2 years ago
  Slaren c03ae8dca1 Add mmap support for model files 2 years ago