Commit History

Author SHA1 Message Date
  Hua Jiang 0ccfc62a96 ggml_tensor: update the structure comments. (#3283) 2 years ago
  Qu Zongfu 7f1a0fe709 ggml : release the requested thread pool resource (#3292) 2 years ago
  slaren 16bc66d947 llama.cpp : split llama_context_params into model and context params (#3301) 2 years ago
  Eve 0512d66670 ci : multithreaded builds (#3311) 2 years ago
  xaedes 0e76a8992c train : finetune LORA (#2632) 2 years ago
  Cebtenzzre 2db94d98ed gguf : basic type checking in gguf_get_* (#3346) 2 years ago
  Cebtenzzre ecf90b1a51 gguf : make token scores and types optional (#3347) 2 years ago
  Georgi Gerganov 2619109ad5 ci : disable freeBSD builds due to lack of VMs (#3381) 2 years ago
  Georgi Gerganov ec893798b7 llama : custom attention mask + parallel decoding + no context swaps (#3228) 2 years ago
  Kevin Ji 45855b3f1c docs : mark code as Bash (#3375) 2 years ago
  Pierre Alexandre SCHEMBRI 4aea3b846e readme : add Mistral AI release 0.1 (#3362) 2 years ago
  slaren da0400344b ggml-cuda : perform cublas fp16 matrix multiplication as fp16 (#3370) 2 years ago
  Zhang Peiyuan e519621010 convert : remove bug in convert.py permute function (#3364) 2 years ago
  Richard Roberson ac43576124 make-ggml.py : compatibility with more models and GGUF (#3290) 2 years ago
  Cebtenzzre 20c7e1e804 gguf : fix a few general keys (#3341) 2 years ago
  Rickard Hallerbäck dc6897404e metal : reusing llama.cpp logging (#3152) 2 years ago
  Jag Chadha 527e57cfd8 build : add ACCELERATE_NEW_LAPACK to fix warning on macOS Sonoma (#3342) 2 years ago
  BarfingLemurs ffe88a36a9 readme : add some recent perplexity and bpw measurements to READMES, link for k-quants (#3340) 2 years ago
  DAN™ 99115f3fa6 cmake : fix build-info.h on MSVC (#3309) 2 years ago
  2f38b454 1726f9626f docs: Fix typo CLBlast_DIR var. (#3330) 2 years ago
  Erik Scholz a98b1633d5 nix : add cuda, use a symlinked toolkit for cmake (#3202) 2 years ago
  slaren c091cdfb24 llama-bench : add README (#3317) 2 years ago
  Cebtenzzre 51a7cf5c6e examples : fix RoPE defaults to match PR #3240 (#3315) 2 years ago
  Kevin Ji bedb92b603 scripts : use `/usr/bin/env` in shebang (#3313) 2 years ago
  Lee Drake bc9d3e3971 Update README.md (#3289) 2 years ago
  shibe2 36b904e200 ggml-opencl.cpp: Make private functions static (#3300) 2 years ago
  Edward Taylor 324f3403d5 zig : fix for updated c lib (#3259) 2 years ago
  yuiseki f56c418ab0 embedding : update README.md (#3224) 2 years ago
  Johannes Gäßler 8185710a80 CUDA: use only 1 thread if fully offloaded (#2915) 2 years ago
  Georgi Gerganov 7eb41179ed readme : update hot topics 2 years ago