Commit History

Author SHA1 Message Date
  Brian f7cab35ef9 gguf-hash: model wide and per tensor hashing using xxhash and sha1 (#8048) 1 year ago
  Georgi Gerganov f3f65429c4 llama : reorganize source code + improve CMake (#8006) 1 year ago
  Xuan Son Nguyen 0c7b3595b9 Add `cvector-generator` example (#7514) 1 year ago
  Olivier Chafik 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 year ago
  Georgi Gerganov 0cd6bd3483 llama : remove beam search (#7736) 1 year ago
  Radoslav Gerganov 5e31828d3e ggml : add RPC backend (#6829) 1 year ago
  Pierrick Hymbert b804b1ef77 eval-callback: Example how to use eval callback for debugging (#6576) 1 year ago
  Minsoo Cheong 64e7b47c69 examples : add "retrieval" (#6193) 1 year ago
  Pierrick Hymbert d0d5de42e5 gguf-split: split and merge gguf per batch of tensors (#6135) 1 year ago
  DAN™ bcebd7dbf6 llama : add support for GritLM (#5959) 1 year ago
  John 6c00a06692 gguf : add python reader example (#5216) 1 year ago
  Abhilash Majumder 0f648573dd ggml : add unified SYCL backend for Intel GPUs (#2690) 2 years ago
  Georgi Gerganov 4be5ef556d metal : remove old API (#4919) 2 years ago
  Kawrakow 326b418b59 Importance Matrix calculation (#4861) 2 years ago
  Georgi Gerganov b0034d93ce examples : add passkey test (#3856) 2 years ago
  LeonEricsson 7082d24cec lookup : add prompt lookup decoding example (#4484) 2 years ago
  Georgi Gerganov 922754a8d6 lookahead : add example for lookahead decoding (#4207) 2 years ago
  zakkor 2fa02b4b3d examples : add tokenize (#4039) 2 years ago
  Georgi Gerganov d1031cf49c sampling : refactor init to use llama_sampling_params (#3696) 2 years ago
  M. Yusuf Sarıgöz 370359e5ba examples: support LLaVA v1.5 (multimodal model) (#3436) 2 years ago
  Georgi Gerganov 8c70a5ff25 batched : add bench tool (#3545) 2 years ago
  xaedes 0e76a8992c train : finetune LORA (#2632) 2 years ago
  Georgi Gerganov ec893798b7 llama : custom attention mask + parallel decoding + no context swaps (#3228) 2 years ago
  Georgi Gerganov 47068e5170 speculative : PoC for speeding-up inference via speculative sampling (#2926) 2 years ago
  Georgi Gerganov c90d135eb4 examples : fix underscore in beam-search + .gitignore (close #2900) 2 years ago
  Matt Pulver c82742ac9c llama : add llama_beam_search() (#2267) 2 years ago
  Georgi Gerganov 6381d4e110 gguf : new file format with flexible meta data (beta) (#2398) 2 years ago
  slaren 097e121e2f llama : add benchmark example (#2626) 2 years ago
  byte-6174 b19edd54d5 Adding support for llama2.c models (#2559) 2 years ago
  DannyDaemonic 3498588e0f Add --simple-io option for subprocesses and break out console.h and cpp (#1558) 2 years ago