Commit History

Autor SHA1 Mensaxe Data
  MaggotHATE fbc98b748e sampling : add XTC sampler (#9742) hai 1 ano
  Georgi Gerganov f4b2dcdf49 readme : fix typo [no ci] hai 1 ano
  Vinesh Janarthanan 441b72b91f main : option to disable context shift (#9484) hai 1 ano
  Denis Spasyuk a8db2a9ce6 Update llama-cli documentation (#8315) hai 1 ano
  Olivier Chafik 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) hai 1 ano
  arch-btw 9973e81c5c readme : remove -ins (#7759) hai 1 ano
  Georgi Gerganov 1442677f92 common : refactor cli arg parsing (#7675) hai 1 ano
  Amir 11474e756d examples: cache hf model when --model not provided (#7353) hai 1 ano
  omahs 04976db7a8 docs: fix typos (#7124) hai 1 ano
  Olivier Chafik 8843a98c2b Improve usability of --model-url & related flags (#6930) hai 1 ano
  Olivier Chafik 7593639ce3 `main`: add --json-schema / -j flag (#6659) hai 1 ano
  Rene Leonhardt 5c4d767ac0 chore: Fix markdown warnings (#6625) hai 1 ano
  Ting Sun cfc4d75df6 doc: fix outdated default value of batch size (#6336) hai 1 ano
  slaren 280345968d cuda : rename build flag to LLAMA_CUDA (#6299) hai 1 ano
  Pierrick Hymbert d01b3c4c32 common: llama_load_model_from_url using --model-url (#6098) hai 1 ano
  bmwl f486f6e1e5 ggml : add numa options (#5377) hai 1 ano
  Richard Kiss 532dd74e38 Fix some documentation typos/grammar mistakes (#4032) %!s(int64=2) %!d(string=hai) anos
  kalomaze 238657db23 samplers : Min-P sampler implementation [alternative to Top P/Top K] (#3841) %!s(int64=2) %!d(string=hai) anos
  slaren 16bc66d947 llama.cpp : split llama_context_params into model and context params (#3301) %!s(int64=2) %!d(string=hai) anos
  Roland 2d770505a8 llama : remove mtest (#3177) %!s(int64=2) %!d(string=hai) anos
  ZHAOKAI WANG 69fdbb9abc readme : quick start command fix (#2908) %!s(int64=2) %!d(string=hai) anos
  Evan Jones f5fe98d11b docs : add grammar docs (#2701) %!s(int64=2) %!d(string=hai) anos
  Christian Demsar e59fcb2bc1 Add --n-predict -2 for stopping generation on full context (#2565) %!s(int64=2) %!d(string=hai) anos
  klosax f3c3b4b167 Add --rope-scale parameter (#2544) %!s(int64=2) %!d(string=hai) anos
  Weird Constructor d91f3f0c55 readme : fix the description of the Tail free sampling (TFS) method (#2431) %!s(int64=2) %!d(string=hai) anos
  Howard Su 32c5411631 Revert "Support using mmap when applying LoRA (#2095)" (#2206) %!s(int64=2) %!d(string=hai) anos
  Howard Su 2347463201 Support using mmap when applying LoRA (#2095) %!s(int64=2) %!d(string=hai) anos
  Howard Su b8c8dda75f Use unsigned for random seed (#2006) %!s(int64=2) %!d(string=hai) anos
  zrm b853d45601 ggml : add NUMA support (#1556) %!s(int64=2) %!d(string=hai) anos
  Johannes Gäßler 254a7a7a5f CUDA full GPU acceleration, KV cache in VRAM (#1827) %!s(int64=2) %!d(string=hai) anos