cturan/llama.cpp

Auteur	SHA1 Message	Date
Brian	672a6f1018 convert-*.py: GGUF Naming Convention Refactor and Metadata Override Refactor (#7499)	il y a 1 an
RunningLeon	3807c3de04 server : respect `--special` cli arg (#8553)	il y a 1 an
Johannes Gäßler	e02b597be3 lookup: fibonacci hashing, fix crashes (#8548)	il y a 1 an
Al Mochkin	b3283448ce build : Fix docker build warnings (#8535) (#8537)	il y a 1 an
Brian	30f80ca0bc CONTRIBUTING.md : remove mention of noci (#8541)	il y a 1 an
hipudding	1bdd8ae19f [CANN] Add Ascend NPU backend (#6035)	il y a 1 an
Masaya, Kato	da3913d8f9 batched: fix n_predict parameter (#8527)	il y a 1 an
Georgi Gerganov	d65a8361fe llama : disable context-shift for DeepSeek v2 (#8501)	il y a 1 an
Johannes Gäßler	5e116e8dd5 make/cmake: add missing force MMQ/cuBLAS for HIP (#8515)	il y a 1 an
Brian	1666f92dcd gguf-hash : update clib.json to point to original xxhash repo (#8491)	il y a 1 an
Steve Bonds	37b12f92ab export-lora : handle help argument (#8497)	il y a 1 an
Georgi Gerganov	0efec57787 llama : valign + remove unused ftype (#8502)	il y a 1 an
compilade	7acfd4e8d5 convert_hf : faster lazy safetensors (#8482)	il y a 1 an
Xuan Son Nguyen	97bdd26eee Refactor lora adapter support (#8332)	il y a 1 an
Xuan Son Nguyen	4db8f60fe7 fix ci (#8494)	il y a 1 an
Daniel Bevenius	8fac431b06 ggml : suppress unknown pragma 'GCC' on windows (#8460)	il y a 1 an
M-A	f17f39ff9c server: update README.md with llama-server --help output [no ci] (#8472)	il y a 1 an
Georgi Gerganov	9104bc20ed common : add --no-cont-batching arg (#6358)	il y a 1 an
NikolaiLyssogor	fc690b018e docs: fix links in development docs [no ci] (#8481)	il y a 1 an
Meng, Hengyu	16bdfa42ac [SYCL] add concat through dim 1/2 (#8483)	il y a 1 an
Georgi Gerganov	3dfda05956 llama : de-duplicate deepseek2 norm	il y a 1 an
0cc4m	bda62d7999 Vulkan MMQ Fix (#8479)	il y a 1 an
compilade	090fca7a07 pydantic : replace uses of __annotations__ with get_type_hints (#8474)	il y a 1 an
Georgi Gerganov	aaab2419ea flake.lock: Update (#8475)	il y a 1 an
Georgi Gerganov	73cf442e7b llama : fix Gemma-2 Query scaling factors (#8473)	il y a 1 an
Brian	e236528e76 gguf_hash.py: Add sha256 (#8470)	il y a 1 an
compilade	fa79495bb4 llama : fix pre-tokenization of non-special added tokens (#8228)	il y a 1 an
bandoti	17eb6aa8a9 vulkan : cmake integration (#8119)	il y a 1 an
Georgi Gerganov	c917b67f06 metal : template-ify some of the kernels (#8447)	il y a 1 an
Georgi Gerganov	4e24cffd8c server : handle content array in chat API (#8449)	il y a 1 an

Récemment Précédemment

Historique des commits Trouver

Historique des commits