cturan/llama.cpp

Autor	SHA1 Mensaje	Fecha
compilade	c69c63039c convert_hf : fix Gemma v1 conversion (#8597)	hace 1 año
Johannes Gäßler	69c487f4ed CUDA: MMQ code deduplication + iquant support (#8495)	hace 1 año
Georgi Gerganov	07283b1a90 gguf : handle null name during init (#8587)	hace 1 año
Michael Coppola	940362224d llama : add support for Tekken pre-tokenizer (#8579)	hace 1 año
Huifeng Ou	69b9945b44 llama.swiftui: fix end of generation bug (#8268)	hace 1 año
Brian	c3776cacab gguf_dump.py: fix markddown kv array print (#8588)	hace 1 año
slaren	87e397d00b ggml : fix quant dot product with odd number of blocks (#8549)	hace 1 año
Brian	57b1d4f9eb convert-*.py: remove add_name from ChatGLMModel class (#8590)	hace 1 año
Georgi Gerganov	d197545530 llama : bump max layers from 256 to 512 (#8530)	hace 1 año
Georgi Gerganov	be0cfb4175 readme : fix server badge	hace 1 año
Clint Herron	b57eb9ca4f ggml : add friendlier error message to fopen errors (#8575)	hace 1 año
Frank Mai	f299aa98ec fix: typo of chatglm4 chat tmpl (#8586)	hace 1 año
Brian	3d0e4367d9 convert-*.py: add general.name kv override (#8571)	hace 1 año
Johannes Gäßler	a15ef8f8a0 CUDA: fix partial offloading for ne0 % 256 != 0 (#8572)	hace 1 año
65a	705b7ecf60 cmake : install all ggml public headers (#8480)	hace 1 año
Eric Zhang	0d2c7321e9 server: use relative routes for static files in new UI (#8552)	hace 1 año
Brian	672a6f1018 convert-*.py: GGUF Naming Convention Refactor and Metadata Override Refactor (#7499)	hace 1 año
RunningLeon	3807c3de04 server : respect `--special` cli arg (#8553)	hace 1 año
Johannes Gäßler	e02b597be3 lookup: fibonacci hashing, fix crashes (#8548)	hace 1 año
Al Mochkin	b3283448ce build : Fix docker build warnings (#8535) (#8537)	hace 1 año
Brian	30f80ca0bc CONTRIBUTING.md : remove mention of noci (#8541)	hace 1 año
hipudding	1bdd8ae19f [CANN] Add Ascend NPU backend (#6035)	hace 1 año
Masaya, Kato	da3913d8f9 batched: fix n_predict parameter (#8527)	hace 1 año
Georgi Gerganov	d65a8361fe llama : disable context-shift for DeepSeek v2 (#8501)	hace 1 año
Johannes Gäßler	5e116e8dd5 make/cmake: add missing force MMQ/cuBLAS for HIP (#8515)	hace 1 año
Brian	1666f92dcd gguf-hash : update clib.json to point to original xxhash repo (#8491)	hace 1 año
Steve Bonds	37b12f92ab export-lora : handle help argument (#8497)	hace 1 año
Georgi Gerganov	0efec57787 llama : valign + remove unused ftype (#8502)	hace 1 año
compilade	7acfd4e8d5 convert_hf : faster lazy safetensors (#8482)	hace 1 año
Xuan Son Nguyen	97bdd26eee Refactor lora adapter support (#8332)	hace 1 año

Posterior Anterior

Historial de Commits Buscar

Historial de Commits