cturan/llama.cpp

Autor	SHA1 Mensaxe	Data
Clint Herron	a59f8fdc85 Server: Enable setting default sampling parameters via command-line (#8402)	hai 1 ano
Andy Salerno	fd560fe680 Update README.md to fix broken link to docs (#8399)	hai 1 ano
Clint Herron	e500d6135a Deprecation warning to assist with migration to new binary names (#8283)	hai 1 ano
Johannes Gäßler	a03e8dd99d make/cmake: LLAMA_NO_CCACHE -> GGML_NO_CCACHE (#8392)	hai 1 ano
Alberto Cabrera Pérez	5b0b8d8cfb sycl : Reenabled mmvq path for the SYCL Nvidia Backend (#8372)	hai 1 ano
Borislav Stanimirov	9925ca4087 cmake : allow external ggml (#8370)	hai 1 ano
daghanerdonmez	9beb2dda03 readme : fix typo [no ci] (#8389)	hai 1 ano
compilade	7d0e23d72e gguf-py : do not use internal numpy types (#7472)	hai 1 ano
Georgi Gerganov	7fdb6f73e3 flake.lock: Update (#8342)	hai 1 ano
Alberto Cabrera Pérez	a130eccef4 labeler : updated sycl to match docs and code refactor (#8373)	hai 1 ano
b4b4o	c4dd11d1d3 readme : fix web link error [no ci] (#8347)	hai 1 ano
Alberto Cabrera Pérez	2ec846d558 sycl : fix powf call in device code (#8368)	hai 1 ano
Georgi Gerganov	3f2d538b81 scripts : fix sync for sycl	hai 1 ano
Georgi Gerganov	2ee44c9a18 sync : ggml	hai 1 ano
Georgi Gerganov	6847d54c4f tests : fix whitespace (#0)	hai 1 ano
John Balis	fde13b3bb9 feat: cuda implementation for `ggml_conv_transpose_1d` (ggml/854)	hai 1 ano
Kevin Wang	470939d483 common : preallocate sampling token data vector (#8363)	hai 1 ano
Georgi Gerganov	6f0dbf6ab0 infill : assert prefix/suffix tokens + remove old space logic (#8351)	hai 1 ano
Kevin Wang	ffd00797d8 common : avoid unnecessary logits fetch (#8358)	hai 1 ano
toyer	04ce3a8b19 readme : add supported glm models (#8360)	hai 1 ano
compilade	3fd62a6b1c py : type-check all Python scripts with Pyright (#8341)	hai 1 ano
Denis Spasyuk	a8db2a9ce6 Update llama-cli documentation (#8315)	hai 1 ano
Alex Tuddenham	4090ea5501 ci : add checks for cmake,make and ctest in ci/run.sh (#8200)	hai 1 ano
Andy Tai	f1948f1e10 readme : update bindings list (#8222)	hai 1 ano
Brian	f7cab35ef9 gguf-hash: model wide and per tensor hashing using xxhash and sha1 (#8048)	hai 1 ano
toyer	905942abdb llama : support glm3 and glm4 (#8031)	hai 1 ano
Georgi Gerganov	b5040086d4 llama : fix n_rot default (#8348)	hai 1 ano
compilade	d39130a398 py : use cpu-only torch in requirements.txt (#8335)	hai 1 ano
standby24x7	b81ba1f96b finetune: Rename command name in README.md (#8343)	hai 1 ano
standby24x7	210eb9ed0a finetune: Rename an old command name in finetune.sh (#8344)	hai 1 ano

Posterior Anterior

Commit History Buscar

Commit History