cturan/llama.cpp

Autor	SHA1 Zpráva	Datum
Xuan Son Nguyen	b115105f05 add llama_lora_adapter_clear (#8653)	před 1 rokem
Georgi Gerganov	938943cdbf llama : move vocab, grammar and sampling into separate files (#8508)	před 1 rokem
Keke Han	081fe431aa llama : fix codeshell support (#8599)	před 1 rokem
Jason Stillerman	d94c6e0ccb llama : add support for SmolLm pre-tokenizer (#8609)	před 1 rokem
Michael Coppola	940362224d llama : add support for Tekken pre-tokenizer (#8579)	před 1 rokem
Georgi Gerganov	d197545530 llama : bump max layers from 256 to 512 (#8530)	před 1 rokem
Georgi Gerganov	0efec57787 llama : valign + remove unused ftype (#8502)	před 1 rokem
Xuan Son Nguyen	97bdd26eee Refactor lora adapter support (#8332)	před 1 rokem
Dibakar Gope	0f1a39f343 ggml : add AArch64 optimized GEMV and GEMM Q4 kernels (#5780)	před 1 rokem
toyer	905942abdb llama : support glm3 and glm4 (#8031)	před 1 rokem
jaime-m-p	213701b51a Detokenizer fixes (#8039)	před 1 rokem
Douglas Hanley	d12f781074 llama : streamline embeddings from "non-embedding" models (#8087)	před 1 rokem
fairydreaming	807b0c49ff Inference support for T5 and FLAN-T5 model families (#5763)	před 1 rokem
Faisal Zaghloul	968967376d Add `JAIS` model(s) (#8118)	před 1 rokem
kustaaya	f675b20a3b Added support for Viking pre-tokenizer (#8135)	před 1 rokem
Georgi Gerganov	f3f65429c4 llama : reorganize source code + improve CMake (#8006)	před 1 rokem