cturan/llama.cpp

Autor	SHA1 Nachricht	Datum
Georgi Gerganov	3a0345970e make : whitespace	vor 1 Jahr
Jared Van Bortel	32c8486e1f wpm : portable unicode tolower (#6305)	vor 1 Jahr
slaren	280345968d cuda : rename build flag to LLAMA_CUDA (#6299)	vor 1 Jahr
slaren	ae1f211ce2 cuda : refactor into multiple files (#6269)	vor 1 Jahr
Minsoo Cheong	64e7b47c69 examples : add "retrieval" (#6193)	vor 1 Jahr
Pierrick Hymbert	21cad01b6e split: add gguf-split in the make build target (#6262)	vor 1 Jahr
Johannes Gäßler	50ccaf5eac lookup: complement data from context with general text statistics (#5479)	vor 1 Jahr
slaren	2f0e81e053 cuda : add LLAMA_CUDA_NO_PEER_COPY to workaround broken ROCm p2p copy (#6208)	vor 1 Jahr
Olivier Chafik	5b7b0ac8df json-schema-to-grammar improvements (+ added to server) (#5978)	vor 1 Jahr
Pierrick Hymbert	d0d5de42e5 gguf-split: split and merge gguf per batch of tensors (#6135)	vor 1 Jahr
Pierrick Hymbert	d01b3c4c32 common: llama_load_model_from_url using --model-url (#6098)	vor 1 Jahr
Georgi Gerganov	131b058409 make : ggml-metal.o depends on ggml.h	vor 1 Jahr
Georgi Gerganov	381da2d9f0 metal : build metallib + fix embed path (#6015)	vor 1 Jahr
slaren	f30ea47a87 llama : add pipeline parallelism support (#6017)	vor 1 Jahr
Georgi Gerganov	83796e62bc llama : refactor unicode stuff (#5992)	vor 1 Jahr
DAN™	bcebd7dbf6 llama : add support for GritLM (#5959)	vor 1 Jahr
Georgi Gerganov	8a3012a4ad ggml : add ggml-common.h to deduplicate shared code (#5940)	vor 1 Jahr
Gabe Goodhart	e1fa9569ba server : add SSL support (#5926)	vor 1 Jahr
Georgi Gerganov	2002bc96bf server : refactor (#5882)	vor 1 Jahr
le.chang	cbbd1efa06 Makefile: use variables for cublas (#5689)	vor 1 Jahr
kwin1412	f1a98c5254 make : fix nvcc version is empty (#5713)	vor 1 Jahr
CJ Pais	6560bed3f0 server : support llava 1.6 (#5553)	vor 1 Jahr
slaren	06bf2cf8c4 make : fix debug build with CUDA (#5616)	vor 1 Jahr
Haoxiang Fei	8dbbd75754 metal : add build system support for embedded metal library (#5604)	vor 1 Jahr
Jared Van Bortel	f24ed14ee0 make : pass CPPFLAGS directly to nvcc, not via -Xcompiler (#5598)	vor 1 Jahr
Georgi Gerganov	d0e3ce51f4 ci : enable -Werror for CUDA builds (#5579)	vor 1 Jahr
Georgi Gerganov	68a6b98b3c make : fix CUDA build (#5580)	vor 1 Jahr
Xuan Son Nguyen	11b12de39b llama : add llama_chat_apply_template() (#5538)	vor 1 Jahr
Jared Van Bortel	a0c2dad9d4 build : pass all warning flags to nvcc via -Xcompiler (#5570)	vor 1 Jahr
Ananta Bastola	6e4e973b26 ci : add an option to fail on compile warning (#3952)	vor 1 Jahr

Neuer Älter

Commit Verlauf Finden

Commit Verlauf