cturan/llama.cpp

Autor	SHA1 Zpráva	Datum
HanishKVC	f89fe2732c Main+: optionally allow special tokens from user in interactive mode (#7097)	před 1 rokem
Andrei	d11afd6652 llava : fix moondream support (#7163)	před 1 rokem
Ouadie EL FAROUKI	8c570c9496 Minor arithmetic improvement to mmvq wrapper kernel (#7172)	před 1 rokem
slaren	eaf4bd8b39 eval-callback : fix conversion to float (#7184)	před 1 rokem
0cc4m	befddd0f15 Vulkan Bugfixes and Improvements (#7084)	před 1 rokem
Georgi Gerganov	d46dbc76f8 readme : add scheduled server workflow status badge	před 1 rokem
l3utterfly	0961d86604 readme : add app (#6371)	před 1 rokem
jaime-m-p	43248e5594 llama3 custom regex split (#6965)	před 1 rokem
Johannes Gäßler	a743d76a01 CUDA: generalize FP16 fattn vec kernel (#7061)	před 1 rokem
Galunid	f31ec120bc Add warning if token is invalid (#7173)	před 1 rokem
Daniel Bevenius	fd9f92b154 llama : update llama_timings.n_p_eval setting (#7160)	před 1 rokem
Sigbjørn Skjæret	22842164bc gguf-py : add special token modification capability (#7166)	před 1 rokem
Albert Jin	4734524882 opencl : alignment size converted from bits to bytes (#7090)	před 1 rokem
Ahmet Zeer	07cd41d096 TypoFix (#7162)	před 1 rokem
Jared Van Bortel	4426e2987b cmake : fix typo (#7151)	před 1 rokem
compilade	f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)	před 1 rokem
agray3	bc4bba364f Introduction of CUDA Graphs to LLama.cpp (#6766)	před 1 rokem
Johannes Gäßler	c12452c7ae JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)	před 1 rokem
Georgi Gerganov	9da243b36a Revert "llava : add support for moondream vision language model (#6899)"	před 1 rokem
JohnnyB	bd1871fa2b server : add themes + favicon (#6848)	před 1 rokem
Gilad S	26458af1d6 metal : use `vm_allocate` instead of `posix_memalign` on macOS (#7078)	před 1 rokem
Dawid Potocki	83330d8cd6 main : add --conversation / -cnv flag (#7108)	před 1 rokem
Eve	465263d0cf sgemm : AVX Q4_0 and Q8_0 (#6891)	před 1 rokem
Johan	911b3900dd server : add_special option for tokenize endpoint (#7059)	před 1 rokem
20kdc	ad211edef5 convert.py : --vocab-only generates false but valid params (#7027)	před 1 rokem
Ren Xuancheng	229ffff872 llama : add BPE pre-tokenization for Qwen2 (#7114)	před 1 rokem
Xuan Son Nguyen	1fd9c1741d clean up json_value & server_log (#7142)	před 1 rokem
DAN™	4cd621c26d convert : add BPE pre-tokenization for DBRX (#7132)	před 1 rokem
Georgi Gerganov	7e0b6a7b3b py : also print the normalizers	před 1 rokem
Brian	acdce3cdef compare-llama-bench.py: add missing basicConfig (#7138)	před 1 rokem

Novější Starší

Historie revizí Hledat

Historie revizí