cturan/llama.cpp

Autor	SHA1 Mensagem	Data
Johannes Gäßler	a743d76a01 CUDA: generalize FP16 fattn vec kernel (#7061)	há 1 ano atrás
Galunid	f31ec120bc Add warning if token is invalid (#7173)	há 1 ano atrás
Daniel Bevenius	fd9f92b154 llama : update llama_timings.n_p_eval setting (#7160)	há 1 ano atrás
Sigbjørn Skjæret	22842164bc gguf-py : add special token modification capability (#7166)	há 1 ano atrás
Albert Jin	4734524882 opencl : alignment size converted from bits to bytes (#7090)	há 1 ano atrás
Ahmet Zeer	07cd41d096 TypoFix (#7162)	há 1 ano atrás
Jared Van Bortel	4426e2987b cmake : fix typo (#7151)	há 1 ano atrás
compilade	f98eb31c51 convert-hf : save memory with lazy evaluation (#7075)	há 1 ano atrás
agray3	bc4bba364f Introduction of CUDA Graphs to LLama.cpp (#6766)	há 1 ano atrás
Johannes Gäßler	c12452c7ae JSON: [key] -> .at(key), assert() -> GGML_ASSERT (#7143)	há 1 ano atrás
Georgi Gerganov	9da243b36a Revert "llava : add support for moondream vision language model (#6899)"	há 1 ano atrás
JohnnyB	bd1871fa2b server : add themes + favicon (#6848)	há 1 ano atrás
Gilad S	26458af1d6 metal : use `vm_allocate` instead of `posix_memalign` on macOS (#7078)	há 1 ano atrás
Dawid Potocki	83330d8cd6 main : add --conversation / -cnv flag (#7108)	há 1 ano atrás
Eve	465263d0cf sgemm : AVX Q4_0 and Q8_0 (#6891)	há 1 ano atrás
Johan	911b3900dd server : add_special option for tokenize endpoint (#7059)	há 1 ano atrás
20kdc	ad211edef5 convert.py : --vocab-only generates false but valid params (#7027)	há 1 ano atrás
Ren Xuancheng	229ffff872 llama : add BPE pre-tokenization for Qwen2 (#7114)	há 1 ano atrás
Xuan Son Nguyen	1fd9c1741d clean up json_value & server_log (#7142)	há 1 ano atrás
DAN™	4cd621c26d convert : add BPE pre-tokenization for DBRX (#7132)	há 1 ano atrás
Georgi Gerganov	7e0b6a7b3b py : also print the normalizers	há 1 ano atrás
Brian	acdce3cdef compare-llama-bench.py: add missing basicConfig (#7138)	há 1 ano atrás
Justine Tunney	3855416027 ggml : introduce bfloat16 support (#6412)	há 1 ano atrás
Georgi Gerganov	c0e6fbf8c3 metal : fix unused warning	há 1 ano atrás
Jeximo	c780e75305 Further tidy on Android instructions README.md (#7077)	há 1 ano atrás
jukofyork	48b2f9c1fc Fixed save_imatrix to match old behaviour for MoE (#7099)	há 1 ano atrás
Johannes Gäßler	af0a5b6163 server: fix incorrectly reported token probabilities (#7125)	há 1 ano atrás
nopperl	b6aa670203 Fix OLMo HF to GGUF conversion (#6910)	há 1 ano atrás
Kyle Mistele	260b7c6529 server : update readme with undocumented options (#7013)	há 1 ano atrás
Georgi Gerganov	53d6c52e22 readme : update hot topics	há 1 ano atrás

Recente Antigo

Histórico de Commits Pesquisar

Histórico de Commits