cturan/llama.cpp

Autor	SHA1 Mensagem	Data
Aaron Miller	2f8cd979ec metal : release buffers when freeing metal context (#2062)	há 2 anos atrás
Judd	471aab6e4c convert : add support of baichuan-7b (#2055)	há 2 anos atrás
Georgi Gerganov	463f2f4c4f llama : fix return value of llama_load_session_file_internal (#2022)	há 2 anos atrás
Rand Xie	cb44dbc7de llama : catch llama_load_session_file_internal exceptions (#2022)	há 2 anos atrás
Georgi Gerganov	79f634a19d embd-input : fix returning ptr to temporary	há 2 anos atrás
Georgi Gerganov	04606a1599 train : fix compile warning	há 2 anos atrás
Qingyou Meng	b1ca8f36a9 ggml : disable GGML_TASK_INIT and GGML_TASK_FINALIZE by default (#1995)	há 2 anos atrás
Howard Su	b8c8dda75f Use unsigned for random seed (#2006)	há 2 anos atrás
LostRuins	96a712ca1b Porting the improved K-Quant CUDA kernels to OpenCL (#1966)	há 2 anos atrás
m3ndax	d3494bb86b llama : replacing auto &kv with const auto &kv (#2041)	há 2 anos atrás
Salvador E. Tropea	5b351e94d0 cuda : remove nchannels_x argument from mul_mat_vec_nc_f16_f32 (#2028)	há 2 anos atrás
Salvador E. Tropea	6432aabb6d cuda : fix missing const qualifier in casts (#2027)	há 2 anos atrás
Howard Su	b922bc351b llama : remove shards weight file support (#2000)	há 2 anos atrás
Johannes Gäßler	7f9753fa12 CUDA GPU acceleration for LoRAs + f16 models (#1970)	há 2 anos atrás
ningshanwutuobang	cfa0750bc9 llama : support input embeddings directly (#1910)	há 2 anos atrás
Erik Scholz	9d23589d63 fix pthreads setaffinity usage on android (#2020)	há 2 anos atrás
Howard Su	0be54f75a6 baby-llama : fix build after ggml_rope change (#2016)	há 2 anos atrás
Georgi Gerganov	181e8d9755 llama : fix rope usage after ChatGLM change	há 2 anos atrás
Georgi Gerganov	d9779021bd ggml : add support for ChatGLM RoPE	há 2 anos atrás
Roman Parykin	d38e451578 readme : add Scala 3 bindings repo (#2010)	há 2 anos atrás
David Yang	eaa6ca5a61 ggml : increase max tensor name + clean up compiler warnings in train-text (#1988)	há 2 anos atrás
Gustavo Rocha Dias	aa777abbb7 readme : LD_LIBRARY_PATH complement for some Android devices when building with CLBlast inside Termux (#2007)	há 2 anos atrás
Georgi Gerganov	c824d2e368 ggml : avoid conv 2d kernel round up	há 2 anos atrás
zrm	b853d45601 ggml : add NUMA support (#1556)	há 2 anos atrás
Georgi Gerganov	9225baef71 k-quants : fix indentation	há 2 anos atrás
katsu560	a84ab1da8d tests : fix quantize perf (#1990)	há 2 anos atrás
katsu560	5743ca8092 k-quants : add AVX support to dot functions (#1916)	há 2 anos atrás
Georgi Gerganov	412c60e473 readme : add link to new k-quants for visibility	há 2 anos atrás
Kawrakow	6769e944c7 k-quants : support for super-block size of 64 (#2001)	há 2 anos atrás
Howard Su	cbebf61ca7 Fix assert when free invalid cuda pointer (#2005)	há 2 anos atrás

Recente Antigo

Histórico de Commits Pesquisar

Histórico de Commits