cturan/llama.cpp

Author	SHA1 Message	Date
ds5t5	f8c90cdbaa llm : add Refact model (#3329)	2 years ago
Georgi Gerganov	f93af02488 sync : ggml (conv 1d + 2d updates, UB fixes) (#3468)	2 years ago
Merrick Christensen	f72f8f22c9 finetune : readme fix typo (#3465)	2 years ago
Tameem	79f34abddb ggml : add RISC-V Vector Support for K-Quants and improved the existing intrinsics (#3453)	2 years ago
h-h-h-h	8186242b6d main : consistent prefix/suffix coloring (#3425)	2 years ago
Georgi Gerganov	ac2219fef3 llama : fix session saving/loading (#3400)	2 years ago
Alex Klinkhamer	48be797ffb llama : expose model's rope_freq_scale in the API (#3418)	2 years ago
Jiahao Li	f56e1baec3 metal : alibi for arbitrary number of heads (#3426)	2 years ago
Eve	017efe899d cmake : make LLAMA_NATIVE flag actually use the instructions supported by the processor (#3273)	2 years ago
goerch	ff5a3f0c09 Work on the BPE tokenizer (#3252)	2 years ago
cebtenzzre	1c84003c08 convert : fix vocab size when not defined in hparams (#3421)	2 years ago
cebtenzzre	e78f0b0d05 cmake : increase minimum version for add_link_options (#3444)	2 years ago
shibe2	665018c749 CLBlast: Add broadcast support for matrix multiplication (#3402)	2 years ago
cebtenzzre	29a404a951 gguf : add BERT, MPT, and GPT-J arch info (#3408)	2 years ago
cebtenzzre	0fe321031a gguf : general usability improvements (#3409)	2 years ago
cebtenzzre	9476b01226 cmake : make CUDA flags more similar to the Makefile (#3420)	2 years ago
xaedes	a03ce38455 finetune : fix #3404 (#3437)	2 years ago
Adrian	a847676984 metal : set log callback before initializing (#3427)	2 years ago
bandoti	095231dfd3 cmake : fix transient definitions in find pkg (#3411)	2 years ago
Kevin Ji	ea55295a74 docker : ignore Git files (#3314)	2 years ago
vvhg1	c97f01c362 infill : add new example + extend server API (#3296)	2 years ago
slaren	f5ef5cfb18 ggml-cuda : perform cublas mat mul of quantized types as f16 (#3412)	2 years ago
slaren	40e07a60f9 llama.cpp : add documentation about rope_freq_base and scale values (#3401)	2 years ago
Georgi Gerganov	bc34dd4f5b train : fix KQ_pos allocation (#3392)	2 years ago
Cebtenzzre	2777a84be4 llama : quantize up to 31% faster on Linux and Windows with mmap (#3206)	2 years ago
BarfingLemurs	0a4a4a0982 readme : update hot topics + model links (#3399)	2 years ago
Andrew Duffy	569550df20 readme : add link to grammars app (#3388)	2 years ago
Jhen-Jie Hong	c71bf2c45c swift : fix build on xcode 15 (#3387)	2 years ago
Cebtenzzre	bc39553c90 build : enable more non-default compiler warnings (#3200)	2 years ago
Hua Jiang	0ccfc62a96 ggml_tensor: update the structure comments. (#3283)	2 years ago

Newer Older

Commit History Find

Commit History