cturan/llama.cpp

Author	SHA1 Message	Date
Andrew Duffy	58c438cf7d Add Accelerate/BLAS when using Swift (#765)	2 years ago
mgroeber9110	53dbba7695 Windows: reactive sigint handler after each Ctrl-C (#736)	2 years ago
SebastianApel	437e77855a 10+% performance improvement of ggml_vec_dot_q4_0 on AVX2 (#654)	2 years ago
Ivan Stepanov	cd7fa95690 Define non-positive temperature behavior (#720)	2 years ago
bsilvereagle	a0c0516416 Remove torch GPU dependencies from the Docker.full image (#665)	2 years ago
Thatcher Chamberlin	d8d4e865cd Add a missing step to the gpt4all instructions (#690)	2 years ago
Christian Falch	e986f94829 Added api for getting/setting the kv_cache (#685)	2 years ago
Marian Cepok	c0bb1d3ce2 ggml : change ne to int64_t (#626)	2 years ago
Leonardo Neumann	6e7801d08d examples : add gpt4all script (#658)	2 years ago
Stephan Walter	81040f10aa llama : do not allocate KV cache for "vocab_only == true" (#682)	2 years ago
Fabian	c4f89d8d73 make : use -march=native -mtune=native on x86 (#609)	2 years ago
Murilo Santana	5b70e7de4c fix default params for examples/main (#697)	2 years ago
Ikko Eltociear Ashimine	a717cba844 py: huggingface -> Hugging Face (#686)	2 years ago
rimoliga	d0a7f742e7 readme: replace termux links with homepage, play store is deprecated (#680)	2 years ago
Slaren	0d054e292e Show error message when -f fails	2 years ago
Stephan Walter	3525899277 Enable -std= for cmake builds, fix warnings (#598)	2 years ago
slaren	1d08882afa Optimize AVX2 ggml_vec_dot_q4_0 (#642)	2 years ago
perserk	02c5b27e91 Add AVX acceleration (#617)	2 years ago
Pavol Rusnak	cbef542879 py : cleanup the code	2 years ago
Pavol Rusnak	9733104be5 drop quantize.py (now that models are using a single file)	2 years ago
Georgi Gerganov	3df890aef4 readme : update supported models	2 years ago
Justine Tunney	ee0c40dd6d Introduce GGML migration tool for new file format	2 years ago
Justine Tunney	6f23ba5ee2 Ensure --mlock works properly with mmap() support	2 years ago
Justine Tunney	78ca9838ee Make loading weights 10-100x faster	2 years ago
Slaren	a017390358 Initial windows support (untested)	2 years ago
Slaren	ac184d5147 Always initialize mm_addr and mm_length in llama_model	2 years ago
Slaren	276e5b7811 Unmap the file in llama_free	2 years ago
Slaren	d68c5dc435 Make mmap_file static	2 years ago
Slaren	64bde3ffd4 Fix ggml_init_params in quantize	2 years ago
Slaren	c03ae8dca1 Add mmap support for model files	2 years ago

Newer Older

Commit History Find

Commit History