cturan/llama.cpp

Author	SHA1 Message	Date
Luo Tian	7d8b24932f zig : fix build by introducing train.cpp (#3539)	2 years ago
Georgi Gerganov	b0ec5218c3 metal : support MTLGPUFamily < Apple7, formatting, style (#3524)	2 years ago
Kerfuffle	63d3b06a43 llama : fix missing break in Persimmon arch case statements (#3535)	2 years ago
Kerfuffle	a16e89cec8 Fix trying to strip newline from empty prompt and cfg prompt file content (#3534)	2 years ago
M. Yusuf Sarıgöz	4d03833211 gguf.py : fix CI for publishing GGUF package (#3532)	2 years ago
Tom C	c47066d833 py : change version of numpy requirement to 1.24.4 (#3515)	2 years ago
cebtenzzre	f1782c68de quantize : fail fast on write errors (#3521)	2 years ago
Jhen-Jie Hong	c26765a0a1 metal : support default.metallib load & reuse code for swift package (#3522)	2 years ago
Phillip Kravtsov	0e797c2fc5 llm : support Adept Persimmon 8B (#3410)	2 years ago
goerch	3a716b4dae Fix for #3454 (#3455)	2 years ago
BarfingLemurs	1faaae8c2b readme : update models, cuda + ppl instructions (#3510)	2 years ago
Mihai	cb13d73a72 server : docs fix default values and add n_probs (#3506)	2 years ago
Kerfuffle	9ca79d5cbb kv cache slot search improvements (#3493)	2 years ago
Georgi Gerganov	0c731ca403 prompts : fix editorconfig checks after #3416	2 years ago
pudepiedj	a8777ad84e parallel : add option to load external prompt file (#3416)	2 years ago
Jhen-Jie Hong	97af49fa39 server : reuse llama_sample_token common util (#3494)	2 years ago
l3utterfly	16820a5a0d llama : correct hparams comparison (#3446)	2 years ago
Jhen-Jie Hong	04b2f4386e ci : fix xcodebuild destinations (#3491)	2 years ago
cebtenzzre	48edda30ee convert : update Falcon script for new HF config (#3448)	2 years ago
Kenvix ⭐	45eba9369f build : use std::make_tuple() for compatibility with older GCC versions (#3488)	2 years ago
staviq	acec9eaaa9 common : process escape sequences in reverse prompts (#3461)	2 years ago
shibe2	e2583cbc29 CLBlast: Fix handling of on-device tensor data	2 years ago
Jhen-Jie Hong	e8b8d32e86 server : fix incorrect num_tokens_predicted (#3480)	2 years ago
Jhen-Jie Hong	8f3a642ec1 swift : disable ACCELERATE_NEW_LAPACK (#3481)	2 years ago
Jhen-Jie Hong	0745384449 ci : add swift build via xcodebuild (#3482)	2 years ago
Kerfuffle	019ba1dcd0 convert : fix Baichuan2 models by using vocab size in config.json (#3299)	2 years ago
Georgi Gerganov	beabc8cfb0 readme : add project status link	2 years ago
Georgi Gerganov	0d152b37fe ggml : fix build after #3329	2 years ago
ds5t5	f8c90cdbaa llm : add Refact model (#3329)	2 years ago
Georgi Gerganov	f93af02488 sync : ggml (conv 1d + 2d updates, UB fixes) (#3468)	2 years ago

Newer Older

Commit History Find

Commit History