cturan/llama.cpp

Author	SHA1 Message	Date
jiez	1966eb2615 quantize : add '--keep-split' to quantize model into shards (#6688)	1 year ago
Johannes Gäßler	784e11dea1 README: add graphic for matrix multiplication (#6881)	1 year ago
Douglas Hanley	b4e4b8a935 llama : add llama_get_pooling_type function (#6862)	1 year ago
mgroeber9110	3fe847b574 server : do not apply Markdown formatting in code sections (#6850)	1 year ago
Kyle Mistele	37246b1031 common : revert showing control tokens by default for server (#6860)	1 year ago
Johannes Gäßler	28103f4832 Server: fix seed for multiple slots (#6835)	1 year ago
Georgi Gerganov	c0d1b3e03e ggml : move 32-bit arm compat in ggml-impl.h (#6865)	1 year ago
Tristan Druyen	abd3314064 llama : add phi 3 chat template (#6857)	1 year ago
Junyang Lin	3fec68be4e convert : add support of codeqwen due to tokenizer (#6707)	1 year ago
liuwei-git	c8297c6af5 llama : add phi3 support (#6852)	1 year ago
Anas Ahouzi	4e96a812b3 [SYCL] Windows default build instructions without -DLLAMA_SYCL_F16 flag activated (#6767)	1 year ago
Justine Tunney	192090bae4 llamafile : improve sgemm.cpp (#6796)	1 year ago
Dave Airlie	e931888d50 ggml : fix calloc argument ordering. (#6820)	1 year ago
Georgi Gerganov	8960fe86ae llama : fix typo in <\|im_end\|> token text (#6745)	1 year ago
Pierrick Hymbert	c0956b09ba ci: fix job are cancelling each other (#6781)	1 year ago
github-actions[bot]	e9b4a1bf68 flake.lock: Update	1 year ago
Olivier Chafik	5cf5e7d490 `build`: generate hex dump of server assets during build (#6661)	1 year ago
Georgi Gerganov	40f74e4d73 llama : add option to render special/control tokens (#6807)	1 year ago
Georgi Gerganov	b9cc76d87e ggml : fix ggml_backend_cpu_supports_op() for CPY (#0)	1 year ago
Wouter	7dbdba5690 llama : add llama-3 chat template (#6751)	1 year ago
pmysl	c1386c936e gguf-py : add IQ1_M to GGML_QUANT_SIZES (#6761)	1 year ago
Jan Boon	e8d35f47cb doc : add link to falcon (#6789)	1 year ago
Mohammadreza Hendiani	2cca09d509 readme : add Fedora instructions (#6783)	1 year ago
Justine Tunney	89b0bf0d5d llava : use logger in llava-cli (#6797)	1 year ago
Pedro Cuenca	b97bc3966e llama : support Llama 3 HF conversion (#6745)	1 year ago
Jan Boon	b8109bc013 doc : server tests require llama to be built with curl enabled (#6788)	1 year ago
Georgi Gerganov	aed82f6837 common : try to fix Android CI (#6780)	1 year ago
loonerin	0e4802b2ec ci: add ubuntu latest release and fix missing build number (mac & ubuntu) (#6748)	1 year ago
Pierrick Hymbert	637e9a86c2 server: static: upstream upgrade (#6765)	1 year ago
nopperl	9958c81b79 Implement the OLMo architecture (#6741)	1 year ago

Newer Older

Commit History Find

Commit History