jiez
|
1966eb2615
quantize : add '--keep-split' to quantize model into shards (#6688)
|
1 year ago |
Johannes Gäßler
|
784e11dea1
README: add graphic for matrix multiplication (#6881)
|
1 year ago |
Douglas Hanley
|
b4e4b8a935
llama : add llama_get_pooling_type function (#6862)
|
1 year ago |
mgroeber9110
|
3fe847b574
server : do not apply Markdown formatting in code sections (#6850)
|
1 year ago |
Kyle Mistele
|
37246b1031
common : revert showing control tokens by default for server (#6860)
|
1 year ago |
Johannes Gäßler
|
28103f4832
Server: fix seed for multiple slots (#6835)
|
1 year ago |
Georgi Gerganov
|
c0d1b3e03e
ggml : move 32-bit arm compat in ggml-impl.h (#6865)
|
1 year ago |
Tristan Druyen
|
abd3314064
llama : add phi 3 chat template (#6857)
|
1 year ago |
Junyang Lin
|
3fec68be4e
convert : add support of codeqwen due to tokenizer (#6707)
|
1 year ago |
liuwei-git
|
c8297c6af5
llama : add phi3 support (#6852)
|
1 year ago |
Anas Ahouzi
|
4e96a812b3
[SYCL] Windows default build instructions without -DLLAMA_SYCL_F16 flag activated (#6767)
|
1 year ago |
Justine Tunney
|
192090bae4
llamafile : improve sgemm.cpp (#6796)
|
1 year ago |
Dave Airlie
|
e931888d50
ggml : fix calloc argument ordering. (#6820)
|
1 year ago |
Georgi Gerganov
|
8960fe86ae
llama : fix typo in <|im_end|> token text (#6745)
|
1 year ago |
Pierrick Hymbert
|
c0956b09ba
ci: fix job are cancelling each other (#6781)
|
1 year ago |
github-actions[bot]
|
e9b4a1bf68
flake.lock: Update
|
1 year ago |
Olivier Chafik
|
5cf5e7d490
`build`: generate hex dump of server assets during build (#6661)
|
1 year ago |
Georgi Gerganov
|
40f74e4d73
llama : add option to render special/control tokens (#6807)
|
1 year ago |
Georgi Gerganov
|
b9cc76d87e
ggml : fix ggml_backend_cpu_supports_op() for CPY (#0)
|
1 year ago |
Wouter
|
7dbdba5690
llama : add llama-3 chat template (#6751)
|
1 year ago |
pmysl
|
c1386c936e
gguf-py : add IQ1_M to GGML_QUANT_SIZES (#6761)
|
1 year ago |
Jan Boon
|
e8d35f47cb
doc : add link to falcon (#6789)
|
1 year ago |
Mohammadreza Hendiani
|
2cca09d509
readme : add Fedora instructions (#6783)
|
1 year ago |
Justine Tunney
|
89b0bf0d5d
llava : use logger in llava-cli (#6797)
|
1 year ago |
Pedro Cuenca
|
b97bc3966e
llama : support Llama 3 HF conversion (#6745)
|
1 year ago |
Jan Boon
|
b8109bc013
doc : server tests require llama to be built with curl enabled (#6788)
|
1 year ago |
Georgi Gerganov
|
aed82f6837
common : try to fix Android CI (#6780)
|
1 year ago |
loonerin
|
0e4802b2ec
ci: add ubuntu latest release and fix missing build number (mac & ubuntu) (#6748)
|
1 year ago |
Pierrick Hymbert
|
637e9a86c2
server: static: upstream upgrade (#6765)
|
1 year ago |
nopperl
|
9958c81b79
Implement the OLMo architecture (#6741)
|
1 year ago |