Judd
|
36680f6e40
convert : update for baichuan (#2081)
|
2 years ago |
Johannes Gäßler
|
924dd22fd3
Quantized dot products for CUDA mul mat vec (#2067)
|
2 years ago |
Georgi Gerganov
|
b472f3fca5
readme : add link web chat PR
|
2 years ago |
Judd
|
471aab6e4c
convert : add support of baichuan-7b (#2055)
|
2 years ago |
Roman Parykin
|
d38e451578
readme : add Scala 3 bindings repo (#2010)
|
2 years ago |
Gustavo Rocha Dias
|
aa777abbb7
readme : LD_LIBRARY_PATH complement for some Android devices when building with CLBlast inside Termux (#2007)
|
2 years ago |
Georgi Gerganov
|
412c60e473
readme : add link to new k-quants for visibility
|
2 years ago |
Georgi Gerganov
|
447ccbe8c3
readme : add new roadmap + manifesto
|
2 years ago |
Georgi Gerganov
|
66a2555ba6
readme : add Azure CI discussion link
|
2 years ago |
Georgi Gerganov
|
11da1a85cd
readme : fix whitespaces
|
2 years ago |
Alberto
|
235b610d65
readme : fixed termux instructions (#1973)
|
2 years ago |
eiery
|
d7b7484f74
Add OpenLLaMA instructions to the README (#1954)
|
2 years ago |
Rahul Vivek Nair
|
fb98254f99
Fix typo in README.md (#1961)
|
2 years ago |
Georgi Gerganov
|
049aa16b8c
readme : add link to p1
|
2 years ago |
Xiake Sun
|
2322ec223a
Fix typo (#1949)
|
2 years ago |
Johannes Gäßler
|
16b9cd1939
Convert vector to f16 for dequantize mul mat vec (#1913)
|
2 years ago |
Mike
|
e1886cf4fe
readme : update Android build instructions (#1922)
|
2 years ago |
Johannes Gäßler
|
2c9380dd2f
Only one CUDA stream per device for async compute (#1898)
|
2 years ago |
Gustavo Rocha Dias
|
bac19927c3
readme : alternative way to build for Android with CLBlast. (#1828)
|
2 years ago |
Aisuko
|
059e99066d
doc : fix wrong address of BLIS.md (#1772)
|
2 years ago |
Georgi Gerganov
|
4dc62c545d
readme : add June roadmap
|
2 years ago |
Yuval Peled
|
f4c55d3bd7
docs : add performance troubleshoot + example benchmark documentation (#1674)
|
2 years ago |
Foul-Tarnished
|
f1465624c2
readme : fix typo (#1700)
|
2 years ago |
Georgi Gerganov
|
827f5eda91
readme : update hot topics
|
2 years ago |
Georgi Gerganov
|
ecb217db4f
llama : Metal inference (#1642)
|
2 years ago |
Henri Vasserman
|
d8bd0013e8
Add info about CUDA_VISIBLE_DEVICES (#1682)
|
2 years ago |
Henri Vasserman
|
97c9b77c4f
Add documentation about CLBlast (#1604)
|
2 years ago |
Evan Jones
|
c31bbe934b
readme : add docs for chat-persistent.sh (#1568)
|
2 years ago |
Zenix
|
b8ee340abe
feature : support blis and other blas implementation (#1536)
|
2 years ago |
Georgi Gerganov
|
ea600071cb
Revert "feature : add blis and other BLAS implementation support (#1502)"
|
2 years ago |