Roman Parykin
|
d38e451578
readme : add Scala 3 bindings repo (#2010)
|
2 years ago |
Gustavo Rocha Dias
|
aa777abbb7
readme : LD_LIBRARY_PATH complement for some Android devices when building with CLBlast inside Termux (#2007)
|
2 years ago |
Georgi Gerganov
|
412c60e473
readme : add link to new k-quants for visibility
|
2 years ago |
Georgi Gerganov
|
447ccbe8c3
readme : add new roadmap + manifesto
|
2 years ago |
Georgi Gerganov
|
66a2555ba6
readme : add Azure CI discussion link
|
2 years ago |
Georgi Gerganov
|
11da1a85cd
readme : fix whitespaces
|
2 years ago |
Alberto
|
235b610d65
readme : fixed termux instructions (#1973)
|
2 years ago |
eiery
|
d7b7484f74
Add OpenLLaMA instructions to the README (#1954)
|
2 years ago |
Rahul Vivek Nair
|
fb98254f99
Fix typo in README.md (#1961)
|
2 years ago |
Georgi Gerganov
|
049aa16b8c
readme : add link to p1
|
2 years ago |
Xiake Sun
|
2322ec223a
Fix typo (#1949)
|
2 years ago |
Johannes Gäßler
|
16b9cd1939
Convert vector to f16 for dequantize mul mat vec (#1913)
|
2 years ago |
Mike
|
e1886cf4fe
readme : update Android build instructions (#1922)
|
2 years ago |
Johannes Gäßler
|
2c9380dd2f
Only one CUDA stream per device for async compute (#1898)
|
2 years ago |
Gustavo Rocha Dias
|
bac19927c3
readme : alternative way to build for Android with CLBlast. (#1828)
|
2 years ago |
Aisuko
|
059e99066d
doc : fix wrong address of BLIS.md (#1772)
|
2 years ago |
Georgi Gerganov
|
4dc62c545d
readme : add June roadmap
|
2 years ago |
Yuval Peled
|
f4c55d3bd7
docs : add performance troubleshoot + example benchmark documentation (#1674)
|
2 years ago |
Foul-Tarnished
|
f1465624c2
readme : fix typo (#1700)
|
2 years ago |
Georgi Gerganov
|
827f5eda91
readme : update hot topics
|
2 years ago |
Georgi Gerganov
|
ecb217db4f
llama : Metal inference (#1642)
|
2 years ago |
Henri Vasserman
|
d8bd0013e8
Add info about CUDA_VISIBLE_DEVICES (#1682)
|
2 years ago |
Henri Vasserman
|
97c9b77c4f
Add documentation about CLBlast (#1604)
|
2 years ago |
Evan Jones
|
c31bbe934b
readme : add docs for chat-persistent.sh (#1568)
|
2 years ago |
Zenix
|
b8ee340abe
feature : support blis and other blas implementation (#1536)
|
2 years ago |
Georgi Gerganov
|
ea600071cb
Revert "feature : add blis and other BLAS implementation support (#1502)"
|
2 years ago |
Zenix
|
07e9ace0f9
feature : add blis and other BLAS implementation support (#1502)
|
2 years ago |
Georgi Gerganov
|
2d5db48371
ggml : use F16 instead of F32 in Q4_0, Q4_1, Q8_0 (#1508)
|
2 years ago |
David Kennedy
|
79e3efb0e9
readme : adds WizardLM to the list of supported models (#1485)
|
2 years ago |
Georgi Gerganov
|
cdd5350892
readme : update Q4_0 perplexities
|
2 years ago |