mgroeber9110
|
c2df36d60d
llama : consistently catch and throw only exceptions deriving from std::exception (#1599)
|
2 лет назад |
kiltyj
|
9d0693bce3
metal : use shared buffers between CPU and GPU (#1696)
|
2 лет назад |
grahameth
|
efe0507632
ggml : fix internal overflow in ggml_time_us on Windows (#1702)
|
2 лет назад |
Georgi Gerganov
|
e7fe66e670
ci : disable auto tidy (#1705)
|
2 лет назад |
Kawrakow
|
99009e72f8
ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684)
|
2 лет назад |
Henri Vasserman
|
5220a991a5
Increase 3B scratch buffers. (#1698)
|
2 лет назад |
Georgi Gerganov
|
d1f563a743
llama : fix Metal KV cache sync (close #1695)
|
2 лет назад |
Georgi Gerganov
|
827f5eda91
readme : update hot topics
|
2 лет назад |
Georgi Gerganov
|
ecb217db4f
llama : Metal inference (#1642)
|
2 лет назад |
0cc4m
|
dcb2ed4826
OpenCL: Fix duplication of layers in VRAM and RAM, add GPU mul kernel (#1653)
|
2 лет назад |
Henri Vasserman
|
d8bd0013e8
Add info about CUDA_VISIBLE_DEVICES (#1682)
|
2 лет назад |
Jiří Podivín
|
b5c85468a3
Docker: change to calling convert.py (#1641)
|
2 лет назад |
Evan Jones
|
136476e898
Fix prompt cache saving and chat-persistent rollover (#1678)
|
2 лет назад |
Henri Vasserman
|
ffb06a345e
OpenLLaMA 3B support (#1588)
|
2 лет назад |
Georgi Gerganov
|
7552ac5863
ggml : sync cgraph import / export API
|
2 лет назад |
Georgi Gerganov
|
5d1830b99d
ggml : fix bug in ggml_alibi
|
2 лет назад |
DannyDaemonic
|
248367605e
Work around for recalculating logits in cached prompts (Fixes #1585) (#1609)
|
2 лет назад |
Jiří Podivín
|
0e730dd23b
Adding git in container package dependencies (#1621)
|
2 лет назад |
Johannes Gäßler
|
3b126f654f
LLAMA_DEBUG adds debug symbols (#1617)
|
2 лет назад |
Kerfuffle
|
1b78ed2081
Only show -ngl option when relevant + other doc/arg handling updates (#1625)
|
2 лет назад |
Vladimir Zorin
|
337aea1139
examples : add --alias option to gpt_params to set use friendly model name (#1614)
|
2 лет назад |
Howard Su
|
bb051d9723
opencl : no need to allocate cl_mem on heap (#1612)
|
2 лет назад |
Howard Su
|
ca74884f66
opencl : use strstr to check if fp16 supported (#1611)
|
2 лет назад |
apcameron
|
a6704643b6
ggml : add support for the RISCV architecture (#1616)
|
2 лет назад |
Kerfuffle
|
0df7d63e5b
Include server in releases + other build system cleanups (#1610)
|
2 лет назад |
Henri Vasserman
|
97c9b77c4f
Add documentation about CLBlast (#1604)
|
2 лет назад |
Henri Vasserman
|
0ecb1bbbeb
[CI] Fix openblas (#1613)
|
2 лет назад |
Georgi Gerganov
|
93618031c7
ggml : add ggml_tensor_overhead()
|
2 лет назад |
Henri Vasserman
|
83c54e6da5
[CI] CLBlast: Fix directory name (#1606)
|
2 лет назад |
Georgi Gerganov
|
bdbda1b17a
ggml : sync ggml core (minor additions, e.g. ggml_get_tensor_by_name())
|
2 лет назад |