Tameem
|
5aec2cfaac
ggml : add RISC-V vector intrinsics support (#2929)
|
2 жил өмнө |
Georgi Gerganov
|
13268c5331
metal : slight speed-up for add and mul kernels (#2917)
|
2 жил өмнө |
staviq
|
4dcd47d71d
logs : fix mingw-like builds (fixes #2898) (#2911)
|
2 жил өмнө |
Cebtenzzre
|
18705a30ef
llama2c : fix segfault and alloc-dealloc-mismatch (#2913)
|
2 жил өмнө |
Kawrakow
|
e8d9158925
metal: somewhat faster f16 x f32 matrix multiply kernel (#2951)
|
2 жил өмнө |
Cebtenzzre
|
bce1fef328
convert : fix another python 3.8 issue (#2949)
|
2 жил өмнө |
slaren
|
528134dd02
remove convert-llama-7b-pth-to-gguf.py and convert-llama-hf-to-gguf.py (#2906)
|
2 жил өмнө |
Kerfuffle
|
aeefac4ff7
scripts: Use local gguf package when running from repo (#2927)
|
2 жил өмнө |
DannyDaemonic
|
e8422de39e
@vxiiduu's fix for PrefetchVirtualMemory (#2930)
|
2 жил өмнө |
Cebtenzzre
|
92d0b751a7
convert : fix python 3.8 support, modernize type annotations (#2916)
|
2 жил өмнө |
Johannes Gäßler
|
8afe228000
CUDA: mul_mat_q=true llama_context_params default (#2912)
|
2 жил өмнө |
Henri Vasserman
|
71d6975559
[Docker] fix tools.sh argument passing. (#2884)
|
2 жил өмнө |
Georgi Gerganov
|
b532a69b2f
convert.py : use dir name to name the llama
|
2 жил өмнө |
Georgi Gerganov
|
c90d135eb4
examples : fix underscore in beam-search + .gitignore (close #2900)
|
2 жил өмнө |
M. Yusuf Sarıgöz
|
0d1c706181
gguf : add workflow for Pypi publishing (#2896)
|
2 жил өмнө |
alonfaraj
|
9509294420
make : add test and update CI (#2897)
|
2 жил өмнө |
Gilad S
|
35092fb547
docs : add `node-llama-cpp` to `README.md` (#2885)
|
2 жил өмнө |
Kerfuffle
|
dc07dc492e
convert : various script cleanups/fixes + merges and special token handling (#2842)
|
2 жил өмнө |
chaihahaha
|
ad9ddcff6e
llm.vim : stop generation at multiple linebreaks, bind to <F2> (#2879)
|
2 жил өмнө |
staviq
|
8341a25957
main : log file (#2748)
|
2 жил өмнө |
Cebtenzzre
|
849408957c
tests : add a C compliance test (#2848)
|
2 жил өмнө |
slaren
|
06abf8eeba
ggml : add view_src and view_offs to ggml_tensor for views (#2874)
|
2 жил өмнө |
slaren
|
c03a243abf
remove outdated references to -eps and -gqa from README (#2881)
|
2 жил өмнө |
Kawrakow
|
fa3582f509
Tell users attmepting to run perplexity with too few tokens to use more (#2882)
|
2 жил өмнө |
Kawrakow
|
e37e69dcc3
10X faster BPE tokenizer (#2876)
|
2 жил өмнө |
maddes8cht
|
53885d7256
py : fix "usage" messages (#2873)
|
2 жил өмнө |
jameswu2014
|
bcce96ba4d
convert.py : fix baichuan7B support (#2870)
|
2 жил өмнө |
Jhen-Jie Hong
|
74e0caeb82
readme : add react-native binding (#2869)
|
2 жил өмнө |
Cebtenzzre
|
d4b5e16c32
make : fix clang tests build, add missing examples (#2859)
|
2 жил өмнө |
Georgi Gerganov
|
3a007648f2
metal : add option to disable debug logs (close #2764)
|
2 жил өмнө |