Martin Delille
|
5dcdf94676
Fix conan badge display [no ci] (#7645)
|
1 year ago |
Manuel
|
2e2340de17
Add brew installation instruction to README [no ci] (#7616)
|
1 year ago |
Martin Delille
|
7846540bd2
readme : add Conan badge (#7638)
|
1 year ago |
Brian
|
e6157f94c8
github: add contact links to issues and convert question into research [no ci] (#7612)
|
1 year ago |
Galunid
|
9c4c9cc83f
Move convert.py to examples/convert-legacy-llama.py (#7430)
|
1 year ago |
Chris Elrod
|
59b0d07766
faster avx512 exp implementation (#7551)
|
1 year ago |
junchao-loongson
|
d5c05821f3
ggml : fix loongarch build (O2 issue) (#7636)
|
1 year ago |
Johannes Gäßler
|
972b555ab9
README: explain parallel build [no ci] (#7618)
|
1 year ago |
Meng, Hengyu
|
3854c9d07f
[SYCL] fix intel docker (#7630)
|
1 year ago |
Galunid
|
eb57fee51f
gguf-py : Add tokenizer.ggml.pre to gguf-new-metadata.py (#7627)
|
1 year ago |
Georgi Gerganov
|
55d62262a9
metal : remove invalid asserts (#7617)
|
1 year ago |
Georgi Gerganov
|
975ec63ff2
metal : add missing asserts (#7617)
|
1 year ago |
Georgi Gerganov
|
fb76ec31a9
ggml : fix YARN + add tests + add asserts (#7617)
|
1 year ago |
Georgi Gerganov
|
cce3dcffc5
cuda : non-cont concat support (#7610)
|
1 year ago |
Radoslav Gerganov
|
210d99173d
llama-bench : add support for the RPC backend (#7435)
|
1 year ago |
slaren
|
87bdf2a199
ggml : use atomic_flag for critical section (#7598)
|
1 year ago |
Georgi Gerganov
|
00281b7be3
scripts : remove mpi remnants
|
1 year ago |
Georgi Gerganov
|
2ab977282b
sync : ggml
|
1 year ago |
Georgi Gerganov
|
72de268bec
ggml : restore ggml_rope_xpos_inplace (ggml/0)
|
1 year ago |
Akarshan Biswas
|
0e8d8bfd6c
Add Arc A750 and Arch linux to readme-sycl.md as verified GPU model and Linux distro (#7605)
|
1 year ago |
zhouwg
|
504f0c340f
ggml : fix typo in ggml.c (#7603)
|
1 year ago |
Meng, Hengyu
|
b864b50ce5
[SYCL] Align GEMM dispatch (#7566)
|
1 year ago |
jaime-m-p
|
02c1ecad07
Tokenizer WPM fixes (#7500)
|
1 year ago |
Georgi Gerganov
|
6bd12ce409
sycl : fix assert (#7563)
|
1 year ago |
Giuseppe Scrivano
|
5442939fcc
llama : support small Granite models (#7481)
|
1 year ago |
k.h.lai
|
56411a950f
vulkan: properly initialize vulkan devices for LLAMA_SPLIT_MODE_NONE (#7552)
|
1 year ago |
Radoslav Gerganov
|
2b737caae1
rpc : resource management rework (#7562)
|
1 year ago |
fairydreaming
|
ee3dff6b8e
Add support for DeepseekV2ForCausalLM (#7519)
|
1 year ago |
Georgi Gerganov
|
edc29433fa
tests : fix test-tokenizer-0.sh
|
1 year ago |
Georgi Gerganov
|
8b99e2aa66
llama : handle unknown utf8 bytes (#7588)
|
1 year ago |