Nuno
|
d7d1eccacc
docker: allow installing pip packages system-wide (#11437)
|
11 months ago |
someone13574
|
4bf3119d61
cmake : don't fail on `GGML_CPU=OFF` (#11457)
|
11 months ago |
Nuno
|
f643120bad
docker: add perplexity and bench commands to full image (#11438)
|
11 months ago |
Akarshan Biswas
|
6e84b0ab8e
SYCL : SOFTMAX F16 mask support and other fixes (#11261)
|
11 months ago |
Michael Engel
|
2b8525d5c8
Handle missing model in CLI parameters for llama-run (#11399)
|
11 months ago |
Eric Curtin
|
a4417ddda9
Add new hf protocol for ollama (#11449)
|
11 months ago |
Haus1
|
d6d24cd9ed
AMD: parse the architecture as supplied by gcnArchName (#11244)
|
11 months ago |
lexasub
|
a5203b4465
llama : minor fixes for up llama load model speed (#11448)
|
11 months ago |
Johannes Gäßler
|
df984e0147
llama: refactor llama_decode_impl (#11381)
|
11 months ago |
Ihar Hrachyshka
|
acd38efee3
metal: Handle null returned from MTLCreateSystemDefaultDevice() (#11441)
|
11 months ago |
Xuan Son Nguyen
|
caf773f249
docker : fix ARM build and Vulkan build (#11434)
|
1 year ago |
Georgi Gerganov
|
178a7eb952
metal : use residency sets (#11427)
|
1 year ago |
Nuno
|
6f53d8a6b4
docker: add missing vulkan library to base layer and update to 24.04 (#11422)
|
1 year ago |
bandoti
|
19f65187cb
cmake: add ggml find package (#11369)
|
1 year ago |
Frank Mai
|
1d8ee06000
rpc: fix register position (#11424)
|
1 year ago |
Georgi Gerganov
|
2cc9b8c32c
readme : update hot topics
|
1 year ago |
Jeff Bolz
|
f35726c2fb
build: apply MSVC /bigobj option to c/cpp files only (#11423)
|
1 year ago |
Jeff Bolz
|
4a75d19376
vulkan: compile shaders on-demand (#11406)
|
1 year ago |
uvos
|
26771a1491
Hip: disable VMM on hip as it seams that it dosent work in some configurations (#11420)
|
1 year ago |
Jeff Bolz
|
ca6baf76c1
build: add /bigobj to MSVC build (#11407)
|
1 year ago |
Diego Devesa
|
6e264a905b
docker : add GGML_CPU_ARM_ARCH arg to select ARM architecture to build for (#11419)
|
1 year ago |
Xuan Son Nguyen
|
49b0e3cec4
server : fix cleaning up stream task (#11418)
|
1 year ago |
Diego Devesa
|
20a758155b
docker : fix CPU ARM build (#11403)
|
1 year ago |
Georgi Gerganov
|
00c24acb2a
ci : fix line breaks on windows builds (#11409)
|
1 year ago |
jiahao su
|
466ea66f33
CANN: Add Ascend CANN build ci (#10217)
|
1 year ago |
uvos
|
5f0db9522f
hip : Add hipGraph and VMM support to ROCM (#11362)
|
1 year ago |
Johannes Gäßler
|
c5d9effb49
CUDA: fix FP16 cuBLAS GEMM (#11396)
|
1 year ago |
uvos
|
9fbadaef4f
rocBLAS: Avoid fp32->fp16->fp32 conversion on cdna (#11356)
|
1 year ago |
Georgi Gerganov
|
9755129c27
release : pack /lib in the packages (#11392)
|
1 year ago |
Jafar Uruç
|
a07c2c8a52
docs : Update readme to build targets for local docker build (#11368)
|
1 year ago |