Jeff Bolz
|
f095a649ec
vulkan: get the first command buffer submitted sooner (#10499)
|
1 vuosi sitten |
Ting Lou
|
678d7994f4
llava: return false instead of exit (#10546)
|
1 vuosi sitten |
Georgi Gerganov
|
dc22344088
ggml : remove redundant copyright notice + update authors
|
1 vuosi sitten |
Georgi Gerganov
|
4c0a95b107
llama : add missing model types
|
1 vuosi sitten |
Xuan Son Nguyen
|
6c59567689
server : (tests) don't use thread for capturing stdout/stderr, bump openai client library (#10568)
|
1 vuosi sitten |
Johannes Gäßler
|
890719311b
common: fix warning message when no GPU found (#10564)
|
1 vuosi sitten |
Random Fly
|
7281cf13ad
docs: fix outdated usage of llama-simple (#10565)
|
1 vuosi sitten |
Diego Devesa
|
e90688edd0
ci : fix tag name in cuda and hip releases (#10566)
|
1 vuosi sitten |
Georgi Gerganov
|
76b27d29c2
ggml : fix row condition for i8mm kernels (#10561)
|
1 vuosi sitten |
Georgi Gerganov
|
eea986f215
cmake : fix ARM feature detection (#10543)
|
1 vuosi sitten |
Shupei Fan
|
c202cef168
ggml-cpu: support IQ4_NL_4_4 by runtime repack (#10541)
|
1 vuosi sitten |
Sergio López
|
2025fa67e9
kompute : improve backend to pass test_backend_ops (#10542)
|
1 vuosi sitten |
Ruixin Huang
|
c6bc73951e
CANN: Update cann.md to display correctly in CLion (#10538)
|
1 vuosi sitten |
leo-pony
|
605fa66c50
CANN: Fix SOC_TYPE compile bug (#10519)
|
1 vuosi sitten |
Chenguang Li
|
b7420131bf
CANN: ROPE operator optimization (#10540)
|
1 vuosi sitten |
Xuan Son Nguyen
|
9f912511bc
common : fix duplicated file name with hf_repo and hf_file (#10550)
|
1 vuosi sitten |
uvos
|
3ad5451f3b
Add some minimal optimizations for CDNA (#10498)
|
1 vuosi sitten |
Diego Devesa
|
46c69e0e75
ci : faster CUDA toolkit installation method and use ccache (#10537)
|
1 vuosi sitten |
Georgi Gerganov
|
9e2301f4a4
metal : fix group_norm support condition (#0)
|
1 vuosi sitten |
Georgi Gerganov
|
fee824a1a1
sync : ggml
|
1 vuosi sitten |
Frankie Robertson
|
9150f8fef9
Do not include arm_neon.h when compiling CUDA code (ggml/1028)
|
1 vuosi sitten |
Jeff Bolz
|
c31ed2abfc
vulkan: define all quant data structures in types.comp (#10440)
|
1 vuosi sitten |
Jeff Bolz
|
5b3466bedf
vulkan: Handle GPUs with less shared memory (#10468)
|
1 vuosi sitten |
Jeff Bolz
|
249a7902ec
vulkan: further optimize q5_k mul_mat_vec (#10479)
|
1 vuosi sitten |
Jeff Bolz
|
71a64989a5
vulkan: skip integer div/mod in get_offsets for batch_idx==0 (#10506)
|
1 vuosi sitten |
Jeff Bolz
|
4a57d362e1
vulkan: optimize Q2_K and Q3_K mul_mat_vec (#10459)
|
1 vuosi sitten |
Diego Devesa
|
c9b00a70b0
ci : fix cuda releases (#10532)
|
1 vuosi sitten |
Shane A
|
de5097351c
Add OLMo 2 model in docs (#10530)
|
1 vuosi sitten |
Diego Devesa
|
5a349f2809
ci : remove nix workflows (#10526)
|
1 vuosi sitten |
Diego Devesa
|
30ec398321
llama : disable warnings for 3rd party sha1 dependency (#10527)
|
1 vuosi sitten |