Anton Mitkov
|
bdca38376f
sycl: Hotfix for non dnnl codepath (#14677)
|
6 months ago |
shalinib-ibm
|
55c509daf5
ggml : refactor llamafile_sgemm PPC code (#14673)
|
6 months ago |
Aman Gupta
|
9c9e4fc635
llama-context: add ability to get logits (#14672)
|
6 months ago |
Johannes Gäßler
|
494c5899cb
scripts: benchmark for HTTP server throughput (#14668)
|
6 months ago |
Akarshan Biswas
|
0f4c6ec0f1
SYCL: use 1D kernel for set_rows (#14618)
|
6 months ago |
Anton Mitkov
|
65a3ebb0aa
sycl: Batched mulmat rework for oneDNN dispatch (#14617)
|
6 months ago |
Molly Sophia
|
0d9226763c
llama : add jinja template for rwkv-world (#14665)
|
6 months ago |
Ed Addario
|
982e347255
quantize : fix minor logic flaw in --tensor-type (#14572)
|
6 months ago |
Sigbjørn Skjæret
|
923e3ea2e3
cuda : add set rows for bf16 (#14664)
|
6 months ago |
Yavor Ivanov
|
e743cddb60
cuda : add ELU support (#14657)
|
6 months ago |
Georgi Gerganov
|
05fec5bd29
ggml : add build-time message to remind about ggml_set_rows (#14661)
|
6 months ago |
Yavor Ivanov
|
dcf7f2ea3c
metal : Add missing unary ops Metal support (#14660)
|
6 months ago |
Yavor Ivanov
|
84b396e051
cmake : Add CMake presets for Linux and GCC (#14656)
|
6 months ago |
Tarek Dakhran
|
c31e60647d
tests : cover lfm2 cases in test_ssm_conv (#14651)
|
6 months ago |
Tarek Dakhran
|
67eade1bf9
docs : add LFM2 to models section (#14650)
|
6 months ago |
Aman Gupta
|
7de5c7cab6
CUDA: add set rows for f32 and f16 (#14551)
|
6 months ago |
Georgi Gerganov
|
8eff95544e
sync : ggml
|
6 months ago |
Georgi Gerganov
|
3120413ccd
vulkan : remove unused vars (#0)
|
6 months ago |
Georgi Gerganov
|
215535701d
sync : ggml
|
6 months ago |
Acly
|
74bb294591
vulkan : implement bilinear interpolation (ggml/1291)
|
6 months ago |
Acly
|
3e303b1107
vulkan : implement ggml_roll (ggml/1290)
|
6 months ago |
Douglas Hanley
|
0c1df14b5f
server : fix pooled embedding output (#14645)
|
6 months ago |
Jeff Bolz
|
b3ad3a0191
vulkan: support SET_ROWS (#14587)
|
6 months ago |
Jeff Bolz
|
98197e5c98
vulkan: optimizations for deepseek prompt processing (#14555)
|
6 months ago |
Tarek Dakhran
|
f5e96b368f
model : support LiquidAI LFM2 hybrid family (#14620)
|
6 months ago |
Slobodan Josic
|
756aa1020a
HIP : Add HIP 7.0+ compatibility for hipBLAS compute types (#14634)
|
6 months ago |
Georgi Gerganov
|
aaa088d87f
readme : add hot PRs (#14636)
|
6 months ago |
Georgi Gerganov
|
0d5375d54b
llama : move enum llama_vocab_pre_type to implementation (#14631)
|
6 months ago |
Dowon
|
576c82eda2
vocab : add midm-2.0 model pre-tokenizer (#14626)
|
6 months ago |
Gabe Goodhart
|
0aedae00e6
model : Granite Four (#13550)
|
6 months ago |