leejet
|
7d43c585dc
add some new ops, fix some operators and add batch operations to certain operators. (ggml/747)
|
1 rok temu |
UEXTM.com
|
5f70671856
Introduce backend GUIDs (ggml/743)
|
1 rok temu |
Kawrakow
|
0becb22ac0
IQ4_XS: a 4.25 bpw quantization (#5747)
|
1 rok temu |
Kawrakow
|
a33e6a0d2a
Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantization range (#5721)
|
1 rok temu |
Georgi Gerganov
|
ab336a9d5e
code : normalize enum names (#5697)
|
1 rok temu |
Kawrakow
|
4c4cb30736
IQ3_S: a much better alternative to Q3_K (#5676)
|
1 rok temu |
Kawrakow
|
a14679cc30
IQ4_NL: 4-bit non-linear quants with blocks of 32 (#5590)
|
1 rok temu |
Didzis Gosko
|
890559ab28
metal : option to embed MSL source into compiled binary (whisper/1842)
|
1 rok temu |
Kawrakow
|
bd2d4e393b
1.5 bit quantization (#5453)
|
1 rok temu |
Georgi Gerganov
|
8f1be0d42f
ggml : add ALiBi support for ggml_soft_max_ext (#5488)
|
1 rok temu |
Ananta Bastola
|
6e4e973b26
ci : add an option to fail on compile warning (#3952)
|
1 rok temu |
Ian Bull
|
f026f8120f
metal : use autoreleasepool to avoid memory leaks (#5437)
|
1 rok temu |
Georgi Gerganov
|
efb7bdbbd0
metal : add im2col F32 dst support (#5132)
|
2 lat temu |
Georgi Gerganov
|
549a1e6cd5
ci : fix yolo URLs + fix metal capture (ggml/712)
|
2 lat temu |
Jack Mousseau
|
5f14ee0b0c
metal : add debug capture backend function (ggml/694)
|
2 lat temu |
Kawrakow
|
f4d7e54974
SOTA 3-bit quants (#5196)
|
2 lat temu |
slaren
|
fbe7dfa53c
ggml : add max buffer sizes to opencl and metal backends (#5181)
|
2 lat temu |
Paul Tsochantaris
|
d2f650cb5b
metal : free metal objects (#5161)
|
2 lat temu |
0cc4m
|
2307523d32
ggml : add Vulkan backend (#2059)
|
2 lat temu |
Paul Tsochantaris
|
6dd3c28c9c
metal : remove unused `n_buffers` and `buffers` (#5129)
|
2 lat temu |
Georgi Gerganov
|
ddc5a5033f
metal : show compile log messages
|
2 lat temu |
Georgi Gerganov
|
26d607608d
metal : disable support for MUL_MAT F32 x F16
|
2 lat temu |
Paul Tsochantaris
|
1e605f4102
metal : fix memory leak, dangling pointer and unused autorel (#5007)
|
2 lat temu |
Georgi Gerganov
|
c918fe8dca
metal : create autorelease pool during library build (#4970)
|
2 lat temu |
Paul Tsochantaris
|
7563293665
metal : remove unnecessary nil check (#4986)
|
2 lat temu |
Paul Tsochantaris
|
158f8c9e21
metal : localized logic in `ggml_metal_graph_compute` (#4924)
|
2 lat temu |
Alex Azarov
|
3a48d558a6
metal : replace loop of dispatch_async with dispatch_apply (#4934)
|
2 lat temu |
Alex Azarov
|
7c8d3abd1a
metal : log `recommendedMaxWorkingSetSize` on iOS 16+ (#4936)
|
2 lat temu |
Justine Tunney
|
a0b3ac8c48
ggml : introduce GGML_CALL function annotation (#4850)
|
2 lat temu |
Alex Azarov
|
5f5fe1bd60
metal : correctly set SIMD support flags on iOS (#4923)
|
2 lat temu |