Xuan-Son Nguyen
|
4b13a684c5
mtmd: fix patch_size initialized to random value in audio models (#17128)
|
2 miesięcy temu |
Xuan-Son Nguyen
|
4882f0ff78
clip: implement minicpm-v sinusoidal embd using GGML (#17036)
|
2 miesięcy temu |
Xuan-Son Nguyen
|
92bb84f775
mtmd: allow QwenVL to process larger image by default (#17020)
|
2 miesięcy temu |
Xuan-Son Nguyen
|
2f0c2db43e
mtmd: improve struct initialization (#16981)
|
2 miesięcy temu |
Xuan-Son Nguyen
|
070ff4d535
mtmd: add --image-min/max-tokens (#16921)
|
2 miesięcy temu |
Xuan-Son Nguyen
|
bf7b0c9725
mtmd: pad mask for qwen2.5vl (#16954)
|
2 miesięcy temu |
Zhiyong Wang
|
6b9a52422b
model: add Janus Pro for image understanding (#16906)
|
2 miesięcy temu |
Georgi Gerganov
|
2f966b8ed8
clip : use FA (#16837)
|
2 miesięcy temu |
Xuan-Son Nguyen
|
cf659bbb8e
mtmd: refactor preprocessing + support max/min pixels (#16878)
|
2 miesięcy temu |
JJJYmmm
|
d261223d24
model: add support for qwen3vl series (#16780)
|
2 miesięcy temu |
Tianyue-Zhao
|
bacddc049a
model: Add support for CogVLM model (#15002)
|
2 miesięcy temu |
Xuan-Son Nguyen
|
e1ab084803
mtmd : fix idefics3 preprocessing (#16806)
|
2 miesięcy temu |
Xuan-Son Nguyen
|
c55d53acec
model : add LightOnOCR-1B model (#16764)
|
2 miesięcy temu |
Xuan-Son Nguyen
|
1bb4f43380
mtmd : support home-cooked Mistral Small Omni (#14928)
|
3 miesięcy temu |
Gabe Goodhart
|
ca71fb9b36
model : Granite docling + Idefics3 preprocessing (SmolVLM) (#16206)
|
3 miesięcy temu |
Aleksei Nikiforov
|
cc1cfa277b
mtmd : fix uninitialized variable in bicubic_resize (#16275)
|
3 miesięcy temu |
Diego Devesa
|
50f4281a6f
llama : allow using iGPUs with --device (#15951)
|
4 miesięcy temu |
Xuan-Son Nguyen
|
79a546220c
mtmd : support Kimi VL model (#15458)
|
4 miesięcy temu |
tc-mb
|
c4e9239064
model : support MiniCPM-V 4.5 (#15575)
|
4 miesięcy temu |
Tarek Dakhran
|
e288693669
readme : model : mtdm : lfm2 improvements (#15476)
|
4 miesięcy temu |
Michael Giba
|
b108e42904
ci : fix -Werror=return-type in clip.cpp so ci/run.sh can run without issue (#15221)
|
5 miesięcy temu |
Xuan-Son Nguyen
|
f08c4c0d8d
mtmd : clean up clip_n_output_tokens (#15391)
|
5 miesięcy temu |
Sigbjørn Skjæret
|
baa9255a45
llama : merge conts and reshapes and remove unnecessary cont (#15380)
|
5 miesięcy temu |
Tarek Dakhran
|
65349f26f2
model : support vision LiquidAI LFM2-VL family (#15347)
|
5 miesięcy temu |
rainred
|
cf9e5648a7
mtmd : Fix MinicpmV model converter and clip to avoid using hardcode. (#14750)
|
5 miesięcy temu |
tc-mb
|
952a47f455
mtmd : support MiniCPM-V 4.0 (#14983)
|
5 miesięcy temu |
Xuan-Son Nguyen
|
00fa15fedc
mtmd : add support for Voxtral (#14862)
|
5 miesięcy temu |
kiwi
|
749e0d27f0
mtmd : fix 32-bit narrowing issue in export-lora and mtmd clip (#14503)
|
5 miesięcy temu |
stduhpf
|
c8ade30036
Mtmd: add a way to select device for vision encoder (#14236)
|
5 miesięcy temu |
Sigbjørn Skjæret
|
28657a8229
ggml : implement GEGLU_ERF and GEGLU_QUICK ops (#14445)
|
6 miesięcy temu |