Xuan-Son Nguyen
|
1bb4f43380
mtmd : support home-cooked Mistral Small Omni (#14928)
|
hace 3 meses |
Gabe Goodhart
|
ca71fb9b36
model : Granite docling + Idefics3 preprocessing (SmolVLM) (#16206)
|
hace 3 meses |
Aleksei Nikiforov
|
cc1cfa277b
mtmd : fix uninitialized variable in bicubic_resize (#16275)
|
hace 3 meses |
Diego Devesa
|
50f4281a6f
llama : allow using iGPUs with --device (#15951)
|
hace 4 meses |
Xuan-Son Nguyen
|
79a546220c
mtmd : support Kimi VL model (#15458)
|
hace 4 meses |
tc-mb
|
c4e9239064
model : support MiniCPM-V 4.5 (#15575)
|
hace 4 meses |
Tarek Dakhran
|
e288693669
readme : model : mtdm : lfm2 improvements (#15476)
|
hace 4 meses |
Michael Giba
|
b108e42904
ci : fix -Werror=return-type in clip.cpp so ci/run.sh can run without issue (#15221)
|
hace 4 meses |
Xuan-Son Nguyen
|
f08c4c0d8d
mtmd : clean up clip_n_output_tokens (#15391)
|
hace 5 meses |
Sigbjørn Skjæret
|
baa9255a45
llama : merge conts and reshapes and remove unnecessary cont (#15380)
|
hace 5 meses |
Tarek Dakhran
|
65349f26f2
model : support vision LiquidAI LFM2-VL family (#15347)
|
hace 5 meses |
rainred
|
cf9e5648a7
mtmd : Fix MinicpmV model converter and clip to avoid using hardcode. (#14750)
|
hace 5 meses |
tc-mb
|
952a47f455
mtmd : support MiniCPM-V 4.0 (#14983)
|
hace 5 meses |
Xuan-Son Nguyen
|
00fa15fedc
mtmd : add support for Voxtral (#14862)
|
hace 5 meses |
kiwi
|
749e0d27f0
mtmd : fix 32-bit narrowing issue in export-lora and mtmd clip (#14503)
|
hace 5 meses |
stduhpf
|
c8ade30036
Mtmd: add a way to select device for vision encoder (#14236)
|
hace 5 meses |
Sigbjørn Skjæret
|
28657a8229
ggml : implement GEGLU_ERF and GEGLU_QUICK ops (#14445)
|
hace 6 meses |
yuiseki
|
5d5c066de8
mtmd : fix Pixtral OOM with large images by capping image_size to 1024 (#14326)
|
hace 6 meses |
Xuan-Son Nguyen
|
413977de32
mtmd : refactor llava-uhd preprocessing logic (#14247)
|
hace 7 meses |
Xuan-Son Nguyen
|
10961339b2
mtmd : move helpers to dedicated library (⚠️ breaking change) (#13866)
|
hace 7 meses |
Xuan-Son Nguyen
|
bc583e3c63
mtmd : support Qwen 2.5 Omni (input audio+vision, no audio output) (#13784)
|
hace 7 meses |
Xuan-Son Nguyen
|
40aaa8a403
mtmd : add support for Qwen2-Audio and SeaLLM-Audio (#13760)
|
hace 7 meses |
Xuan-Son Nguyen
|
797990c4bc
mtmd : add ultravox audio input (#13623)
|
hace 7 meses |
Xuan-Son Nguyen
|
92ecdcc06a
mtmd : add vision support for llama 4 (#13282)
|
hace 8 meses |
Xuan-Son Nguyen
|
71bdbdb587
clip : clip.h become private API (⚠️ breaking change) (#13510)
|
hace 8 meses |
Xuan-Son Nguyen
|
b4726345ac
mtmd : remove libllava, remove clip-quantize-cli (⚠️ breaking change) (#13460)
|
hace 8 meses |
Xuan-Son Nguyen
|
de4c07f937
clip : cap max image size 1024 for qwen vl model (#13478)
|
hace 8 meses |
City
|
c104023994
mtmd : Use RMS norm for InternVL 3 38B and 78B mmproj (#13459)
|
hace 8 meses |
David Huang
|
7f323a589f
Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386)
|
hace 8 meses |
City
|
3eac209319
mtmd : support InternVL 3 38B and 78B mmproj (#13443)
|
hace 8 meses |