Russyyds d6d2c2ab8c Add performance print for gemma3 in example (#12929) 9 bulan lalu
..
android 1c641e6aac `build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809) 1 tahun lalu
CMakeLists.txt 8b9cc7cdd8 llava : introduce libmtmd (#12849) 9 bulan lalu
MobileVLM-README.md e665744317 llava : fix the script error in MobileVLM README (#9054) 1 tahun lalu
README-gemma3.md 267c1399f1 common : refactor downloading system, handle mmproj with -hf option (#12694) 9 bulan lalu
README-glmedge.md 0cec062a63 llama : add support for GLM-Edge and GLM-Edge-V series models (#10573) 11 bulan lalu
README-granitevision.md 84d5f4bc19 Update granite vision docs for 3.2 model (#12105) 10 bulan lalu
README-minicpmo2.6.md 8352cdc87b llava : fix bug in minicpm-v code (#11513) 10 bulan lalu
README-minicpmv2.5.md 8352cdc87b llava : fix bug in minicpm-v code (#11513) 10 bulan lalu
README-minicpmv2.6.md 8352cdc87b llava : fix bug in minicpm-v code (#11513) 10 bulan lalu
README-quantize.md 1ec208083c llava: add quantization for the visual projector LLAVA, Qwen2VL (#11644) 11 bulan lalu
README.md 7a2c913e66 llava : Add Granite Vision Support (#11794) 10 bulan lalu
clip-impl.h 0c50923944 clip : use smart pointer (⚠️ breaking change) (#12869) 9 bulan lalu
clip-quantize-cli.cpp 1ec208083c llava: add quantization for the visual projector LLAVA, Qwen2VL (#11644) 11 bulan lalu
clip.cpp e59ea539b8 llava: Fix cpu-only clip image encoding sefault (#12907) 9 bulan lalu
clip.h 0c50923944 clip : use smart pointer (⚠️ breaking change) (#12869) 9 bulan lalu
convert_image_encoder_to_gguf.py e9b2f84f14 llava: add big-endian conversion for image encoder (#12218) 10 bulan lalu
gemma3-cli.cpp d6d2c2ab8c Add performance print for gemma3 in example (#12929) 9 bulan lalu
gemma3_convert_encoder_to_gguf.py 7841fc723e llama : Add Gemma 3 support (+ experimental vision capability) (#12343) 10 bulan lalu
glmedge-convert-image-encoder-to-gguf.py 0cec062a63 llama : add support for GLM-Edge and GLM-Edge-V series models (#10573) 11 bulan lalu
glmedge-surgery.py 0cec062a63 llama : add support for GLM-Edge and GLM-Edge-V series models (#10573) 11 bulan lalu
llava-cli.cpp 0364178ca2 clip : refactor clip_init, add tests (#12757) 9 bulan lalu
llava.cpp 0c50923944 clip : use smart pointer (⚠️ breaking change) (#12869) 9 bulan lalu
llava.h 3071c0a5f2 llava : support MiniCPM-V-2.5 (#7599) 1 tahun lalu
llava_surgery.py e235b267a2 py : switch to snake_case (#8305) 1 tahun lalu
llava_surgery_v2.py 7a2c913e66 llava : Add Granite Vision Support (#11794) 10 bulan lalu
minicpmv-cli.cpp 0364178ca2 clip : refactor clip_init, add tests (#12757) 9 bulan lalu
minicpmv-convert-image-encoder-to-gguf.py 8352cdc87b llava : fix bug in minicpm-v code (#11513) 10 bulan lalu
minicpmv-surgery.py 3e3357fd77 llava : support Minicpm-omni (#11289) 1 tahun lalu
mtmd.cpp 0c50923944 clip : use smart pointer (⚠️ breaking change) (#12869) 9 bulan lalu
mtmd.h 8b9cc7cdd8 llava : introduce libmtmd (#12849) 9 bulan lalu
qwen2_vl_surgery.py 4ddd199f6f llava : Allow locally downloaded models for QwenVL (#10833) 1 tahun lalu
qwen2vl-cli.cpp 0364178ca2 clip : refactor clip_init, add tests (#12757) 9 bulan lalu
requirements.txt d3ae0ee8d7 py : fix requirements check '==' -> '~=' (#8982) 1 tahun lalu
test-1.jpeg 0364178ca2 clip : refactor clip_init, add tests (#12757) 9 bulan lalu
tests.sh 0364178ca2 clip : refactor clip_init, add tests (#12757) 9 bulan lalu

README-gemma3.md

Gemma 3 vision

[!IMPORTANT]

This is very experimental, only used for demo purpose.

Quick started

You can use pre-quantized model from ggml-org's Hugging Face account

# build
cmake -B build
cmake --build build --target llama-gemma3-cli

# alternatively, install from brew (MacOS)
brew install llama.cpp

# run it
llama-gemma3-cli -hf ggml-org/gemma-3-4b-it-GGUF
llama-gemma3-cli -hf ggml-org/gemma-3-12b-it-GGUF
llama-gemma3-cli -hf ggml-org/gemma-3-27b-it-GGUF

# note: 1B model does not support vision

How to get mmproj.gguf?

cd gemma-3-4b-it
python ../llama.cpp/examples/llava/gemma3_convert_encoder_to_gguf.py .

# output file is mmproj.gguf

How to run it?

What you need:

  • The text model GGUF, can be converted using convert_hf_to_gguf.py
  • The mmproj file from step above
  • An image file

    # build
    cmake -B build
    cmake --build build --target llama-gemma3-cli
    
    # run it
    ./build/bin/llama-gemma3-cli -m {text_model}.gguf --mmproj mmproj.gguf --image your_image.jpg