cturan/llama.cpp @ ae803bfc3d0fc2d0d3e1cce22ee103a30939e104

Jared Van Bortel 2f567611c0 llama-model : support Qwen2 embedding models and pooling_mode_lasttoken (#13245)		hace 9 meses
..
scripts	aff9d107b0 gguf-py : GGUF Editor GUI - Python + Qt6 (#12930)	hace 9 meses
__init__.py	672a6f1018 convert-*.py: GGUF Naming Convention Refactor and Metadata Override Refactor (#7499)	hace 1 año
constants.py	2f567611c0 llama-model : support Qwen2 embedding models and pooling_mode_lasttoken (#13245)	hace 9 meses
gguf.py	34b0a08207 gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)	hace 2 años
gguf_reader.py	69050a11be Refactor gguf scripts to improve metadata handling (#11909)	hace 11 meses
gguf_writer.py	074e42ab31 convert : converting mmproj for Qwen2/2.5VL from convert_hf_to_gguf (#13209)	hace 9 meses
lazy.py	a226bc7a9a gguf-py : support lazy tensor splitting (#12809)	hace 10 meses
metadata.py	06c2b1561d convert : fix Norway problem when parsing YAML (#12114)	hace 11 meses
py.typed	dc07dc492e convert : various script cleanups/fixes + merges and special token handling (#2842)	hace 2 años
quants.py	9bc6db28d0 ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151)	hace 1 año
tensor_mapping.py	074e42ab31 convert : converting mmproj for Qwen2/2.5VL from convert_hf_to_gguf (#13209)	hace 9 meses
utility.py	64eda5deb9 convert : ability to lazy-load safetensors remotely without downloading to disk (#12820)	hace 10 meses
vocab.py	a686171ea7 convert : Support chat_template.json (#12460)	hace 10 meses