cturan/llama.cpp

Author	SHA1 Message	Date
Diego Devesa	d5c63cd7f9 test-backend-ops : add option -p to filter by op params (#12155)	10 months ago
ag2s20150909	9660ffef58 ggml : fix kleidiai build (#12159)	10 months ago
Eric Curtin	c950a1f692 Adding UTF-8 support to llama.cpp (#12111)	10 months ago
Xuan-Son Nguyen	7b69003af7 webui : add ?m=... and ?q=... params (#12148)	10 months ago
Akarshan Biswas	ece9745bb8 SYCL: Move CPY kernels to a separate file and add few missing kernels (#12133)	10 months ago
Diego Devesa	cc473cac7c ggml-backend : keep paths in native string type when possible (#12144)	10 months ago
Sigbjørn Skjæret	14dec0c2f2 main: use jinja chat template system prompt by default (#12118)	10 months ago
Sigbjørn Skjæret	1782cdfed6 main: update outdated system prompt message (followup to #12131) (#12132)	11 months ago
Sigbjørn Skjæret	45a8e76745 common : add --system-prompt parameter, replace behavior of -p in conversation mode (#12131)	11 months ago
Erik Scholz	80c41ddd8f CUDA: compress mode option and default to size (#12029)	11 months ago
Vivian	2cc4a5e44a webui : minor typo fixes (#12116)	11 months ago
Xuan-Son Nguyen	06c2b1561d convert : fix Norway problem when parsing YAML (#12114)	11 months ago
William Tambellini	70680c48e5 ggml : upgrade init_tensor API to return a ggml_status (#11854)	11 months ago
Xuan-Son Nguyen	c43a3e7996 llama : add Phi-4-mini support (supersede #12099) (#12108)	11 months ago
Alex Brooks	84d5f4bc19 Update granite vision docs for 3.2 model (#12105)	11 months ago
Rémy O	438a83926a vulkan: add specific MMV kernels for IQ2 and IQ3 quants + optimizations (#11595)	11 months ago
Johannes Gäßler	9c42b1718c CUDA: fix logic for V100 + GGML_CUDA_FORCE_MMQ (#12098)	11 months ago
Prashant Vithule	05e6f5aad0 ggml: aarch64: implement SVE kernels for q2_k_q8_k vector dot (#12064)	11 months ago
hipudding	673cfef9aa CANN: Fix build error with GCC 13 (#11990)	11 months ago
Eve	fbeda9002d vulkan: matmul dequantization improvements (#12015)	11 months ago
Daniele	581650b7ca vulkan: improve im2col (#11826)	11 months ago
Vladimir Vuksanovic	b95c8af37c cmake: Fix ggml backend dependencies and installation (#11818)	11 months ago
Ting Lou	a800ae46da llava : add struct for FFI bindgen (#12079)	11 months ago
Sigbjørn Skjæret	69050a11be Refactor gguf scripts to improve metadata handling (#11909)	11 months ago
Aleksei Nikiforov	3567ee3a94 gguf-py: enable reading non-native endian files (#12081)	11 months ago
Kante Yin	53e4db1012 readme : update infra list (#9096)	11 months ago
Olivier Chafik	d7cfe1ffe0 docs: add docs/function-calling.md to lighten server/README.md's plight (#12069)	11 months ago
Jeff Bolz	a82c9e7c23 vulkan: fix assertion when qy_needs_dequant (#12068)	11 months ago
rhjdvsgsgks	401af80b54 server: handle echo=false on /v1/completions (#12060)	11 months ago
Judd	c132239bfb add OP sigmoid (#12056)	11 months ago

Newer Older

Commit History Find

Commit History