cturan/llama.cpp @ a28e0d5eb18c18e6a4598286158f427269b1444e

Johannes Gäßler 5fa07c2f93 CUDA: optimize FA for GQA + large batches (#12014)		10 months ago
..
.gitignore	2c4f566c88 tests : gitignore ggml-common.h	1 year ago
CMakeLists.txt	ff227703d6 sampling : support for llguidance grammars (#10224)	11 months ago
get-model.cpp	413e7b0559 ci : add model tests + script wrapper (#4586)	2 years ago
get-model.h	413e7b0559 ci : add model tests + script wrapper (#4586)	2 years ago
run-json-schema-to-grammar.mjs	a71d81cf8c server : revamp chat UI with vuejs and daisyui (#10175)	1 year ago
test-arg-parser.cpp	d9d54e498d speculative : refactor and add a simpler example (#10362)	1 year ago
test-autorelease.cpp	afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110)	1 year ago
test-backend-ops.cpp	5fa07c2f93 CUDA: optimize FA for GQA + large batches (#12014)	10 months ago
test-barrier.cpp	9f40989351 ggml : move CPU backend to a separate file (#10144)	1 year ago
test-c.c	fbf1ddec69 Nomic Vulkan backend (#4456)	1 year ago
test-chat-template.cpp	63e489c025 tool-call: refactor common chat / tool-call api (+ tests / fixes) (#11900)	11 months ago
test-chat.cpp	63e489c025 tool-call: refactor common chat / tool-call api (+ tests / fixes) (#11900)	11 months ago
test-double-float.cpp	370b1f7e7a ggml : minor naming changes (#8433)	1 year ago
test-gguf.cpp	fef0cbeadf cleanup: fix compile warnings associated with gnu_printf (#11811)	11 months ago
test-grammar-integration.cpp	ff227703d6 sampling : support for llguidance grammars (#10224)	11 months ago
test-grammar-llguidance.cpp	ff227703d6 sampling : support for llguidance grammars (#10224)	11 months ago
test-grammar-parser.cpp	df270ef745 llama : refactor sampling v2 (#9294)	1 year ago
test-json-schema-to-grammar.cpp	ff227703d6 sampling : support for llguidance grammars (#10224)	11 months ago
test-llama-grammar.cpp	5cab3e4aaa llama : minor grammar refactor (#10897)	1 year ago
test-log.cpp	7eee341bee common : use common_ prefix for common library functions (#9805)	1 year ago
test-lora-conversion-inference.sh	f11cfdfd7f ci : use -no-cnv in gguf-split tests (#11254)	1 year ago
test-model-load-cancel.cpp	47182dd03f llama : update llama_model API names (#11063)	1 year ago
test-opt.cpp	24203e9dd7 ggml : inttypes.h -> cinttypes (#0)	1 year ago
test-quantize-fns.cpp	f6d12e7df8 tests : fix compile warning	1 year ago
test-quantize-perf.cpp	24203e9dd7 ggml : inttypes.h -> cinttypes (#0)	1 year ago
test-rope.cpp	ba1cb19cdd llama : add Qwen2VL support + multimodal RoPE (#10361)	1 year ago
test-sampling.cpp	27e8a23300 sampling: add Top-nσ sampler (#11223)	11 months ago
test-tokenizer-0.cpp	afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110)	1 year ago
test-tokenizer-0.py	6fbd432211 py : logging and flake8 suppression refactoring (#7081)	1 year ago
test-tokenizer-0.sh	edc29433fa tests : fix test-tokenizer-0.sh	1 year ago
test-tokenizer-1-bpe.cpp	afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110)	1 year ago
test-tokenizer-1-spm.cpp	afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110)	1 year ago
test-tokenizer-random.py	afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110)	1 year ago