Johannes Gäßler 5fa07c2f93 CUDA: optimize FA for GQA + large batches (#12014) 10 months ago
..
.gitignore 2c4f566c88 tests : gitignore ggml-common.h 1 year ago
CMakeLists.txt ff227703d6 sampling : support for llguidance grammars (#10224) 11 months ago
get-model.cpp 413e7b0559 ci : add model tests + script wrapper (#4586) 2 years ago
get-model.h 413e7b0559 ci : add model tests + script wrapper (#4586) 2 years ago
run-json-schema-to-grammar.mjs a71d81cf8c server : revamp chat UI with vuejs and daisyui (#10175) 1 year ago
test-arg-parser.cpp d9d54e498d speculative : refactor and add a simpler example (#10362) 1 year ago
test-autorelease.cpp afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 year ago
test-backend-ops.cpp 5fa07c2f93 CUDA: optimize FA for GQA + large batches (#12014) 10 months ago
test-barrier.cpp 9f40989351 ggml : move CPU backend to a separate file (#10144) 1 year ago
test-c.c fbf1ddec69 Nomic Vulkan backend (#4456) 1 year ago
test-chat-template.cpp 63e489c025 tool-call: refactor common chat / tool-call api (+ tests / fixes) (#11900) 11 months ago
test-chat.cpp 63e489c025 tool-call: refactor common chat / tool-call api (+ tests / fixes) (#11900) 11 months ago
test-double-float.cpp 370b1f7e7a ggml : minor naming changes (#8433) 1 year ago
test-gguf.cpp fef0cbeadf cleanup: fix compile warnings associated with gnu_printf (#11811) 11 months ago
test-grammar-integration.cpp ff227703d6 sampling : support for llguidance grammars (#10224) 11 months ago
test-grammar-llguidance.cpp ff227703d6 sampling : support for llguidance grammars (#10224) 11 months ago
test-grammar-parser.cpp df270ef745 llama : refactor sampling v2 (#9294) 1 year ago
test-json-schema-to-grammar.cpp ff227703d6 sampling : support for llguidance grammars (#10224) 11 months ago
test-llama-grammar.cpp 5cab3e4aaa llama : minor grammar refactor (#10897) 1 year ago
test-log.cpp 7eee341bee common : use common_ prefix for common library functions (#9805) 1 year ago
test-lora-conversion-inference.sh f11cfdfd7f ci : use -no-cnv in gguf-split tests (#11254) 1 year ago
test-model-load-cancel.cpp 47182dd03f llama : update llama_model API names (#11063) 1 year ago
test-opt.cpp 24203e9dd7 ggml : inttypes.h -> cinttypes (#0) 1 year ago
test-quantize-fns.cpp f6d12e7df8 tests : fix compile warning 1 year ago
test-quantize-perf.cpp 24203e9dd7 ggml : inttypes.h -> cinttypes (#0) 1 year ago
test-rope.cpp ba1cb19cdd llama : add Qwen2VL support + multimodal RoPE (#10361) 1 year ago
test-sampling.cpp 27e8a23300 sampling: add Top-nσ sampler (#11223) 11 months ago
test-tokenizer-0.cpp afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 year ago
test-tokenizer-0.py 6fbd432211 py : logging and flake8 suppression refactoring (#7081) 1 year ago
test-tokenizer-0.sh edc29433fa tests : fix test-tokenizer-0.sh 1 year ago
test-tokenizer-1-bpe.cpp afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 year ago
test-tokenizer-1-spm.cpp afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 year ago
test-tokenizer-random.py afa8a9ec9b llama : add `llama_vocab`, functions -> methods, naming (#11110) 1 year ago