Georgi Gerganov 803f8baf4f llama : deprecate explicit kv_self defrag/update calls (#13921) hace 7 meses
..
CMakeLists.txt 7cc2d2c889 ggml : move AMX to the CPU backend (#10570) hace 1 año
README.md 68ff663a04 repo : update links to new url (#11886) hace 11 meses
passkey.cpp 803f8baf4f llama : deprecate explicit kv_self defrag/update calls (#13921) hace 7 meses

README.md

llama.cpp/example/passkey

A passkey retrieval task is an evaluation method used to measure a language models ability to recall information from long contexts.

See the following PRs for more info:

Usage

make -j && ./llama-passkey -m ./models/llama-7b-v2/ggml-model-f16.gguf --junk 250