Bizhao Shi 2d38b6e400 CANN: Add the basic supports of Flash Attention kernel (#13627) 7 月之前
..
backend 2d38b6e400 CANN: Add the basic supports of Flash Attention kernel (#13627) 7 月之前
development 1d36b3670b llama : move end-user examples to tools directory (#13249) 8 月之前
multimodal 9b61acf060 mtmd : rename llava directory to mtmd (#13311) 8 月之前
android.md 68ff663a04 repo : update links to new url (#11886) 11 月之前
build.md 84778e9770 CUDA/HIP: Share the same unified memory allocation logic. (#12934) 9 月之前
docker.md 33983057d0 musa: Upgrade MUSA SDK version to rc4.0.1 and use mudnn::Unary::IDENTITY op to accelerate D2D memory copy (#13647) 8 月之前
function-calling.md f5cd27b71d `server`: streaming of tool calls and thoughts when `--jinja` is on (#12379) 7 月之前
install.md 18b663d8e4 install : add macports (#12518) 10 月之前
llguidance.md 89daa2564f llguidance build fixes for Windows (#11664) 11 月之前
multimodal.md 40aaa8a403 mtmd : add support for Qwen2-Audio and SeaLLM-Audio (#13760) 7 月之前