Bizhao Shi 2d38b6e400 CANN: Add the basic supports of Flash Attention kernel (#13627) il y a 7 mois
..
backend 2d38b6e400 CANN: Add the basic supports of Flash Attention kernel (#13627) il y a 7 mois
development 1d36b3670b llama : move end-user examples to tools directory (#13249) il y a 8 mois
multimodal 9b61acf060 mtmd : rename llava directory to mtmd (#13311) il y a 8 mois
android.md 68ff663a04 repo : update links to new url (#11886) il y a 11 mois
build.md 84778e9770 CUDA/HIP: Share the same unified memory allocation logic. (#12934) il y a 9 mois
docker.md 33983057d0 musa: Upgrade MUSA SDK version to rc4.0.1 and use mudnn::Unary::IDENTITY op to accelerate D2D memory copy (#13647) il y a 8 mois
function-calling.md f5cd27b71d `server`: streaming of tool calls and thoughts when `--jinja` is on (#12379) il y a 7 mois
install.md 18b663d8e4 install : add macports (#12518) il y a 10 mois
llguidance.md 89daa2564f llguidance build fixes for Windows (#11664) il y a 11 mois
multimodal.md 40aaa8a403 mtmd : add support for Qwen2-Audio and SeaLLM-Audio (#13760) il y a 7 mois