| .. |
|
backend
|
2d38b6e400
CANN: Add the basic supports of Flash Attention kernel (#13627)
|
7 月之前 |
|
development
|
1d36b3670b
llama : move end-user examples to tools directory (#13249)
|
8 月之前 |
|
multimodal
|
9b61acf060
mtmd : rename llava directory to mtmd (#13311)
|
8 月之前 |
|
android.md
|
68ff663a04
repo : update links to new url (#11886)
|
11 月之前 |
|
build.md
|
84778e9770
CUDA/HIP: Share the same unified memory allocation logic. (#12934)
|
9 月之前 |
|
docker.md
|
33983057d0
musa: Upgrade MUSA SDK version to rc4.0.1 and use mudnn::Unary::IDENTITY op to accelerate D2D memory copy (#13647)
|
8 月之前 |
|
function-calling.md
|
f5cd27b71d
`server`: streaming of tool calls and thoughts when `--jinja` is on (#12379)
|
7 月之前 |
|
install.md
|
18b663d8e4
install : add macports (#12518)
|
10 月之前 |
|
llguidance.md
|
89daa2564f
llguidance build fixes for Windows (#11664)
|
11 月之前 |
|
multimodal.md
|
40aaa8a403
mtmd : add support for Qwen2-Audio and SeaLLM-Audio (#13760)
|
7 月之前 |