cturan/llama.cpp

Auteur	SHA1 Message	Date
cturan	54ed0123a6 Add minimax model support	il y a 2 mois
Johannes Gäßler	945501f5ea llama: fix leaked buffers for mmap + split files (#16765)	il y a 2 mois
Aman Gupta	75cbdd3fce test-backend-ops: print failed tests at the end (#16785)	il y a 2 mois
tamarPal	2b9bd9bf4e sycl: add ROLL operation support (#16665)	il y a 2 mois
shani-f	59fc1ec8e8 sycl: add REPEAT_BACK operation support (#16734)	il y a 2 mois
Aman Gupta	75d33b9302 CUDA: support for weight clamp in top-k norm (#16702)	il y a 2 mois
Acly	3470a5c891 ggml-alloc : make gallocr prefer chunks that allow memory reuse (#16788)	il y a 2 mois
Sigbjørn Skjæret	bd562fe4f7 cuda : use fast copy when src and dst are of different type and contiguous (#16789)	il y a 2 mois
leejet	bbac6a26b2 ggml: fix cuda kernel launch configuration for k_compute_batched_ptrs to support large batch (#16744)	il y a 2 mois
Sigbjørn Skjæret	73a48c9790 convert : enable expert group selection for all models with it (#16691)	il y a 2 mois
Sigbjørn Skjæret	f696428ce8 graph : add clamping to ffn_moe_weights_sum to avoid div-by-zero (#16655)	il y a 2 mois
Sigbjørn Skjæret	7cce4f8158 model : set res->t_embd in SmallThinker models (#16782)	il y a 2 mois
amirai21	8d8862829c docs : add Jamba to Text-only models list (#16778)	il y a 2 mois
Aman Gupta	f77c13b91f CUDA: General GEMV fusion (#16715)	il y a 2 mois
Gilad S.	3cfa9c3f12 vulkan: deduplicate Microsoft Direct3D12 devices (#16689)	il y a 2 mois
Galunid	5d195f17bc convert : handle mmproj filename/path properly (#16760)	il y a 2 mois
Shunta Saito	226f295f4d model : set res->t_embd in PLaMo2 models (#16766)	il y a 2 mois
Giuseppe Scrivano	f90b4a8efe vulkan: delete dead code (#16732)	il y a 2 mois
Jeff Bolz	8423d01931 vulkan: Optimize SSM_SCAN (#16645)	il y a 2 mois
compilade	5cca2542ac convert : avoid dequantizing mxfp4 for GPT-OSS (#16756)	il y a 2 mois
leejet	55945d2ef5 ggml: fix CUDA grid launch condition for large block_nums.y in binbcast (#16742)	il y a 2 mois
Aman Gupta	0bcb40b48c CUDA: use CUB for arbitary size argsort (#16754)	il y a 2 mois
Florian Badie	69e9ff0103 webui: support q URL parameter (#16728)	il y a 2 mois
Daniel Bevenius	5a91109a5d model-conversion : add trust_remote_code for orig model run [no ci] (#16751)	il y a 2 mois
compilade	f8f071fadd convert : handle pre-quantized models (#14810)	il y a 2 mois
Johannes Gäßler	0bf47a1dbb server: add memory breakdown print (#16740)	il y a 2 mois
Julien Denize	dd62dcfab9 convert : Make mistral-common dependency optional (#16738)	il y a 2 mois
Xuan-Son Nguyen	d0660f237a mtmd-cli : allow using --jinja (#16718)	il y a 2 mois
Prajwal B Mehendarkar	fe6a9882ac Manually link -lbsd to resolve flock symbol on AIX (#16610)	il y a 2 mois
Aman Gupta	061f0eff02 ggml-cuda: use passed ops instead of hardcoded ops (#16712)	il y a 2 mois

Récemment Précédemment

Historique des commits Trouver

Historique des commits