cturan/llama.cpp

Author	SHA1 Message	Date
Sigbjørn Skjæret	e434e69183 common : suggest --jinja when autodetection fails (#14222)	7 months ago
Georgi Gerganov	89fea80d29 server : fix incorrect usage of llama_get_embeddings() (#14225)	7 months ago
Diego Devesa	6adc3c3ebc llama : add thread safety test (#14035)	7 months ago
bandoti	0dbcabde8c cmake: clean up external project logic for vulkan-shaders-gen (#14179)	7 months ago
Đinh Trọng Huy	ad590be98c model : add NeoBERT (#14164)	7 months ago
uvos	7d6d91babf HIP: disable rocwmma on gfx12 by default until rocm 7.0 (#14202)	7 months ago
Georgi Gerganov	d3e64b9f49 llama : rework embeddings logic (#14208)	7 months ago
Charles Xu	3ba0d843c6 ggml: Add Android support for GGML_CPU_ALL_VARIANTS (#14206)	7 months ago
Bartowski	0bf49eb668 convert : remove arcee change in convert_hf_to_gguf_update.py (#14207)	7 months ago
Đinh Trọng Huy	4ad243677b gguf-py : allow key override when adding value to GGUFWriter (#14194)	7 months ago
Jeff Bolz	c89c2d1ab9 vulkan: mutex around vkQueueSubmit (#14127)	7 months ago
xctan	3555b3004b ggml-cpu : rework weak alias on apple targets (#14146)	7 months ago
Bartowski	d7da8dc83a model : Add support for Arcee AI's upcoming AFM model (#14185)	7 months ago
Eric Curtin	cd355eda7d server : When listening on a unix domain socket don't print http:// and port (#14180)	7 months ago
Ed Addario	30e5b01de2 quantize : change int to unsigned int for KV overrides (#14197)	7 months ago
uvos	e54b394082 CUDA/HIP: fix ssm_scan on devices where warp size is not 32 (#14196)	7 months ago
uvos	2c2caa4443 HIP: Replace usage of depricated preprocessor macro __AMDGCN_WAVEFRONT_SIZE__ (#14183)	7 months ago
Georgi Gerganov	5fce5f948d kv-cache : fix use-after-move of defrag info (#14189)	7 months ago
Mikko Juola	9ae4143bc6 model : add dots.llm1 architecture support (#14044) (#14118)	7 months ago
Georgi Gerganov	c311ac664d cparams : rename LLAMA_MAX_PARALLEL_SEQUENCES to LLAMA_MAX_SEQ (#14188)	7 months ago
Georgi Gerganov	b9912ac570 batch : auto-gen positions + verify multi-sequence input (#14177)	7 months ago
Pepijn de Vos	00ba772610 docs : remove WIP since PR has been merged (#13912)	7 months ago
Piotr	3cb203c89f llama-chat : Do not throw when tool parsing fails (#14012)	7 months ago
Aman Gupta	2e42be42bd compare-llama-bench: add option to plot (#14169)	7 months ago
Georgi Gerganov	fb85a288d7 vocab : fix build (#14175)	7 months ago
Svetlozar Georgiev	40643edb86 sycl: fix docker image (#14144)	7 months ago
Guy Goldenberg	3cfbbdb44e Merge commit from fork	7 months ago
Georgi Gerganov	80709b70a2 batch : add LLAMA_BATCH_DEBUG environment variable (#14172)	7 months ago
ddpasa	26ff3685bf docs : Update multimodal.md (#14122)	7 months ago
Georgi Gerganov	60c666347b batch : rework llama_batch_allocr (#14153)	7 months ago

Newer Older

Commit History Find

Commit History