Commit History

Author SHA1 Message Date
  Sigbjørn Skjæret e434e69183 common : suggest --jinja when autodetection fails (#14222) 7 months ago
  Georgi Gerganov 89fea80d29 server : fix incorrect usage of llama_get_embeddings() (#14225) 7 months ago
  Diego Devesa 6adc3c3ebc llama : add thread safety test (#14035) 7 months ago
  bandoti 0dbcabde8c cmake: clean up external project logic for vulkan-shaders-gen (#14179) 7 months ago
  Đinh Trọng Huy ad590be98c model : add NeoBERT (#14164) 7 months ago
  uvos 7d6d91babf HIP: disable rocwmma on gfx12 by default until rocm 7.0 (#14202) 7 months ago
  Georgi Gerganov d3e64b9f49 llama : rework embeddings logic (#14208) 7 months ago
  Charles Xu 3ba0d843c6 ggml: Add Android support for GGML_CPU_ALL_VARIANTS (#14206) 7 months ago
  Bartowski 0bf49eb668 convert : remove arcee change in convert_hf_to_gguf_update.py (#14207) 7 months ago
  Đinh Trọng Huy 4ad243677b gguf-py : allow key override when adding value to GGUFWriter (#14194) 7 months ago
  Jeff Bolz c89c2d1ab9 vulkan: mutex around vkQueueSubmit (#14127) 7 months ago
  xctan 3555b3004b ggml-cpu : rework weak alias on apple targets (#14146) 7 months ago
  Bartowski d7da8dc83a model : Add support for Arcee AI's upcoming AFM model (#14185) 7 months ago
  Eric Curtin cd355eda7d server : When listening on a unix domain socket don't print http:// and port (#14180) 7 months ago
  Ed Addario 30e5b01de2 quantize : change int to unsigned int for KV overrides (#14197) 7 months ago
  uvos e54b394082 CUDA/HIP: fix ssm_scan on devices where warp size is not 32 (#14196) 7 months ago
  uvos 2c2caa4443 HIP: Replace usage of depricated preprocessor macro __AMDGCN_WAVEFRONT_SIZE__ (#14183) 7 months ago
  Georgi Gerganov 5fce5f948d kv-cache : fix use-after-move of defrag info (#14189) 7 months ago
  Mikko Juola 9ae4143bc6 model : add dots.llm1 architecture support (#14044) (#14118) 7 months ago
  Georgi Gerganov c311ac664d cparams : rename LLAMA_MAX_PARALLEL_SEQUENCES to LLAMA_MAX_SEQ (#14188) 7 months ago
  Georgi Gerganov b9912ac570 batch : auto-gen positions + verify multi-sequence input (#14177) 7 months ago
  Pepijn de Vos 00ba772610 docs : remove WIP since PR has been merged (#13912) 7 months ago
  Piotr 3cb203c89f llama-chat : Do not throw when tool parsing fails (#14012) 7 months ago
  Aman Gupta 2e42be42bd compare-llama-bench: add option to plot (#14169) 7 months ago
  Georgi Gerganov fb85a288d7 vocab : fix build (#14175) 7 months ago
  Svetlozar Georgiev 40643edb86 sycl: fix docker image (#14144) 7 months ago
  Guy Goldenberg 3cfbbdb44e Merge commit from fork 7 months ago
  Georgi Gerganov 80709b70a2 batch : add LLAMA_BATCH_DEBUG environment variable (#14172) 7 months ago
  ddpasa 26ff3685bf docs : Update multimodal.md (#14122) 7 months ago
  Georgi Gerganov 60c666347b batch : rework llama_batch_allocr (#14153) 7 months ago