Commit History

Author SHA1 Message Date
  Georgi Gerganov d3e64b9f49 llama : rework embeddings logic (#14208) 7 months ago
  Charles Xu 3ba0d843c6 ggml: Add Android support for GGML_CPU_ALL_VARIANTS (#14206) 7 months ago
  Bartowski 0bf49eb668 convert : remove arcee change in convert_hf_to_gguf_update.py (#14207) 7 months ago
  Đinh Trọng Huy 4ad243677b gguf-py : allow key override when adding value to GGUFWriter (#14194) 7 months ago
  Jeff Bolz c89c2d1ab9 vulkan: mutex around vkQueueSubmit (#14127) 7 months ago
  xctan 3555b3004b ggml-cpu : rework weak alias on apple targets (#14146) 7 months ago
  Bartowski d7da8dc83a model : Add support for Arcee AI's upcoming AFM model (#14185) 7 months ago
  Eric Curtin cd355eda7d server : When listening on a unix domain socket don't print http:// and port (#14180) 7 months ago
  Ed Addario 30e5b01de2 quantize : change int to unsigned int for KV overrides (#14197) 7 months ago
  uvos e54b394082 CUDA/HIP: fix ssm_scan on devices where warp size is not 32 (#14196) 7 months ago
  uvos 2c2caa4443 HIP: Replace usage of depricated preprocessor macro __AMDGCN_WAVEFRONT_SIZE__ (#14183) 7 months ago
  Georgi Gerganov 5fce5f948d kv-cache : fix use-after-move of defrag info (#14189) 7 months ago
  Mikko Juola 9ae4143bc6 model : add dots.llm1 architecture support (#14044) (#14118) 7 months ago
  Georgi Gerganov c311ac664d cparams : rename LLAMA_MAX_PARALLEL_SEQUENCES to LLAMA_MAX_SEQ (#14188) 7 months ago
  Georgi Gerganov b9912ac570 batch : auto-gen positions + verify multi-sequence input (#14177) 7 months ago
  Pepijn de Vos 00ba772610 docs : remove WIP since PR has been merged (#13912) 7 months ago
  Piotr 3cb203c89f llama-chat : Do not throw when tool parsing fails (#14012) 7 months ago
  Aman Gupta 2e42be42bd compare-llama-bench: add option to plot (#14169) 7 months ago
  Georgi Gerganov fb85a288d7 vocab : fix build (#14175) 7 months ago
  Svetlozar Georgiev 40643edb86 sycl: fix docker image (#14144) 7 months ago
  Guy Goldenberg 3cfbbdb44e Merge commit from fork 7 months ago
  Georgi Gerganov 80709b70a2 batch : add LLAMA_BATCH_DEBUG environment variable (#14172) 7 months ago
  ddpasa 26ff3685bf docs : Update multimodal.md (#14122) 7 months ago
  Georgi Gerganov 60c666347b batch : rework llama_batch_allocr (#14153) 7 months ago
  Georgi Gerganov b7cc7745e3 readme : remove survey link (#14168) 7 months ago
  Christian Kastner cc8d081879 cmake: Add ability to pass in LLAMA_BUILD_NUMBER/COMMIT (#14167) 7 months ago
  Đinh Trọng Huy d714dadb57 pooling : make cls_b and cls_out_b optional (#14165) 7 months ago
  Georgi Gerganov ffad043973 server : fix SWA condition for full context reprocess (#14163) 7 months ago
  Anton Mitkov 0889eba570 sycl: Adding additional cpy dbg print output (#14034) 7 months ago
  Ewan Crawford c61285e739 SYCL: Bump oneMath commit (#14152) 7 months ago