Commit History

Author SHA1 Message Date
  Anton Mitkov bdca38376f sycl: Hotfix for non dnnl codepath (#14677) 6 months ago
  shalinib-ibm 55c509daf5 ggml : refactor llamafile_sgemm PPC code (#14673) 6 months ago
  Aman Gupta 9c9e4fc635 llama-context: add ability to get logits (#14672) 6 months ago
  Johannes Gäßler 494c5899cb scripts: benchmark for HTTP server throughput (#14668) 6 months ago
  Akarshan Biswas 0f4c6ec0f1 SYCL: use 1D kernel for set_rows (#14618) 6 months ago
  Anton Mitkov 65a3ebb0aa sycl: Batched mulmat rework for oneDNN dispatch (#14617) 6 months ago
  Molly Sophia 0d9226763c llama : add jinja template for rwkv-world (#14665) 6 months ago
  Ed Addario 982e347255 quantize : fix minor logic flaw in --tensor-type (#14572) 6 months ago
  Sigbjørn Skjæret 923e3ea2e3 cuda : add set rows for bf16 (#14664) 6 months ago
  Yavor Ivanov e743cddb60 cuda : add ELU support (#14657) 6 months ago
  Georgi Gerganov 05fec5bd29 ggml : add build-time message to remind about ggml_set_rows (#14661) 6 months ago
  Yavor Ivanov dcf7f2ea3c metal : Add missing unary ops Metal support (#14660) 6 months ago
  Yavor Ivanov 84b396e051 cmake : Add CMake presets for Linux and GCC (#14656) 6 months ago
  Tarek Dakhran c31e60647d tests : cover lfm2 cases in test_ssm_conv (#14651) 6 months ago
  Tarek Dakhran 67eade1bf9 docs : add LFM2 to models section (#14650) 6 months ago
  Aman Gupta 7de5c7cab6 CUDA: add set rows for f32 and f16 (#14551) 6 months ago
  Georgi Gerganov 8eff95544e sync : ggml 6 months ago
  Georgi Gerganov 3120413ccd vulkan : remove unused vars (#0) 6 months ago
  Georgi Gerganov 215535701d sync : ggml 6 months ago
  Acly 74bb294591 vulkan : implement bilinear interpolation (ggml/1291) 6 months ago
  Acly 3e303b1107 vulkan : implement ggml_roll (ggml/1290) 6 months ago
  Douglas Hanley 0c1df14b5f server : fix pooled embedding output (#14645) 6 months ago
  Jeff Bolz b3ad3a0191 vulkan: support SET_ROWS (#14587) 6 months ago
  Jeff Bolz 98197e5c98 vulkan: optimizations for deepseek prompt processing (#14555) 6 months ago
  Tarek Dakhran f5e96b368f model : support LiquidAI LFM2 hybrid family (#14620) 6 months ago
  Slobodan Josic 756aa1020a HIP : Add HIP 7.0+ compatibility for hipBLAS compute types (#14634) 6 months ago
  Georgi Gerganov aaa088d87f readme : add hot PRs (#14636) 6 months ago
  Georgi Gerganov 0d5375d54b llama : move enum llama_vocab_pre_type to implementation (#14631) 6 months ago
  Dowon 576c82eda2 vocab : add midm-2.0 model pre-tokenizer (#14626) 6 months ago
  Gabe Goodhart 0aedae00e6 model : Granite Four (#13550) 6 months ago