Histórico de Commits

Autor SHA1 Mensagem Data
  Georgi Gerganov 01612b7409 llama : reuse compute graphs (#14482) há 6 meses atrás
  Tarek Dakhran 086cf81e88 llama : fix parallel processing for lfm2 (#14705) há 6 meses atrás
  Georgi Gerganov d9b691081c kv-cache : opt mask set input (#14600) há 6 meses atrás
  Georgi Gerganov ad57d3edd2 batch : fix uninitialized has_cpl flag (#14733) há 6 meses atrás
  Sigbjørn Skjæret 1ba45d4982 ci : disable failing vulkan crossbuilds (#14723) há 6 meses atrás
  Sigbjørn Skjæret 19e5943d9e convert : make hf token optional (#14717) há 6 meses atrás
  Diner Burger 496957e1cb llama : fix parameter order for hybrid memory initialization (#14725) há 6 meses atrás
  Reese Levine 21c021745d ggml: Add initial WebGPU backend (#14521) há 6 meses atrás
  tempstudio b0f0ecc3dc model : support output bias for qwen2 (#14711) há 6 meses atrás
  Georgi Gerganov 225e7a1438 llama : add high-throughput mode (#14363) há 6 meses atrás
  Aman Gupta ab14019821 Support diffusion models: Add Dream 7B (#14644) há 6 meses atrás
  Georgi Gerganov 64978340b0 ggml : add asserts (#14720) há 6 meses atrás
  Georgi Gerganov 6ffd4e9c44 server : pre-calculate EOG logit biases (#14721) há 6 meses atrás
  Shunta Saito e4841d24d3 llama : fix parallel processing for plamo2 (#14716) há 6 meses atrás
  Georgi Gerganov 538cc77f7f server : fix handling of the ignore_eos flag (#14710) há 6 meses atrás
  Johannes Gäßler 5cae766541 scripts: synthetic prompt mode for server-bench.py (#14695) há 6 meses atrás
  Sigbjørn Skjæret 4b91d6f71f convert : only check for tokenizer folder if we need it (#14704) há 6 meses atrás
  Sigbjørn Skjæret cf91f217f1 convert : add pre-computed hashes first to prevent order mishaps (#14701) há 6 meses atrás
  Min-Hua 79e0b68c17 llama: add LLAMA_API to deprecated llama_kv_self_seq_div (#14708) há 6 meses atrás
  Ed Addario c81f4192f9 gguf-py : dump bpw per layer and model in markdown mode (#14703) há 6 meses atrás
  Gabriel Larson 4a4f426944 model : add Kimi-K2 support (#14654) há 6 meses atrás
  Jeff Bolz ba1ceb3456 vulkan: fix noncontig check for mat_mul_id splitting (#14683) há 6 meses atrás
  Jeff Bolz 10a0351a97 vulkan: add RTE variants for glu/add/sub/mul/div (#14653) há 6 meses atrás
  Shunta Saito 68e37a61a7 model : add PLaMo-2 support (#14560) há 6 meses atrás
  R0CKSTAR cbc68be51d cuda: fix build warnings in set-rows.cu (unused variable) (#14687) há 6 meses atrás
  Anton Mitkov bdca38376f sycl: Hotfix for non dnnl codepath (#14677) há 6 meses atrás
  shalinib-ibm 55c509daf5 ggml : refactor llamafile_sgemm PPC code (#14673) há 6 meses atrás
  Aman Gupta 9c9e4fc635 llama-context: add ability to get logits (#14672) há 6 meses atrás
  Johannes Gäßler 494c5899cb scripts: benchmark for HTTP server throughput (#14668) há 6 meses atrás
  Akarshan Biswas 0f4c6ec0f1 SYCL: use 1D kernel for set_rows (#14618) há 6 meses atrás