Histórico de Commits

Autor SHA1 Mensagem Data
  Georgi Gerganov d9c6ce46f7 kv-cache : support V-less cache (#19067) há 1 semana atrás
  Georgi Gerganov 557515be1e graph : utilize `ggml_build_forward_select()` to avoid reallocations (#18898) há 1 semana atrás
  Tarek Dakhran ad8d85bd94 memory : add llama_memory_hybrid_iswa (#18601) há 1 semana atrás
  Daniel Bevenius d3dce4e0a5 sampling : add support for backend sampling (#17004) há 4 semanas atrás
  Georgi Gerganov c560316440 graph : reuse SSM graphs (#16490) há 1 mês atrás
  Xuan-Son Nguyen 0759b09c90 graph: add f_attn_temp_offset (#18025) há 1 mês atrás
  Georgi Gerganov e38b7c6e9e graph : support cacheless embeddings with FA and iSWA (#16528) há 3 meses atrás
  Saba Fallah e08db42595 model: EmbeddingGemma Adding Support for SentenceTransformers Dense Modules (#16367) há 3 meses atrás
  Douglas Hanley b5bd037832 llama : add support for qwen3 reranker (#15824) há 4 meses atrás
  Daniel Bevenius fb15d649ed llama : add support for EmbeddingGemma 300m (#15798) há 5 meses atrás
  Johannes Gäßler e81b8e4b7f llama: use FA + max. GPU layers by default (#15434) há 5 meses atrás
  Georgi Gerganov 3f196be84b graph : remove build_attn_with_sinks overload (#15469) há 5 meses atrás
  Georgi Gerganov 715a6db02c kv-cache : drop the "unified" prefix (#15467) há 5 meses atrás
  Georgi Gerganov fd1234cb46 llama : add gpt-oss (#15091) há 6 meses atrás
  Georgi Gerganov ba42794c9e graph : fix equal_seq() check (#14986) há 6 meses atrás
  Dongliang Wei c1dacaa99b llama : merge build_moe_ffn_from_probs function into build_moe_ffn (#14968) há 6 meses atrás
  compilade 66625a59a5 graph : reduce splits for recurrent and hybrid models (#14825) há 6 meses atrás
  Georgi Gerganov 1e15bfd42c graph : fix stack-use-after-return (#14960) há 6 meses atrás
  Dongliang Wei 6c6e397aff model : add support for SmallThinker series (#14898) há 6 meses atrás
  Georgi Gerganov 8f974bc1e9 graph : refactor context to not pass gf explicitly (#14629) há 6 meses atrás
  Georgi Gerganov 01612b7409 llama : reuse compute graphs (#14482) há 6 meses atrás
  Georgi Gerganov 225e7a1438 llama : add high-throughput mode (#14363) há 6 meses atrás
  Xuan-Son Nguyen cb9178f885 llama : remove llm_graph_input_one (#14603) há 6 meses atrás
  compilade 4a5686da22 llama : support Jamba hybrid Transformer-Mamba models (#7531) há 6 meses atrás
  Georgi Gerganov 7b50f7c025 graph : prepare for 4D mask (#14515) há 7 meses atrás
  Georgi Gerganov a70c8a0c4b kv-cache : use ggml_set_rows (#14285) há 7 meses atrás
  compilade 5d46babdc2 llama : initial Mamba-2 support (#9126) há 7 meses atrás
  Sigbjørn Skjæret a0535ffa0d ggml : implement REGLU/GEGLU/SWIGLU ops (#14158) há 7 meses atrás
  Georgi Gerganov 72babea5de graph : make llm_graph_context destructor virtual (#14410) há 7 meses atrás
  Xuan-Son Nguyen 8846aace49 model : gemma3n text-only (#14400) há 7 meses atrás