تاریخچه Commit ها

نویسنده SHA1 پیام تاریخ
  Georgi Gerganov 08ea539df2 unicode : improve naming style (#10838) 1 سال پیش
  Georgi Gerganov 644fd71b44 sampling : refactor + optimize penalties sampler (#10803) 1 سال پیش
  Bartowski 4ddd199f6f llava : Allow locally downloaded models for QwenVL (#10833) 1 سال پیش
  Valentin Mamedov a0974156f3 llama : add Deepseek MoE v1 & GigaChat models (#10827) 1 سال پیش
  Georgi Gerganov 87cf323cef scripts : change build path to "build-bench" for compare-commits.sh (#10836) 1 سال پیش
  Vinesh Janarthanan 5478bbcd17 server: (UI) add syntax highlighting and latex math rendering (#10808) 1 سال پیش
  Georgi Gerganov b5ae1ddff9 gguf-py : bump to v0.13.0 1 سال پیش
  Michelle Tan 89d604f2c8 server: Fix `has_next_line` in JSON response (#10818) 1 سال پیش
  Evgeny Kurnevsky e52aba537a nix: allow to override rocm gpu targets (#10794) 1 سال پیش
  HimariO ba1cb19cdd llama : add Qwen2VL support + multimodal RoPE (#10361) 1 سال پیش
  cduk 56eea0781c Removes spurious \r in output that causes logging in journalctl to treat lines as binary and therefore hidden by default (#10771) 1 سال پیش
  lhez a76c56fa1a Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (#10693) 1 سال پیش
  Eric Curtin c27ac678dd Opt class for positional argument handling (#10508) 1 سال پیش
  Corentin REGAL 11e07fd63b fix: graceful shutdown for Docker images (#10815) 1 سال پیش
  Jett Janiak 4601a8bb67 gguf-py : numpy 2 newbyteorder fix (#9772) 1 سال پیش
  谢乃闻 9f35e44592 Fix crash caused by ggml_backend_load_all when launching on Android Activity (#10812) 1 سال پیش
  Eve 64ae065511 vulkan: small mul_mat_vec optimizations (#10665) 1 سال پیش
  Akarshan Biswas 83ed24a97b SYCL: Reduce most of the compiler warnings (#10748) 1 سال پیش
  Karol Kontny d583cd03f6 ggml : Fix compilation issues on ARM platform when building without fp16 (#10811) 1 سال پیش
  Xuan Son Nguyen adffa6ffd5 common : improve -ctv -ctk CLI arguments (#10806) 1 سال پیش
  Xuan Son Nguyen 274ec65af6 contrib : add ngxson as codeowner (#10804) 1 سال پیش
  a3sh 8faa1d4dd4 CUDA: faster non-contiguous concat (#10760) 1 سال پیش
  Diego Devesa cb13ef85a4 remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (#10797) 1 سال پیش
  0cc4m 4064c0e3b6 Vulkan: Use improved q4_k and q5_k dequant code in dequant shaders (#10798) 1 سال پیش
  0cc4m dc5301d565 Vulkan: Add VK_EXT_subgroup_size_control support to ensure full subgroups for coopmats (#10721) 1 سال پیش
  Xuan Son Nguyen 9fdb124304 common : add missing env var for speculative (#10801) 1 سال پیش
  CentricStorm 5555c0c1f6 docs: update server streaming mode documentation (#9519) 1 سال پیش
  Georgi Gerganov 973f328b1e Merge pull request #10788 from ggerganov/gg/gguf-py-0.11.0 1 سال پیش
  Georgi Gerganov fb18934a97 gguf-py : bump version to 0.11.0 1 سال پیش
  Xuan Son Nguyen 235f6e14bf server : (UI) add tok/s, get rid of completion.js (#10786) 1 سال پیش