Historique des commits

Auteur SHA1 Message Date
  Judd e976423005 ggml : check ggml_add src1 type (ggml/708) il y a 2 ans
  Michael Klimenko 35a2ee9143 Remove unused data and add fixes (#5154) il y a 2 ans
  Maximilian Winter ec903c0341 server : add self-extend support (#5104) il y a 2 ans
  0cc4m a1d6df129b Add OpenCL add kernel (#5151) il y a 2 ans
  Jared Van Bortel bbe7c56c99 cmake : pass CPU architecture flags to nvcc (#5146) il y a 2 ans
  slaren 62fead3ea0 cuda : fix tensor size calculation for non-split buffer (#5145) il y a 2 ans
  slaren 15b4538ff2 ggml-alloc : add 10% margin to the buffer sizes (#5149) il y a 2 ans
  snadampal 7032f4f634 ggml : update softmax n_task calculation (#5126) il y a 2 ans
  Georgi Gerganov 5f1925a8ce scripts : move run-with-preset.py from root to scripts folder il y a 2 ans
  Georgi Gerganov 3b7c914de2 tests : gitignore test-c.o il y a 2 ans
  Xuan Son Nguyen 48c857aa10 server : refactored the task processing logic (#5065) il y a 2 ans
  crasm 413e7b0559 ci : add model tests + script wrapper (#4586) il y a 2 ans
  Paul Tsochantaris 6dd3c28c9c metal : remove unused `n_buffers` and `buffers` (#5129) il y a 2 ans
  Riceball LEE 38b431de23 gguf : fix "general.alignment" type in gguf_reader.py (#5136) il y a 2 ans
  Georgi Gerganov aad0b01d73 readme : update hot topics il y a 2 ans
  Kawrakow 1182cf4d4f Another bucket sort (#5109) il y a 2 ans
  XiaotaoChen fe54033b69 readme : add MobileVLM 1.7B/3B to the supported models list (#5107) il y a 2 ans
  l3utterfly 5eaf9964fc llama : dynamic temperature sampling (#4972) il y a 2 ans
  Jared Van Bortel d292f4f204 examples : make pydantic scripts pass mypy and support py3.8 (#5099) il y a 2 ans
  Valentin Konovalov 256d1bb0dd android : use release cmake build type by default (#5123) il y a 2 ans
  Kawrakow faa3526a1e Fix Q3_K_XS for MoE models (#5113) il y a 2 ans
  Georgi Gerganov ddc5a5033f metal : show compile log messages il y a 2 ans
  Engininja2 cd4fddb29f cuda : fix 2-bit quants on amd hip (#5105) il y a 2 ans
  Michael Hueschen c9b316c78f nix-shell: use addToSearchPath il y a 2 ans
  Michael Hueschen bf63d695b8 nix: add cc to devShell LD_LIBRARY_PATH il y a 2 ans
  slaren 1387ea2117 llama : pre-allocate input tensors in a separate buffer (#5100) il y a 2 ans
  Georgi Gerganov 26d607608d metal : disable support for MUL_MAT F32 x F16 il y a 2 ans
  Kawrakow 44879ee885 Additional KL-divergence statistics (#5081) il y a 2 ans
  Johannes Gäßler 9ecdd12e95 CUDA: more info when no device code (#5088) il y a 2 ans
  Georgi Gerganov 89758723c7 minor : clean-up some warnings and style (#5094) il y a 2 ans