Histórico de Commits

Autor SHA1 Mensagem Data
  Sam/Samuel 3f750f8d76 metal: add support for opt_step_sgd (#16539) há 4 meses atrás
  Georgi Gerganov c515fc5771 ggml : fix scalar path for computing norm (#16558) há 4 meses atrás
  hipudding f9bc66c3eb CANN: Update several operators to support FP16 data format (#16251) há 4 meses atrás
  Sam/Samuel a31cf36ad9 metal : add opt_step_adamw and op_sum (#16529) há 4 meses atrás
  Pascal 81d54bbfd5 webui: remove client-side context pre-check and rely on backend for limits (#16506) há 4 meses atrás
  Neo Zhang Jianyu c7be9febcb [SYCL] fix UT fault cases: count-equal, argsort, pad OPs (#16521) há 4 meses atrás
  Mathieu Baudier 8415f61e23 ci : add Vulkan on Ubuntu with default packages build (#16532) há 4 meses atrás
  Aldehir Rojas 2c301e91ab common : handle unicode during partial json parsing (#16526) há 4 meses atrás
  Georgi Gerganov 4b2dae383d common : update presets (#16504) há 4 meses atrás
  sirus20x6 41aac5c69b ggml : Fix FP16 ELU positive branch (#16519) há 4 meses atrás
  Daniel Bevenius a2fba89a42 hparams : add check for layer index in is_recurrent (#16511) há 4 meses atrás
  sirus20x6 20cc625edc ggml: Correct SVE implementation in ggml_vec_dot_f16_unroll (#16518) há 4 meses atrás
  Johannes Gäßler 11f0af5504 CUDA: faster tile FA, add oob checks, more HSs (#16492) há 4 meses atrás
  Georgi Gerganov a3cb04744f metal : fix mul-mm condition + fix mul-mv permuted kernels (#16494) há 4 meses atrás
  Pascal 4a8fbe0a5e feat: render user content as markdown option (#16358) há 4 meses atrás
  Yann Follet 31d0ff1869 server / ranking : add sorting and management of top_n (#16403) há 4 meses atrás
  Diego Devesa 97870e6497 cuda : avoid initializing unused devices (#16510) há 4 meses atrás
  amirai21 477a66b035 convert : correctly handle LLaMA tokenizer for Jamba (#16470) há 4 meses atrás
  Georgi Gerganov e60f01d941 server : fix division by zero when reporting stats (#16501) há 4 meses atrás
  Georgi Gerganov 81086cd6a3 vocab : mark EOT token for Granite models (#16499) há 4 meses atrás
  Radoslav Gerganov 68ee98ae18 server : return HTTP 400 if prompt exceeds context length (#16486) há 4 meses atrás
  Radoslav Gerganov cdb6da468c server : log requests to /v1/completions (#16495) há 4 meses atrás
  Prajwal B Mehendarkar 6d69ab3f26 cmake : Dont define XOPENSOURCE on AIX (#16481) há 4 meses atrás
  Pascal 1faa13a118 webui: updated the chat service to only include max_tokens in the req… (#16489) há 4 meses atrás
  duduta 1deee0f8d4 cpu : optimize the ggml NORM operation (#15953) há 4 meses atrás
  Georgi Gerganov d00cbea63c server : host-memory prompt caching (#16391) há 4 meses atrás
  Pascal 8328fd4bae No markdown in cot (#16483) há 4 meses atrás
  Daniel Bevenius 56b4795842 model-conversion : add support for SentenceTransformers (#16387) há 4 meses atrás
  sudhiarm 2c0d875ae6 ci: add ARM64 Kleidiai build and test support (#16462) há 4 meses atrás
  Chenguang Li aa4711d369 CANN: Improve ACL graph matching (#16166) há 4 meses atrás