Commit History

Autor SHA1 Mensaxe Data
  Georgi Gerganov 90a2fff0e7 flake.lock: Update (#9488) hai 1 ano
  Georgi Gerganov 6262d13e0b common : reimplement logging (#9418) hai 1 ano
  slaren e6deac31f7 gguf-split : add basic checks (#9499) hai 1 ano
  Michael Podvitskiy 6988da94a2 cmake : correct order of sycl flags (#9497) hai 1 ano
  Csaba Kecskemeti 3c7989fd29 py : add "LLaMAForCausalLM" conversion support (#9485) hai 1 ano
  OSecret d6b37c881f readme : update tools list (#9475) hai 1 ano
  Michael Podvitskiy 7596487beb cmake : try to fix sycl+intel build (#9487) hai 1 ano
  Yuri Khrustalev 822b6322de ggml : ggml_type_name return "NONE" for invalid values (#9458) hai 1 ano
  VoidIsVoid dcdcee3a74 server: add data: [DONE] to /chat/completions stream response (#9459) hai 1 ano
  Georgi Gerganov 1f4111e540 cmake : use list(APPEND ...) instead of set() + dedup linker (#9463) hai 1 ano
  Daniel Bevenius befaf1197f llama : make cell_id const in inp_s_mask block (#9470) hai 1 ano
  Xuan Son Nguyen feff4aa846 server : add loading html page while model is loading (#9468) hai 1 ano
  Georgi Gerganov 0abc6a2c25 llama : llama_perf + option to disable timings during decode (#9355) hai 1 ano
  Gilad S. bd35cb0ae3 feat: remove a sampler from a chain (#9445) hai 1 ano
  Mathijs Henquet 78203641fe server : Add option to return token pieces in /tokenize endpoint (#9108) hai 1 ano
  Dou Xinpeng e6b7801bd1 cann: Add host buffer type for Ascend NPU (#9406) hai 1 ano
  fengerhu1 e665744317 llava : fix the script error in MobileVLM README (#9054) hai 1 ano
  Xuan Son Nguyen d4c3c10fad lora : raise error if lm_head is ignored (#9103) hai 1 ano
  Michael Podvitskiy 2a825116b6 cmake : fix for builds without `GGML_CDEF_PUBLIC` (#9338) hai 1 ano
  Huang Qi 4dc4f5f14a ci : update HIP SDK to 24.Q3 (ROCm 6.1) (#9329) hai 1 ano
  daminho c837981bba py : add Phi-1.5/Phi-2 tokenizer (#9361) hai 1 ano
  Trivikram Kamat 3c26a1644d ci : bump actions/checkout to v4 (#9377) hai 1 ano
  Michael Podvitskiy ff76e18516 cmake : fixed the order of linking libraries for llama-quantize (#9450) hai 1 ano
  Molly Sophia 39f852f440 py : add special tokens in hf_converter for RWKV v6 (#9428) hai 1 ano
  Ahmad Tameem 2b00fa7997 riscv : modify Makefile and add a RISCV_VECT to print log info (#9442) hai 1 ano
  Georgi Gerganov d6a04f872d ggml : hide ggml_object, ggml_cgraph, ggml_hash_set (#9408) hai 1 ano
  Neo Zhang Jianyu c9c8575a1a enhance run script to be easy to change the parameters (#9448) hai 1 ano
  Xinpeng Dou df4b7945ae cann: Fix error when running a non-exist op (#9424) hai 1 ano
  Faisal Zaghloul 449ccfb6f5 Add Jais to list of supported models (#9439) hai 1 ano
  slaren 1b28061400 llama : skip token bounds check when evaluating embeddings (#9437) hai 1 ano