Csaba Kecskemeti
|
3c7989fd29
py : add "LLaMAForCausalLM" conversion support (#9485)
|
1 rok temu |
OSecret
|
d6b37c881f
readme : update tools list (#9475)
|
1 rok temu |
Michael Podvitskiy
|
7596487beb
cmake : try to fix sycl+intel build (#9487)
|
1 rok temu |
Yuri Khrustalev
|
822b6322de
ggml : ggml_type_name return "NONE" for invalid values (#9458)
|
1 rok temu |
VoidIsVoid
|
dcdcee3a74
server: add data: [DONE] to /chat/completions stream response (#9459)
|
1 rok temu |
Georgi Gerganov
|
1f4111e540
cmake : use list(APPEND ...) instead of set() + dedup linker (#9463)
|
1 rok temu |
Daniel Bevenius
|
befaf1197f
llama : make cell_id const in inp_s_mask block (#9470)
|
1 rok temu |
Xuan Son Nguyen
|
feff4aa846
server : add loading html page while model is loading (#9468)
|
1 rok temu |
Georgi Gerganov
|
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
|
1 rok temu |
Gilad S.
|
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
|
1 rok temu |
Mathijs Henquet
|
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
|
1 rok temu |
Dou Xinpeng
|
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
|
1 rok temu |
fengerhu1
|
e665744317
llava : fix the script error in MobileVLM README (#9054)
|
1 rok temu |
Xuan Son Nguyen
|
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)
|
1 rok temu |
Michael Podvitskiy
|
2a825116b6
cmake : fix for builds without `GGML_CDEF_PUBLIC` (#9338)
|
1 rok temu |
Huang Qi
|
4dc4f5f14a
ci : update HIP SDK to 24.Q3 (ROCm 6.1) (#9329)
|
1 rok temu |
daminho
|
c837981bba
py : add Phi-1.5/Phi-2 tokenizer (#9361)
|
1 rok temu |
Trivikram Kamat
|
3c26a1644d
ci : bump actions/checkout to v4 (#9377)
|
1 rok temu |
Michael Podvitskiy
|
ff76e18516
cmake : fixed the order of linking libraries for llama-quantize (#9450)
|
1 rok temu |
Molly Sophia
|
39f852f440
py : add special tokens in hf_converter for RWKV v6 (#9428)
|
1 rok temu |
Ahmad Tameem
|
2b00fa7997
riscv : modify Makefile and add a RISCV_VECT to print log info (#9442)
|
1 rok temu |
Georgi Gerganov
|
d6a04f872d
ggml : hide ggml_object, ggml_cgraph, ggml_hash_set (#9408)
|
1 rok temu |
Neo Zhang Jianyu
|
c9c8575a1a
enhance run script to be easy to change the parameters (#9448)
|
1 rok temu |
Xinpeng Dou
|
df4b7945ae
cann: Fix error when running a non-exist op (#9424)
|
1 rok temu |
Faisal Zaghloul
|
449ccfb6f5
Add Jais to list of supported models (#9439)
|
1 rok temu |
slaren
|
1b28061400
llama : skip token bounds check when evaluating embeddings (#9437)
|
1 rok temu |
Pavel Zloi
|
8db003a19d
py : support converting local models (#7547)
|
1 rok temu |
Xuan Son Nguyen
|
0996c5597f
llava : correct args for minicpmv-cli (#9429)
|
1 rok temu |
Xuan Son Nguyen
|
5bb2c5dbd2
files : remove accidentally added `lora_test` submodule (#9430)
|
1 rok temu |
Farbod Bijary
|
67155ab7f5
feat: Implements retrying logic for downloading models using --model-url flag (#9255)
|
1 rok temu |