Shane A
|
0aadac10c7
llama : support OLMoE (#9462)
|
пре 1 година |
CarryFun
|
95ca85168b
llama : support MiniCPM3 (#9322)
|
пре 1 година |
Vinesh Janarthanan
|
441b72b91f
main : option to disable context shift (#9484)
|
пре 1 година |
Georgi Gerganov
|
c4965a64f7
metal : handle zero-sized allocs (#9466)
|
пре 1 година |
Georgi Gerganov
|
90a2fff0e7
flake.lock: Update (#9488)
|
пре 1 година |
Georgi Gerganov
|
6262d13e0b
common : reimplement logging (#9418)
|
пре 1 година |
slaren
|
e6deac31f7
gguf-split : add basic checks (#9499)
|
пре 1 година |
Michael Podvitskiy
|
6988da94a2
cmake : correct order of sycl flags (#9497)
|
пре 1 година |
Csaba Kecskemeti
|
3c7989fd29
py : add "LLaMAForCausalLM" conversion support (#9485)
|
пре 1 година |
OSecret
|
d6b37c881f
readme : update tools list (#9475)
|
пре 1 година |
Michael Podvitskiy
|
7596487beb
cmake : try to fix sycl+intel build (#9487)
|
пре 1 година |
Yuri Khrustalev
|
822b6322de
ggml : ggml_type_name return "NONE" for invalid values (#9458)
|
пре 1 година |
VoidIsVoid
|
dcdcee3a74
server: add data: [DONE] to /chat/completions stream response (#9459)
|
пре 1 година |
Georgi Gerganov
|
1f4111e540
cmake : use list(APPEND ...) instead of set() + dedup linker (#9463)
|
пре 1 година |
Daniel Bevenius
|
befaf1197f
llama : make cell_id const in inp_s_mask block (#9470)
|
пре 1 година |
Xuan Son Nguyen
|
feff4aa846
server : add loading html page while model is loading (#9468)
|
пре 1 година |
Georgi Gerganov
|
0abc6a2c25
llama : llama_perf + option to disable timings during decode (#9355)
|
пре 1 година |
Gilad S.
|
bd35cb0ae3
feat: remove a sampler from a chain (#9445)
|
пре 1 година |
Mathijs Henquet
|
78203641fe
server : Add option to return token pieces in /tokenize endpoint (#9108)
|
пре 1 година |
Dou Xinpeng
|
e6b7801bd1
cann: Add host buffer type for Ascend NPU (#9406)
|
пре 1 година |
fengerhu1
|
e665744317
llava : fix the script error in MobileVLM README (#9054)
|
пре 1 година |
Xuan Son Nguyen
|
d4c3c10fad
lora : raise error if lm_head is ignored (#9103)
|
пре 1 година |
Michael Podvitskiy
|
2a825116b6
cmake : fix for builds without `GGML_CDEF_PUBLIC` (#9338)
|
пре 1 година |
Huang Qi
|
4dc4f5f14a
ci : update HIP SDK to 24.Q3 (ROCm 6.1) (#9329)
|
пре 1 година |
daminho
|
c837981bba
py : add Phi-1.5/Phi-2 tokenizer (#9361)
|
пре 1 година |
Trivikram Kamat
|
3c26a1644d
ci : bump actions/checkout to v4 (#9377)
|
пре 1 година |
Michael Podvitskiy
|
ff76e18516
cmake : fixed the order of linking libraries for llama-quantize (#9450)
|
пре 1 година |
Molly Sophia
|
39f852f440
py : add special tokens in hf_converter for RWKV v6 (#9428)
|
пре 1 година |
Ahmad Tameem
|
2b00fa7997
riscv : modify Makefile and add a RISCV_VECT to print log info (#9442)
|
пре 1 година |
Georgi Gerganov
|
d6a04f872d
ggml : hide ggml_object, ggml_cgraph, ggml_hash_set (#9408)
|
пре 1 година |