Daniel Bevenius
|
6443ddd985
llama : use reserve/emplace_back in sampler_sample (#9534)
|
1 year ago |
Vinesh Janarthanan
|
8a308354f6
server : match OAI structured output response (#9527)
|
1 year ago |
Eric Zhang
|
f799155ab8
server : fix OpenSSL build (remove obsolete `LOG_INFO`) (#9529)
|
1 year ago |
Neo Zhang Jianyu
|
faf67b3de4
[SYCL]set context default value to avoid memory issue, update guide (#9476)
|
1 year ago |
Michael Podvitskiy
|
7be099fa81
llama-bench: correct argument parsing error message (#9524)
|
1 year ago |
Bert Wagner
|
8b836ae731
arg : add env variable for parallel (#9513)
|
1 year ago |
Michael Podvitskiy
|
8344ef58f8
llama : fix n_vocab init for 'no_vocab' case (#9511)
|
1 year ago |
Max Krasnyansky
|
0226613853
threadpool : skip polling for unused threads (#9461)
|
1 year ago |
Yuri Khrustalev
|
503147a9f9
unicode : add <algorithm> (#9508)
|
1 year ago |
Gabe Goodhart
|
0d2ec43833
llama : support IBM Granite architecture (#9412)
|
1 year ago |
Michael Podvitskiy
|
37f3a3810e
llama : add llama_n_head() (#9512)
|
1 year ago |
slaren
|
23e0d70bac
ggml : move common CPU backend impl to new header (#9509)
|
1 year ago |
Daniel Bevenius
|
acb2c32c33
llama : rename n_embed to n_embd in rwkv6_time_mix (#9504)
|
1 year ago |
Michael Podvitskiy
|
a6a3a5c531
ggml : link MATH_LIBRARY not by its full path (#9339)
|
1 year ago |
compilade
|
d54c21df7e
convert : identify missing model files (#9397)
|
1 year ago |
Georgi Gerganov
|
19514d632e
cmake : do not hide GGML options + rename option (#9465)
|
1 year ago |
Eve
|
5c3d0f1824
ggml : IQ4_NL sgemm + Q4_0 AVX optimization (#9422)
|
1 year ago |
Shane A
|
0aadac10c7
llama : support OLMoE (#9462)
|
1 year ago |
CarryFun
|
95ca85168b
llama : support MiniCPM3 (#9322)
|
1 year ago |
Vinesh Janarthanan
|
441b72b91f
main : option to disable context shift (#9484)
|
1 year ago |
Georgi Gerganov
|
c4965a64f7
metal : handle zero-sized allocs (#9466)
|
1 year ago |
Georgi Gerganov
|
90a2fff0e7
flake.lock: Update (#9488)
|
1 year ago |
Georgi Gerganov
|
6262d13e0b
common : reimplement logging (#9418)
|
1 year ago |
slaren
|
e6deac31f7
gguf-split : add basic checks (#9499)
|
1 year ago |
Michael Podvitskiy
|
6988da94a2
cmake : correct order of sycl flags (#9497)
|
1 year ago |
Csaba Kecskemeti
|
3c7989fd29
py : add "LLaMAForCausalLM" conversion support (#9485)
|
1 year ago |
OSecret
|
d6b37c881f
readme : update tools list (#9475)
|
1 year ago |
Michael Podvitskiy
|
7596487beb
cmake : try to fix sycl+intel build (#9487)
|
1 year ago |
Yuri Khrustalev
|
822b6322de
ggml : ggml_type_name return "NONE" for invalid values (#9458)
|
1 year ago |
VoidIsVoid
|
dcdcee3a74
server: add data: [DONE] to /chat/completions stream response (#9459)
|
1 year ago |