Xuan Son Nguyen
|
842500144e
gguf-split: add --no-tensor-first-split (#7072)
|
пре 1 година |
Jeximo
|
cf768b7e71
Tidy Android Instructions README.md (#7016)
|
пре 1 година |
viric
|
fcd84a0f5a
Fix Linux /sys cpu path to guess number of cores (#7064)
|
пре 1 година |
maor-ps
|
03fb8a002d
If first token generated from the server is the stop word the server will crash (#7038)
|
пре 1 година |
Georgi Gerganov
|
92139b90af
tests : add test-tokenizer-0.sh + fix some tokenizers (#7036)
|
пре 1 година |
Brian
|
a2ac89d6ef
convert.py : add python logging instead of print() (#6511)
|
пре 1 година |
Daniel Bevenius
|
433def286e
llama : rename ctx to user_data in progress_callback (#7045)
|
пре 1 година |
Bartowski
|
60325fa56f
Remove .attention from skipped tensors to match more accurately (#7051)
|
пре 1 година |
alwqx
|
6ecf3189e0
chore: fix typo in llama.cpp (#7032)
|
пре 1 година |
Andrew Downing
|
b0d943de17
Update LOG_IMPL and LOG_TEE_IMPL (#7029)
|
пре 1 година |
l3utterfly
|
8d608a81b7
main : fix off by one error for context shift (#6921)
|
пре 1 година |
Johannes Gäßler
|
3ea0d36000
Server: add tests for batch size, different seeds (#6950)
|
пре 1 година |
Johannes Gäßler
|
1613ef8d8e
CUDA: CUDART < 11.7 workaround for __hmax, __hmax2 (#7019)
|
пре 1 година |
slaren
|
c4ec9c0d3d
ci : exempt confirmed bugs from being tagged as stale (#7014)
|
пре 1 година |
Johannes Gäßler
|
a8f9b07631
perplexity: more statistics, added documentation (#6936)
|
пре 1 година |
Kevin Gibbons
|
f364eb6fb5
switch to using localizedDescription (#7010)
|
пре 1 година |
Georgi Gerganov
|
77e15bec62
metal : remove deprecated error code (#7008)
|
пре 1 година |
Kevin Gibbons
|
a68a1e7ed0
metal : log more info on error (#6987)
|
пре 1 година |
Georgi Gerganov
|
9c67c2773d
ggml : add Flash Attention (#5021)
|
пре 1 година |
Georgi Gerganov
|
952d03dbea
convert : use utf8 encoding (#7000)
|
пре 1 година |
Olivier Chafik
|
8843a98c2b
Improve usability of --model-url & related flags (#6930)
|
пре 1 година |
Clint Herron
|
b8c1476e44
Extending grammar integration tests (#6644)
|
пре 1 година |
Daniel Bevenius
|
5539e6fdd1
main : fix typo in comment in main.cpp (#6985)
|
пре 1 година |
Olivier Chafik
|
b8a7a5a90f
build(cmake): simplify instructions (`cmake -B build && cmake --build build ...`) (#6964)
|
пре 1 година |
Georgi Gerganov
|
d2c898f746
ci : tmp disable gguf-split (#6983)
|
пре 1 година |
Georgi Gerganov
|
544f1f10ad
ggml : fix __MSC_VER -> _MSC_VER (#6977)
|
пре 1 година |
cpumaxx
|
ffe666572f
llava-cli : multiple images (#6969)
|
пре 1 година |
Georgi Gerganov
|
24affa7db3
readme : update hot topics
|
пре 1 година |
Georgi Gerganov
|
f4ab2a4147
llama : fix BPE pre-tokenization (#6920)
|
пре 1 година |
David Renshaw
|
3f167476b1
sampling : use std::random_device{}() for default random seed (#6962)
|
пре 1 година |